WO2001075067A9 - Novel nucleic acids and polypeptides - Google Patents

Novel nucleic acids and polypeptides

Info

Publication number
WO2001075067A9
WO2001075067A9 PCT/US2001/008631 US0108631W WO0175067A9 WO 2001075067 A9 WO2001075067 A9 WO 2001075067A9 US 0108631 W US0108631 W US 0108631W WO 0175067 A9 WO0175067 A9 WO 0175067A9
Authority
WO
WIPO (PCT)
Prior art keywords
polypeptide
polynucleotide
protein
cells
sequence
Prior art date
Application number
PCT/US2001/008631
Other languages
French (fr)
Other versions
WO2001075067A2 (en
WO2001075067A8 (en
WO2001075067A3 (en
Inventor
Rodoje T Drmanac
Chenghua Liu
Y Tom Tang
Original Assignee
Hyseq Inc
Rodoje T Drmanac
Chenghua Liu
Y Tom Tang
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Hyseq Inc, Rodoje T Drmanac, Chenghua Liu, Y Tom Tang filed Critical Hyseq Inc
Priority to US10/450,763 priority Critical patent/US20050196754A1/en
Priority to AU2001249251A priority patent/AU2001249251A1/en
Publication of WO2001075067A2 publication Critical patent/WO2001075067A2/en
Publication of WO2001075067A3 publication Critical patent/WO2001075067A3/en
Priority to US10/128,558 priority patent/US20040219521A1/en
Priority to US10/243,552 priority patent/US20030224379A1/en
Publication of WO2001075067A9 publication Critical patent/WO2001075067A9/en
Priority to US10/302,689 priority patent/US20080050393A1/en
Publication of WO2001075067A8 publication Critical patent/WO2001075067A8/en

Links

Classifications

    • CCHEMISTRY; METALLURGY
    • C07ORGANIC CHEMISTRY
    • C07KPEPTIDES
    • C07K14/00Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
    • C07K14/435Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans
    • C07K14/46Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans from vertebrates
    • C07K14/47Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans from vertebrates from mammals
    • CCHEMISTRY; METALLURGY
    • C07ORGANIC CHEMISTRY
    • C07KPEPTIDES
    • C07K14/00Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
    • C07K14/81Protease inhibitors
    • C07K14/8107Endopeptidase (E.C. 3.4.21-99) inhibitors
    • AHUMAN NECESSITIES
    • A61MEDICAL OR VETERINARY SCIENCE; HYGIENE
    • A61KPREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
    • A61K38/00Medicinal preparations containing peptides

Definitions

  • the present invention provides novel polynucleotides and proteins encoded by such polynucleotides, along with uses for these polynucleotides and proteins, for example in therapeutic, diagnostic and research methods.
  • Identified polynucleotide and polypeptide sequences have numerous applications in, for example, diagnostics, forensics, gene mapping; identification of mutations responsible for genetic disorders or other traits, to assess biodiversity, and to produce many other types of data and products dependent on DNA and amino acid sequences.
  • compositions of the present invention include novel isolated polypeptides, novel isolated polynucleotides encoding such polypeptides, including recombinant DNA molecules, cloned genes or degenerate variants thereof, especially naturally occurring variants such as allelic variants, antisense polynucleotide molecules, and antibodies that specifically recognize one or more epitopes present on such polypeptides, as well as hybridomas producing such antibodies.
  • compositions of the present invention additionally include vectors, including expression vectors, containing the polynucleotides of the invention, cells genetically engineered to contain such polynucleotidesand cells genetically engineered to express such polynucleotides.
  • the present invention relates to a collection or library of at least one novel nucleic acid sequence assembled from expressed sequence tags (ESTs) isolated mainly by sequencing by hybridization (SBH), and in some cases, sequences obtained from one or more public databases.
  • ESTs expressed sequence tags isolated mainly by sequencing by hybridization
  • the invention relates also to the proteins encoded by such polynucleotides, along with therapeutic, diagnostic and research utilities for these polynucleotides and proteins. These nucleic acid sequences are designated as SEQ ID NO: 1 -30368.
  • the polypeptides sequences are designated SEQ ID NO: 30369-60736.
  • the nucleic acids and polypeptides are provided in the Sequence Listing.
  • A is adenosine
  • C is cytosine
  • G is guanine
  • T is thymine
  • N is any of the four bases.
  • * corresponds to the stop codon.
  • nucleic acid sequences of the present invention also include, nucleic acid sequences that hybridize to the complement of SEQ ID NO: 1-30368 under stringent hybridization conditions; nucleic acid sequences which are allelic variants or species homologues of any of the nucleic acid sequences recited above, or nucleic acid sequences that encode a peptide comprising a specific domain or truncation of the peptides encoded by SEQ ID NO: 1-30368.
  • a polynucleotide comprising a nucleotide sequence having at least 90% identity to an identifying sequence of SEQ ID NO: 1-30368 or a degenerate variant or fragment thereof.
  • the identifying sequence can be 100 base pairs in length.
  • the nucleic acid sequences of the present invention also include the sequence information from the nucleic acid sequences of SEQ ID NO: 1 -30368.
  • the sequence information can be a segment of any one of SEQ ID NO: 1 -30368 that uniquely identifies or represents the sequence information of SEQ ID NO: 1-30368.
  • a collection as used in this application can be a collection of only one polynucleotide.
  • the collection of sequence information or identifying information of each sequence can be provided on a nucleic acid array.
  • segments of sequence information is provided on a nucleic acid array to detect the polynucleotide that contains the segment.
  • the array can be designed to detect full-match or mismatch to the polynucleotide that contains the segment.
  • the collection can also be provided in a computer-readable format.
  • This invention also includes the reverse or direct complement of any of the nucleic acid sequences recited above; cloning or expression vectors containing the nucleic acid sequences; and host cells or organisms transformed with these expression vectors.
  • Nucleic acid sequences (or their reverse or direct complements) according to the invention have numerous applications in a variety of techniques known to those skilled in the art of molecular biology, such as use as hybridization probes, use as primers for PCR, use in an array, use in computer-readable media, use in sequencing full-length genes, use for chromosome and gene mapping, use in the recombinant production of protein, and use in the generation of anti-sense DNA or RNA, their chemical analogs and the like.
  • nucleic acid sequences of SEQ ID NO: 1 -30368 or novel segments or parts of the nucleic acids of the invention are used as primers in expression assays that are well known in the art.
  • nucleic acid sequences of SEQ ID NO: 1 -30368 or novel segments or parts of the nucleic acids provided herein are used in diagnostics for identifying expressed genes or, as well known in the art and exemplified by Vollrath et al., Science 258:52-59 (1992), as expressed sequence tags for physical mapping of the human genome.
  • the isolated polynucleotides of the invention include, but are not limited to, a polynucleotide comprising any one of the nucleotide sequences set forth in SEQ ID NO: 1-30368; a polynucleotide comprising any of the full length protein coding sequences of SEQ ID NO: 1 -30368; and a polynucleotide comprising any of the nucleotide sequences of the mature protein coding sequences of SEQ ID NO: 1-30368.
  • the polynucleotides of the present invention also include, but are not limited to, a polynucleotide that hybridizes under stringent hybridization conditions to (a) the complement of any one of the nucleotide sequences set forth in SEQ ID NO: 1 -30368; (b) a nucleotide sequence encoding any one of the amino acid sequences set forth in the Sequence Listing (e.g., SEQ ID NO: 30369-60736); (c) a polynucleotide which is an allelic variant of any polynucleotides recited above; (d) a polynucleotide which encodes a species homolog (e.g.
  • the isolated polypeptides of the invention include, but are not limited to, a polypeptide comprising any of the amino acid sequences set forth in the Sequence Listing; or the corresponding full length or mature protein.
  • Polypeptides of the invention also include polypeptides with biological activity that are encoded by (a) any of the polynucleotides having a nucleotide sequence set forth in SEQ ID NO: 1-30368; or (b) polynucleotides that hybridize to the complement of the polynucleotides of (a) under stringent hybridization conditions.
  • Biologically or immunologically active variants of any of the polypeptide sequences in the Sequence Listing, and "substantial equivalents" thereof e.g.
  • polypeptides of the invention may be wholly or partially chemically synthesized but are preferably produced by recombinant means using the genetically engineered cells (e.g. host cells) of the invention.
  • the invention also provides compositions comprising a polypeptide of the invention.
  • Polypeptide compositions of the invention may further comprise an acceptable carrier, such as a hydrophilic, e.g., pharmaceutically acceptable, carrier.
  • an acceptable carrier such as a hydrophilic, e.g., pharmaceutically acceptable, carrier.
  • the invention also provides host cells transformed or transfected with a polynucleotide of the invention.
  • the invention also relates to methods for producing a polypeptide of the invention comprising growing a culture of the host cells of the invention in a suitable culture medium under conditions permitting expression of the desired polypeptide, and purifying the polypeptide from the culture or from the host cells.
  • Preferred embodiments include those in which the protein produced by such process is a mature form of the protein.
  • Polynucleotides according to the invention have numerous applications in a variety of techniques known to those skilled in the art of molecular biology. These techniques include use as hybridization probes, use as oligomers, or primers, for PCR, use for chromosome and gene mapping, use in the recombinant production of protein, and use in generation of anti-sense DNA or RNA, their chemical analogs and the like. For example, when the expression of an mRNA is largely restricted to a particular cell or tissue type, polynucleotides of the invention can be used as hybridization probes to detect the presence of the particular cell or tissue mRNA in a sample using, e.g., in situ hybridization.
  • the polynucleotides are used in diagnostics as expressed sequence tags for identifying expressed genes or, as well known in the art and exemplified by Vollrath et al., Science 258:52-59 (1992), as expressed sequence tags for physical mapping of the human genome.
  • polypeptides according to the invention can be used in a variety of conventional procedures and methods that are currently applied to other proteins.
  • a polypeptide of the invention can be used to generate an antibody that specifically binds the polypeptide.
  • Such antibodies, particularly monoclonal antibodies, are useful for detecting or quantitating the polypeptide in tissue.
  • the polypeptides of the invention can also be used as molecular weight markers, and as a food supplement.
  • Methods are also provided for preventing, treating, or ameliorating a medical condition which comprises the step of administering to a mammalian subject a therapeutically effective amount of a composition comprising a polypeptide of the present invention and a pharmaceutically acceptable carrier.
  • polypeptides and polynucleotides of the invention can be utilized, for example, in methods for the prevention and/or treatment of disorders involving aberrant protein expression or biological activity.
  • the present invention further relates to methods for detecting the presence of the polynucleotides or polypeptides of the invention in a sample. Such methods can, for example, be utilized as part of prognostic and diagnostic evaluation of disorders as recited herein and for the identification of subjects exhibiting a predisposition to such conditions.
  • the invention provides a method for detecting the polynucleotides of the invention in a sample, comprising contacting the sample with a compound that binds to and forms a complex with the polynucleotide of interest for a period sufficient to form the complex and under conditions sufficient to form a complex and detecting the complex such that if a complex is detected, the polynucleotide of interest is detected.
  • the invention also provides a method for detecting the polypeptides of the invention in a sample comprising contacting the sample with a compound that binds to and forms a complex with the polypeptide under conditions and for a period sufficient to form the complex and detecting the formation of the complex such that if a complex is formed, the polypeptide is detected.
  • kits comprising polynucleotide probes and/or monoclonal antibodies, and optionally quantitative standards, for carrying out methods of the invention. Furthermore, the invention provides methods for evaluating the efficacy of drugs, and monitoring the progress of patients, involved in clinical trials for the treatment of disorders as recited above.
  • the invention also provides methods for the identification of compounds that modulate (t.e., increase or decrease) the expression or activity of the polynucleotides and/or polypeptides of the invention. Such methods can be utilized, for example, for the identification of compounds that can ameliorate symptoms of disorders as recited herein. Such methods can include, but are not limited to, assays for identifying compounds and other substances that interact with (e.g., bind to) the polypeptides of the invention.
  • the invention provides a method for identifying a compound that binds to the polypeptides of the invention comprising contacting the compound with a polypeptide of the invention in a cell for a time sufficient to form a polypeptide/compound complex, wherein the complex drives expression of a reporter gene sequence in the cell; and detecting the complex by detecting the reporter gene sequence expression such that if expression of the reporter gene is detected the compound that binds to a polypeptide of the invention is identified.
  • the methods of the invention also provides methods for treatment which involve the administration of the polynucleotides or polypeptides of the invention to individuals exhibiting symptoms or tendencies.
  • the invention encompasses methods for treating diseases or disorders as recited herein comprising administering compounds and other substances that modulate the overall activity of the target gene products. Compounds and other substances can effect such modulation either on the level of target gene/protein expression or target protein activity.
  • polypeptides of the present invention and the polynucleotides encoding them are also useful for the same functions known to one of skill in the art as the polypeptides and polynucleotides to which they have homology (set forth in the sequence listing). If no homology is set forth for a sequence, then the polypeptides and polynucleotides of the present invention are useful for a variety of applications, as described herein, including use in arrays for detection.
  • active refers to those forms of the polypeptide which retain the biologic and/or immunologic activities of any naturally occurring polypeptide.
  • biologically active refers to a protein or peptide having structural, regulatory or biochemical functions of a naturally occurring molecule.
  • immunologically active refers to the capability of the natural, recombinant or synthetic polypeptide to induce a specific immune response in appropriate animals or cells and to bind with specific antibodies.
  • activated cells are those cells which are engaged in extracellular or intracellular membrane trafficking, including the export of secretory or enzymatic molecules as part of a normal or disease process.
  • complementary or “complementarity” refer to the natural binding of polynucleotides by base pairing.
  • sequence 5'-AGT-3' binds to the complementary sequence 3'-TCA-5'.
  • Complementarity between two single-stranded molecules may be "partial” such that only some of the nucleic acids bind or it may be "complete” such that total complementarity exists between the single stranded molecules.
  • the degree of complementarity between the nucleic acid strands has significant effects on the efficiency and strength of the hybridization between the nucleic acid strands.
  • Embryonic stem cells refers to a cell that can give rise to many differentiated cell types in an embryo or an adult, including the germ cells.
  • GSCs germ line stem cells
  • primordial stem cells that provide a steady and continuous source of germ cells for the production of gametes.
  • primordial germ cells PGCs
  • PGCs primary germ cells
  • PGCs are the source from which GSCs and ES cells are derived The PGCs, the GSCs and the ES cells are capable of self-renewal. Thus these cells not only populate the germ line and give rise to a plurality of terminally differentiated cells that comprise the adult specialized organs, but are able to regenerate themselves.
  • EMF expression modulating fragment
  • a sequence is said to "modulate the expression of an operably linked sequence” when the expression of the sequence is altered by the presence of the EMF.
  • EMFs include, but are not limited to, promoters, and promoter modulating sequences (inducible elements).
  • One class of EMFs are nucleic acid fragments which induce the expression of an operably linked ORF in response to a specific regulatory factor or physiological event.
  • the terms "nucleotide sequence” or “nucleic acid” or “polynucleotide” or “oligonucleotide” are used interchangeably and refer to a heteropolymer of nucleotides or the sequence of these nucleotides.
  • DNA or RNA of genomic or synthetic origin which may be single-stranded or double-stranded and may represent the sense or the antisense strand, to peptide nucleic acid (PNA) or to any DNA-like or RNA-like material.
  • PNA peptide nucleic acid
  • A is adenine
  • C is cytosine
  • T is thymine
  • G is guanine
  • N is A, C, G or T (U).
  • T thymine
  • U uracil
  • nucleic acid segments provided by this invention may be assembled from fragments of the genome and short oligonucleotide linkers, or from a series of oligonucleotides, or from individual nucleotides, to provide a synthetic nucleic acid which is capable of being expressed in a recombinant transcriptional unit comprising regulatory elements derived from a microbial or viral operon, or a eukaryotic gene.
  • oligonucleotide fragment or a "polynucleotide fragment", "portion,” or “segment” or “probe” or “primer” are used interchangeably and refer to a sequence of nucleotide residues which are at least about 5 nucleotides, more preferably at least about 7 nucleotides, more preferably at least about 9 nucleotides, more preferably at least about 1 1 nucleotides and most preferably at least about 17 nucleotides.
  • the fragment is preferably less than about 500 nucleotides, preferably less than about 200 nucleotides, more preferably less than about 100 nucleotides, more preferably less than about 50 nucleotides and most preferably less than 30 nucleotides.
  • the probe is from about 6 nucleotides to about 200 nucleotides, preferably from about 15 to about 50 nucleotides, more preferably from about 17 to 30 nucleotides and most preferably from about 20 to 25 nucleotides.
  • the fragments can be used in polymerase chain reaction (PCR), various hybridization procedures or microarray procedures to identify or amplify identical or related parts of mRNA or DNA molecules.
  • a fragment or segment may uniquely identify each polynucleotide sequence of the present invention.
  • the fragment comprises a sequence substantially similar to any one of SEQ ID NO: 1-30368.
  • Probes may, for example, be used to determine whether specific mRNA molecules are present in a cell or tissue or to isolate similar nucleic acid sequences from chromosomal DNA as described by Walsh et al. (Walsh, P.S. et al., 1992, PCR Methods Appl 1 :241-250). They may be labeled by nick translation, Klenow fill-in reaction, PCR, or other methods well known in the art. Probes of the present invention, their preparation and/or labeling are elaborated in
  • the nucleic acid sequences of the present invention also include the sequence information from the nucleic acid sequences of SEQ ID NO: 1-30368.
  • the sequence information can be a segment of any one of SEQ ID NO: 1-30368 that uniquely identifies or represents the sequence information of that sequence of SEQ ID NO: 1-30368.
  • One such segment can be a twenty-mer nucleic acid sequence because the probability that a twenty-mer is fully matched in the human genome is 1 in 300. In the human genome, there are three billion base pairs in one set of chromosomes. Because 4 possible twenty-mers exist, there are 300 times more twenty-mers than there are base pairs in a set of human chromosomes. Using the same analysis, the probability for a seventeen-mer to be fully matched in the human genome is approximately 1 in 5. When these segments are used in arrays for expression studies, fifteen-mer segments can be used. The probability that the fifteen-mer is fully matched in the expressed sequences is also approximately one in five because expressed sequences comprise less than approximately 5% of the entire genome sequence.
  • a segment when using sequence information for detecting a single mismatch, a segment can be a twenty-five mer.
  • the probability that the twenty-five mer would appear in a human genome with a single mismatch is calculated by multiplying the probability for a full match (1 ⁇ 4 2 ⁇ ) times the increased probability for mismatch at each nucleotide position (3 x 25).
  • the probability that an eighteen mer with a single mismatch can be detected in an array for expression studies is approximately one in five.
  • the probability that a twenty-mer with a single mismatch can be detected in a human genome is approximately one in five.
  • the term "open reading frame," ORF means a series of nucleotide triplets coding for amino acids without any termination codons and is a sequence translatable into protein.
  • operably linked refers to functionally related nucleic acid sequences.
  • a promoter is operably associated or operably linked with a coding sequence if the promoter controls the transcription of the coding sequence.
  • operably linked nucleic acid sequences can be contiguous and in the same reading frame, certain genetic elements e.g. repressor genes are not contiguously linked to the coding sequence but still control transcription/translation of the coding sequence.
  • pluripotent refers to the capability of a cell to differentiate into a number of differentiated cell types that are present in an adult organism.
  • a pluripotent cell is restricted in its differentiation capability in comparison to a totipotent cell.
  • polypeptide or “peptide” or “amino acid sequence” refer to an oligopeptide, peptide, polypeptide or protein sequence or fragment thereof and to naturally occurring or synthetic molecules.
  • a polypeptide "fragment,” “portion,” or “segment” is a stretch of amino acid residues of at least about 5 amino acids, preferably at least about 7 amino acids, more preferably at least about 9 amino acids and most preferably at least about 17 or more amino acids.
  • the peptide preferably is not greater than about 200 amino acids, more preferably less than 150 amino acids and most preferably less than 100 amino acids.
  • the peptide is from about 5 to about 200 amino acids.
  • any polypeptide must have sufficient length to display biological and/or immunological activity.
  • naturally occurring polypeptide refers to polypeptides produced by cells that have not been genetically engineered and specifically contemplates various polypeptides arising from post-translational modifications of the polypeptide including, but not limited to, acetylation, carboxylation, glycosylation, phosphorylation, lipidation and acylation.
  • transcriptional modifications including, but not limited to, acetylation, carboxylation, glycosylation, phosphorylation, lipidation and acylation.
  • transcriptslated protein coding portion means a sequence which encodes for the full length protein which may include any leader sequence or any processing sequence.
  • mature protein coding sequence means a sequence which encodes a peptide or protein without a signal or leader sequence.
  • the "mature protein portion” means that portion of the protein which does not include a signal or leader sequence.
  • the peptide may have been produced by processing in the cell which removes any leader/signal sequence.
  • the mature protein portion may or may not include an initial methionine residue. The methionine residue may be removed from the protein during processing in the cell.
  • the peptide may be produced synthetically or the protein may have been produced using a polynucleotide only encoding for the mature protein coding sequence.
  • derivative refers to polypeptides chemically modified by such techniques as ubiquitination, labeling (e.g., with radionuclides or various enzymes), covalent polymer attachment such as pegylation (derivatization with polyethylene glycol) and insertion or substitution by chemical synthesis of amino acids such as ornithine, which do not normally occur in human proteins.
  • variant refers to any polypeptide differing from naturally occurring polypeptides by amino acid insertions, deletions, and substitutions, created using, e g., recombinant DNA techniques.
  • Guidance in determining which amino acid residues may be replaced, added or deleted without abolishing activities of interest, may be found by comparing the sequence of the particular polypeptide with that of homologous peptides and minimizing the number of amino acid sequence changes made in regions of high homology (conserved regions) or by replacing amino acids with consensus sequence.
  • recombinant variants encoding these same or similar polypeptides may be synthesized or selected by making use of the "redundancy" in the genetic code.
  • Various codon substitutions such as the silent changes which produce various restriction sites, may be introduced to optimize cloning into a plasmid or viral vector or expression in a particular prokaryotic or eukaryotic system.
  • Mutations in the polynucleotide sequence may be reflected in the polypeptide or domains of other peptides added to the polypeptide to modify the properties of any part of the polypeptide, to change characteristics such as ligand-binding affinities, interchain affinities, or degradation/turnover rate.
  • amino acid substitutions are the result of replacing one amino acid with another amino acid having similar structural and/or chemical properties, i.e., conservative amino acid replacements.
  • conservative amino acid replacements may be made on the basis of similarity in polarity, charge, solubility, hydrophobicity, hydrophilicity, and/or the amphipathic nature of the residues involved.
  • nonpolar (hydrophobic) amino acids include alanine, leucine, isoleucine, valine, proline, phenylalanine, tryptophan, and methionine; polar neutral amino acids include glycine, serine, threonine, cysteine, tyrosine, asparagine, and glutamine; positively charged (basic) amino acids include arginine, lysine, and histidine; and negatively charged (acidic) amino acids include aspartic acid and glutamic acid.
  • “Insertions” or “deletions” are preferably in the range of about 1 to 20 amino acids, more preferably 1 to 10 amino acids.
  • the variation allowed may be experimentally determined by systematically making insertions, deletions, or substitutions of amino acids in a polypeptide molecule using recombinant DNA techniques and assaying the resulting recombinant variants for activity.
  • insertions, deletions or non-conservative alterations can be engineered to produce altered polypeptides.
  • Such alterations can, for example, alter one or more of the biological functions or biochemical characteristics of the polypeptides of the invention.
  • such alterations may change polypeptide characteristics such as ligand-binding affinities, interchain affinities, or degradation/turnover rate.
  • Such alterations can be selected so as to generate polypeptides that are better suited for expression, scale up and the like in the host cells chosen for expression.
  • cysteine residues can be deleted or substituted with another amino acid residue in order to eliminate disulfide bridges.
  • purified or “substantially purified” as used herein denotes that the indicated nucleic acid or polypeptide is present in the substantial absence of other biological macromolecules, e.g., polynucleotides, proteins, and the like.
  • the polynucleotide or polypeptide is purified such that it constitutes at least 95% by weight, more preferably at least 99% by weight, of the indicated biological macromolecules present (but water, buffers, and other small molecules, especially molecules having a molecular weight of less than 1000 daltons, can be present).
  • isolated refers to a nucleic acid or polypeptide separated from at least one other component (e.g., nucleic acid or polypeptide) present with the nucleic acid or polypeptide in its natural source.
  • the nucleic acid or polypeptide is found in the presence of (if anything) only a solvent, buffer, ion, or other component normally present in a solution of the same.
  • isolated and purified do not encompass nucleic acids or polypeptides present in their natural source.
  • recombinant when used herein to refer to a polypeptide or protein, means that a polypeptide or protein is derived from recombinant (e.g., microbial, insect, or mammalian) expression systems.
  • Microbial refers to recombinant polypeptides or proteins made in bacterial or fungal (e.g., yeast) expression systems.
  • recombinant microbial defines a polypeptide or protein essentially free of native endogenous substances and unaccompanied by associated native glycosylation. Polypeptides or proteins expressed in most bacterial cultures, e.g., E.
  • recombinant expression vehicle or vector refers to a plasmid or phage or virus or vector, for expressing a polypeptide from a DNA (RNA) sequence.
  • An expression vehicle can comprise a transcriptional unit comprising an assembly of (1) a genetic element or elements having a regulatory role in gene expression, for example, promoters or enhancers, (2) a structural or coding sequence which is transcribed into mRNA and translated into protein, and (3) appropriate transcription initiation and termination sequences.
  • Structural units intended for use in yeast or eukaryotic expression systems preferably include a leader sequence enabling extracellular secretion of translated protein by a host cell.
  • recombinant protein may include an amino terminal methionine residue. This residue may or may not be subsequently cleaved from the expressed recombinant protein to provide a final product.
  • recombinant expression system means host cells which have stably integrated a recombinant transcriptional unit into chromosomal DNA or carry the recombinant transcriptional unit extrachromosomally.
  • Recombinant expression systems as defined herein will express heterologous polypeptides or proteins upon induction of the regulatory elements linked to the DNA segment or synthetic gene to be expressed.
  • This term also means host cells which have stably integrated a recombinant genetic element or elements having a regulatory role in gene expression, for example, promoters or enhancers.
  • Recombinant expression systems as defined herein will express polypeptides or proteins endogenous to the cell upon induction of the regulatory elements linked to the endogenous DNA segment or gene to be expressed.
  • the cells can be prokaryotic or eukaryotic.
  • secreted includes a protein that is transported across or through a membrane, including transport as a result of signal sequences in its amino acid sequence when it is expressed in a suitable host cell.
  • “Secreted” proteins include without limitation proteins secreted wholly (e.g., soluble proteins) or partially (e.g., receptors) from the cell in which they are expressed.
  • “Secreted” proteins also include without limitation proteins that are transported across the membrane of the endoplasmic reticulum.
  • “Secreted” proteins are also intended to include proteins containing non-typical signal sequences (e.g. Interleukin-1 Beta, see Krasney, P.A. and Young, P.R.
  • an expression vector may be designed to contain a "signal or leader sequence" which will direct the polypeptide through the membrane of a cell.
  • a sequence may be naturally present on the polypeptides of the present invention or provided from heterologous protein sources by recombinant DNA techniques.
  • stringent is used to refer to conditions that are commonly understood in the art as stringent.
  • Stringent conditions can include highly stringent conditions (i.e., hybridization to filter-bound DNA in 0.5 M NaHPO 4 , 7% sodium dodecyl sulfate (SDS), 1 mM EDTA at 65°C, and washing in 0.1 X SSC/0.1% SDS at 68°C), and moderately stringent conditions (i.e., washing in 0.2X SSC/0.1% SDS at 42°C).
  • highly stringent conditions i.e., hybridization to filter-bound DNA in 0.5 M NaHPO 4 , 7% sodium dodecyl sulfate (SDS), 1 mM EDTA at 65°C, and washing in 0.1 X SSC/0.1% SDS at 68°C
  • moderately stringent conditions i.e., washing in 0.2X SSC/0.1% SDS at 42°C.
  • Other exemplary hybridization conditions are described herein in the examples.
  • additional exemplary stringent hybridization conditions include washing in 6X SSC/0.05% sodium pyrophosphate at 37°C (for 14-base oligonucleotides), 48°C (for 17-base oligos), 55°C (for 20-base oligonucleotides), and 60°C (for 23-base oligonucleotides).
  • substantially equivalent can refer both to nucleotide and amino acid sequences, for example a mutant sequence, that varies from a reference sequence by one or more substitutions, deletions, or additions, the net effect of which does not result in an adverse functional dissimilarity between the reference and subject sequences.
  • such a substantially equivalent sequence varies from one of those listed herein by no more than about 35% (i.e., the number of individual residue substitutions, additions, and/or deletions in a substantially equivalent sequence, as compared to the corresponding reference sequence, divided by the total number of residues in the substantially equivalent sequence is about 0.35 or less).
  • Such a sequence is said to have 65% sequence identity to the listed sequence.
  • a substantially equivalent, e.g., mutant, sequence of the invention varies from a listed sequence by no more than 30% (70% sequence identity); in a variation of this embodiment, by no more than 25% (75% sequence identity); and in a further variation of this embodiment, by no more than 20% (80% sequence identity) and in a further variation of this embodiment, by no more than 10% (90% sequence identity) and in a further variation of this embodiment, by no more that 5% (95% sequence identity).
  • Substantially equivalent, e.g., mutant, amino acid sequences according to the invention preferably have at least 80% sequence identity with a listed amino acid sequence, more preferably at least 85% sequence identity, more preferably at least 90% sequence identity, more preferably at least 95% identity, more preferably at least 98% identity, and most preferably at least 99% identity.
  • Substantially equivalent nucleotide sequences of the invention can have lower percent sequence identities, taking into account, for example, the redundancy or degeneracy of the genetic code.
  • nucleotide sequence has at least about 65% identity, more preferably at least about 75% identity, more preferably at least about 80% sequence identity, more preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, and most preferably at least about 95% identity, more preferably at least about 98% sequence identity, and most preferably at least about 99% sequence identity.
  • sequences having substantially equivalent biological activity and substantially equivalent expression characteristics are considered substantially equivalent.
  • truncation of the mature sequence e.g., via a mutation which creates a spurious stop codon
  • Sequence identity may be determined, e.g., using the Jotun Hein method (Hein, J. (1990) Methods Enzymol. 183:626-645). Identity between sequences can also be determined by other methods known in the art, e.g. by varying hybridization conditions.
  • totipotent refers to the capability of a cell to differentiate into all of the cell types of an adult organism.
  • transformation means introducing DNA into a suitable host cell so that the
  • UMF uptake modulating fragment
  • Nucleotide sequences of the invention are set forth in the Sequence Listing.
  • the isolated polynucleotides of the invention include a polynucleotide comprising the nucleotide sequences of SEQ ID NO: 1-30368; a polynucleotide encoding any one of the peptide sequences of SEQ ID NO: 30369-60736; and a polynucleotide comprising the nucleotide sequence encoding the mature protein coding sequence of the polypeptides of any one of SEQ ID NO: 30369-60736.
  • the polynucleotides of the present invention also include, but are not limited to, a polynucleotide that hybridizes under stringent conditions to (a) the complement of any of the nucleotides sequences of SEQ ID NO: 1-30368; (b) nucleotide sequences encoding any one of the amino acid sequences set forth in the Sequence Listing; (c) a polynucleotide which is an allelic variant of any polynucleotide recited above; (d) a polynucleotide which encodes a species homolog of any of the proteins recited above; or (e) a polynucleotide that encodes a polypeptide comprising a specific domain or truncation of the polypeptides of SEQ ID NO: 30369-60736.
  • Domains of interest may depend on the nature of the encoded polypeptide; e.g., domains in receptor-like polypeptides include ligand-binding, extracellular, transmembrane, or cytoplasmic domains, or combinations thereof; domains in immunoglobulin-like proteins include the variable immunoglobulin-like domains; domains in enzyme-like polypeptides include catalytic and substrate binding domains; and domains in ligand polypeptides include receptor-binding domains.
  • the polynucleotides of the invention include naturally occurring or wholly or partially synthetic DNA, e.g., cDNA and genomic DNA, and RNA, e.g., mRNA.
  • the polynucleotides may include all of the coding region of the cDNA or may represent a portion of the coding region of the cDNA.
  • the present invention also provides genes corresponding to the cDNA sequences disclosed herein.
  • the corresponding genes can be isolated in accordance with known methods using the sequence information disclosed herein. Such methods include the preparation of probes or primers from the disclosed sequence information for identification and/or amplification of genes in appropriate genomic libraries or other sources of genomic materials. Further 5' and 3' sequence can be obtained using methods known in the art. For example, full length cDNA or genomic DNA that corresponds to any of the polynucleotides of SEQ ID NO: 1-30368 can be obtained by screening appropriate cDNA or genomic DNA libraries under suitable hybridization conditions using any of the polynucleotides of SEQ ID NO: 1-30368 or a portion thereof as a probe.
  • polynucleotides of SEQ ID NO: 1-30368 may be used as the basis for suitable primer(s) that allow identification and/or amplification of genes in appropriate genomic DNA or cDNA libraries.
  • suitable primer(s) that allow identification and/or amplification of genes in appropriate genomic DNA or cDNA libraries.
  • the nucleic acid sequences of the invention can be assembled from ESTs and sequences
  • polynucleotides of the invention also provide polynucleotides including nucleotide sequences that are substantially equivalent to the polynucleotides recited above.
  • Polynucleotides according to the invention can have, e.g., at least about 65%, at least about 70%, at least about 75%, at least about 80%, 81%, 82%, 83%, 84%, more typically at least about 85%, 86%, 87%, 88%, 89%, more typically at least about 90%, 91%, 92%, 93%, 94%, and even more typically at least about 95%, 96%, 97%, 98%, 99%, sequence identity to a polynucleotide recited above.
  • nucleic acid sequence fragments that hybridize under stringent conditions to any of the nucleotide sequences of SEQ ID NO: 1-30368, or complements thereof, which fragment is greater than about 5 nucleotides, preferably 7 nucleotides, more preferably greater than 9 nucleotides and most preferably greater than 17 nucleotides. Fragments of, e.g. 15, 17, or 20 nucleotides or more that are selective for (i.e. specifically hybridize to any one of the polynucleotides of the invention) are contemplated.
  • Probes capable of specifically hybridizing to a polynucleotide can differentiate polynucleotide sequences of the invention from other polynucleotide sequences in the same family of genes or can differentiate human genes from genes of other species, and are preferably based on unique nucleotide sequences.
  • sequences falling within the scope of the present invention are not limited to these specific sequences, but also include allelic and species variations thereof. Allelic and species variations can be routinely determined by comparing the sequence provided in SEQ ID NO: 1 - 30368, a representative fragment thereof, or a nucleotide sequence at least 90% identical, preferably 95% identical, to SEQ ID NO: 1 -30368 with a sequence from another isolate of the same species. Furthermore, to accommodate codon variability, the invention includes nucleic acid molecules coding for the same amino acid sequences as do the specific ORFs disclosed herein. In other words, in the coding region of an ORF, substitution of one codon for another codon that encodes the same amino acid is expressly contemplated.
  • the nearest neighbor or homology result for the nucleic acids of the present invention can be obtained by searching a database using an algorithm or a program.
  • a BLAST which stands for Basic Local Alignment Search Tool is used to search for local sequence alignments (Altshul, S.F. J Mol. Evol. 36 290-300 (1993) and Altschul S.F. et al. J. Mol. Biol. 21 :403-410(1990)).
  • a FASTA version 3 search against Genpept using Fastxy algorithm.
  • Species homologs (or orthologs) of the disclosed polynucleotides and proteins are also provided by the present invention. Species homologs may be isolated and identified by making suitable probes or primers from the sequences provided herein and screening a suitable nucleic acid source from the desired species.
  • the invention also encompasses allelic variants of the disclosed polynucleotides or proteins; that is, naturally occurring alternative forms of the isolated polynucleotide which also encode proteins which are identical, homologous or related to that encoded by the polynucleotides.
  • nucleic acid sequences of the invention are further directed to sequences which encode variants of the described nucleic acids.
  • These amino acid sequence variants may be prepared by methods known in the art by introducing appropriate nucleotide changes into a native or variant polynucleotide. There are two variables in the construction of amino acid sequence variants: the location of the mutation and the nature of the mutation. Nucleic acids encoding the amino acid sequence variants are preferably constructed by mutating the polynucleotide to encode an amino acid sequence that does not occur in nature. These nucleic acid alterations can be made at sites that differ in the nucleic acids from different species
  • Amino acid sequence deletions generally range from about 1 to 30 residues, preferably about 1 to 10 residues, and are typically contiguous.
  • Amino acid insertions include amino- and/or carboxyl-terminal fusions ranging in length from one to one hundred or more residues, as well as intrasequence insertions of single or multiple amino acid residues.
  • Intrasequence insertions may range generally from about 1 to 10 amino residues, preferably from 1 to 5 residues.
  • terminal insertions include the heterologous signal sequences necessary for secretion or for intracellular targeting in different host cells and sequences such as FLAG or poly-histidine sequences useful for purifying the expressed protein.
  • polynucleotides encoding the novel amino acid sequences are changed via site-directed mutagenesis. This method uses oligonucleotide sequences to alter a polynucleotide to encode the desired amino acid variant, as well as sufficient adjacent nucleotides on both sides of the changed amino acid to form a stable duplex on either side of the site of being changed.
  • PCR amplification results in a population of product DNA fragments that differ from the polynucleotide template encoding the polypeptide at the position specified by the primer.
  • the product DNA fragments replace the corresponding region in the plasmid and this gives a polynucleotide encoding the desired amino acid variant.
  • a further technique for generating amino acid variants is the cassette mutagenesis technique described in Wells et al., Gene 34:315 (1985); and other mutagenesis techniques well known in the art, such as, for example, the techniques in Sambrook et al., supra, and Current Protocols in Molecular Biology, Ausubel et al. Due to the inherent degeneracy of the genetic code, other DNA sequences which encode substantially the same or a functionally equivalent amino acid sequence may be used in the practice of the invention for the cloning and expression of these novel nucleic acids. Such DNA sequences include those which are capable of hybridizing to the appropriate novel nucleic acid sequence under stringent conditions.
  • Polynucleotides encoding preferred polypeptide truncations of the invention can be used to generate polynucleotides encoding chimeric or fusion proteins comprising one or more domains of the invention and heterologous protein sequences.
  • the polynucleotides of the invention additionally include the complement of any of the polynucleotides recited above.
  • the polynucleotide can be DNA (genomic, cDNA, amplified, or synthetic) or RNA. Methods and algorithms for obtaining such polynucleotides are well known to those of skill in the art and can include, for example, methods for determining hybridization conditions that can routinely isolate polynucleotides of the desired sequence identities.
  • polynucleotide sequences comprising the mature protein coding sequences corresponding to any one of SEQ ID NO: 1-30368, or functional equivalents thereof, may be used to generate recombinant DNA molecules that direct the expression of that nucleic acid, or a functional equivalent thereof, in appropriate host cells. Also included are the cDNA inserts of any of the clones identified herein.
  • a polynucleotide according to the invention can be joined to any of a variety of other nucleotide sequences by well-established recombinant DNA techniques (see Sambrook J et al. (1989) Molecular Cloning: A Laboratory Manual, Cold Spring Harbor Laboratory, NY).
  • Useful nucleotide sequences for joining to polynucleotides include an assortment of vectors, e.g., plasmids, cosmids, lambda phage derivatives, phagemids, and the like, that are well known in the art. Accordingly, the invention also provides a vector including a polynucleotide of the invention and a host cell containing the polynucleotide.
  • the vector contains an origin of replication functional in at least one organism, convenient restriction endonuclease sites, and a selectable marker for the host cell.
  • Vectors according to the invention include expression vectors, replication vectors, probe generation vectors, and sequencing vectors.
  • a host cell according to the invention can be a prokaryotic or eukaryotic cell and can be a unicellular organism or part of a multicellular organism.
  • the present invention further provides recombinant constructs comprising a nucleic acid having any of the nucleotide sequences of SEQ ID NO: 1-30368 or a fragment thereof or any other polynucleotides of the invention.
  • the recombinant constructs of the present invention comprise a vector, such as a plasmid or viral vector, into which a nucleic acid having any of the nucleotide sequences of SEQ ID NO: 1-30368 or a fragment thereof is inserted, in a forward or reverse orientation.
  • the vector may further comprise regulatory sequences, including for example, a promoter, operably linked to the ORF.
  • Bacterial pBs, phagescript, PsiX174, pBluescript SK, pBs KS, pNH8a, pNH16a, pNH18a, pNH46a (Stratagene); pTrc99A, pKK223-3, pKK233-3, pDR540, pRIT5 (Pharmacia).
  • Eukaryotic pWLneo, pSV2cat, pOG44, PXTI, pSG (Stratagene) pSVK3, pBPV, pMSG, pSVL (Pharmacia).
  • the isolated polynucleotide of the invention may be operably linked to an expression control sequence such as the pMT2 or pED expression vectors disclosed in Kaufman et al, Nucleic Acids Res. 19, 4485-4490 (1991), in order to produce the protein recombinantly.
  • an expression control sequence such as the pMT2 or pED expression vectors disclosed in Kaufman et al, Nucleic Acids Res. 19, 4485-4490 (1991)
  • Many suitable expression control sequences are known in the art. General methods of expressing recombinant proteins are also known and are exemplified in R. Kaufman, Methods in Enzymology 185, 537-566 (1990).
  • operably linked means that the isolated polynucleotide of the invention and an expression control sequence are situated within a vector or cell in such a way that the protein is expressed by a host cell which has been transformed (transfected) with the ligated polynucleotide/expression control sequence.
  • Promoter regions can be selected from any desired gene using CAT (chloramphenicol transferase) vectors or other vectors with selectable markers.
  • Two appropriate vectors are pKK232-8 and pCM7.
  • Particular named bacterial promoters include lad, lacZ, T3, T7, gpt, lambda PR, and trc.
  • Eukaryotic promoters include CMV immediate early, HSV thymidine kinase, early and late SV40, LTRs from retrovirus, and mouse metallothionein-I. Selection of the appropriate vector and promoter is well within the level of ordinary skill in the art.
  • recombinant expression vectors will include origins of replication and selectable markers permitting transformation of the host cell, e.g., the ampicillin resistance gene ofE coli and S. cerevisiae TRP1 gene, and a promoter derived from a highly-expressed gene to direct transcription of a downstream structural sequence.
  • promoters can be derived from operons encoding glycolytic enzymes such as 3-phosphoglycerate kinase (PGK), a-factor, acid phosphatase, or heat shock proteins, among others.
  • PGK 3-phosphoglycerate kinase
  • the heterologous structural sequence is assembled in appropriate phase with translation initiation and termination sequences, and preferably, a leader sequence capable of directing secretion of translated protein into the periplasmic space or extracellular medium.
  • the heterologous sequence can encode a fusion protein including an amino terminal identification peptide imparting desired characteristics, e.g., stabilization or simplified purification of expressed recombinant product.
  • Useful expression vectors for bacterial use are constructed by inserting a structural DNA sequence encoding a desired protein together with suitable translation initiation and termination signals in operable reading phase with a functional promoter.
  • the vector will comprise one or more phenotypic selectable markers and an origin of replication to ensure maintenance of the vector and to, if desirable, provide amplification within the host.
  • Suitable prokaryotic hosts for transformation include E. coli, Bacillus subtilis, Salmonella typhimu ⁇ um and various species within the genera Pseudomonas, Streptomyces, and Staphylococcus, although others may also be employed as a matter of choice.
  • useful expression vectors for bacterial use can comprise a selectable marker and bacterial origin of replication derived from commercially available plasmids comprising genetic elements of the well known cloning vector pBR322 (ATCC 37017).
  • cloning vector pBR322 ATCC 37017
  • Such commercial vectors include, for example, pKK223-3 (Pharmacia Fine Chemicals, Uppsala, Sweden) and GEM 1 (Promega Biotech, Madison, WI, USA). These pBR322 "backbone" sections are combined with an appropriate promoter and the structural sequence to be expressed.
  • the selected promoter is induced or derepressed by appropriate means (e.g., temperature shift or chemical induction) and cells are cultured for an additional period.
  • appropriate means e.g., temperature shift or chemical induction
  • Cells are typically harvested by centrifugation, disrupted by physical or chemical means, and the resulting crude extract retained for further purification.
  • Polynucleotides of the invention can also be used to induce immune responses.
  • nucleic acid sequences encoding a polypeptide may be used to generate antibodies against the encoded polypeptide following topical administration of naked plasmid D ⁇ A or following injection, and preferably intramuscular injection of the D ⁇ A.
  • the nucleic acid sequences are preferably inserted in a recombinant expression vector and may be in the form of naked D ⁇ A.
  • Another aspect of the invention pertains to isolated antisense nucleic acid molecules that are hybridizable to or complementary to the nucleic acid molecule comprising the nucleotide sequence of SEQ ID NO: 1-30368, or fragments, analogs or derivatives thereof.
  • An "antisense" nucleic acid comprises a nucleotide sequence that is complementary to a "sense" nucleic acid encoding a protein, e.g. , complementary to the coding strand of a double-stranded cDNA molecule or complementary to an mRNA sequence.
  • antisense nucleic acid molecules comprise a sequence complementary to at least about 10, 25, 50, 100, 250 or 500 nucleotides or an entire coding strand, or to only a portion thereof.
  • an antisense nucleic acid molecule is antisense to a "coding region" of the coding strand of a nucleotide sequence of the invention.
  • the term “coding region” refers to the region of the nucleotide sequence comprising codons which are translated into amino acid residues.
  • the antisense nucleic acid molecule is antisense to a "noncoding region" of the coding strand of a nucleotide sequence of the invention.
  • the term “noncoding region” refers to 5' and 3' sequences which flank the coding region that are not translated into amino acids (i.e., also referred to as 5' and 3' untranslated regions).
  • antisense nucleic acids of the invention can be designed according to the rules of Watson and Crick or Hoogsteen base pairing.
  • the antisense nucleic acid molecule can be complementary to the entire coding region of a mRNA, but more preferably is an oligonucleotide that is antisense to only a portion of the coding or noncoding region of a mRNA.
  • the antisense oligonucleotide can be complementary to the region surrounding the translation start site of a mRNA.
  • An antisense oligonucleotide can be, for example, about 5, 10, 15, 20, 25, 30, 35, 40, 45 or 50 nucleotides in length.
  • an antisense nucleic acid of the invention can be constructed using chemical synthesis or enzymatic ligation reactions using procedures known in the art.
  • an antisense nucleic acid e.g., an antisense oligonucleotide
  • an antisense nucleic acid can be chemically synthesized using naturally occurring nucleotides or variously modified nucleotides designed to increase the biological stability of the molecules or to increase the physical stability of the duplex formed between the antisense and sense nucleic acids, e.g., phosphorothioate derivatives and acridine substituted nucleotides can be used.
  • modified nucleotides that can be used to generate the antisense nucleic acid include: 5-fluorouracil, 5-bromouracil, 5-chlorouracil, 5-iodouracil, hypoxanthine, xanthine, 4-acetylcytosine, 5-(carboxyhydroxylmethyl) uracil, 5-carboxymethylaminomethyl- 2-thiouridine, 5-carboxymethylaminomethyluracil, dihydrouracil, beta-D-galactosylqueosine, inosine, N6-isopentenyladenine, 1-methylguanine, 1 -methylinosine, 2,2-dimethylguanine, 2-methyladenine, 2-methylguanine, 3-methylcytosine, 5-methylcytosine, N6-adenine, 7-methylguanine, 5-methylaminomethyluracil, 5-methoxyaminomethyl-2-thiouracil, beta-D-mannosylqueosine, 5'-me
  • the antisense nucleic acid can be produced biologically using an expression vector into which a nucleic acid has been subcloned in an antisense orientation (i.e., RNA transcribed from the inserted nucleic acid will be of an antisense orientation to a target nucleic acid of interest, described further in the following subsection).
  • the antisense nucleic acid molecules of the invention are typically administered to a subject or generated in situ such that they hybridize with or bind to cellular mRNA and/or genomic DNA encoding a protein according to the invention to thereby inhibit expression of the protein, e.g., by inhibiting transcription and/or translation.
  • the hybridization can be by conventional nucleotide complementarity to form a stable duplex, or, for example, in the case of an antisense nucleic acid molecule that binds to DNA duplexes, through specific interactions in the major groove of the double helix.
  • An example of a route of administration of antisense nucleic acid molecules of the invention includes direct injection at a tissue site.
  • antisense nucleic acid molecules can be modified to target selected cells and then administered systemically.
  • antisense molecules can be modified such that they specifically bind to receptors or antigens expressed on a selected cell surface, e.g.
  • antisense nucleic acid molecules by linking the antisense nucleic acid molecules to peptides or antibodies that bind to cell surface receptors or antigens.
  • the antisense nucleic acid molecules can also be delivered to cells using the vectors described herein.
  • vector constructs in which the antisense nucleic acid molecule is placed under the control of a strong pol II or pol III promoter are preferred.
  • the antisense nucleic acid molecule of the invention is an -a n omeric nucleic acid molecule.
  • An -a nomeric nucleic acid molecule forms specific double-stranded hybrids with complementary RNA in which, contrary to the usual -uni ts, the strands run parallel to each other (Gaultier et al. (1987) Nucleic Acids Res 15: 6625-6641).
  • the antisense nucleic acid molecule can also comprise a 2'-o-methylribonucleotide (Inoue et al.
  • an antisense nucleic acid of the invention is a ribozyme.
  • Ribozymes are catalytic RNA molecules with ribonuclease activity that are capable of cleaving a single-stranded nucleic acid, such as a mRNA, to which they have a complementary region.
  • ribozymes e.g., hammerhead ribozymes (described in Haselhoff and Gerlach (1988) Nature 334:585-591)
  • a ribozyme having specificity for a nucleic acid of the invention can be designed based upon the nucleotide sequence of a DNA disclosed herein (i.e., SEQ ID NO: 1 -
  • a derivative of a Tetrahymena L-19 IVS RNA can be constructed in which the nucleotide sequence of the active site is complementary to the nucleotide sequence to be cleaved in an mRNA of SEQ ID NO: 1-30368 (see, e.g., Cech et al. U.S. Pat. No. 4,987,071; and Cech et al. U.S. Pat. No. 5,116,742).
  • polynucleotides of the invention can be used to select a catalytic RNA having a specific ribonuclease activity from a pool of RNA molecules. See, e.g. , Bartel et al., (1993) Science 261 :141 1-1418.
  • gene expression can be inhibited by targeting nucleotide sequences complementary to the regulatory region (e.g., promoter and/or enhancers) to form triple helical structures that prevent transcription of the gene in target cells.
  • the regulatory region e.g., promoter and/or enhancers
  • gene expression can be inhibited by targeting nucleotide sequences complementary to the regulatory region (e.g., promoter and/or enhancers) to form triple helical structures that prevent transcription of the gene in target cells.
  • the regulatory region e.g., promoter and/or enhancers
  • the nucleic acids of the invention can be modified at the base moiety, sugar moiety or phosphate backbone to improve, e.g., the stability, hybridization, or solubility of the molecule.
  • the deoxyribose phosphate backbone of the nucleic acids can be modified to generate peptide nucleic acids (see Hyrup et al. (1996) Bioorg Med Chem 4: 5-23).
  • the terms "peptide nucleic acids” or "PNAs” refer to nucleic acid mimics, e.g., DNA mimics, in which the deoxyribose phosphate backbone is replaced by a pseudopeptide backbone and only the four natural nucleobases are retained.
  • PNAs The neutral backbone of PNAs has been shown to allow for specific hybridization to DNA and RNA under conditions of low ionic strength.
  • the synthesis of PNA oligomers can be performed using standard solid phase peptide synthesis protocols as described in Hyrup et al. (1996) above; Perry-O'Keefe et al. (1996) PNAS 93: 14670-675.
  • PNAs of the invention can be used in therapeutic and diagnostic applications.
  • PNAs can be used as antisense or antigene agents for sequence-specific modulation of gene expression by, e.g., inducing transcription or translation arrest or inhibiting replication.
  • PNAs of the invention can also be used, e.g., in the analysis of single base pair mutations in a gene by, e.g., PNA directed PCR clamping; as artificial restriction enzymes when used in combination with other enzymes, e.g., SI nucleases (Hyrup B. (1996) above); or as probes or primers for DNA sequence and hybridization (Hyrup et al. (1996), above; Perry-O'Keefe (1996), above).
  • PNAs of the invention can be modified, e.g., to enhance their stability or cellular uptake, by attaching lipophilic or other helper groups to PNA, by the formation of PNA-DNA chimeras, or by the use of liposomes or other techniques of drug delivery known in the art.
  • PNA-DNA chimeras can be generated that may combine the advantageous properties of PNA and DNA.
  • Such chimeras allow DNA recognition enzymes, e.g., RNase H and DNA polymerases, to interact with the DNA portion while the PNA portion would provide high binding affinity and specificity.
  • PNA-DNA chimeras can be linked using linkers of appropriate lengths selected in terms of base stacking, number of bonds between the nucleobases, and orientation (Hyrup (1996) above).
  • the synthesis of PNA-DNA chimeras can be performed as described in Hyrup (1996) above and Finn et al. (1996) Nucl Acids Res 24: 3357-63.
  • a DNA chain can be synthesized on a solid support using standard phosphoramidite coupling chemistry, and modified nucleoside analogs, e.g., 5'-(4-methoxytrityl)amino-5'-deoxy-thymidine phosphoramidite, can be used between the PNA and the 5' end of DNA (Mag et al.
  • PNA monomers are then coupled in a stepwise manner to produce a chimeric molecule with a 5' PNA segment and a 3' DNA segment (Finn et al. (1996) above).
  • chimeric molecules can be synthesized with a 5' DNA segment and a 3' PNA segment. See, Petersen et al. (1975) Bioorg Med Chem Lett 5: 11 19-11124.
  • the oligonucleotide may include other appended groups such as peptides (e.g., for targeting host cell receptors in vivo), or agents facilitating transport across the cell membrane (see, e.g., Letsinger et al., 1989, Proc. Natl. Acad. Sci. U.S.A. 86:6553-6556; Lemaitre et al., 1987, Proc. Natl. Acad. Sci. 84:648-652; PCT Publication No. W088/09810) or the blood-brain barrier (see, e.g., PCT Publication No. W089/10134).
  • peptides e.g., for targeting host cell receptors in vivo
  • agents facilitating transport across the cell membrane see, e.g., Letsinger et al., 1989, Proc. Natl. Acad. Sci. U.S.A. 86:6553-6556; Lemaitre et al., 1987
  • oligonucleotides can be modified with hybridization triggered cleavage agents (See, e.g. , Krol et al., 1988, BioTechniques 6:958-976) or intercalating agents (see, e.g., Zon, 1988, Pharm. Res. 5:539-549).
  • the oligonucleotide may be conjugated to another molecule, e.g., a peptide, a hybridization triggered cross-linking agent, a transport agent, a hybridization-triggered cleavage agent, etc.
  • the present invention further provides host cells genetically engineered to contain the polynucleotides of the invention.
  • host cells may contain nucleic acids of the invention introduced into the host cell using known transformation, transfection or infection methods.
  • the present invention still further provides host cells genetically engineered to express the polynucleotides of the invention, wherein such polynucleotides are in operative association with a regulatory sequence heterologous to the host cell which drives expression of the polynucleotides in the cell.
  • nucleic acid sequences allows for modification of cells to permit, or increase, expression of endogenous polypeptide.
  • Cells can be modified (e.g., by homologous recombination) to provide increased polypeptide expression by replacing, in whole or in part, the naturally occurring promoter with all or part of a heterologous promoter so that the cells express the polypeptide at higher levels.
  • the heterologous promoter is inserted in such a manner that it is operatively linked to the encoding sequences. See, for example, PCT International Publication No. WO94/12650, PCT International Publication No. WO92/20808, and PCT International
  • amplifiable marker DNA e.g., ada, dhfr, and the multifunctional CAD gene which encodes carbamyl phosphate synthase, aspartate transcarbamylase, and dihydroorotase
  • intron DNA may be inserted along with the heterologous promoter DNA. If linked to the coding sequence, amplification of the marker DNA by standard selection methods results in co- amplification of the desired protein coding sequences in the cells.
  • the host cell can be a higher eukaryotic host cell, such as a mammalian cell, a lower eukaryotic host cell, such as a yeast cell, or the host cell can be a prokaryotic cell, such as a bacterial cell.
  • Introduction of the recombinant construct into the host cell can be effected by calcium phosphate transfection, DEAE, dextran mediated transfection, or electroporation (Davis, L. et al., Basic Methods in Molecular Biology (1986)).
  • the host cells containing one of the polynucleotides of the invention can be used in conventional manners to produce the gene product encoded by the isolated fragment (in the case of an ORF) or can be used to produce a heterologous protein under the control of the EMF.
  • Any host/vector system can be used to express one or more of the ORFs of the present invention. These include, but are not limited to, eukaryotic hosts such as HeLa cells, Cv-1 cell, COS cells, 293 cells, and Sf9 cells, as well as prokaryotic host such as E. coli and B. subtilis.
  • the most preferred cells are those which do not normally express the particular polypeptide or protein or which expresses the polypeptide or protein at low natural level.
  • Mature proteins can be expressed in mammalian cells, yeast, bacteria, or other cells under the control of appropriate promoters. Cell-free translation systems can also be employed to produce such proteins using RNAs derived from the DNA constructs of the present invention. Appropriate cloning and expression vectors for use with prokaryotic and eukaryotic hosts are described by Sambrook, et al., in Molecular Cloning: A Laboratory Manual, Second Edition, Cold Spring Harbor, New York (1989), the disclosure of which is hereby incorporated by reference.
  • mammalian cell culture systems can also be employed to express recombinant protein.
  • mammalian expression systems include the COS-7 lines of monkey kidney fibroblasts, described by Gluzman, Cell 23:175 (1981).
  • Other cell lines capable of expressing a compatible vector are, for example, the C127, monkey COS cells, Chinese Hamster Ovary (CHO) cells, human kidney 293 cells, human epidermal A431 cells, human Colo205 cells, 3T3 cells, CV-1 cells, other transformed primate cell lines, normal diploid cells, cell strains derived from in vitro culture of primary tissue, primary explants, HeLa cells, mouse L cells, BHK, HL-60, U937, HaK or Jurkat cells.
  • Mammalian expression vectors will comprise an origin of replication, a suitable promoter and also any necessary ribosome binding sites, polyadenylation site, splice donor and acceptor sites, transcriptional termination sequences, and 5 ' flanking nontranscribed sequences.
  • DNA sequences derived from the SV40 viral genome for example, SV40 origin, early promoter, enhancer, splice, and polyadenylation sites may be used to provide the required nontranscribed genetic elements.
  • Recombinant polypeptides and proteins produced in bacterial culture are usually isolated by initial extraction from cell pellets, followed by one or more salting-out, aqueous ion exchange or size exclusion chromatography steps.
  • Protein refolding steps can be used, as necessary, in completing configuration of the mature protein. Finally, high performance liquid chromatography (HPLC) can be employed for final purification steps. Microbial cells employed in expression of proteins can be disrupted by any convenient method, including freeze-thaw cycling, sonication, mechanical disruption, or use of cell lysing agents.
  • yeast strains include Saccharomyces cerevisiae, Schizosaccharomyces pombe, Kluyveromyces strains, Candida, or any yeast strain capable of expressing heterologous proteins.
  • yeast strains include Escherichia coli, Bacillus subtilis, Salmonella typhimurium, or any bacterial strain capable of expressing heterologous proteins. If the protein is made in yeast or bacteria, it may be necessary to modify the protein produced therein, for example by phosphorylation or glycosylation of the appropriate sites, in order to obtain the functional protein.
  • cells and tissues may be engineered to express an endogenous gene comprising the polynucleotides of the invention under the control of inducible regulatory elements, in which case the regulatory sequences of the endogenous gene may be replaced by homologous recombination.
  • gene targeting can be used to replace a gene's existing regulatory region with a regulatory sequence isolated from a different gene or a novel regulatory sequence synthesized by genetic engineering methods.
  • regulatory sequences may be comprised of promoters, enhancers, scaffold-attachment regions, negative regulatory elements, transcriptional initiation sites, regulatory protein binding sites or combinations of said sequences.
  • sequences which affect the structure or stability of the RNA or protein produced may be replaced, removed, added, or otherwise modified by targeting.
  • sequences which affect the structure or stability of the RNA or protein produced may be replaced, removed, added, or otherwise modified by targeting.
  • These sequence include polyadenylation signals, mRNA stability elements, splice sites, leader sequences for enhancing or modifying transport or secretion properties of the protein, or other sequences which alter or improve the function or stability of protein or RNA molecules.
  • the targeting event may be a simple insertion of the regulatory sequence, placing the gene under the control of the new regulatory sequence, e.g., inserting a new promoter or enhancer or both upstream of a gene.
  • the targeting event may be a simple deletion of a regulatory element, such as the deletion of a tissue-specific negative regulatory element.
  • the targeting event may replace an existing element; for example, a tissue-specific enhancer can be replaced by an enhancer that has broader or different cell-type specificity than the naturally occurring elements.
  • the naturally occurring sequences are deleted and new sequences are added.
  • the identification of the targeting event may be facilitated by the use of one or more selectable marker genes that are contiguous with the targeting DNA, allowing for the selection of cells in which the exogenous DNA has integrated into the host cell genome.
  • the identification of the targeting event may also be facilitated by the use of one or more marker genes exhibiting the property of negative selection, such that the negatively selectable marker is linked to the exogenous DNA, but configured such that the negatively selectable marker flanks the targeting sequence, and such that a correct homologous recombination event with sequences in the host cell genome does not result in the stable integration of the negatively selectable marker.
  • Markers useful for this purpose include the Herpes Simplex Virus thymidine kinase (TK) gene or the bacterial xanthine-guanine phosphoribosyl-transferase (gpt) gene.
  • the isolated polypeptides of the invention include, but are not limited to, a polypeptide comprising: the amino acid sequences set forth as any one of SEQ ID NO: 30369-60736 or an amino acid sequence encoded by any one of the nucleotide sequences SEQ ID NO: 1-30368 or the corresponding full length or mature protein.
  • Polypeptides of the invention also include polypeptides preferably with biological or immunological activity that are encoded by: (a) a polynucleotide having any one of the nucleotide sequences set forth in SEQ ID NO: 1-30368 or (b) polynucleotides encoding any one of the amino acid sequences set forth as SEQ ID NO:
  • the invention also provides biologically active or immunologically active variants of any of the amino acid sequences set forth as SEQ ID NO: 30369-60736 or the corresponding full length or mature protein; and
  • allelic variants may have a similar, increased, or decreased activity compared to polypeptides comprising SEQ ID NO: 30369-60736.
  • Fragments of the proteins of the present invention which are capable of exhibiting biological activity are also encompassed by the present invention.
  • Fragments of the protein may be in linear form or they may be cyclized using known methods, for example, as described in H. U. Saragovi, et al., Bio/Technology 10, 773-778 (1992) and in R. S. McDowell, et al., J. Amer. Chem. Soc. 114, 9245-9253 (1992), both of which are incorporated herein by reference.
  • Such fragments may be fused to carrier molecules such as immunoglobulins for many purposes, including increasing the valency of protein binding sites.
  • the present invention also provides both full-length and mature forms (for example, without a signal sequence or precursor sequence) of the disclosed proteins.
  • the protein coding sequence is identified in the sequence listing by translation of the disclosed nucleotide sequences.
  • the mature form of such protein may be obtained by expression of a full-length polynucleotide in a suitable mammalian cell or other host cell.
  • the sequence of the mature form of the protein is also determinable from the amino acid sequence of the full-length form.
  • proteins of the present invention are membrane bound, soluble forms of the proteins are also provided. In such forms, part or all of the regions causing the proteins to be membrane bound are deleted so that the proteins are fully secreted from the cell in which they are expressed.
  • Protein compositions of the present invention may further comprise an acceptable carrier, such as a hydrophilic, e.g., pharmaceutically acceptable, carrier.
  • the present invention further provides isolated polypeptides encoded by the nucleic acid fragments of the present invention or by degenerate variants of the nucleic acid fragments of the present invention.
  • degenerate variant is intended nucleotide fragments which differ from a nucleic acid fragment of the present invention (e.g., an ORF) by nucleotide sequence but, due to the degeneracy of the genetic code, encode an identical polypeptide sequence.
  • Preferred nucleic acid fragments of the present invention are the ORFs that encode proteins.
  • the amino acid sequence can be synthesized using commercially available peptide synthesizers.
  • the synthetically-constructed protein sequences by virtue of sharing primary, secondary or tertiary structural and/or conformational characteristics with proteins may possess biological properties in common therewith, including protein activity. This technique is particularly useful in producing small peptides and fragments of larger polypeptides. Fragments are useful, for example, in generating antibodies against the native polypeptide. Thus, they may be employed as biologically active or immunological substitutes for natural, purified proteins in screening of therapeutic compounds and in immunological processes for the development of antibodies.
  • polypeptides and proteins of the present invention can alternatively be purified from cells which have been altered to express the desired polypeptide or protein.
  • a cell is said to be altered to express a desired polypeptide or protein when the cell, through genetic manipulation, is made to produce a polypeptide or protein which it normally does not produce or which the cell normally produces at a lower level.
  • One skilled in the art can readily adapt procedures for introducing and expressing either recombinant or synthetic sequences into eukaryotic or prokaryotic cells in order to generate a cell which produces one of the polypeptides or proteins of the present invention.
  • the invention also relates to methods for producing a polypeptide comprising growing a culture of host cells of the invention in a suitable culture medium, and purifying the protein from the cells or the culture in which the cells are grown.
  • the methods of the invention include a process for producing a polypeptide in which a host cell containing a suitable expression vector that includes a polynucleotide of the invention is cultured under conditions that allow expression of the encoded polypeptide.
  • the polypeptide can be. recovered from the culture, conveniently from the culture medium, or from a lysate prepared from the host cells and further purified.
  • Preferred embodiments include those in which the protein produced by such process is a full length or mature form of the protein.
  • the polypeptide or protein is purified from bacterial cells which naturally produce the polypeptide or protein.
  • One skilled in the art can readily follow known methods for isolating polypeptides and proteins in order to obtain one of the isolated polypeptides or proteins of the present invention. These include, but are not limited to, immunochromatography, HPLC, size-exclusion chromatography, ion-exchange chromatography, and immuno-affinity chromatography. See, e.g., Scopes, Protein Purification: Principles and Practice, Springer-Verlag (1994); Sambrook, et al., in Molecular Cloning: A Laboratory Manual; Ausubel et al., Current Protocols in Molecular Biology. Polypeptide fragments that retain biological/immunological activity include fragments comprising greater than about 100 amino acids, or greater than about 200 amino acids, and fragments that encode specific protein domains.
  • the purified polypeptides can be used in in vitro binding assays which are well known in the art to identify molecules which bind to the polypeptides. These molecules include but are not limited to, for e.g., small molecules, molecules from combinatorial libraries, antibodies or other proteins.
  • the molecules identified in the binding assay are then tested for antagonist or agonist activity in in vivo tissue culture or animal models that are well known in the art. In brief, the molecules are titrated into a plurality of cell cultures or animals and then tested for either cell/animal death or prolonged survival of the animal/cells.
  • the peptides of the invention or molecules capable of binding to the peptides may be complexed with toxins, e.g., ricin or cholera, or with other compounds that are toxic to cells.
  • the toxin-binding molecule complex is then targeted to a tumor or other cell by the specificity of the binding molecule for SEQ ID NO: 30369-60736.
  • the protein of the invention may also be expressed as a product of transgenic animals, e.g., as a component of the milk of transgenic cows, goats, pigs, or sheep which are characterized by somatic or germ cells containing a nucleotide sequence encoding the protein.
  • the proteins provided herein also include proteins characterized by amino acid sequences similar to those of purified proteins but into which modification are naturally provided or deliberately engineered.
  • modifications in the peptide or DNA sequence can be made by those skilled in the art using known techniques.
  • Modifications of interest in the protein sequences may include the alteration, substitution, replacement, insertion or deletion of a selected amino acid residue in the coding sequence.
  • one or more of the cysteine residues may be deleted or replaced with another amino acid to alter the conformation of the molecule. Techniques for such alteration, substitution, replacement, insertion or deletion are well known to those skilled in the art (see, e.g., U.S. Pat. No. 4,518,584).
  • such alteration, substitution, replacement, insertion or deletion retains the desired activity of the protein.
  • Regions of the protein that are important for the protein function can be determined by various methods known in the art including the alanine-scanning method which involved systematic substitution of single or strings of amino acids with alanine, followed by testing the resulting alanine-containing variant for biological activity. This type of analysis determines the importance of the substituted amino acid(s) in biological activity. Regions of the protein that are important for protein function may be determined by the eMATRIX program.
  • the protein may also be produced by operably linking the isolated polynucleotide of the invention to suitable control sequences in one or more insect expression vectors, and employing an insect expression system.
  • suitable control sequences in one or more insect expression vectors, and employing an insect expression system.
  • Materials and methods for baculovirus/insect cell expression systems are commercially available in kit form from, e.g., Invitrogen, San Diego, Calif, U.S.A. (the MaxBatTM kit), and such methods are well known in the art, as described in Summers and Smith, Texas Agricultural Experiment Station Bulletin No. 1555 (1987), incorporated herein by reference.
  • an insect cell capable of expressing a polynucleotide of the present invention is "transformed.”
  • the protein of the invention may be prepared by culturing transformed host cells under culture conditions suitable to express the recombinant protein.
  • the resulting expressed protein may then be purified from such culture (i.e., from culture medium or cell extracts) using known purification processes, such as gel filtration and ion exchange chromatography.
  • the purification of the protein may also include an affinity column containing agents which will bind to the protein; one or more column steps over such affinity resins as concanavalin A-agarose, heparin-toyopearlTM or Cibacrom blue 3GA SepharoseTM; one or more steps involving hydrophobic interaction chromatography using such resins as phenyl ether, butyl ether, or propyl ether; or immunoaffinity chromatography.
  • the protein of the invention may also be expressed in a form that will facilitate purification.
  • it may be expressed as a fusion protein, such as those of maltose binding protein (MBP), glutathione-S-transferase (GST) or thioredoxin (TRX), or as a His-tag. Kits for expression and purification of such fusion proteins are commercially available from New England BioLab (Beverly, Mass.), Pharmacia (Piscataway, N.J.) and Invitrogen, respectively.
  • the protein can also be tagged with an epitope and subsequently purified by using a specific antibody directed to such epitope.
  • FLAG® is commercially available from Kodak (New Haven, Conn.).
  • RP- HPLC reverse-phase high performance liquid chromatography
  • hydrophobic RP-HPLC media e.g. , silica gel having pendant methyl or other aliphatic groups
  • RP- HPLC reverse-phase high performance liquid chromatography
  • the protein thus purified is substantially free of other mammalian proteins and is defined in accordance with the present invention as an "isolated protein.”
  • the polypeptides of the invention include analogs (variants). This embraces fragments, as well as peptides in which one or more amino acids has been deleted, inserted, or substituted.
  • analogs of the polypeptides of the invention embrace fusions of the polypeptides or modifications of the polypeptides of the invention, wherein the polypeptide or analog is fused to another moiety or moieties, e.g. , targeting moiety or another therapeutic agent. Such analogs may exhibit improved properties such as activity and/or stability.
  • moieties which may be fused to the polypeptide or an analog include, for example, targeting moieties which provide for the delivery of polypeptide to pancreatic cells, e.g., antibodies to pancreatic cells, antibodies to immune cells such as T-cells, monocytes, dendritic cells, granulocytes, etc., as well as receptor and ligands expressed on pancreatic or immune cells.
  • moieties which may be fused to the polypeptide include therapeutic agents which are used for treatment, for example, immunosuppressive drugs such as cyclosporin, SK506, azathioprine, CD3 antibodies and steroids. Also, polypeptides may be fused to immune modulators, and other cytokines such as alpha or beta interferon.
  • Preferred identity and/or similarity are designed to give the largest match between the sequences tested.
  • Methods to determine identity and similarity are codified in computer programs including, but are not limited to, the GCG program package, including GAP
  • BLAST programs are publicly available from the National Center for Biotechnology Information (NCBI) and other sources (BLAST Manual, Altschul, S., et al. NCB NLM NIH Bethesda, MD 20894; Altschul, S., et al., J. Mol. Biol. 215:403-410 (1990).
  • a "chimeric protein" or “fusion protein” comprises a polypeptide of the invention operatively linked to another polypeptide.
  • the polypeptide according to the invention can correspond to all or a portion of a protein according to the invention.
  • a fusion protein comprises at least one biologically active portion of a protein according to the invention.
  • a fusion protein comprises at least two biologically active portions of a protein according to the invention.
  • the term "operatively linked" is intended to indicate that the polypeptide according to the invention and the other polypeptide are fused in-frame to each other.
  • the polypeptide can be fused to the N-terminus or C-terminus.
  • a fusion protein comprises a polypeptide according to the invention operably linked to the extracellular domain of a second protein.
  • the fusion protein is a GST-fusion protein in which the polypeptide sequences of the invention are fused to the C-terminus of the GST (i.e., glutathione S-transferase) sequences.
  • the fusion protein is an immunoglobulin fusion protein in which the polypeptide sequences according to the invention comprises one or more domains are fused to sequences derived from a member of the immunoglobulin protein family.
  • the immunoglobulin fusion proteins of the invention can be incorporated into pharmaceutical compositions and administered to a subject to inhibit an interaction between a ligand and a protein of the invention on the surface of a cell, to thereby suppress signal transduction in vivo.
  • the immunoglobulin fusion proteins can be used to affect the bioavailability of a cognate ligand.
  • Inhibition of the ligand/protein interaction may be useful therapeutically for both the treatment of proliferative and differentiative disorders, e,g., cancer as well as modulating (e.g., promoting or inhibiting) cell survival.
  • the immunoglobulin fusion proteins of the invention can be used as immunogens to produce antibodies in a subject, to purify ligands, and in screening assays to identify molecules that inhibit the interaction of a polypeptide of the invention with a ligand.
  • a chimeric or fusion protein of the invention can be produced by standard recombinant DNA techniques.
  • DNA fragments coding for the different polypeptide sequences are ligated together in-frame in accordance with conventional techniques, e.g., by employing blunt-ended or stagger-ended termini for ligation, restriction enzyme digestion to provide for appropriate termini, filling-in of cohesive ends as appropriate, alkaline phosphatase treatment to avoid undesirable joining, and enzymatic ligation.
  • the fusion gene can be synthesized by conventional techniques including automated DNA synthesizers.
  • PCR amplification of gene fragments can be carried out using anchor primers that give rise to complementary overhangs between two consecutive gene fragments that can subsequently be annealed and reamplified to generate a chimeric gene sequence (see, for example, Ausubel et al. (eds.) CURRENT PROTOCOLS IN MOLECULAR BIOLOGY, John Wiley & Sons, 1992).
  • anchor primers that give rise to complementary overhangs between two consecutive gene fragments that can subsequently be annealed and reamplified to generate a chimeric gene sequence
  • expression vectors are commercially available that already encode a fusion moiety (e.g., a GST polypeptide).
  • a nucleic acid encoding a polypeptide of the invention can be cloned into such an expression vector such that the fusion moiety is linked in-frame to the protein of the invention.
  • Mutations in the polynucleotides of the invention gene may result in loss of normal function of the encoded protein.
  • the invention thus provides gene therapy to restore normal activity of the polypeptides of the invention; or to treat disease states involving polypeptides of the invention.
  • Delivery of a functional gene encoding polypeptides of the invention to appropriate cells is effected ex vivo, in situ, or in vivo by use of vectors, and more particularly viral vectors (e.g., adenovirus, adeno-associated virus, or a retrovirus), or ex vivo by use of physical DNA transfer methods (e.g., liposomes or chemical treatments). See, for example, Anderson, Nature, supplement to vol. 392, no. 6679, pp.25-20 (1998).
  • polypeptides of the invention in other human disease states, preventing the expression of or inhibiting the activity of polypeptides of the invention will be useful in treating the disease states. It is contemplated that antisense therapy or gene therapy could be applied to negatively regulate the expression of polypeptides of the invention.
  • RNA sequences are known in the art.
  • the polypeptides of the present invention can be inhibited by using targeted deletion methods, or the insertion of a negative regulatory element such as a silencer, which is tissue specific.
  • the present invention still further provides cells genetically engineered in vivo to express the polynucleotides of the invention, wherein such polynucleotides are in operative association with a regulatory sequence heterologous to the host cell which drives expression of the polynucleotides in the cell. These methods can be used to increase or decrease the expression of the polynucleotides of the present invention.
  • DNA sequences allows for modification of cells to permit, increase, or decrease, expression of endogenous polypeptide.
  • Cells can be modified (e.g. , by homologous recombination) to provide increased polypeptide expression by replacing, in whole or in part, the naturally occurring promoter with all or part of a heterologous promoter so that the cells express the protein at higher levels.
  • the heterologous promoter is inserted in such a manner that it is operatively linked to the desired protein encoding sequences. See, for example, PCT International Publication No. WO 94/12650, PCT International Publication No. WO 92/20808, and PCT International Publication No. WO 91/09955.
  • amplifiable marker DNA e.g. , ada, dhfr, and the multifunctional CAD gene which encodes carbamyl phosphate synthase, aspartate transcarbamylase, and dihydroorotase
  • intron DNA may be inserted along with the heterologous promoter DNA. If linked to the desired protein coding sequence, amplification of the marker DNA by standard selection methods results in co-amplification of the desired protein coding sequences in the cells.
  • cells and tissues may be engineered to express an endogenous gene comprising the polynucleotides of the invention under the control of inducible regulatory elements, in which case the regulatory sequences of the endogenous gene may be replaced by homologous recombination.
  • gene targeting can be used to replace a gene's existing regulatory region with a regulatory sequence isolated from a different gene or a novel regulatory sequence synthesized by genetic engineering methods.
  • Such regulatory sequences may be comprised of promoters, enhancers, scaffold-attachment regions, negative regulatory elements, transcriptional initiation sites, regulatory protein binding sites or combinations of said sequences.
  • sequences which affect the structure or stability of the RNA or protein produced may be replaced, removed, added, or otherwise modified by targeting. These sequences include polyadenylation signals, mRNA stability elements, splice sites, leader sequences for enhancing or modifying transport or secretion properties of the protein, or other sequences which alter or improve the function or stability of protein or RNA molecules.
  • the targeting event may be a simple insertion of the regulatory sequence, placing the gene under the control of the new regulatory sequence, e.g., inserting a new promoter or enhancer or both upstream of a gene.
  • the targeting event may be a simple deletion of a regulatory element, such as the deletion of a tissue-specific negative regulatory element.
  • the targeting event may replace an existing element; for example, a tissue-specific enhancer can be replaced by an enhancer that has broader or different cell-type specificity than the naturally occurring elements.
  • the naturally occurring sequences are deleted and new sequences are
  • the identification of the targeting event may be facilitated by the use of one or more selectable marker genes that are contiguous with the targeting DNA, allowing for the selection of cells in which the exogenous DNA has integrated into the cell genome.
  • the identification of the targeting event may also be facilitated by the use of one or more marker genes exhibiting the property of negative selection, such that the negatively selectable marker is linked to the exogenous DNA, but configured such that the negatively selectable marker flanks the targeting sequence, and such that a correct homologous recombination event with sequences in the host cell genome does not result in the stable integration of the negatively selectable marker.
  • Markers useful for this purpose include the Herpes Simplex Virus thymidine kinase (TK) gene or the bacterial xanthine-guanine phosphoribosyl-transferase (gpt) gene.
  • one or more genes provided by the invention are either over expressed or inactivated in the germ line of animals using homologous recombination [Capecchi, Science 244:1288-1292 (1989)].
  • Animals in which the gene is over expressed, under the regulatory control of exogenous or endogenous promoter elements, are known as transgenic animals.
  • Animals in which an endogenous gene has been inactivated by homologous recombination are referred to as "knockout" animals.
  • Knockout animals preferably non-human mammals, can be prepared as described in U.S. Patent No. 5,557,032, incorporated herein by reference.
  • Transgenic animals are useful to determine the roles polypeptides of the invention play in biological processes, and preferably in disease states. Transgenic animals are useful as model systems to identify compounds that modulate lipid metabolism. Transgenic animals, preferably non-human mammals, are produced using methods as described in U.S. Patent No 5,489,743 and PCT Publication No. WO94/28122, incorporated herein by reference.
  • Transgenic animals can be prepared wherein all or part of a promoter of the polynucleotides of the invention is either activated or inactivated to alter the level of expression of the polypeptides of the invention. Inactivation can be carried out using homologous recombination methods described above. Activation can be achieved by supplementing or even replacing the homologous promoter to provide for increased protein expression.
  • the homologous promoter can be supplemented by insertion of one or more heterologous enhancer elements known to confer promoter activation in a particular tissue.
  • polynucleotides of the present invention also make possible the development, through, e.g., homologous recombination or knock out strategies, of animals that fail to express polypeptides of the invention or that express a variant polypeptide. Such animals are useful as models for studying the in vivo activities of polypeptide as well as for studying modulators of the polypeptides of the invention.
  • one or more genes provided by the invention are either over expressed or inactivated in the germ line of animals using homologous recombination [Capecchi, Science 244:1288-1292 (1989)].
  • Animals in which the gene is over expressed, under the regulatory control of exogenous or endogenous promoter elements, are known as transgenic animals.
  • Animals in which an endogenous gene has been inactivated by homologous recombination are referred to as "knockout" animals.
  • Knockout animals preferably non-human mammals, can be prepared as described in U.S. Patent No. 5,557,032, incorporated herein by reference.
  • Transgenic animals are useful to determine the roles polypeptides of the invention play in biological processes, and preferably in disease states. Transgenic animals are useful as model systems to identify compounds that modulate lipid metabolism. Transgenic animals, preferably non-human mammals, are produced using methods as described in U.S. Patent No 5,489,743 and PCT Publication No. WO94/28122, incorporated herein by reference.
  • Transgenic animals can be prepared wherein all or part of the polynucleotides of the invention promoter is either activated or inactivated to alter the level of expression of the polypeptides of the invention. Inactivation can be carried out using homologous recombination methods described above. Activation can be achieved by supplementing or even replacing the homologous promoter to provide for increased protein expression.
  • the homologous promoter can be supplemented by insertion of one or more heterologous enhancer elements known to confer promoter activation in a particular tissue.
  • polynucleotides and proteins of the present invention are expected to exhibit one or more of the uses or biological activities (including those associated with assays cited herein) identified herein.
  • Uses or activities described for proteins of the present invention may be provided by administration or use of such proteins or of polynucleotides encoding such proteins (such as, for example, in gene therapies or vectors suitable for introduction of DNA).
  • the mechanism underlying the particular condition or pathology will dictate whether the polypeptides of the invention, the polynucleotides of the invention or modulators (activators or inhibitors) thereof would be beneficial to the subject in need of treatment.
  • compositions of the invention include compositions comprising isolated polynucleotides (including recombinant DNA molecules, cloned genes and degenerate variants thereof) or polypeptides of the invention (including full length protein, mature protein and truncations or domains thereof), or compounds and other substances that modulate the overall activity of the target gene products, either at the level of target gene/protein expression or target protein activity.
  • modulators include polypeptides, analogs, (variants), including fragments and fusion proteins, antibodies and other binding proteins; chemical compounds that directly or indirectly activate or inhibit the polypeptides of the invention (identified, e.g., via drug screening assays as described herein); antisense polynucleotides and polynucleotides suitable for triple helix formation; and in particular antibodies or other binding partners that specifically recognize one or more epitopes of the polypeptides of the invention.
  • polypeptides of the present invention may likewise be involved in cellular activation or in one of the other physiological pathways described herein.
  • the polynucleotides provided by the present invention can be used by the research community for various purposes.
  • the polynucleotides can be used to express recombinant protein for analysis, characterization or therapeutic use; as markers for tissues in which the corresponding protein is preferentially expressed (either constitutively or at a particular stage of tissue differentiation or development or in disease states); as molecular weight markers on gels; as chromosome markers or tags (when labeled) to identify chromosomes or to map related gene positions; to compare with endogenous DNA sequences in patients to identify potential genetic disorders; as probes to hybridize and thus discover novel, related DNA sequences; as a source of information to derive PCR primers for genetic fingerprinting; as a probe to "subtract-out" known sequences in the process of discovering other novel polynucleotides; for selecting and making oligomers for attachment to a "gene chip” or other support, including for examination of expression patterns; to raise anti-protein antibodies using DNA immunization techniques; and
  • the polynucleotide encodes a protein which binds or potentially binds to another protein (such as, for example, in a receptor-ligand interaction)
  • the polynucleotide can also be used in interaction trap assays (such as, for example, that described in Gyuris et al., Cell 75:791-803 (1993)) to identify polynucleotides encoding the other protein with which binding occurs or to identify inhibitors of the binding interaction.
  • polypeptides provided by the present invention can similarly be used in assays to determine biological activity, including in a panel of multiple proteins for high-throughput screening; to raise antibodies or to elicit another immune response; as a reagent (including the labeled reagent) in assays designed to quantitatively determine levels of the protein (or its receptor) in biological fluids; as markers for tissues in which the corresponding polypeptide is preferentially expressed (either constitutively or at a particular stage of tissue differentiation or development or in a disease state); and, of course, to isolate correlative receptors or ligands. Proteins involved in these binding interactions can also be used to screen for peptide or small molecule inhibitors or agonists of the binding interaction. Any or all of these research utilities are capable of being developed into reagent grade or kit format for commercialization as research products.
  • Polynucleotides and polypeptides of the present invention can also be used as nutritional sources or supplements. Such uses include without limitation use as a protein or amino acid supplement, use as a carbon source, use as a nitrogen source and use as a source of carbohydrate.
  • the polypeptide or polynucleotide of the invention can be added to the feed of a particular organism or can be administered as a separate solid or liquid preparation, such as in the form of powder, pills, solutions, suspensions or capsules.
  • the polypeptide or polynucleotide of the invention can be added to the medium in or on which the microorganism is cultured.
  • a polypeptide of the present invention may exhibit activity relating to cytokine, cell proliferation (either inducing or inhibiting) or cell differentiation (either inducing or inhibiting) activity or may induce production of other cytokines in certain cell populations.
  • a polynucleotide of the invention can encode a polypeptide exhibiting such attributes. Many protein factors discovered to date, including all known cytokines, have exhibited activity in one or more factor-dependent cell proliferation assays, and hence the assays serve as a convenient confirmation of cytokine activity.
  • compositions of the present invention is evidenced by any one of a number of routine factor dependent cell proliferation assays for cell lines including, without limitation, 32D, DA2, DA1G, T10, B9, B9/11, BaF3, MC9/G, M+(preB M+), 2E8, RB5, DAI, 123, Tl 165, HT2, CTLL2, TF-1, Mo7e, CMK, HUVEC, and Caco.
  • Therapeutic compositions of the invention can be used in the following: Assays for T-cell or thymocyte proliferation include without limitation those described in: Current Protocols in Immunology, Ed by J. E. Coligan, A. M. Kruisbeek, D. H. Margulies, E. M.
  • Assays for cytokine production and/or proliferation of spleen cells, lymph node cells or thymocytes include, without limitation, those described in: Polyclonal T cell stimulation, Kruisbeek, A. M. and Shevach, E. M. In Current Protocols in Immunology. J. E. e.a. Coligan eds. Vol 1 pp. 3.12.1-3.12.14, John Wiley and Sons, Toronto. 1994; and Measurement of mouse and human interleukin- ⁇ , Schreiber, R. D. In Current Protocols in Immunology. J. E. e.a. Coligan eds. Vol 1 pp. 6.8.1-6.8.8, John Wiley and Sons, Toronto. 1994.
  • Assays for proliferation and differentiation of hematopoietic and lymphopoietic cells include, without limitation, those described in: Measurement of Human and Murine Interleukin 2 and Interleukin 4, Bottomly, K., Davis, L. S. and Lipsky, P. E. In Current Protocols in Immunology. J. E. e.a. Coligan eds. Vol 1 pp. 6.3.1-6.3.12, John Wiley and Sons, Toronto. 1991; deVries et al., J. Exp. Med. 173:1205-121 1, 1991; Moreau et al., Nature 336:690-692, 1988; Greenberger et al., Proc. Natl. Acad. Sci. U.S.A.
  • Assays for T-cell clone responses to antigens include, without limitation, those described in: Current Protocols in Immunology, Ed by J. E. Coligan, A. M. Kruisbeek, D. H. Margulies, E. M. Shevach, W Strober, Pub. Greene Publishing Associates and Wiley-Interscience (Chapter 3, In Vitro assays for Mouse
  • a polypeptide of the present invention may exhibit stem cell growth factor activity and be involved in the proliferation, differentiation and survival of pluripotent and totipotent stem cells including primordial germ cells, embryonic stem cells, hematopoietic stem cells and/or germ line stem cells.
  • Administration of the polypeptide of the invention to stem cells in vivo or ex vivo is expected to maintain and expand cell populations in a totipotential or pluripotential state which would be useful for re-engineering damaged or diseased tissues, transplantation, manufacture of bio-pharmaceuticals and the development of bio-sensors.
  • the ability to produce large quantities of human cells has important working applications for the production of human proteins which currently must be obtained from non-human sources or donors, implantation of cells to treat diseases such as Parkinson's, Alzheimer's and other neurodegenerative diseases; tissues for grafting such as bone marrow, skin, cartilage, tendons, bone, muscle (including cardiac muscle), blood vessels, cornea, neural cells, gastrointestinal cells and others; and organs for transplantation such as kidney, liver, pancreas (including islet cells), heart and lung.
  • diseases such as Parkinson's, Alzheimer's and other neurodegenerative diseases
  • tissues for grafting such as bone marrow, skin, cartilage, tendons, bone, muscle (including cardiac muscle), blood vessels, cornea, neural cells, gastrointestinal cells and others
  • organs for transplantation such as kidney, liver, pancreas (including islet cells), heart and lung.
  • exogenous growth factors and/or cytokines may be administered in combination with the polypeptide of the invention to achieve the desired effect, including any of the growth factors listed herein, other stem cell maintenance factors, and specifically including stem cell factor (SCF), leukemia inhibitory factor (LIF), Flt-3 ligand (Flt- 3L), any of the interleukins, recombinant soluble IL-6 receptor fused to IL-6, macrophage inflammatory protein 1 -alpha (MIP-1 -alpha), G-CSF, GM-CSF, thrombopoietin (TPO), platelet factor 4 (PF-4), platelet-derived growth factor (PDGF), neural growth factors and basic fibroblast growth factor (bFGF).
  • SCF stem cell factor
  • LIF leukemia inhibitory factor
  • Flt-3 ligand Flt-3 ligand
  • MIP-1 -alpha macrophage inflammatory protein 1 -alpha
  • G-CSF G-CSF
  • GM-CSF thro
  • stroma cells transfected with a polynucleotide that encodes for the polypeptide of the invention can be used as a feeder layer for the stem cell populations in culture or in vivo.
  • Stromal support cells for feeder layers may include embryonic bone marrow fibroblasts, bone marrow stromal cells, fetal liver cells, or cultured embryonic fibroblasts (see U.S. Patent No. 5,690,926).
  • Stem cells themselves can be transfected with a polynucleotide of the invention to induce autocrine expression of the polypeptide of the invention. This will allow for generation of undifferentiated totipotential/pluripotential stem cell lines that are useful as is or that can then be differentiated into the desired mature cell types. These stable cell lines can also serve as a source of undifferentiated totipotential/pluripotential mRNA to create cDNA libraries and templates for polymerase chain reaction experiments. These studies would allow for the isolation and identification of differentially expressed genes in stem cell populations that regulate stem cell proliferation and/or maintenance.
  • polypeptides of the present invention may be used to manipulate stem cells in culture to give rise to neuroepithelial cells that can be used to augment or replace cells damaged by illness, autoimmune disease, accidental damage or genetic disorders.
  • the polypeptide of the invention may be useful for inducing the proliferation of neural cells and for the regeneration of nerve and brain tissue, i.e. for the treatment of central and peripheral nervous system diseases and neuropathies, as well as mechanical and traumatic disorders which involve degeneration, death or trauma to neural cells or nerve tissue.
  • the expanded stem cell populations can also be genetically altered for gene therapy purposes and to decrease host rejection of replacement tissues after grafting or implantation.
  • a broadly applicable method of obtaining pure populations of a specific differentiated cell type from undifferentiated stem cell populations involves the use of a cell-type specific promoter driving a selectable marker.
  • the selectable marker allows only cells of the desired type to survive.
  • stem cells can be induced to differentiate into cardiomyocytes (Wobus et al., Differentiation, 48: 173-182, (1991); Klug et al., J. Clin. Invest., 98(1): 216-224, (1998)) or skeletal muscle cells (Browder, L. W. In: Principles of Tissue Engineering eds.
  • directed differentiation of stem cells can be accomplished by culturing the stem cells in the presence of a differentiation factor such as retinoic acid and an antagonist of the polypeptide of the invention which would inhibit the effects of endogenous stem cell factor activity and allow differentiation to proceed.
  • a differentiation factor such as retinoic acid and an antagonist of the polypeptide of the invention which would inhibit the effects of endogenous stem cell factor activity and allow differentiation to proceed.
  • stem cells In vitro cultures of stem cells can be used to determine if the polypeptide of the invention exhibits stem cell growth factor activity.
  • Stem cells are isolated from any one of various cell sources (including hematopoietic stem cells and embryonic stem cells) and cultured on a feeder layer, as described by Thompson et al. Proc. Natl. Acad. Sci, U.S.A., 92: 7844-7848 (1995), in the presence of the polypeptide of the invention alone or in combination with other growth factors or cytokines.
  • the ability of the polypeptide of the invention to induce stem cells proliferation is determined by colony formation on semi-solid support e.g. as described by Bernstein et al., Blood, 77: 2316-2321 (1991).
  • a polypeptide of the present invention may be involved in regulation of hematopoiesis and, consequently, in the treatment of myeloid or lymphoid cell disorders. Even marginal biological activity in support of colony forming cells or of factor-dependent cell lines indicates involvement in regulating hematopoiesis, e.g.
  • erythroid progenitor cells alone or in combination with other cytokines, thereby indicating utility, for example, in treating various anemias or for use in conjunction with irradiation/chemotherapy to stimulate the production of erythroid precursors and/or erythroid cells; in supporting the growth and proliferation of myeloid cells such as granulocytes and monocytes/macrophages (i.e., traditional CSF activity) useful, for example, in conjunction with chemotherapy to prevent or treat consequent myelo-suppression; in supporting the growth and proliferation of megakaryocytes and consequently of platelets thereby allowing prevention or treatment of various platelet disorders such as thrombocytopenia, and generally for use in place of or complimentary to platelet transfusions; and/or in supporting the growth and proliferation of hematopoietic stem cells which are capable of maturing to any and all of the above-mentioned hematopoietic cells and therefore find therapeutic utility in various stem cell disorders (such as those usually treated with
  • compositions of the invention can be used in the following: Suitable assays for proliferation and differentiation of various hematopoietic lines are cited above.
  • Assays for embryonic stem cell differentiation include, without limitation, those described in: Johansson et al. Cellular Biology 15:141-151, 1995; Keller et al., Molecular and Cellular Biology 13:473-486, 1993; McClanahan et al., Blood 81 :2903-2915, 1993.
  • Assays for stem cell survival and differentiation include, without limitation, those described in: Methylcellulose colony forming assays, Freshney, M. G. In Culture of Hematopoietic Cells. R. I. Freshney, et al. eds.
  • a polypeptide of the present invention also may be involved in bone, cartilage, tendon, ligament and/or nerve tissue growth or regeneration, as well as in wound healing and tissue repair and replacement, and in healing of burns, incisions and ulcers.
  • a polypeptide of the present invention which induces cartilage and/or bone growth in circumstances where bone is not normally formed, has application in the healing of bone fractures and cartilage damage or defects in humans and other animals.
  • Compositions of a polypeptide, antibody, binding partner, or other modulator of the invention may have prophylactic use in closed as well as open fracture reduction and also in the improved fixation of artificial joints. De novo bone formation induced by an osteogenic agent contributes to the repair of congenital, trauma induced, or oncologic resection induced craniofacial defects, and also is useful in cosmetic plastic surgery.
  • a polypeptide of this invention may also be involved in attracting bone-forming cells, stimulating growth of bone-forming cells, or inducing differentiation of progenitors of bone-forming cells.
  • Treatment of osteoporosis, osteoarthritis, bone degenerative disorders, or periodontal disease, such as through stimulation of bone and/or cartilage repair or by blocking inflammation or processes of tissue destruction (collagenase activity, osteoclast activity, etc.) mediated by inflammatory processes may also be possible using the composition of the invention.
  • Another category of tissue regeneration activity that may involve the polypeptide of the present invention is tendon/ligament formation.
  • tendon/ligament-like tissue or other tissue formation in circumstances where such tissue is not normally formed, has application in the healing of tendon or ligament tears, deformities and other tendon or ligament defects in humans and other animals.
  • a preparation employing a tendon/ligament-like tissue inducing protein may have prophylactic use in preventing damage to tendon or ligament tissue, as well as use in the improved fixation of tendon or ligament to bone or other tissues, and in repairing defects to tendon or ligament tissue.
  • De novo tendon/ligament-like tissue formation induced by a composition of the present invention contributes to the repair of congenital, trauma induced, or other tendon or ligament defects of other origin, and is also useful in cosmetic plastic surgery for attachment or repair of tendons or ligaments.
  • compositions of the present invention may provide environment to attract tendon- or ligament-forming cells, stimulate growth of tendon- or ligament-forming cells, induce differentiation of progenitors of tendon- or ligament- forming cells, or induce growth of tendon/ligament cells or progenitors ex vivo for return in vivo to effect tissue repair.
  • the compositions of the invention may also be useful in the treatment of tendinitis, carpal tunnel syndrome and other tendon or ligament defects.
  • the compositions may also include an appropriate matrix and/or sequestering agent as a carrier as is well known in the art.
  • compositions of the present invention may also be useful for proliferation of neural cells and for regeneration of nerve and brain tissue, i.e. for the treatment of central and peripheral nervous system diseases and neuropathies, as well as mechanical and traumatic disorders, which involve degeneration, death or trauma to neural cells or nerve tissue. More specifically, a composition may be used in the treatment of diseases of the peripheral nervous system, such as peripheral nerve injuries, peripheral neuropathy and localized neuropathies, and central nervous system diseases, such as Alzheimer's, Parkinson's disease, Huntington's disease, amyotrophic lateral sclerosis, and Shy-Drager syndrome. Further conditions that may be treated in accordance with the present invention include mechanical and traumatic disorders, such as spinal cord disorders, head trauma and cerebrovascular diseases such as stroke. Peripheral neuropathies resulting from chemotherapy or other medical therapies may also be treatable using a composition of the invention.
  • compositions of the invention may also be useful to promote better or faster closure of non-healing wounds, including without limitation pressure ulcers, ulcers associated with vascular insufficiency, surgical and traumatic wounds, and the like.
  • compositions of the present invention may also be involved in the generation or regeneration of other tissues, such as organs (including, for example, pancreas, liver, intestine, kidney, skin, endothelium), muscle (smooth, skeletal or cardiac) and vascular (including vascular endothelium) tissue, or for promoting the growth of cells comprising such tissues. Part of the desired effects may be by inhibition or modulation of fibrotic scarring may allow normal tissue to regenerate.
  • a polypeptide of the present invention may also exhibit angiogenic activity.
  • a composition of the present invention may also be useful for gut protection or regeneration and treatment of lung or liver fibrosis, reperfusion injury in various tissues, and conditions resulting from systemic cytokine damage.
  • a composition of the present invention may also be useful for promoting or inhibiting differentiation of tissues described above from precursor tissues or cells; or for inhibiting the growth of tissues described above.
  • compositions of the invention can be used in the following: Assays for tissue generation activity include, without limitation, those described in: International Patent Publication No. WO95/16035 (bone, cartilage, tendon); International Patent Publication No. WO95/05846 (nerve, neuronal); International Patent Publication No. WO91/07491 (skin, endothelium).
  • Assays for wound healing activity include, without limitation, those described in: Winter, Epidermal Wound Healing, pps. 71-112 (Maibach, H. I. and Rovee, D. T., eds.), Year Book Medical Publishers, Inc., Chicago, as modified by Eaglstein and Mertz, J. Invest. Dermatol 71 :382-84 (1978).
  • a polypeptide of the present invention may also exhibit immune stimulating or immune suppressing activity, including without limitation the activities for which assays are described herein.
  • a polynucleotide of the invention can encode a polypeptide exhibiting such activities.
  • a protein may be useful in the treatment of various immune deficiencies and disorders (including severe combined immunodeficiency (SOD)), e.g., in regulating (up or down) growth and proliferation of T and/or B lymphocytes, as well as effecting the cytolytic activity of NK cells and other cell populations.
  • SOD severe combined immunodeficiency
  • These immune deficiencies may be genetic or be caused by viral (e.g., HIV) as well as bacterial or fungal infections, or may result from autoimmune disorders.
  • infectious diseases causes by viral, bacterial, fungal or other infection may be treatable using a protein of the present invention, including infections by HIV, hepatitis viruses, herpes viruses, mycobacteria, Leishmania spp., malaria spp. and various fungal infections such as candidiasis.
  • proteins of the present invention may also be useful where a boost to the immune system generally may be desirable, i.e., in the treatment of cancer.
  • Autoimmune disorders which may be treated using a protein of the present invention include, for example, connective tissue disease, multiple sclerosis, systemic lupus erythematosus, rheumatoid arthritis, autoimmune pulmonary inflammation, Guillain-Barre syndrome, autoimmune thyroiditis, insulin dependent diabetes mellitis, myasthenia gravis, graft-versus-host disease and autoimmune inflammatory eye disease.
  • Such a protein (or antagonists thereof, including antibodies) of the present invention may also to be useful in the treatment of allergic reactions and conditions (e.g., anaphylaxis, serum sickness, drug reactions, food allergies, insect venom allergies, mastocytosis, allergic rhinitis, hypersensitivity pneumonitis, urticaria, angioedema, eczema, atopic dermatitis, allergic contact dermatitis, erythema multiforme, Stevens- Johnson syndrome, allergic conjunctivitis, atopic keratoconjunctivitis, venereal keratoconjunctivitis, giant papillary conjunctivitis and contact allergies), such as asthma (particularly allergic asthma) or other respiratory problems.
  • allergic reactions and conditions e.g., anaphylaxis, serum sickness, drug reactions, food allergies, insect venom allergies, mastocytosis, allergic rhinitis, hypersensitivity pneumonitis, urticaria, angioedema,
  • a protein (or antagonists thereof) of the present invention may also be treatable using a protein (or antagonists thereof) of the present invention.
  • the therapeutic effects of the polypeptides or antagonists thereof on allergic reactions can be evaluated by in vivo animals models such as the cumulative contact enhancement test (Lastbom et al., Toxicology 125: 59-66, 1998), skin prick test (Hoffmann et al., Allergy 54: 446-54, 1999), guinea pig skin sensitization test (Vohr et al., Arch. Toxocol. 73: 501-9), and murine local lymph node assay (Kimber et al., J. Toxicol.
  • T cells may be inhibited by suppressing T cell responses or by inducing specific tolerance in T cells, or both.
  • Immunosuppression of T cell responses is generally an active, non-antigen-specific, process which requires continuous exposure of the T cells to the suppressive agent.
  • Tolerance which involves inducing non-responsiveness or anergy in T cells, is distinguishable from immunosuppression in that it is generally antigen-specific and persists after exposure to the tolerizing agent has ceased. Operationally, tolerance can be demonstrated by the lack of a T cell response upon reexposure to specific antigen in the absence of the tolerizing agent.
  • Down regulating or preventing one or more antigen functions (including without limitation B lymphocyte antigen functions (such as, for example, B7)), e.g., preventing high level lymphokine synthesis by activated T cells, will be useful in situations of tissue, skin and organ transplantation and in graft-versus-host disease (GVHD).
  • B lymphocyte antigen functions such as, for example, B7
  • GVHD graft-versus-host disease
  • blockage of T cell function should result in reduced tissue destruction in tissue transplantation.
  • rejection of the transplant is initiated through its recognition as foreign by T cells, followed by an immune reaction that destroys the transplant.
  • the administration of a therapeutic composition of the invention may prevent cytokine synthesis by immune cells, such as T cells, and thus acts as an immunosuppressant.
  • a lack of costimulation may also be sufficient to anergize the T cells, thereby inducing tolerance in a subject.
  • Induction of long-term tolerance by B lymphocyte antigen-blocking reagents may avoid the necessity of repeated administration of these blocking reagents.
  • To achieve sufficient immunosuppression or tolerance in a subject it may also be necessary to block the function of a combination of B lymphocyte antigens.
  • the efficacy of particular therapeutic compositions in preventing organ transplant rejection or GVHD can be assessed using animal models that are predictive of efficacy in humans.
  • Examples of appropriate systems which can be used include allogeneic cardiac grafts in rats and xenogeneic pancreatic islet cell grafts in mice, both of which have been used to examine the immunosuppressive effects of CTLA4Ig fusion proteins in vivo as described in Lenschow et al., Science 257:789-792 (1992) and Turka et al., Proc. Natl. Acad. Sci USA, 89:1 1102-11 105 (1992).
  • murine models of GVHD see Paul ed., Fundamental Immunology, Raven Press, New York, 1989, pp. 846-847) can be used to determine the effect of therapeutic compositions of the invention on the development of that disease.
  • Blocking antigen function may also be therapeutically useful for treating autoimmune diseases.
  • Many autoimmune disorders are the result of inappropriate activation of T cells that are reactive against self-tissue and which promote the production of cytokines and autoanti bodies involved in the pathology of the diseases.
  • Preventing the activation of autoreactive T cells may reduce or eliminate disease symptoms.
  • Administration of reagents which block stimulation of T cells can be used to inhibit T cell activation and prevent production of autoantibodies or T cell-derived cytokines which may be involved in the disease process. Additionally, blocking reagents may induce antigen-specific tolerance of autoreactive T cells which could lead to long-term relief from the disease.
  • the efficacy of blocking reagents in preventing or alleviating autoimmune disorders can be determined using a number of well-characterized animal models of human autoimmune diseases. Examples include murine experimental autoimmune encephalitis, systemic lupus erythmatosis in MRL/lpr/lpr mice or
  • NZB hybrid mice murine autoimmune collagen arthritis, diabetes mellitus in NOD mice and BB rats, and murine experimental myasthenia gravis (see Paul ed., Fundamental Immunology, Raven Press, New York, 1989, pp. 840-856).
  • Upregulation of an antigen function may also be useful in therapy. Upregulation of immune responses may be in the form of enhancing an existing immune response or eliciting an initial immune response. For example, enhancing an immune response may be useful in cases of viral infection, including systemic viral diseases such as influenza, the common cold, and encephalitis.
  • anti-viral immune responses may be enhanced in an infected patient by removing T cells from the patient, costimulating the T cells in vitro with viral antigen-pulsed APCs either expressing a peptide of the present invention or together with a stimulatory form of a soluble peptide of the present invention and reintroducing the in vitro activated T cells into the patient.
  • Another method of enhancing anti-viral immune responses would be to isolate infected cells from a patient, transfect them with a nucleic acid encoding a protein of the present invention as described herein such that the cells express all or a portion of the protein on their surface, and reintroduce the transfected cells into the patient.
  • the infected cells would now be capable of delivering a costimulatory signal to, and thereby activate, T cells in vivo.
  • a polypeptide of the present invention may provide the necessary stimulation signal to T cells to induce a T cell mediated immune response against the transfected tumor cells.
  • tumor cells which lack MHC class I or MHC class II molecules, or which fail to reexpress sufficient mounts of MHC class I or MHC class II molecules, can be transfected with nucleic acid encoding all or a portion of (e.g., a cytoplasmic-domain truncated portion) of an MHC class I alpha chain protein and ⁇ microglobulin protein or an MHC class II alpha chain protein and an MHC class II beta chain protein to thereby express MHC class I or MHC class II proteins on the cell surface.
  • a gene encoding an antisense construct which blocks expression of an MHC class II associated protein, such as the invariant chain can also be cotransfected with a DNA encoding a peptide having the activity of a B lymphocyte antigen to promote presentation of tumor associated antigens and induce tumor specific immunity.
  • a T cell mediated immune response in a human subject may be sufficient to overcome tumor-specific tolerance in the subject.
  • the activity of a protein of the invention may, among other means, be measured by the following methods:
  • Suitable assays for thymocyte or splenocyte cytotoxicity include, without limitation, those described in: Current Protocols in Immunology, Ed by J. E. Coligan, A. M. Kruisbeek, D. H. Margulies, E. M. Shevach, W. Strober, Pub. Greene Publishing Associates and WileyTnterscience (Chapter 3, In Vitro assays for Mouse Lymphocyte Function 3.1-3.19; Chapter 7, Immunologic studies in Humans); Herrmann et al., Proc. Natl. Acad. Sci. USA 78:2488-2492, 1981 ; Herrmann et al., J. Immunol. 128: 1968-1974, 1982; Handa et al., J.
  • T-cell-dependent immunoglobulin responses and isotype switching (which will identify, among others, proteins that modulate T-cell dependent antibody responses and that affect Thl/Th2 profiles) include, without limitation, those described in: Maliszewski, J. Immunol. 144:3028-3033, 1990; and Assays for B cell function: In vitro antibody production, Mond, J. J. and Brunswick, M. In Current Protocols in Immunology. J. E. e.a. Coligan eds. Vol 1 pp. 3.8.1-3.8.16, John Wiley and Sons, Toronto. 1994.
  • MLR Mixed lymphocyte reaction
  • Dendritic cell-dependent assays (which will identify, among others, proteins expressed by dendritic cells that activate naive T-cells) include, without limitation, those described in: Guery et al., J. Immunol. 134:536-544, 1995; Inaba et al., Journal of Experimental Medicine
  • lymphocyte survival/apoptosis (which will identify, among others, proteins that prevent apoptosis after superantigen induction and proteins that regulate lymphocyte homeostasis) include, without limitation, those described in: Darzynkiewicz et al., Cytometry 13:795-808, 1992; Gorczyca et al., Leukemia 7:659-670, 1993; Gorczyca et al., Cancer Research 53:1945-1951, 1993; Itoh et al., Cell 66:233-243, 1991 ; Zacharchuk, Journal of Immunology 145:4037-4045, 1990; Zamai et al., Cytometry 14:891-897, 1993; Gorczyca et al., International Journal of Oncology 1 :639-648, 1992.
  • Assays for proteins that influence early steps of T-cell commitment and development include, without limitation, those described in: Antica et al., Blood 84:1 11-1 17, 1994; Fine et al., Cellular Immunology 155:111-122, 1994; Galy et al., Blood 85:2770-2778, 1995; Toki et al.,
  • a polypeptide of the present invention may also exhibit activin- or inhibin-related activities.
  • a polynucleotide of the invention may encode a polypeptide exhibiting such characteristics.
  • Inhibins are characterized by their ability to inhibit the release of follicle stimulating hormone (FSH), while activins and are characterized by their ability to stimulate the release of follicle stimulating hormone (FSH).
  • FSH follicle stimulating hormone
  • a polypeptide of the present invention alone or in heterodimers with a member of the inhibin family, may be useful as a contraceptive based on the ability of inhibins to decrease fertility in female mammals and decrease spermatogenesis in male mammals.
  • the polypeptide of the invention may be useful as a fertility inducing therapeutic, based upon the ability of activin molecules in stimulating FSH release from cells of the anterior pituitary. See, for example, U.S. Pat. No. 4,798,885.
  • a polypeptide of the invention may also be useful for advancement of the onset of fertility in sexually immature mammals, so as to increase the lifetime reproductive performance of domestic animals such as, but not limited to, cows, sheep and pigs.
  • the activity of a polypeptide of the invention may, among other means, be measured by the following methods.
  • Assays for activin/inhibin activity include, without limitation, those described in: Vale et al., Endocrinology 91 :562-572, 1972; Ling et al., Nature 321 :779-782, 1986; Vale et al., Nature 321 :776-779, 1986; Mason et al., Nature 318:659-663, 1985; Forage et al., Proc. Natl. Acad. Sci. USA 83:3091-3095, 1986.
  • a polypeptide of the present invention may be involved in chemotactic or chemokinetic activity for mammalian cells, including, for example, monocytes, fibroblasts, neutrophils, T-cells, mast cells, eosinophils, epithelial and/or endothelial cells.
  • a polynucleotide of the invention can encode a polypeptide exhibiting such attributes.
  • Chemotactic and chemokinetic receptor activation can be used to mobilize or attract a desired cell population to a desired site of action.
  • Chemotactic or chemokinetic compositions e.g. proteins, antibodies, binding partners, or modulators of the invention
  • a protein or peptide has chemotactic activity for a particular cell population if it can stimulate, directly or indirectly, the directed orientation or movement of such cell population.
  • the protein or peptide has the ability to directly stimulate directed movement of cells.
  • Therapeutic compositions of the invention can be used in the following: Assays for chemotactic activity (which will identify proteins that induce or prevent chemotaxis) consist of assays that measure the ability of a protein to induce the migration of cells across a membrane as well as the ability of a protein to induce the adhesion of one cell population to another cell population.
  • Suitable assays for movement and adhesion include, without limitation, those described in: Current Protocols in Immunology, Ed by J. E. Coligan, A. M. Kruisbeek, D. H. Marguiles, E. M. Shevach, W. Strober, Pub. Greene Publishing Associates and Wiley-Interscience (Chapter 6.12, Measurement of alpha and beta Chemokines
  • a polypeptide of the invention may also be involved in hemostatis or thrombolysis or thrombosis.
  • a polynucleotide of the invention can encode a polypeptide exhibiting such attributes.
  • Compositions may be useful in treatment of various coagulation disorders (including hereditary disorders, such as hemophilias) or to enhance coagulation and other hemostatic events in treating wounds resulting from trauma, surgery or other causes.
  • a composition of the invention may also be useful for dissolving or inhibiting formation of thromboses and for treatment and prevention of conditions resulting therefrom (such as, for example, infarction of cardiac and central nervous system vessels (e.g., stroke).
  • compositions of the invention can be used in the following: Assay for hemostatic and thrombolytic activity include, without limitation, those described in: Linet et al., J. Clin. Pharmacol. 26:131-140, 1986; Burdick et al., Thrombosis Res. 45:413-419, 1987; Humphrey et al., Fibrinolysis 5:71-79 (1991); Schaub, Prostaglandins 35:467-474, 1988.
  • Polypeptides of the invention may be involved in cancer cell generation, proliferation or metastasis. Detection of the presence or amount of polynucleotides or polypeptides of the invention may be useful for the diagnosis and/or prognosis of one or more types of cancer. For example, the presence or increased expression of a polynucleotide/polypeptide of the invention may indicate a hereditary risk of cancer, a precancerous condition, or an ongoing malignancy. Conversely, a defect in the gene or absence of the polypeptide may be associated with a cancer condition. Identification of single nucleotide polymorphisms associated with cancer or a predisposition to cancer may also be useful for diagnosis or prognosis.
  • compositions of the invention may be effective in adult and pediatric oncology including in solid phase tumors/malignancies, locally advanced tumors, human soft tissue sarcomas, metastatic cancer, including lymphatic metastases, blood cell malignancies including multiple myeloma, acute and chronic leukemias, and lymphomas, head and neck cancers including mouth cancer, larynx cancer and thyroid cancer, lung cancers including small cell carcinoma and non-small cell cancers, breast cancers including small cell carcinoma and ductal carcinoma, gastrointestinal cancers including esophageal cancer, stomach cancer, colon cancer, colorectal cancer and polyps associated with colorectal neoplasia, pancreatic cancers, liver cancer, urologic cancers including bladder cancer and prostate cancer, malignancies of the female genital tract including ovarian
  • Polypeptides, polynucleotides, or modulators of polypeptides of the invention may be administered to treat cancer.
  • Therapeutic compositions can be administered in therapeutically effective dosages alone or in combination with adjuvant cancer therapy such as surgery, chemotherapy, radiotherapy, thermotherapy, and laser therapy, and may provide a beneficial effect, e.g. reducing tumor size, slowing rate of tumor growth, inhibiting metastasis, or otherwise improving overall clinical condition, without necessarily eradicating the cancer.
  • composition can also be administered in therapeutically effective amounts as a portion of an anti-cancer cocktail.
  • An anti-cancer cocktail is a mixture of the polypeptide or modulator of the invention with one or more anti-cancer drugs in addition to a pharmaceutically acceptable carrier for delivery.
  • the use of anti-cancer cocktails as a cancer treatment is routine.
  • Anti-cancer drugs that are well known in the art and can be used as a treatment in combination with the polypeptide or modulator of the invention include: Actinomycin D, Aminoglutethimide, Asparaginase, Bleomycin, Busulfan, Carboplatin, Carmustine,
  • compositions of the invention may be used for prophylactic treatment of cancer.
  • hereditary conditions and/or environmental situations e.g. exposure to carcinogens
  • In vitro models can be used to determine the effective doses of the polypeptide of the invention as a potential cancer treatment.
  • Suitable tumor cells lines are available, e.g. from American Type Tissue Culture Collection catalogs.
  • a polypeptide of the present invention may also demonstrate activity as receptor, receptor ligand or inhibitor or agonist of receptor/ligand interactions.
  • a polynucleotide of the invention can encode a polypeptide exhibiting such characteristics.
  • receptors and ligands include, without limitation, cytokine receptors and their ligands, receptor kinases and their ligands, receptor phosphatases and their ligands, receptors involved in cell-cell interactions and their ligands (including without limitation, cellular adhesion molecules (such as selectins, integrins and their ligands) and receptor/ligand pairs involved in antigen presentation, antigen recognition and development of cellular and humoral immune responses.
  • Receptors and ligands are also useful for screening of potential peptide or small molecule inhibitors of the relevant receptor/ligand interaction.
  • a protein of the present invention (including, without limitation, fragments of receptors and ligands) may themselves be useful as inhibitors of receptor/ligand interactions.
  • the activity of a polypeptide of the invention may, among other means, be measured by the following methods:
  • Suitable assays for receptor-ligand activity include without limitation those described in: Current Protocols in Immunology, Ed by J. E. Coligan, A. M. Kruisbeek, D. H. Margulies, E. M. Shevach, W. Strober, Pub. Greene Publishing Associates and Wiley- Interscience (Chapter 7.28, Measurement of Cellular Adhesion under static conditions 7.28.1- 7.28.22), Takai et al., Proc. Natl. Acad. Sci. USA 84:6864-6868, 1987; Bierer et al., J. Exp. Med. 168:1145-1156, 1988; Rosenstein et al., J. Exp. Med. 169:149-160 1989; Stoltenborg et al., J. Immunol. Methods 175:59-68, 1994; Stitt et al., Cell 80:661-670, 1995.
  • polypeptides of the invention may be used as a receptor for a ligand(s) thereby transmitting the biological activity of that ligand(s).
  • Ligands may be identified through binding assays, affinity chromatography, dihybrid screening assays, BIAcore assays, gel overlay assays, or other methods known in the art.
  • polypeptides of the present invention or ligand(s) thereof may be labeled by being coupled to radioisotopes, colorimetric molecules or toxin molecules by conventional methods.
  • radioisotopes include, but are not limited to, tritium and carbon- 14.
  • colorimetric molecules include, but are not limited to, fluorescent molecules such as fluorescamine, or rhodamine or other colorimetric molecules.
  • toxins include, but are not limited, to ricin.
  • This invention is particularly useful for screening chemical compounds by using the novel polypeptides or binding fragments thereof in any of a variety of drug screening techniques.
  • the polypeptides or fragments employed in such a test may either be free in solution, affixed to a solid support, borne on a cell surface or located intracellularly.
  • One method of drug screening utilizes eukaryotic or prokaryotic host cells which are stably transformed with recombinant nucleic acids expressing the polypeptide or a fragment thereof. Drugs are screened against such transformed cells in competitive binding assays. Such cells, either in viable or fixed form, can be used for standard binding assays.
  • Sources for test compounds that may be screened for ability to bind to or modulate (i.e., increase or decrease) the activity of polypeptides of the invention include (1) inorganic and organic chemical libraries, (2) natural product libraries, and (3) combinatorial libraries comprised of either random or mimetic peptides, oligonucleotides or organic molecules.
  • Chemical libraries may be readily synthesized or purchased from a number of commercial sources, and may include structural analogs of known compounds or compounds that are identified as "hits" or "leads” via natural product screening.
  • the sources of natural product libraries are microorganisms (including bacteria and fungi), animals, plants or other vegetation, or marine organisms, and libraries of mixtures for screening may be created by: (1) fermentation and extraction of broths from soil, plant or marine microorganisms or (2) extraction of the organisms themselves.
  • Natural product libraries include polyketides, non-ribosomal peptides, and (non-naturally occurring) variants thereof. For a review, see Science 282:63-68 (1998).
  • Combinatorial libraries are composed of large numbers of peptides, oligonucleotides or organic compounds and can be readily prepared by traditional automated synthesis methods, PCR, cloning or proprietary synthetic methods.
  • peptide and oligonucleotide combinatorial libraries are peptide and oligonucleotide combinatorial libraries.
  • Still other libraries of interest include peptide, protein, peptidomimetic, multiparallel synthetic collection, recombinatorial, and polypeptide libraries.
  • combinatorial chemistry and libraries created therefrom see Myers, Curr. Opin. Biotechnol. 8:701-707 (1997).
  • peptidomimetic libraries see Al-Obeidi et al., Mol. Biotechnol, 9(3):205-23 (1998); Hruby et al., Curr Opin Chem Biol,
  • the binding molecules thus identified may be complexed with toxins, e.g., ricin or cholera, or with other compounds that are toxic to cells such as radioisotopes.
  • toxins e.g., ricin or cholera
  • the toxin-binding molecule complex is then targeted to a tumor or other cell by the specificity of the binding molecule for a polypeptide of the invention.
  • the binding molecules may be complexed with imaging agents for targeting and imaging purposes.
  • the invention also provides methods to detect specific binding of a polypeptide e.g. a ligand or a receptor.
  • a polypeptide e.g. a ligand or a receptor.
  • the art provides numerous assays particularly useful for identifying previously unknown binding partners for receptor polypeptides of the invention. For example, expression cloning using mammalian or bacterial cells, or dihybrid screening assays can be used to identify polynucleotides encoding binding partners. As another example, affinity chromatography with the appropriate immobilized polypeptide of the invention can be used to isolate polypeptides that recognize and bind polypeptides of the invention.
  • Ligands for receptor polypeptides of the invention can also be identified by adding exogenous ligands, or cocktails of ligands to two cells populations that are genetically identical except for the expression of the receptor of the invention: one cell population expresses the receptor of the invention whereas the other does not. The responses of the two cell populations to the addition of ligands(s) are then compared.
  • an expression library can be co-expressed with the polypeptide of the invention in cells and assayed for an autocrine response to identify potential ligand(s).
  • BIAcore assays can be used to identify binding partner polypeptides, including, (1) organic and inorganic chemical libraries, (2) natural product libraries, and (3) combinatorial libraries comprised of random peptides, oligonucleotides or organic molecules.
  • downstream intracellular signaling molecules in the signaling cascade of the polypeptide of the invention can be determined.
  • a chimeric protein in which the cytoplasmic domain of the polypeptide of the invention is fused to the extracellular portion of a protein, whose ligand has been identified is produced in a host cell.
  • the cell is then incubated with the ligand specific for the extracellular portion of the chimeric protein, thereby activating the chimeric receptor.
  • Known downstream proteins involved in intracellular signaling can then be assayed for expected modifications i.e. phosphorylation.
  • Other methods known to those in the art can also be used to identify signaling molecules involved in receptor activity.
  • compositions of the present invention may also exhibit anti-inflammatory activity.
  • the anti-inflammatory activity may be achieved by providing a stimulus to cells involved in the inflammatory response, by inhibiting or promoting cell-cell interactions (such as, for example, cell adhesion), by inhibiting or promoting chemotaxis of cells involved in the inflammatory process, inhibiting or promoting cell extravasation, or by stimulating or suppressing production of other factors which more directly inhibit or promote an inflammatory response.
  • compositions with such activities can be used to treat inflammatory conditions including chronic or acute conditions), including without limitation intimation associated with infection (such as septic shock, sepsis or systemic inflammatory response syndrome (SIRS)), ischemia-reperfusion injury, endotoxin lethality, arthritis, complement-mediated hyperacute rejection, nephritis, cytokine or chemokine-induced lung injury, inflammatory bowel disease, Crohn's disease or resulting from over production of cytokines such as TNF or IL-1.
  • Compositions of the invention may also be useful to treat anaphylaxis and hypersensitivity to an antigenic substance or material.
  • compositions of this invention may be utilized to prevent or treat conditions such as, but not limited to, sepsis, acute pancreatitis, endotoxin shock, cytokine induced shock, rheumatoid arthritis, chronic inflammatory arthritis, pancreatic cell damage from diabetes mellitus type 1, graft versus host disease, inflammatory bowel disease, inflamation associated with pulmonary disease, other autoimmune disease or inflammatory disease, an antiproliferative agent such as for acute or chronic mylegenous leukemia or in the prevention of premature labor secondary to intrauterine infections.
  • conditions such as, but not limited to, sepsis, acute pancreatitis, endotoxin shock, cytokine induced shock, rheumatoid arthritis, chronic inflammatory arthritis, pancreatic cell damage from diabetes mellitus type 1, graft versus host disease, inflammatory bowel disease, inflamation associated with pulmonary disease, other autoimmune disease or inflammatory disease, an antiproliferative agent such as for acute or chronic mylegen
  • Leukemias and related disorders may be treated or prevented by administration of a therapeutic that promotes or inhibits function of the polynucleotides and/or polypeptides of the invention.
  • leukemias and related disorders include but are not limited to acute leukemia, acute lymphocytic leukemia, acute myelocytic leukemia, myeloblastic, promyelocytic, myelomonocytic, monocytic, erythroleukemia, chronic leukemia, chronic myelocytic (granulocytic) leukemia and chronic lymphocytic leukemia (for a review of such disorders, see Fishman et al., 1985, Medicine, 2d Ed., J.B. Lippincott Co., Philadelphia).
  • Nervous system disorders involving cell types which can be tested for efficacy of intervention with compounds that modulate the activity of the polynucleotides and/or polypeptides of the invention, and which can be treated upon thus observing an indication of therapeutic utility, include but are not limited to nervous system injuries, and diseases or disorders which result in either a disconnection of axons, a diminution or degeneration of neurons, or demyelination.
  • Nervous system lesions which may be treated in a patient (including human and non-human mammalian patients) according to the invention include but are not limited to the following lesions of either the central (including spinal cord, brain) or peripheral nervous systems: (i) traumatic lesions, including lesions caused by physical injury or associated with surgery, for example, lesions which sever a portion of the nervous system, or compression injuries;
  • ischemic lesions in which a lack of oxygen in a portion of the nervous system results in neuronal injury or death, including cerebral infarction or ischemia, or spinal cord infarction or ischemia;
  • infectious lesions in which a portion of the nervous system is destroyed or injured as a result of infection, for example, by an abscess or associated with infection by human immunodeficiency virus, herpes zoster, or herpes simplex virus or with Lyme disease, tuberculosis, syphilis;
  • degenerative lesions in which a portion of the nervous system is destroyed or injured as a result of a degenerative process including but not limited to degeneration associated with Parkinson's disease, Alzheimer's disease, Huntington's chorea, or amyotrophic lateral sclerosis;
  • neurological lesions associated with systemic diseases including but not limited to diabetes (diabetic neuropathy, Bell's palsy), systemic lupus erythematosus, carcinoma, or sarcoidosis;
  • demyelinated lesions in which a portion of the nervous system is destroyed or injured by a demyelinating disease including but not limited to multiple sclerosis, human immunodeficiency virus-associated myelopathy, transverse myelopathy or various etiologies, progressive multifocal leukoencephalopathy, and central pontine myelinolysis.
  • Therapeutics which are useful according to the invention for treatment of a nervous system disorder may be selected by testing for biological activity in promoting the survival or differentiation of neurons.
  • therapeutics which elicit any of the following effects may be useful according to the invention: (i) increased survival time of neurons in culture; (ii) increased sprouting of neurons in culture or in vivo;
  • neuron-associated molecule in culture or in vivo, e.g., choline acetyltransferase or acetylcholinesterase with respect to motor neurons; or (iv) decreased symptoms of neuron dysfunction in vivo.
  • increased survival of neurons may be measured by the method set forth in Arakawa et al. (1990, J. Neurosci. 10:3507-3515); increased sprouting of neurons may be detected by methods set forth in Pestronk et al. (1980, Exp. Neurol. 70:65-82) or Brown et al. (1981, Ann. Rev. Neurosci.
  • neuron-associated molecules may be measured by bioassay, enzymatic assay, antibody binding, Northern blot assay, etc., depending on the molecule to be measured; and motor neuron dysfunction may be measured by assessing the physical manifestation of motor neuron disorder, e.g., weakness, motor neuron conduction velocity, or functional disability.
  • motor neuron disorders that may be treated according to the invention include but are not limited to disorders such as infarction, infection, exposure to toxin, trauma, surgical damage, degenerative disease or malignancy that may affect motor neurons as well as other components of the nervous system, as well as disorders that selectively affect neurons such as amyotrophic lateral sclerosis, and including but not limited to progressive spinal muscular atrophy, progressive bulbar palsy, primary lateral sclerosis, infantile and juvenile muscular atrophy, progressive bulbar paralysis of childhood (Fazio-Londe syndrome), poliomyelitis and the post polio syndrome, and Hereditary Motorsensory Neuropathy (Charcot-Marie-Tooth Disease).
  • disorders such as infarction, infection, exposure to toxin, trauma, surgical damage, degenerative disease or malignancy that may affect motor neurons as well as other components of the nervous system, as well as disorders that selectively affect neurons such as amyotrophic lateral sclerosis, and including but not limited to progressive spinal muscular atrophy, progressive bulbar palsy, primary
  • a polypeptide of the invention may also exhibit one or more of the following additional activities or effects: inhibiting the growth, infection or function of, or killing, infectious agents, including, without limitation, bacteria, viruses, fungi and other parasites; effecting (suppressing or enhancing) bodily characteristics, including, without limitation, height, weight, hair color, eye color, skin, fat to lean ratio or other tissue pigmentation, or organ or body part size or shape
  • polymorphisms makes possible the identification of such polymorphisms in human subjects and the pharmacogenetic use of this information for diagnosis and treatment.
  • Such polymorphisms may be associated with, e.g., differential predisposition or susceptibility to various disease states (such as disorders involving inflammation or immune response) or a differential response to drug administration, and this genetic information can be used to tailor preventive or therapeutic treatment appropriately.
  • the existence of a polymorphism associated with a predisposition to inflammation or autoimmune disease makes possible the diagnosis of this condition in humans by identifying the presence of the polymorphism.
  • Polymorphisms can be identified in a variety of ways known in the art which all generally involve obtaining a sample from a patient, analyzing DNA from the sample, optionally involving isolation or amplification of the DNA, and identifying the presence of the polymorphism in the DNA. For example, PCR may be used to amplify an appropriate fragment of genomic DNA which may then be sequenced.
  • the DNA may be subjected to allele-specific oligonucleotide hybridization (in which appropriate oligonucleotides are hybridized to the DNA under conditions permitting detection of a single base mismatch) or to a single nucleotide extension assay (in which an oligonucleotide that hybridizes immediately adjacent to the position of the polymorphism is extended with one or more labeled nucleotides).
  • allele-specific oligonucleotide hybridization in which appropriate oligonucleotides are hybridized to the DNA under conditions permitting detection of a single base mismatch
  • a single nucleotide extension assay in which an oligonucleotide that hybridizes immediately adjacent to the position of the polymorphism is extended with one or more labeled nucleotides.
  • traditional restriction fragment length polymorphism analysis using restriction enzymes that provide differential digestion of the genomic DNA depending on the presence or absence of the polymorphism
  • the array can comprise modified nucleotide sequences of the present invention in order to detect the nucleotide sequences of the present invention.
  • any one of the nucleotide sequences of the present invention can be placed on the array to detect changes from those sequences.
  • polymorphism resulting in a change in the amino acid sequence could also be detected by detecting a corresponding change in amino acid sequence of the protein, e.g., by an antibody specific to the variant sequence.
  • the immunosuppressive effects of the compositions of the invention against rheumatoid arthritis are determined in an experimental animal model system.
  • the experimental model system is adjuvant induced arthritis in rats, and the protocol is described by J. Holoshitz, et at., 1983, Science, 219:56, or by B. Waksman et al., 1963, Int. Arch. Allergy Appl. Immunol., 23:129.
  • Induction of the disease can be caused by a single injection, generally intradermally, of a suspension of killed Mycobacterium tuberculosis in complete Freund's adjuvant (CFA).
  • CFA complete Freund's adjuvant
  • the route of injection can vary, but rats may be injected at the base of the tail with an adjuvant mixture.
  • the polypeptide is administered in phosphate buffered solution (PBS) at a dose of about 1-5 mg/kg.
  • the control consists of administering PBS only.
  • the procedure for testing the effects of the test compound would consist of intradermally injecting killed Mycobacterium tuberculosis in CFA followed by immediately administering the test compound and subsequent treatment every other day until day 24.
  • an overall arthritis score may be obtained as described by J. Holoskitz above. An analysis of the data would reveal that the test compound would have a dramatic affect on the swelling of the joints as measured by a decrease of the arthritis score.
  • compositions including polypeptide fragments, analogs, variants and antibodies or other binding partners or modulators including antisense polynucleotides
  • therapeutic applications include, but are not limited to, those exemplified herein.
  • One embodiment of the invention is the administration of an effective amount of the polypeptides or other composition of the invention to individuals affected by a disease or disorder that can be modulated by regulating the peptides of the invention. While the mode of administration is not particularly important, parenteral administration is preferred. An exemplary mode of administration is to deliver an intravenous bolus.
  • the dosage of the polypeptides or other composition of the invention will normally be determined by the prescribing physician. It is to be expected that the dosage will vary according to the age, weight, condition and response of the individual patient. Typically, the amount of polypeptide administered per dose will be in the range of about 0.01 ⁇ g/kg to 100 mg/kg of body weight, with the preferred dose being about 0.1 ⁇ g/kg to 10 mg/kg of patient body weight.
  • polypeptides of the invention will be formulated in an injectable form combined with a pharmaceutically acceptable parenteral vehicle.
  • a pharmaceutically acceptable parenteral vehicle include water, saline, Ringer's solution, dextrose solution, and solutions consisting of small amounts of the human serum albumin.
  • the vehicle may contain minor amounts of additives that maintain the isotonicity and stability of the polypeptide or other active ingredient. The preparation of such solutions is within the skill of the art.
  • a protein or other composition of the present invention may be administered to a patient in need, by itself, or in pharmaceutical compositions where it is mixed with suitable carriers or excipient(s) at doses to treat or ameliorate a variety of disorders.
  • a composition may optionally contain (in addition to protein or other active ingredient and a carrier) diluents, fillers, salts, buffers, stabilizers, solubilizers, and other materials well known in the art.
  • pharmaceutically acceptable means a non-toxic material that does not interfere with the effectiveness of the biological activity of the active ingredient(s).
  • the pharmaceutical composition of the invention may also contain cytokines, lymphokines, or other hematopoietic factors such as M-CSF, GM-CSF, TNF, IL-1, IL-2, IL-3, IL-4, IL-5, IL-6, IL-7, IL-8, IL-9, IL-10, IL-11, IL-12, IL-13, IL-14, IL-15, IFN, TNF0, TNF1 , TNF2, G-CSF, Meg-CSF, thrombopoietin, stem cell factor, and erythropoietin.
  • proteins of the invention may be combined with other agents beneficial to the treatment of the disease or disorder in question.
  • agents include various growth factors such as epidermal growth factor (EGF), platelet-derived growth factor (PDGF), transforming growth factors (TGF- ⁇ and TGF- ⁇ ), insulin-like growth factor (IGF), as well as cytokines described herein.
  • EGF epidermal growth factor
  • PDGF platelet-derived growth factor
  • TGF- ⁇ and TGF- ⁇ transforming growth factors
  • IGF insulin-like growth factor
  • the pharmaceutical composition may further contain other agents which either enhance the activity of the protein or other active ingredient or complement its activity or use in treatment. Such additional factors and/or agents may be included in the pharmaceutical composition to produce a synergistic effect with protein or other active ingredient of the invention, or to minimize side effects.
  • protein or other active ingredient of the present invention may be included in formulations of the particular clotting factor, cytokine, lymphokine, other hematopoietic factor, thrombolytic or anti-thrombotic factor, or anti- inflammatory agent to minimize side effects of the clotting factor, cytokine, lymphokine, other hematopoietic factor, thrombolytic or anti-thrombotic factor, or anti-inflammatory agent (such as IL-lRa, IL-1 Hyl, IL-1 Hy2, anti-TNF, corticosteroids, immunosuppressive agents).
  • a protein of the present invention may be active in multimers (e.g., heterodimers or homodimers) or complexes with itself or other proteins.
  • pharmaceutical compositions of the invention may comprise a protein of the invention in such multimeric or complexed form.
  • a second protein or a therapeutic agent may be concurrently administered with the first protein (e.g., at the same time, or at differing times provided that therapeutic concentrations of the combination of agents is achieved at the treatment site).
  • a therapeutically effective dose further refers to that amount of the compound sufficient to result in amelioration of symptoms, e.g., treatment, healing, prevention or amelioration of the relevant medical condition, or an increase in rate of treatment, healing, prevention or amelioration of such conditions.
  • a therapeutically effective dose refers to that ingredient alone.
  • a therapeutically effective dose refers to combined amounts of the active ingredients that result in the therapeutic effect, whether administered in combination, serially or simultaneously.
  • a therapeutically effective amount of protein or other active ingredient of the present invention is administered to a mammal having a condition to be treated.
  • Protein or other active ingredient of the present invention may be administered in accordance with the method of the invention either alone or in combination with other therapies such as treatments employing cytokines, lymphokines or other hematopoietic factors.
  • protein or other active ingredient of the present invention may be administered either simultaneously with the cytokine(s), lymphokine(s), other hematopoietic factor(s), thrombolytic or anti-thrombotic factors, or sequentially.
  • cytokine(s), lymphokine(s), other hematopoietic factor(s), thrombolytic or anti-thrombotic factors are administered sequentially, the attending physician will decide on the appropriate sequence of administering protein or other active ingredient of the present invention in combination with cytokine(s), lymphokine(s), other hematopoietic factor(s), thrombolytic or anti-thrombotic factors.
  • Suitable routes of administration may, for example, include oral, rectal, transmucosal, or intestinal administration; parenteral delivery, including intramuscular, subcutaneous, intramedullary injections, as well as intrathecal, direct intraventricular, intravenous, intraperitoneal, intranasal, or intraocular injections.
  • Administration of protein or other active ingredient of the present invention used in the pharmaceutical composition or to practice the method of the present invention can be carried out in a variety of conventional ways, such as oral ingestion, inhalation, topical application or cutaneous, subcutaneous, intraperitoneal, parenteral or intravenous injection. Intravenous administration to the patient is preferred.
  • the compounds may be administered topically, for example, as eye drops.
  • a targeted drug delivery system for example, in a liposome coated with a specific antibody, targeting, for example, arthritic or fibrotic tissue. The liposomes will be targeted to and taken up selectively by the afflicted tissue.
  • the polypeptides of the invention are administered by any route that delivers an effective dosage to the desired site of action.
  • a suitable route of administration and an effective dosage for a particular indication is within the level of skill in the art.
  • Suitable dosage ranges for the polypeptides of the invention can be extrapolated from these dosages or from similar studies in appropriate animal models. Dosages can then be adjusted as necessary by the clinician to provide maximal therapeutic benefit.
  • compositions for use in accordance with the present invention thus may be formulated in a conventional manner using one or more physiologically acceptable carriers comprising excipients and auxiliaries which facilitate processing of the active compounds into preparations which can be used pharmaceutically.
  • physiologically acceptable carriers comprising excipients and auxiliaries which facilitate processing of the active compounds into preparations which can be used pharmaceutically.
  • These pharmaceutical compositions may be manufactured in a manner that is itself known, e.g. , by means of conventional mixing, dissolving, granulating, dragee-making, levigating, emulsifying, encapsulating, entrapping or lyophilizing processes. Proper formulation is dependent upon the route of administration chosen.
  • protein or other active ingredient of the present invention When a therapeutically effective amount of protein or other active ingredient of the present invention is administered orally, protein or other active ingredient of the present invention will be in the form of a tablet, capsule, powder, solution or elixir.
  • the pharmaceutical composition of the invention may additionally contain a solid carrier such as a gelatin or an adjuvant.
  • the tablet, capsule, and powder contain from about 5 to 95% protein or other active ingredient of the present invention, and preferably from about 25 to 90% protein or other active ingredient of the present invention.
  • a liquid carrier such as water, petroleum, oils of animal or plant origin such as peanut oil, mineral oil, soybean oil, or sesame oil, or synthetic oils may be added.
  • the liquid form of the pharmaceutical composition may further contain physiological saline solution, dextrose or other saccharide solution, or glycols such as ethylene glycol, propylene glycol or polyethylene glycol.
  • the pharmaceutical composition When administered in liquid form, contains from about 0.5 to 90% by weight of protein or other active ingredient of the present invention, and preferably from about 1 to 50% protein or other active ingredient of the present invention.
  • protein or other active ingredient of the present invention When a therapeutically effective amount of protein or other active ingredient of the present invention is administered by intravenous, cutaneous or subcutaneous injection, protein or other active ingredient of the present invention will be in the form of a pyrogen-free, parenterally acceptable aqueous solution.
  • parenterally acceptable protein or other active ingredient solutions having due regard to pH, isotonicity, stability, and the like, is within the skill in the art.
  • a preferred pharmaceutical composition for intravenous, cutaneous, or subcutaneous injection should contain, in addition to protein or other active ingredient of the present invention, an isotonic vehicle such as Sodium Chloride Injection, Ringer's Injection, Dextrose Injection, Dextrose and Sodium Chloride Injection, Lactated Ringer's Injection, or other vehicle as known in the art.
  • the pharmaceutical composition of the present invention may also contain stabilizers, preservatives, buffers, antioxidants, or other additives known to those of skill in the art.
  • the agents of the invention may be formulated in aqueous solutions, preferably in physiologically compatible buffers such as Hanks's solution, Ringer's solution, or physiological saline buffer.
  • penetrants appropriate to the barrier to be permeated are used in the formulation. Such penetrants are generally known in the art.
  • the compounds can be formulated readily by combining the active compounds with pharmaceutically acceptable carriers well known in the art.
  • Such carriers enable the compounds of the invention to be formulated as tablets, pills, dragees, capsules, liquids, gels, syrups, slurries, suspensions and the like, for oral ingestion by a patient to be treated.
  • Pharmaceutical preparations for oral use can be obtained from a solid excipient, optionally grinding a resulting mixture, and processing the mixture of granules, after adding suitable auxiliaries, if desired, to obtain tablets or dragee cores.
  • Suitable excipients are, in particular, fillers such as sugars, including lactose, sucrose, mannitol, or sorbitol; cellulose preparations such as, for example, maize starch, wheat starch, rice starch, potato starch, gelatin, gum tragacanth, methyl cellulose, hydroxypropylmethyl-cellulose, sodium carboxymethylcellulose, and/or polyvinylpyrrolidone (PVP).
  • disintegrating agents may be added, such as the cross-linked polyvinyl pyrrolidone, agar, or alginic acid or a salt thereof such as sodium alginate.
  • Dragee cores are provided with suitable coatings.
  • concentrated sugar solutions may be used, which may optionally contain gum arabic, talc, polyvinyl pyrrolidone, carbopol gel, polyethylene glycol, and/or titanium dioxide, lacquer solutions, and suitable organic solvents or solvent mixtures.
  • Dyestuffs or pigments may be added to the tablets or dragee coatings for identification or to characterize different combinations of active compound doses.
  • compositions which can be used orally include push-fit capsules made of gelatin, as well as soft, sealed capsules made of gelatin and a plasticizer, such as glycerol or sorbitol.
  • the push-fit capsules can contain the active ingredients in admixture with filler such as lactose, binders such as starches, and/or lubricants such as talc or magnesium stearate and, optionally, stabilizers.
  • the active compounds may be dissolved or suspended in suitable liquids, such as fatty oils, liquid paraffin, or liquid polyethylene glycols.
  • stabilizers may be added. All formulations for oral administration should be in dosages suitable for such administration.
  • compositions may take the form of tablets or lozenges formulated in conventional manner.
  • the compounds for use according to the present invention are conveniently delivered in the form of an aerosol spray presentation from pressurized packs or a nebuliser, with the use of a suitable propellant, e.g., dichlorodifluoromethane, trichlorofluoromethane, dichlorotetrafluoroethane, carbon dioxide or other suitable gas.
  • a suitable propellant e.g., dichlorodifluoromethane, trichlorofluoromethane, dichlorotetrafluoroethane, carbon dioxide or other suitable gas.
  • the dosage unit may be determined by providing a valve to deliver a metered amount.
  • Capsules and cartridges of, e.g., gelatin for use in an inhaler or insufflator may be formulated containing a powder mix of the compound and a suitable powder base such as lactose or starch.
  • the compounds may be formulated for parenteral administration by injection, e.g., by bolus injection or continuous infusion.
  • Formulations for injection may be presented in unit dosage form, e.g., in ampules or in multi-dose containers, with an added preservative.
  • the compositions may take such forms as suspensions, solutions or emulsions in oily or aqueous vehicles, and may contain formulatory agents such as suspending, stabilizing and/or dispersing agents.
  • compositions for parenteral administration include aqueous solutions of the active compounds in water-soluble form.
  • suspensions of the active compounds may be prepared as appropriate oily injection suspensions.
  • Suitable lipophilic solvents or vehicles include fatty oils such as sesame oil, or synthetic fatty acid esters, such as ethyl oleate or triglycerides, or liposomes.
  • Aqueous injection suspensions may contain substances which increase the viscosity of the suspension, such as sodium carboxymethyl cellulose, sorbitol, or dextran.
  • the suspension may also contain suitable stabilizers or agents which increase the solubility of the compounds to allow for the preparation of highly concentrated solutions.
  • the active ingredient may be in powder form for constitution with a suitable vehicle, e.g., sterile pyrogen-free water, before use.
  • the compounds may also be formulated in rectal compositions such as suppositories or retention enemas, e.g., containing conventional suppository bases such as cocoa butter or other glycerides.
  • the compounds may also be formulated as a depot preparation.
  • Such long acting formulations may be administered by implantation (for example subcutaneously or intramuscularly) or by intramuscular injection.
  • the compounds may be formulated with suitable polymeric or hydrophobic materials (for example as an emulsion in an acceptable oil) or ion exchange resins, or as sparingly soluble derivatives, for example, as a sparingly soluble salt.
  • a pharmaceutical carrier for the hydrophobic compounds of the invention is a co-solvent system comprising benzyl alcohol, a nonpolar surfactant, a water-miscible organic polymer, and an aqueous phase.
  • the co-solvent system may be the VPD co-solvent system.
  • VPD is a solution of 3% w/v benzyl alcohol, 8% w/v of the nonpolar surfactant polysorbate 80, and 65% w/v polyethylene glycol 300, made up to volume in absolute ethanol.
  • the VPD co-solvent system (VPD:5W) consists of VPD diluted 1 :1 with a 5% dextrose in water solution.
  • This co-solvent system dissolves hydrophobic compounds well, and itself produces low toxicity upon systemic administration.
  • the proportions of a co-solvent system may be varied considerably without destroying its solubility and toxicity characteristics.
  • identity of the co-solvent components may be varied: for example, other low-toxicity nonpolar surfactants may be used instead of polysorbate 80; the fraction size of polyethylene glycol may be varied; other biocompatible polymers may replace polyethylene glycol, e.g. polyvinyl pyrrolidone; and other sugars or polysaccharides may substitute for dextrose.
  • other delivery systems for hydrophobic pharmaceutical compounds may be employed. Liposomes and emulsions are well known examples of delivery vehicles or carriers for hydrophobic drugs.
  • Certain organic solvents such as dimethylsulfoxide also may be employed, although usually at the cost of greater toxicity.
  • the compounds may be delivered using a sustained-release system, such as semipermeable matrices of solid hydrophobic polymers containing the therapeutic agent.
  • sustained-release materials have been established and are well known by those skilled in the art.
  • Sustained-release capsules may, depending on their chemical nature, release the compounds for a few weeks up to over 100 days.
  • additional strategies for protein or other active ingredient stabilization may be employed.
  • the pharmaceutical compositions also may comprise suitable solid or gel phase carriers or excipients.
  • suitable solid or gel phase carriers or excipients include but are not limited to calcium carbonate, calcium phosphate, various sugars, starches, cellulose derivatives, gelatin, and polymers such as polyethylene glycols.
  • Many of the active ingredients of the invention may be provided as salts with pharmaceutically compatible counter ions.
  • Such pharmaceutically acceptable base addition salts are those salts which retain the biological effectiveness and properties of the free acids and which are obtained by reaction with inorganic or organic bases such as sodium hydroxide, magnesium hydroxide, ammonia, trialkylamine, dialkylamine, monoalkylamine, dibasic amino acids, sodium acetate, potassium benzoate, triethanol amine and the like.
  • the pharmaceutical composition of the invention may be in the form of a complex of the protein(s) or other active ingredient(s) of present invention along with protein or peptide antigens.
  • the protein and/or peptide antigen will deliver a stimulatory signal to both B and T lymphocytes.
  • B-lymphocytes will respond to antigen through their surface immunoglobulin receptor.
  • T lymphocytes will respond to antigen through the T cell receptor (TCR) following presentation of the antigen by MHC proteins.
  • TCR T cell receptor
  • the antigen components could also be supplied as purified MHC-peptide complexes alone or with co-stimulatory molecules that can directly signal T cells.
  • antibodies able to bind surface immunoglobulin and other molecules on B cells as well as antibodies able to bind the TCR and other molecules on T cells can be combined with the pharmaceutical composition of the invention.
  • the pharmaceutical composition of the invention may be in the form of a liposome in which protein of the present invention is combined, in addition to other pharmaceutically acceptable carriers, with amphipathic agents such as lipids which exist in aggregated form as micelles, insoluble monolayers, liquid crystals, or lamellar layers in aqueous solution.
  • Suitable lipids for liposomal formulation include, without limitation, monoglycerides, diglycerides, sulfatides, lysolecithins, phospholipids, saponin, bile acids, and the like. Preparation of such liposomal formulations is within the level of skill in the art, as disclosed, for example, in U.S. Patent Nos. 4,235,871; 4,501,728; 4,837,028; and 4,737,323, all of which are incorporated herein by reference.
  • the amount of protein or other active ingredient of the present invention in the pharmaceutical composition of the present invention will depend upon the nature and severity of the condition being treated, and on the nature of prior treatments which the patient has undergone.
  • the attending physician will decide the amount of protein or other active ingredient of the present invention with which to treat each individual patient. Initially, the attending physician will administer low doses of protein or other active ingredient of the present invention and observe the patient's response. Larger doses of protein or other active ingredient of the present invention may be administered until the optimal therapeutic effect is obtained for the patient, and at that point the dosage is not increased further. It is contemplated that the various pharmaceutical compositions used to practice the method of the present invention should contain about 0.01 ⁇ g to about 100 mg (preferably about 0.1 ⁇ g to about 10 mg, more preferably about 0.1 ⁇ g to about 1 mg) of protein or other active ingredient of the present invention per kg body weight.
  • the therapeutic method includes administering the composition topically, systematically, or locally as an implant or device.
  • the therapeutic composition for use in this invention is, of course, in a pyrogen-free, physiologically acceptable form.
  • the composition may desirably be encapsulated or injected in a viscous form for delivery to the site of bone, cartilage or tissue damage.
  • Topical administration may be suitable for wound healing and tissue repair.
  • Therapeutically useful agents other than a protein or other active ingredient of the invention which may also optionally be included in the composition as described above, may alternatively or additionally, be administered simultaneously or sequentially with the composition in the methods of the invention.
  • the composition would include a matrix capable of delivering the protein-containing or other active ingredient-containing composition to the site of bone and/or cartilage damage, providing a structure for the developing bone and cartilage and optimally capable of being resorbed into the body.
  • matrices may be formed of materials presently in use for other implanted medical applications.
  • compositions may be biodegradable and chemically defined calcium sulfate, tricalcium phosphate, hydroxyapatite, polylactic acid, polyglycolic acid and polyanhydrides.
  • potential materials are biodegradable and biologically well-defined, such as bone or dermal collagen.
  • Further matrices are comprised of pure proteins or extracellular matrix components.
  • Other potential matrices are nonbiodegradable and chemically defined, such as sintered hydroxyapatite, bioglass, aluminates, or other ceramics.
  • Matrices may be comprised of combinations of any of the above- mentioned types of material, such as polylactic acid and hydroxyapatite or collagen and tricalcium phosphate.
  • the bioceramics may be altered in composition, such as in calcium-aluminate-phosphate and processing to alter pore size, particle size, particle shape, and biodegradability.
  • a 50:50 (mole weight) copolymer of lactic acid and glycolic acid in the form of porous particles having diameters ranging from 150 to 800 microns.
  • a sequestering agent such as carboxymethyl cellulose or autologous blood clot, to prevent the protein compositions from disassociating from the matrix.
  • a preferred family of sequestering agents is cellulosic materials such as alkylcelluloses
  • CMC carboxymethylcellulose
  • Other preferred sequestering agents include hyaluronic acid, sodium alginate, poly(ethylene glycol), polyoxyethylene oxide, carboxyvinyl polymer and poly(vinyl alcohol).
  • the amount of sequestering agent useful herein is 0.5-20 wt %, preferably 1-10 wt % based on total formulation weight, which represents the amount necessary to prevent desorption of the protein from the polymer matrix and to provide appropriate handling of the composition, yet not so much that the progenitor cells are prevented from infiltrating the matrix, thereby providing the protein the opportunity to assist the osteogenic activity of the progenitor cells.
  • proteins or other active ingredients of the invention may be combined with other agents beneficial to the treatment of the bone and/or cartilage defect, wound, or tissue in question.
  • EGF epidermal growth factor
  • PDGF platelet derived growth factor
  • TGF- and TGF- ⁇ transforming growth factors
  • IGF insulin-like growth factor
  • a protein-containing pharmaceutical composition to be used in tissue regeneration will be determined by the attending physician considering various factors which modify the action of the proteins, e.g., amount of tissue weight desired to be formed, the site of damage, the condition of the damaged tissue, the size of a wound, type of damaged tissue (e.g., bone), the patient's age, sex, and diet, the severity of any infection, time of administration and other clinical factors.
  • the dosage may vary with the type of matrix used in the reconstitution and with inclusion of other proteins in the pharmaceutical composition.
  • polynucleotides of the present invention can also be used for gene therapy. Such polynucleotides can be introduced either in vivo or ex vivo into cells for expression in a mammalian subject. Polynucleotides of the invention may also be administered by other known methods for introduction of nucleic acid into a cell or organism (including, without limitation, in the form of viral vectors or naked DNA). Cells may also be cultured ex vivo in the presence of proteins of the present invention in order to proliferate or to produce a desired effect on or activity in such cells. Treated cells can then be introduced in vivo for therapeutic purposes.
  • IGF I insulin like growth factor I
  • compositions suitable for use in the present invention include compositions wherein the active ingredients are contained in an effective amount to achieve its intended purpose. More specifically, a therapeutically effective amount means an amount effective to prevent development of or to alleviate the existing symptoms of the subject being treated. Determination of the effective amount is well within the capability of those skilled in the art, especially in light of the detailed disclosure provided herein.
  • the therapeutically effective dose can be estimated initially from appropriate in vitro assays. For example, a dose can be formulated in animal models to achieve a circulating concentration range that can be used to more accurately determine useful doses in humans.
  • a dose can be formulated in animal models to achieve a circulating concentration range that includes the IC JO as determined in cell culture (i.e., the concentration of the test compound which achieves a half-maximal inhibition of the protein's biological activity).
  • Such information can be used to more accurately determine useful doses in humans.
  • a therapeutically effective dose refers to that amount of the compound that results in amelioration of symptoms or a prolongation of survival in a patient.
  • Toxicity and therapeutic efficacy of such compounds can be determined by standard pharmaceutical procedures in cell cultures or experimental animals, e.g., for determining the LD 0 (the dose lethal to 50% of the population) and the ED 50 (the dose therapeutically effective in 50% of the population).
  • the dose ratio between toxic and therapeutic effects is the therapeutic index and it can be expressed as the ratio between LD 50 and ED 50 .
  • Compounds which exhibit high therapeutic indices are preferred. The data obtained from these cell culture assays and animal studies can be used in formulating a range of dosage for use in human.
  • the dosage of such compounds lies preferably within a range of circulating concentrations that include the ED 50 with little or no toxicity.
  • the dosage may vary within this range depending upon the dosage form employed and the route of administration utilized.
  • the exact formulation, route of administration and dosage can be chosen by the individual physician in view of the patient's condition. See, e.g., Fingl et al., 1975, in "The
  • Dosage amount and interval may be adjusted individually to provide plasma levels of the active moiety which are sufficient to maintain the desired effects, or minimal effective concentration (MEC).
  • MEC minimal effective concentration
  • the MEC will vary for each compound but can be estimated from in vitro data. Dosages necessary to achieve the MEC will depend on individual characteristics and route of administration. However, HPLC assays or bioassays can be used to determine plasma concentrations.
  • Dosage intervals can also be determined using MEC value.
  • Compounds should be administered using a regimen that maintains plasma levels above the MEC for 10-90% of the time, preferably between 30-90% and most preferably between 50-90%.
  • the effective local concentration of the drug may not be related to plasma concentration.
  • An exemplary dosage regimen for polypeptides or other compositions of the invention will be in the range of about 0.01 ⁇ g/kg to 100 mg/kg of body weight daily, with the preferred dose being about 0.1 ⁇ g/kg to 25 mg/kg of patient body weight daily, varying in adults and children. Dosing may be once daily, or equivalent doses may be delivered at longer or shorter intervals.
  • composition administered will, of course, be dependent on the subject being treated, on the subject's age and weight, the severity of the affliction, the manner of administration and the judgment of the prescribing physician.
  • compositions may, if desired, be presented in a pack or dispenser device which may contain one or more unit dosage forms containing the active ingredient.
  • the pack may, for example, comprise metal or plastic foil, such as a blister pack.
  • the pack or dispenser device may be accompanied by instructions for administration.
  • Compositions comprising a compound of the invention formulated in a compatible pharmaceutical carrier may also be prepared, placed in an appropriate container, and labeled for treatment of an indicated condition.
  • antibody refers to immunoglobulin molecules and immunologically active portions of immunoglobulin (Ig) molecules, i.e., molecules that contain an antigen-binding site that specifically binds (immunoreacts with) an antigen.
  • immunoglobulin immunoglobulin
  • Such antibodies include, but are not limited to, polyclonal, monoclonal, chimeric, single chain, F ab , F at ,' and F (a ')2 fragments, and an F ab expression library.
  • an antibody molecule obtained from humans relates to any of the classes IgG, IgM, IgA, IgE and IgD, which differ from one another by the nature of the heavy chain present in the molecule. Certain classes have subclasses as well, such as IgGi, IgG 2 , and others. Furthermore, in humans, the light chain may be a kappa chain or a lambda chain. Reference herein to antibodies includes a reference to all such classes, subclasses and types of human antibody species.
  • An isolated related protein of the invention may be intended to serve as an antigen, or a portion or fragment thereof, and additionally can be used as an immunogen to generate antibodies that immunospecifically bind the antigen, using standard techniques for polyclonal and monoclonal antibody preparation.
  • the full-length protein can be used or, alternatively, the invention provides antigenic peptide fragments of the antigen for use as immunogens.
  • An antigenic peptide fragment comprises at least 6 amino acid residues of the amino acid sequence of the full length protein, (for example the amino acid sequence shown in SEQ ID NO: 30369), and encompasses an epitope thereof such that an antibody raised against the peptide forms a specific immune complex with the full length protein or with any fragment that contains the epitope.
  • the antigenic peptide comprises at least 10 amino acid residues, or at least 15 amino acid residues, or at least 20 amino acid residues, or at least 30 amino acid residues.
  • Preferred epitopes encompassed by the antigenic peptide are regions of the protein that are located on its surface; commonly these are hydrophilic regions.
  • At least one epitope encompassed by the antigenic peptide is a region on the surface of the protein of the invention that is located on the surface of the protein, e.g., a hydrophilic region.
  • a hydrophobicity analysis of the human related protein sequence will indicate which regions of a related protein are particularly hydrophilic and, therefore, are likely to encode surface residues useful for targeting antibody production.
  • hydropathy plots showing regions of hydrophilicity and hydrophobicity may be generated by any method well known in the art, including, for example, the Kyte Doolittle or the Hopp Woods methods, either with or without Fourier transformation. See, e.g., Hopp and Woods, 1981, Proc. Nat.
  • Antibodies that are specific for one or more domains within an antigenic protein, or derivatives, fragments, analogs or homologs thereof, are also provided herein.
  • a protein of the invention may be utilized as an immunogen in the generation of antibodies that immunospecifically bind these protein components.
  • an appropriate immunogenic preparation can contain, for example, the naturally occurring immunogenic protein, a chemically synthesized polypeptide representing the immunogenic protein, or a recombinantly expressed immunogenic protein.
  • the protein may be conjugated to a second protein known to be immunogenic in the mammal being immunized.
  • immunogenic proteins include but are not limited to keyhole limpet hemocyanin, serum albumin, bovine thyroglobulin, and soybean trypsin inhibitor.
  • the preparation can further include an adjuvant.
  • adjuvants used to increase the immunological response include, but are not limited to, Freund's (complete and incomplete), mineral gels (e.g., aluminum hydroxide), surface active substances (e.g., lysolecithin, pluronic polyols, polyanions, peptides, oil emulsions, dinitrophenol, etc.), adjuvants usable in humans such as Bacille Calmette-Guerin and Corynebacterium parvum, or similar immunostimulatory agents.
  • Additional examples of adjuvants which can be employed include MPL-TDM adjuvant (monophosphoryl Lipid A, synthetic trehalose dicorynomycolate).
  • the polyclonal antibody molecules directed against the immunogenic protein can be isolated from the mammal (e.g., from the blood) and further purified by well known techniques, such as affinity chromatography using protein A or protein G, which provide primarily the IgG fraction of immune serum. Subsequently, or alternatively, the specific antigen which is the target of the immunoglobulin sought, or an epitope thereof, may be immobilized on a column to purify the immune specific antibody by immunoaffinity chromatography. Purification of immunoglobulins is discussed, for example, by D. Wilkinson (The Engineer, published by The Engineer, Inc., Philadelphia PA, Vol. 14, No. 8 (April 17, 2000), pp. 25-28).
  • MAb monoclonal antibody
  • CDRs complementarity determining regions
  • MAbs thus contain an antigen- binding site capable of immunoreacting with a particular epitope of the antigen characterized by a unique binding affinity for it.
  • Monoclonal antibodies can be prepared using hybridoma methods, such as those described by Kohler and Milstein, Nature. 256:495 (1975).
  • a mouse, hamster, or other appropriate host animal is typically immunized with an immunizing agent to elicit lymphocytes that produce or are capable of producing antibodies that will specifically bind to the immunizing agent.
  • the lymphocytes can be immunized in vitro.
  • the immunizing agent will typically include the protein antigen, a fragment thereof or a fusion protein thereof.
  • peripheral blood lymphocytes are used if cells of human origin are desired, or spleen cells or lymph node cells are used if non-human mammalian sources are desired.
  • the lymphocytes are then fused with an immortalized cell line using a suitable fusing agent, such as polyethylene glycol, to form a hybridoma cell (Goding, Monoclonal Antibodies: Principles and Practice. Academic Press, (1986) pp. 59-103).
  • Immortalized cell lines are usually transformed mammalian cells, particularly myeloma cells of rodent, bovine and human origin. Usually, rat or mouse myeloma cell lines are employed.
  • the hybridoma cells can be cultured in a suitable culture medium that preferably contains one or more substances that inhibit the growth or survival of the unfused, immortalized cells.
  • the culture medium for the hybridomas typically will include hypoxanthine, aminopterin, and thymidine ("HAT medium”), which substances prevent the growth of HGPRT-deficient cells.
  • HGPRT hypoxanthine guanine phosphoribosyl transferase
  • Preferred immortalized cell lines are those that fuse efficiently, support stable high level expression of antibody by the selected antibody-producing cells, and are sensitive to a medium such as HAT medium. More preferred immortalized cell lines are murine myeloma lines, which can be obtained, for instance, from the Salk Institute Cell Distribution Center, San Diego, California and the American Type Culture Collection, Manassas, Virginia. Human myeloma and mouse-human heteromyeloma cell lines also have been described for the production of human monoclonal antibodies (Kozbor, J. Immunol., 133:3001 (1984); Brodeur et al., Monoclonal Antibody Production Techniques and Applications, Marcel Dekker, Inc., New York, (1987) pp. 51-63).
  • the culture medium in which the hybridoma cells are cultured can then be assayed for the presence of monoclonal antibodies directed against the antigen.
  • the binding specificity of monoclonal antibodies produced by the hybridoma cells is determined by immunoprecipitation or by an in vitro binding assay, such as radioimmunoassay (RIA) or enzyme-linked immunoabsorbent assay (ELISA). Such techniques and assays are known in the art.
  • the binding affinity of the monoclonal antibody can, for example, be determined by the Scatchard analysis of Munson and Pollard, Anal. Biochem., 107:220 (1980).
  • antibodies having a high degree of specificity and a high binding affinity for the target antigen are isolated.
  • the clones can be subcloned by limiting dilution procedures and grown by standard methods. Suitable culture media for this purpose include, for example, Dulbecco's Modified Eagle's Medium and RPMI-1640 medium. Alternatively, the hybridoma cells can be grown in vivo as ascites in a mammal.
  • the monoclonal antibodies secreted by the subclones can be isolated or purified from the culture medium or ascites fluid by conventional immunoglobulin purification procedures such as, for example, protein A-Sepharose, hydroxylapatite chromatography, gel electrophoresis, dialysis, or affinity chromatography.
  • the monoclonal antibodies can also be made by recombinant DNA methods, such as those described in U.S. Patent No. 4,816,567.
  • DNA encoding the monoclonal antibodies of the invention can be readily isolated and sequenced using conventional procedures (e.g., by using oligonucleotide probes that are capable of binding specifically to genes encoding the heavy and light chains of murine antibodies).
  • the hybridoma cells of the invention serve as a preferred source of such DNA.
  • the DNA can be placed into expression vectors, which are then transfected into host cells such as simian COS cells, Chinese hamster ovary (CHO) cells, or myeloma cells that do not otherwise produce immunoglobulin protein, to obtain the synthesis of monoclonal antibodies in the recombinant host cells.
  • host cells such as simian COS cells, Chinese hamster ovary (CHO) cells, or myeloma cells that do not otherwise produce immunoglobulin protein, to obtain the synthesis of monoclonal antibodies in the recombinant host cells.
  • the DNA also can be modified, for example, by substituting the coding sequence for human heavy and light chain constant domains in place of the homologous murine sequences (U.S. Patent No. 4,816,567; Morrison, Nature 368, 812-13 (1994)) or by covalently joining to the immunoglobulin coding sequence all or part of the coding sequence for a non-immunoglobulin polypeptide.
  • non-immunoglobulin polypeptide can be substituted for the constant domains of an antibody of the invention, or can be substituted for the variable domains of one antigen-combining site of an antibody of the invention to create a chimeric bivalent antibody.
  • the antibodies directed against the protein antigens of the invention can further comprise humanized antibodies or human antibodies. These antibodies are suitable for administration to humans without engendering an immune response by the human against the administered immunoglobulin.
  • Humanized forms of antibodies are chimeric immunoglobulins, immunoglobulin chains or fragments thereof (such as Fv, Fab, Fab', F(ab') 2 or other antigen- binding subsequences of antibodies) that are principally comprised of the sequence of a human immunoglobulin, and contain minimal sequence derived from a non-human immunoglobulin.
  • Humanization can be performed following the method of Winter and co-workers (Jones et al., Nature, 321 :522-525 (1986); Riechmann et al., Nature, 332:323-327 (1988); Verhoeyen et al., Science. 239:1534-1536 (1988)), by substituting rodent CDRs or CDR sequences for the corresponding sequences of a human antibody. (See also U.S. Patent No. 5,225,539.) In some instances, Fv framework residues of the human immunoglobulin are replaced by corresponding non-human residues. Humanized antibodies can also comprise residues which are found neither in the recipient antibody nor in the imported CDR or framework sequences.
  • the humanized antibody will comprise substantially all of at least one, and typically two, variable domains, in which all or substantially all of the CDR regions correspond to those of a non-human immunoglobulin and all or substantially all of the framework regions are those of a human immunoglobulin consensus sequence.
  • the humanized antibody optimally also will comprise at least a portion of an immunoglobulin constant region (Fc), typically that of a human immunoglobulin (Jones et al., 1986; Riechmann et al., 1988; and Presta, Curr. Op. Struct. Biol., 2:593-596 (1992)).
  • Fc immunoglobulin constant region
  • Fully human antibodies relate to antibody molecules in which essentially the entire sequences of both the light chain and the heavy chain, including the CDRs, arise from human genes. Such antibodies are termed "human antibodies", or “fully human antibodies” herein.
  • Human monoclonal antibodies can be prepared by the trioma technique; the human B-cell hybridoma technique (see Kozbor, et al., 1983 Immunol Today 4: 72) and the EBV hybridoma technique to produce human monoclonal antibodies (see Cole, et al., 1985 In: MONOCLONAL ANTIBODIES AND CANCER THERAPY, Alan R. Liss, Inc., pp. 77-96).
  • Human monoclonal antibodies may be utilized in the practice of the present invention and may be produced by using human hybridomas (see Cote, et al., 1983. Proc Natl Acad Sci USA 80: 2026-2030) or by transforming human B-cells with Epstein Barr Virus in vitro (see Cole, et al., 1985 In: MONOCLONAL ANTIBODIES AND CANCER THERAPY, Alan R. Liss, Inc., pp. 77-96).
  • human antibodies can also be produced using additional techniques, including phage display libraries (Hoogenboom and Winter, J. Mol. Biol., 227:381 (1991); Marks et al., J. Mol. Biol., 222:581 (1991)).
  • human antibodies can be made by introducing human immunoglobulin loci into transgenic animals, e.g., mice in which the endogenous immunoglobulin genes have been partially or completely inactivated. Upon challenge, human antibody production is observed, which closely resembles that seen in humans in all respects, including gene rearrangement, assembly, and antibody repertoire. This approach is described, for example, in U.S. Patent Nos.
  • Human antibodies may additionally be produced using transgenic nonhuman animals which are modified so as to produce fully human antibodies rather than the animal's endogenous antibodies in response to challenge by an antigen.
  • transgenic nonhuman animals which are modified so as to produce fully human antibodies rather than the animal's endogenous antibodies in response to challenge by an antigen.
  • the endogenous genes encoding the heavy and light immunoglobulin chains in the nonhuman host have been incapacitated, and active loci encoding human heavy and light chain immunoglobulins are inserted into the host's genome.
  • the human genes are incorporated, for example, using yeast artificial chromosomes containing the requisite human DNA segments. An animal which provides all the desired modifications is then obtained as progeny by crossbreeding intermediate transgenic animals containing fewer than the full complement of the modifications.
  • nonhuman animal is a mouse, and is termed the Xenomouse as disclosed in PCT publications WO 96/33735 and WO 96/34096.
  • This animal produces B cells which secrete fully human immunoglobulins.
  • the antibodies can be obtained directly from the animal after immunization with an immunogen of interest, as, for example, a preparation of a polyclonal antibody, or alternatively from immortalized B cells derived from the animal, such as hybridomas producing monoclonal antibodies.
  • the genes encoding the immunoglobulins with human variable regions can be recovered and expressed to obtain the antibodies directly, or can be further modified to obtain analogs of antibodies such as, for example, single chain Fv molecules.
  • U.S. Patent No. 5,939,598 An example of a method of producing a nonhuman host, exemplified as a mouse, lacking expression of an endogenous immunoglobulin heavy chain is disclosed in U.S. Patent No. 5,939,598. It can be obtained by a method including deleting the J segment genes from at least one endogenous heavy chain locus in an embryonic stem cell to prevent rearrangement of the locus and to prevent formation of a transcript of a rearranged immunoglobulin heavy chain locus, the deletion being effected by a targeting vector containing a gene encoding a selectable marker; and producing from the embryonic stem cell a transgenic mouse whose somatic and germ cells contain the gene encoding the selectable marker.
  • a method for producing an antibody of interest, such as a human antibody is disclosed in
  • U.S. Patent No. 5,916,771 It includes introducing an expression vector that contains a nucleotide sequence encoding a heavy chain into one mammalian host cell in culture, introducing an expression vector containing a nucleotide sequence encoding a light chain into another mammalian host cell, and fusing the two cells to form a hybrid cell.
  • the hybrid cell expresses an antibody containing the heavy chain and the light chain.
  • techniques can be adapted for the production of single-chain antibodies specific to an antigenic protein of the invention (see e.g., U.S. Patent No. 4,946,778).
  • methods can be adapted for the construction of F ab expression libraries (see e.g., Huse, et al., 1989 Science 246: 1275-1281) to allow rapid and effective identification of monoclonal F ab fragments with the desired specificity for a protein or derivatives, fragments, analogs or homologs thereof.
  • Antibody fragments that contain the idiotypes to a protein antigen may be produced by techniques known in the art including, but not limited to: (i) an F (ab ') 2 fragment produced by pepsin digestion of an antibody molecule; (ii) an F ab fragment generated by reducing the disulfide bridges of an F( a b')2 fragment; (iii) an F a b fragment generated by the treatment of the antibody molecule with papain and a reducing agent and (iv) F v fragments.
  • Bispecific antibodies are monoclonal, preferably human or humanized, antibodies that have binding specificities for at least two different antigens. In the present case, one of the binding specificities is for an antigenic protein of the invention.
  • the second binding target is any other antigen, and advantageously is a cell-surface protein or receptor or receptor subunit.
  • Methods for making bispecific antibodies are known in the art. Traditionally, the recombinant production of bispecific antibodies is based on the co-expression of two immunoglobulin heavy-chain/light-chain pairs, where the two heavy chains have different specificities (Milstein and Cuello, Nature, 305:537-539 (1983)).
  • Antibody variable domains with the desired binding specificities can be fused to immunoglobulin constant domain sequences.
  • the fusion preferably is with an immunoglobulin heavy-chain constant domain, comprising at least part of the hinge, CH2, and CH3 regions. It is preferred to have the first heavy-chain constant region (CHI) containing the site necessary for light-chain binding present in at least one of the fusions.
  • CHI first heavy-chain constant region
  • the interface between a pair of antibody molecules can be engineered to maximize the percentage of heterodimers which are recovered from recombinant cell culture.
  • the preferred interface comprises at least a part of the CH3 region of an antibody constant domain.
  • one or more small amino acid side chains from the interface of the first antibody molecule are replaced with larger side chains (e.g. tyrosine or tryptophan).
  • Compensatory "cavities" of identical or similar size to the large side chain(s) are created on the interface of the second antibody molecule by replacing large amino acid side chains with smaller ones (e.g. alanine or threonine).
  • This provides a mechanism for increasing the yield of the heterodimer over other unwanted end-products such as homodimers.
  • Bispecific antibodies can be prepared as full length antibodies or antibody fragments (e.g.
  • bispecific antibodies F(ab') 2 bispecific antibodies.
  • Techniques for generating bispecific antibodies from antibody fragments have been described in the literature. For example, bispecific antibodies can be prepared using chemical linkage. Brennan et al., Science 229:81 (1985) describe a procedure wherein intact antibodies are proteolytically cleaved to generate F(ab') 2 fragments. These fragments are reduced in the presence of the dithiol complexing agent sodium arsenite to stabilize vicinal dithiols and prevent intermolecular disulfide formation. The Fab' fragments generated are then converted to thionitrobenzoate (TNB) derivatives.
  • TAB thionitrobenzoate
  • One of the Fab'-TNB derivatives is then reconverted to the Fab'-thiol by reduction with mercaptoethylamine and is mixed with an equimolar amount of the other Fab'-TNB derivative to form the bispecific antibody.
  • the bispecific antibodies produced can be used as agents for the selective immobilization of enzymes.
  • Fab' fragments can be directly recovered from E. coli and chemically coupled to form bispecific antibodies.
  • Shalaby et al., J. Exp. Med. 175:217-225 (1992) describe the production of a fully humanized bispecific antibody F(ab') 2 molecule.
  • Each Fab' fragment was separately secreted from E. coli and subjected to directed chemical coupling in vitro to form the bispecific antibody.
  • the bispecific antibody thus formed was able to bind to cells overexpressing the ErbB2 receptor and normal human T cells, as well as trigger the lytic activity of human cytotoxic lymphocytes against human breast tumor targets.
  • Various techniques for making and isolating bispecific antibody fragments directly from recombinant cell culture have also been described.
  • bispecific antibodies have been produced using leucine zippers.
  • the leucine zipper peptides from the Fos and Jun proteins were linked to the Fab' portions of two different antibodies by gene fusion.
  • the antibody homodimers were reduced at the hinge region to form monomers and then re-oxidized to form the antibody heterodimers.
  • This method can also be utilized for the production of antibody homodimers.
  • the "diabody” technology described by Hollinger et al., Proc. Natl. Acad. Sci. USA 90:6444-6448 (1993) has provided an alternative mechanism for making bispecific antibody fragments.
  • the fragments comprise a heavy-chain variable domain (Vn) connected to a light-chain variable domain (V L ) by a linker which is too short to allow pairing between the two domains on the same chain. Accordingly, the V H and V domains of one fragment are forced to pair with the complementary V L and V H domains of another fragment, thereby forming two antigen-binding sites.
  • Vn heavy-chain variable domain
  • V L light-chain variable domain
  • Another strategy for making bispecific antibody fragments by the use of single-chain Fv (sFv) dimers has also been reported. See, Gruber et al., J. Immunol. 152:5368 (1994). Antibodies with more than two valencies are contemplated. For example, trispecific antibodies can be prepared. Tutt et al., J. Immunol.
  • bispecific antibodies can bind to two different epitopes, at least one of which originates in the protein antigen of the invention.
  • an anti-antigenic arm of an immunoglobulin molecule can be combined with an arm which binds to a triggering molecule on a leukocyte such as a T-cell receptor molecule (e.g. CD2, CD3, CD28, or B7), or Fc receptors for IgG (Fc ⁇ R), such as Fc ⁇ RI (CD64), Fc ⁇ RII (CD32) and Fc ⁇ RIII (CD 16) so as to focus cellular defense mechanisms to the cell expressing the particular antigen.
  • Bispecific antibodies can also be used to direct cytotoxic agents to cells which express a particular antigen.
  • antibodies possess an antigen-binding arm and an arm which binds a cytotoxic agent or a radionuclide chelator, such as EOTUBE, DPTA, DOTA, or TETA.
  • a cytotoxic agent or a radionuclide chelator such as EOTUBE, DPTA, DOTA, or TETA.
  • Another bispecific antibody of interest binds the protein antigen described herein and further binds tissue factor (TF).
  • Heteroconjugate antibodies are also within the scope of the present invention.
  • Heteroconjugate antibodies are composed of two covalently joined antibodies. Such antibodies have, for example, been proposed to target immune system cells to unwanted cells (U.S. Patent No. 4,676,980), and for treatment of HIV infection (WO 91/00360; WO 92/200373; EP 03089). It is contemplated that the antibodies can be prepared in vitro using known methods in synthetic protein chemistry, including those involving crosslinking agents. For example, immunotoxins can be constructed using a disulfide exchange reaction or by forming a thioether bond. Examples of suitable reagents for this purpose include iminothiolate and methyl-4- mercaptobutyrimidate and those disclosed, for example, in U.S. Patent No. 4,676,980.
  • cysteine residue(s) can be introduced into the Fc region, thereby allowing interchain disulfide bond formation in this region.
  • the homodimeric antibody thus generated can have improved internalization capability and/or increased complement-mediated cell killing and antibody-dependent cellular cytotoxicity (ADCC). See Caron et al., J. Exp Med., 176: 1191- 1 195 (1992) and Shopes, J. Immunol., 148: 2918-2922 (1992).
  • Homodimeric antibodies with enhanced anti-tumor activity can also be prepared using heterobifunctional cross-linkers as described in Wolff et al. Cancer Research, 53: 2560-2565 (1993).
  • an antibody can be engineered that has dual Fc regions and can thereby have enhanced complement lysis and
  • ADCC capabilities See Stevenson et al., Anti-Cancer Drug Design, 3: 219-230 (1989).
  • the invention also pertains to immunoconjugates comprising an antibody conjugated to a cytotoxic agent such as a chemotherapeutic agent, toxin (e.g., an enzymatically active toxin of bacterial, fungal, plant, or animal origin, or fragments thereof), or a radioactive isotope (i.e., a radioconjugate).
  • a cytotoxic agent such as a chemotherapeutic agent, toxin (e.g., an enzymatically active toxin of bacterial, fungal, plant, or animal origin, or fragments thereof), or a radioactive isotope (i.e., a radioconjugate).
  • Enzymatically active toxins and fragments thereof that can be used include diphtheria A chain, nonbinding active fragments of diphtheria toxin, exotoxin A chain (from Pseudomonas aeruginosa), ricin A chain, abrin A chain, modeccin A chain, alpha-sarcin, Aleurites fordii proteins, dianthin proteins, Phytolaca americana proteins (PAPI, PAPII, and PAP-S), momordica charantia inhibitor, curcin, crotin, sapaonaria officinalis inhibitor, gelonin, mitogellin, restrictocin, phenomycin, enomycin, and the tricothecenes.
  • a variety of radionuclides are available for the production of radioconjugated antibodies. Examples include 212 Bi, 13 l I, 131 In, 90 Y, and l86 Re.
  • Conjugates of the antibody and cytotoxic agent are made using a variety of bifunctional protein-coupling agents such as N-succinimidyl-3-(2-pyridyldithiol) propionate (SPDP), iminothiolane (IT), bifunctional derivatives of imidoesters (such as dimethyl adipimidate HCL), active esters (such as disuccinimidyl suberate), aldehydes (such as glutareldehyde), bis-azido compounds (such as bis (p-azidobenzoyl) hexanediamine), bis-diazonium derivatives (such as bis-(p-diazoniumbenzoyl)-ethylenediamine), diisocyanates (such as tolyene 2,6-diisocyanate), and bis-active fluorine compounds (such as 1 ,5-difluoro-2,4-dinitrobenzene).
  • SPDP N-succinimidyl
  • a ricin immunotoxin can be prepared as described in Vitetta et al., Science, 238: 1098 (1987).
  • Carbon- 14-labeled l-isothiocyanatobenzyl-3-methyldiethylene triaminepentaacetic acid (MX- DTPA) is an exemplary chelating agent for conjugation of radionucleotide to the antibody. See WO94/11026.
  • the antibody in another embodiment, can be conjugated to a "receptor" (such streptavidin) for utilization in tumor pretargeting wherein the antibody-receptor conjugate is administered to the patient, followed by removal of unbound conjugate from the circulation using a clearing agent and then administration of a "ligand” (e.g., avidin) that is in turn conjugated to a cytotoxic agent.
  • a "receptor” such streptavidin
  • a "ligand” e.g., avidin
  • a nucleotide sequence of the present invention can be recorded on computer readable media.
  • computer readable media refers to any medium which can be read and accessed directly by a computer. Such media include, but are not limited to: magnetic storage media, such as floppy discs, hard disc storage medium, and magnetic tape; optical storage media such as CD-ROM; electrical storage media such as RAM and ROM; and hybrids of these categories such as magnetic/optical storage media.
  • magnetic storage media such as floppy discs, hard disc storage medium, and magnetic tape
  • optical storage media such as CD-ROM
  • electrical storage media such as RAM and ROM
  • hybrids of these categories such as magnetic/optical storage media.
  • recorded refers to a process for storing information on computer readable medium.
  • a skilled artisan can readily adopt any of the presently known methods for recording information on computer readable medium to generate manufactures comprising the nucleotide sequence information of the present invention.
  • a variety of data storage structures are available to a skilled artisan for creating a computer readable medium having recorded thereon a nucleotide sequence of the present invention.
  • the choice of the data storage structure will generally be based on the means chosen to access the stored information.
  • a variety of data processor programs and formats can be used to store the nucleotide sequence information of the present invention on computer readable medium.
  • the sequence information can be represented in a word processing text file, formatted in commercially-available software such as WordPerfect and Microsoft Word, or represented in the form of an ASCII file, stored in a database application, such as DB2, Sybase, Oracle, or the like.
  • a skilled artisan can readily adapt any number of data processor structuring formats (e.g. text file or database) in order to obtain computer readable medium having recorded thereon the nucleotide sequence information of the present invention.
  • nucleotide sequences SEQ ID NO: 1-30368 or a representative fragment thereof; or a nucleotide sequence at least 95% identical to any of the nucleotide sequences of SEQ ID NO: 1-30368 in computer readable form a skilled artisan can routinely access the sequence information for a variety of purposes.
  • Computer software is publicly available which allows a skilled artisan to access sequence information provided in a computer readable medium.
  • the examples which follow demonstrate how software which implements the BLAST (Altschul et al., J. Mol. Biol. 215:403-410 (1990)) and BLAZE (Brutlag et al., Comp.
  • a computer-based system refers to the hardware means, software means, and data storage means used to analyze the nucleotide sequence information of the present invention.
  • the minimum hardware means of the computer-based systems of the present invention comprises a central processing unit (CPU), input means, output means, and data storage means.
  • the computer-based systems of the present invention comprise a data storage means having stored therein a nucleotide sequence of the present invention and the necessary hardware means and software means for supporting and implementing a search means.
  • data storage means refers to memory which can store nucleotide sequence information of the present invention, or a memory access means which can access manufactures having recorded thereon the nucleotide sequence information of the present invention.
  • search means refers to one or more programs which are implemented on the computer-based system to compare a target sequence or target structural motif with the sequence information stored within the data storage means. Search means are used to identify fragments or regions of a known sequence which match a particular target sequence or target motif.
  • a variety of known algorithms are disclosed publicly and a variety of commercially available software for conducting search means are and can be used in the computer-based systems of the present invention. Examples of such software includes, but is not limited to, Smith- Waterman, MacPattern (EMBL), BLASTN and BLASTA (NPOLYPEPTIDEIA).
  • EMBL Smith- Waterman
  • BLASTN BLASTN
  • BLASTA NPOLYPEPTIDEIA
  • a "target sequence” can be any nucleic acid or amino acid sequence of six or more nucleotides or two or more amino acids.
  • a skilled artisan can readily recognize that the longer a target sequence is, the less likely a target sequence will be present as a random occurrence in the database.
  • the most preferred sequence length of a target sequence is from about 10 to 300 amino acids, more preferably from about 30 to 100 nucleotide residues.
  • searches for commercially important fragments, such as sequence fragments involved in gene expression and protein processing may be of shorter length.
  • a target structural motif refers to any rationally selected sequence or combination of sequences in which the sequence(s) are chosen based on a three-dimensional configuration which is formed upon the folding of the target motif.
  • target motifs include, but are not limited to, enzyme active sites and signal sequences.
  • Nucleic acid target motifs include, but are not limited to, promoter sequences, hairpin structures and inducible expression elements (protein binding sequences).
  • fragments of the present invention can be used to control gene expression through triple helix formation or antisense DNA or RNA, both of which methods are based on the binding of a polynucleotide sequence to DNA or RNA.
  • Polynucleotides suitable for use in these methods are preferably 20 to 40 bases in length and are designed to be complementary to a region of the gene involved in transcription (triple helix - see Lee et al., Nucl. Acids Res.
  • the present invention further provides methods to identify the presence or expression of one of the ORFs of the present invention, or homolog thereof, in a test sample, using a nucleic acid probe or antibodies of the present invention, optionally conjugated or otherwise associated with a suitable label.
  • methods for detecting a polynucleotide of the invention can comprise contacting a sample with a compound that binds to and forms a complex with the polynucleotide for a period sufficient to form the complex, and detecting the complex, so that if a complex is detected, a polynucleotide of the invention is detected in the sample.
  • Such methods can also comprise contacting a sample under stringent hybridization conditions with nucleic acid primers that anneal to a polynucleotide of the invention under such conditions, and amplifying annealed polynucleotides, so that if a polynucleotide is amplified, a polynucleotide of the invention is detected in the sample.
  • methods for detecting a polypeptide of the invention can comprise contacting a sample with a compound that binds to and forms a complex with the polypeptide for a period sufficient to form the complex, and detecting the complex, so that if a complex is detected, a polypeptide of the invention is detected in the sample.
  • methods comprise incubating a test sample with one or more of the antibodies or one or more of the nucleic acid probes of the present invention and assaying for binding of the nucleic acid probes or antibodies to components within the test sample.
  • Conditions for incubating a nucleic acid probe or antibody with a test sample vary. Incubation conditions depend on the format employed in the assay, the detection methods employed, and the type and nature of the nucleic acid probe or antibody used in the assay.
  • One skilled in the art will recognize that any one of the commonly available hybridization, amplification or immunological assay formats can readily be adapted to employ the nucleic acid probes or antibodies of the present invention. Examples of such assays can be found in Chard, T., An Introduction to Radioimmunoassay and Related Techniques, Elsevier Science Publishers, Amsterdam, The Netherlands (1986); Bullock, G.R. et al., Techniques in Immunocytochemistry, Academic Press, Orlando, FL Vol. 1 (1982), Vol.
  • test samples of the present invention include cells, protein or membrane extracts of cells, or biological fluids such as sputum, blood, serum, plasma, or urine.
  • the test sample used in the above-described method will vary based on the assay format, nature of the detection method and the tissues, cells or extracts used as the sample to be assayed. Methods for preparing protein extracts or membrane extracts of cells are well known in the art and can be readily be adapted in order to obtain a sample which is compatible with the system utilized.
  • kits which contain the necessary reagents to carry out the assays of the present invention.
  • the invention provides a compartment kit to receive, in close confinement, one or more containers which comprises: (a) a first container comprising one of the probes or antibodies of the present invention; and (b) one or more other containers comprising one or more of the following: wash reagents, reagents capable of detecting presence of a bound probe or antibody.
  • a compartment kit includes any kit in which reagents are contained in separate containers.
  • Such containers include small glass containers, plastic containers or strips of plastic or paper.
  • Such containers allows one to efficiently transfer reagents from one compartment to another compartment such that the samples and reagents are not cross-contaminated, and the agents or solutions of each container can be added in a quantitative fashion from one compartment to another.
  • Such containers will include a container which will accept the test sample, a container which contains the antibodies used in the assay, containers which contain wash reagents (such as phosphate buffered saline, Tris-buffers, etc.), and containers which contain the reagents used to detect the bound antibody or probe.
  • Types of detection reagents include labeled nucleic acid probes, labeled secondary antibodies, or in the alternative, if the primary antibody is labeled, the enzymatic, or antibody binding reagents which are capable of reacting with the labeled antibody.
  • labeled nucleic acid probes labeled secondary antibodies, or in the alternative, if the primary antibody is labeled, the enzymatic, or antibody binding reagents which are capable of reacting with the labeled antibody.
  • the disclosed probes and antibodies of the present invention can be readily incorporated into one of the established kit formats which are well known in the art.
  • novel polypeptides and binding partners of the invention are useful in medical imaging of sites expressing the molecules of the invention (e.g., where the polypeptide of the invention is involved in the immune response, for imaging sites of inflammation or infection). See, e.g., Kunkel et al., U.S. Pat. NO. 5,413,778.
  • Such methods involve chemical attachment of a labeling or imaging agent, administration of the labeled polypeptide to a subject in a pharmaceutically acceptable carrier, and imaging the labeled polypeptide in vivo at the target site.
  • the present invention further provides methods of obtaining and identifying agents which bind to a polypeptide encoded by an ORF corresponding to any of the nucleotide sequences set forth in SEQ ID NO: 1-30368, or bind to a specific domain of the polypeptide encoded by the nucleic acid.
  • said method comprises the steps of:
  • such methods for identifying compounds that bind to a polynucleotide of the invention can comprise contacting a compound with a polynucleotide of the invention for a time sufficient to form a polynucleotide/compound complex, and detecting the complex, so that if a polynucleotide/compound complex is detected, a compound that binds to a polynucleotide of the invention is identified.
  • such methods for identifying compounds that bind to a polypeptide of the invention can comprise contacting a compound with a polypeptide of the invention for a time sufficient to form a polypeptide/compound complex, and detecting the complex, so that if a polypeptide/compound complex is detected, a compound that binds to a polynucleotide of the invention is identified.
  • Methods for identifying compounds that bind to a polypeptide of the invention can also comprise contacting a compound with a polypeptide of the invention in a cell for a time sufficient to form a polypeptide/compound complex, wherein the complex drives expression of a receptor gene sequence in the cell, and detecting the complex by detecting reporter gene sequence expression, so that if a polypeptide/compound complex is detected, a compound that binds a polypeptide of the invention is identified.
  • Compounds identified via such methods can include compounds which modulate the activity of a polypeptide of the invention (that is, increase or decrease its activity, relative to activity observed in the absence of the compound).
  • compounds identified via such methods can include compounds which modulate the expression of a polynucleotide of the invention (that is, increase or decrease expression relative to expression levels observed in the absence of the compound).
  • Compounds, such as compounds identified via the methods of the invention can be tested using standard assays well known to those of skill in the art for their ability to modulate activity/expression.
  • the agents screened in the above assay can be, but are not limited to, peptides, carbohydrates, vitamin derivatives, or other pharmaceutical agents.
  • the agents can be selected and screened at random or rationally selected or designed using protein modeling techniques.
  • agents such as peptides, carbohydrates, pharmaceutical agents and the like are selected at random and are assayed for their ability to bind to the protein encoded by the ORF of the present invention.
  • agents may be rationally selected or designed.
  • an agent is said to be "rationally selected or designed" when the agent is chosen based on the configuration of the particular protein.
  • one skilled in the art can readily adapt currently available procedures to generate peptides, pharmaceutical agents and the like, capable of binding to a specific peptide sequence, in order to generate rationally designed antipeptide peptides, for example see Hurby et al., Application of Synthetic Peptides: Antisense Peptides," In Synthetic Peptides, A User's Guide, W.H. Freeman, NY (1992), pp. 289-307, and Kaspczak et al., Biochemistry 28:9230-8 (1989), or pharmaceutical agents, or the like.
  • one class of agents of the present invention can be used to control gene expression through binding to one of the ORFs or EMFs of the present invention. As described above, such agents can be randomly screened or rationally designed/selected. Targeting the ORF or EMF allows a skilled artisan to design sequence specific or element specific agents, modulating the expression of either a single ORF or multiple ORFs which rely on the same EMF for expression control.
  • One class of DNA binding agents are agents which contain base residues which hybridize or form a triple helix formation by binding to DNA or RNA. Such agents can be based on the classic phosphodiester, ribonucleic acid backbone, or can be a variety of sulfhydryl or polymeric derivatives which have base attachment capacity.
  • Agents suitable for use in these methods preferably contain 20 to 40 bases and are designed to be complementary to a region of the gene involved in transcription (triple helix - see Lee et al., Nucl. Acids Res. 3:173 (1979); Cooney et al., Science 241 :456 (1988); and Dervan et al., Science 251 :1360 (1991)) or to the mRNA itself (antisense - Okano, J. Neurochem. 56:560
  • Agents that bind to a protein encoded by one of the ORFs of the present invention can be used as a diagnostic agent. Agents which bind to a protein encoded by one of the ORFs of the present invention can be formulated using known techniques to generate a pharmaceutical composition.
  • Another aspect of the subject invention is to provide for polypeptide-specific nucleic acid hybridization probes capable of hybridizing with naturally occurring nucleotide sequences.
  • the hybridization probes of the subject invention may be derived from any of the nucleotide sequences SEQ ID NO: 1-30368. Because the corresponding gene is only expressed in a limited number of tissues, a hybridization probe derived from of any of the nucleotide sequences SEQ ID NO: 1-30368 can be used as an indicator of the presence of RNA of cell type of such a tissue in a sample.
  • any suitable hybridization technique can be employed, such as, for example, in situ hybridization.
  • PCR as described in US Patents Nos. 4,683,195 and 4,965,188 provides additional uses for oligonucleotides based upon the nucleotide sequences.
  • probes used in PCR may be of recombinant origin, may be chemically synthesized, or a mixture of both.
  • the probe will comprise a discrete nucleotide sequence for the detection of identical sequences or a degenerate pool of possible sequences for identification of closely related genomic sequences.
  • Other means for producing specific hybridization probes for nucleic acids include the cloning of nucleic acid sequences into vectors for the production of mRNA probes.
  • RNA polymerase as T7 or SP6 RNA polymerase and the appropriate radioactively labeled nucleotides.
  • the nucleotide sequences may be used to construct hybridization probes for mapping their respective genomic sequences.
  • the nucleotide sequence provided herein may be mapped to a chromosome or specific regions of a chromosome using well known genetic and/or chromosomal mapping techniques. These techniques include in situ hybridization, linkage analysis against known chromosomal markers, hybridization screening with libraries or flow-sorted chromosomal preparations specific to known chromosomes, and the like.
  • Oligonucleotides i.e., small nucleic acid segments, may be readily prepared by, for example, directly synthesizing the oligonucleotide by chemical means, as is commonly practiced using an automated oligonucleotide synthesizer.
  • Support bound oligonucleotides may be prepared by any of the methods known to those of skill in the art using any suitable support such as glass, polystyrene or Teflon.
  • One strategy is to precisely spot oligonucleotides synthesized by standard synthesizers. Immobilization can be achieved using passive adsorption (Inouye & Hondo, (1990) J. Clin. Microbiol.28(6) 1469-72); using UV light (Nagata et al, 1985; Dahlen et al., 1987; Morrissey & Collins, (1989) Mol. Cell Probes 3(2) 189-207) or by covalent binding of base modified DNA (Keller etal, 1988; 1989); all references being specifically incorporated herein.
  • CovaLink NH is a polystyrene surface grafted with secondary amino groups (>NH) that serve as bridge-heads for further covalent coupling.
  • CovaLink Modules may be purchased from Nunc Laboratories. DNA molecules may be bound to CovaLink exclusively at the 5'-end by a phosphoramidatebond, allowing immobilization of more than 1 pmol of DNA (Rasmussenet ⁇ /., (1991) Anal. Biochem. 198(1) 138-42).
  • CovaLink NH strips for covalent binding of DNA molecules at the 5'-end has been described (Rasmussen et al., (1991). In this technology, a phosphoramidatebond is employed (Chu et al., (1983) Nucleic Acids Res. 11(8) 6513-29). This is beneficial as immobilization using only a single covalent bond is preferred.
  • the phosphoramidatebond joins the DNA to the CovaLink NH secondary amino groups that are positioned at the end of spacer arms covalently grafted onto the polystyrene surface through a 2 nm long spacer arm.
  • the oligonucleotide terminus must have a 5'-end phosphate group. It is, perhaps, even possible for biotin to be covalently bound to CovaLink and then streptavidin used to bind the probes.
  • the linkage method includes dissolving DNA in water (7.5 ng/ul) and denaturing for 10 min. at 95°C and cooling on ice for 10 min. Ice-cold 0.1 M 1 -methylimidazole, pH 7.0 ( 1 -Melm ), is then added to a final concentration of 10 mM 1 -Melm 7 . A ss DNA solution is then dispensed into CovaLink NH strips (75 ul/well) standing on ice.
  • a further suitable method for use with the present invention is that described in PCT Patent Application WO 90/03382 (Southern & Maskos), incorporated herein by reference.
  • This method of preparing an oligonucleotide bound to a support involves attaching a nucleoside 3'-reagent through the phosphate group by a covalent phosphodiester link to aliphatic hydroxyl groups carried by the support.
  • the oligonucleotide is then synthesized on the supported nucleoside and protecting groups removed from the synthetic oligonucleotide chain under standard conditions that do not cleave the oligonucleotide from the support.
  • Suitable reagents include nucleoside phosphoramidite and nucleoside hydrogen phosphorate.
  • An on-chip strategy for the preparation of DNA probe for the preparation of DNA probe arrays may be employed.
  • addressable laser-activated photodeprotection may be employed in the chemical synthesis of oligonucleotides directly on a glass surface, as described by
  • Probes may also be immobilized on nylon supports as described by Van Ness et al. ( 1991 ) Nucleic Acids Res. 19(12) 3345-50; or linked to Teflon using the method of Duncan & Cavalier (1988) Anal. Biochem. 169( 1 ) 104-8; all references being specifically incorporated herein.
  • the nucleic acids may be obtained from any appropriate source, such as cDNAs, genomic DNA
  • DNA DNA, chromosomal DNA, mi crodissected chromosome bands, cosmid or YAC inserts, and RNA, including mRNA without any amplification steps.
  • Sambrook et al. 1989 describes three protocols for the isolation of high molecular weight DNA from mammalian cells (p.
  • DNA fragments may be prepared as clones in Ml 3, plasmid or lambda vectors and/or prepared directly from genomic DNA or cDNA by PCR or other amplification methods. Samples may be prepared or dispensed in multiwell plates. About 100- 1000 ng of DNA samples may be prepared in 2-500 ml of final volume.
  • nucleic acids would then be fragmented by any of the methods known to those of skill in the art including, for example, using restriction enzymes as described at 9.24-9.28 of Sambrook et al. (1989), shearing by ultrasound and NaOH treatment. Low pressure shearing is also appropriate, as described by Schriefer et al. (1990) Nucleic acids
  • One particularly suitable way for fragmenting DNA is contemplated to be that using the two base recognition endonuclease, Cv/JI, described by Fitzgerald et al. (1992) Nucleic Acids Res. 20(14) 3753-62. These authors described an approach for the rapid fragmentation and fractionation of DNA into particular sizes that they contemplated to be suitable for shotgun cloning and sequencing.
  • the restriction endonuclease Cv/JI normally cleaves the recognition sequence PuGCPy between the G and C to leave blunt ends.
  • Atypical reaction conditions which alter the specificity of this enzyme (Cv/JI* *), yield a quasi-random distribution of DNA fragments form the small molecule pUC 19 (2688 base pairs).
  • Fitzgerald et al. (1992) quantitatively evaluated the randomness of this fragmentation strategy, using a Cv/JI* * digest of pUC 19 that was size fractionated by a rapid gel filtration method and directly ligated, without end repair, to a lac Z minus M 13 cloning vector. Sequence analysis of 76 clones showed that Cv/JI* * restricts pyGCPy and PuGCPu, in addition to PuGCPy sites, and that new sequence data is accumulated at a rate consistent with random fragmentation.
  • advantages of this approach compared to sonication and agarose gel fractionation include: smaller amounts of DNA are required (0.2-0.5 ug instead of 2-5 ug); and fewer steps are involved (no preligation, end repair, chemical extraction, or agarose gel electrophoresis and elution are needed.
  • Arrays may be prepared by spotting DNA samples on a support such as a nylon membrane. Spotting may be performed by using arrays of metal pins (the positions of which correspond to an array of wells in a microtiter plate) to repeated by transfer of about 20 nl of a DNA solution to a nylon membrane. By offset printing, a density of dots higher than the density of the wells is achieved. One to 25 dots may be accommodated in 1 mm , depending on the type of label used. By avoiding spotting in some preselected number of rows and columns, separate subsets (subarrays) may be formed. Samples in one subarray may be the same genomic segment of DNA (or the same gene) from different individuals, or may be different, overlapped genomic clones.
  • Each of the subarrays may represent replica spotting of the same samples.
  • a selected gene segment may be amplified from 64 patients.
  • the amplified gene segment may be in one 96-well plate (all 96 wells containing the same sample). A plate for each of the 64 patients is prepared. By using a 96-pin device, all samples may be spotted on one 8 x 12 cm membrane.
  • Subarrays may contain 64 samples, one from each patient. Where the 96 subarrays are identical, the dot span may be 1 mm and there may be a 1 mm space between subarrays.
  • membranes or plates available from NUNC, Naperville, Illinois
  • physical spacers e.g. a plastic grid molded over the membrane, the grid being similar to the sort of membrane applied to the bottom of multiwell plates, or hydrophobic strips.
  • a fixed physical spacer is not preferred for imaging by exposure to flat phosphor-storage screens or x-ray films.
  • a plurality of novel nucleic acids were obtained from cDNA libraries prepared from various human tissues and in some cases isolated from a genomic library derived from human chromosome using standard PCR, SBH sequence signature analysis and Sanger sequencing techniques.
  • the inserts of the library were amplified with PCR using primers specific for the vector sequences which flank the inserts.
  • Clones from cDNA libraries were spotted on nylon membrane filters and screened with oligonucleotide probes (e.g. , 7-mers) to obtain signature sequences. The clones were clustered into groups of similar or identical sequences. Representative clones were selected for sequencing. In some cases, the 5' sequence of the amplified inserts was then deduced using a typical
  • the novel contigs of the invention were assembled from sequences that were obtained from a cDNA library by methods described in Example 1 above, and in some cases sequences obtained from one or more public databases.
  • the sequences for the resulting nucleic acid contigs are designated as SEQ ID NO: 1 -30368 and are provided in the attached Sequence Listing.
  • the contigs were assembled using an EST sequence as a seed.
  • a recursive algorithm was used to extend the seed EST into an extended assemblage, by pulling additional sequences from different databases (i.e., Hyseq's database containing EST sequences, dbEST version 115, gb pri 115, and UniGene version 103, and exons from public domain genomic sequences predicted by GenScan) that belong to this assemblage.
  • the algorithm terminated when there was no additional sequences from the above databases that would extend the assemblage.
  • the inclusion of component sequences into the assemblage was based on a BLASTN hit to the extending assemblage with BLAST score greater than 300 and percent identity greater than 95%.
  • novel predicted polypeptides (including proteins) encoded by the novel polynucleotides (SEQ ID NO: 1 -30368) of the present invention are incorporated in the attached Sequence Listing.
  • a subset the predicted polypeptide sequences contain an unknown amino acid, a stop codon, a possible nucleotide deletion or a possible nucleotide insertion. These sequences have been shown in their entirety with the special characters in Table 2.
  • Table 2 also shows the corresponding start and stop nucleotide locations to each of SEQ ID NO: 1 -30368. Table 2 also indicates the method by which the polypeptide was predicted.
  • Method A refers to a polypeptide obtained by using a software program called FASTY (available from http://fasta.bioch.virginia,edu which selects a polypeptide based on a comparison of the translated novel polynucleotide to known polynucleotides (W.R. Pearson, Methods in Enzymology, 183:63-98 (1990), herein incorporated by reference).
  • Method B refers to a polypeptide obtained by using a software program called GenScan for human/vertebrate sequences (available from Stanford University, Office of Technology Licensing) that predicts the polypeptide based on a probabilistic model of gene structure/compositional properties (C. Burge and S. Karlin, J. Mol.
  • Method C refers to a polypeptide obtained by using a Hyseq proprietary software program that translates the novel polynucleotide and its complementary strand into six possible amino acid sequences (forward and reverse frames) and chooses the polypeptide with the longest open reading frame.
  • the nearest neighbor results for SEQ ID NO: 1 -30368 were obtained by a BLASTP version 2.0al 19MP-WashU search against Genpept release 121 and Geneseq release 200103 (Derwent), using BLAST algorithm. The nearest neighbor result showed the closest homologue for SEQ ID NO: 1-30368.
  • the nearest neighbor results for SEQ ID NO: 1-30368 are incorporated in the attached Sequence Listing. Using eMatrix software package (Stanford University, Stanford, CA) (Wu et al., J.
  • Tables 1 and 2 follow.
  • Table 1 shows the various tissue sources of SEQ ID NO: 1-30368.
  • Table 2 shows the start and stop nucleotides for the translated amino acid sequence for which each assemblage encodes.
  • Table 2 also provides a correlation between the amino acid sequences set forth in the Sequence Listing, the nucleotide sequences set forth in the Sequence Listing and the SEQ ID NO: in USSN 09/540,217
  • 2115421272- -2127421277-21283 21410-2141421434- 2143921485--2149121495-2150021647-2165521729- 2173321929--2193521958-2196121973-2197421978 22000-2200622026-2202922040--2204122087-22088 22101-2210722141-221432216022250-2225222284- 222892230922314-2231722336-2234222347-22348 22358-223592237222405-224082249522534-22539 22634-2264322653-2265422661--2266222665-22667 22671-2267422700-2270122794--2279622805-22809 22887-2289122899-2290022948-2295022952-22953 22982-2298622991-2299423059--230602307123141 232492325123329-2333723412-2341423489-23490 23492-2349323508-23543---
  • Tissue RNA Library SEQ ID NOS origin Source Name cultured Stratagen ADP001 180-181221333-335373-374395398-399461-463534- preadi- e 535576-578619-620716-7171097-10981235-1237 pocytes 1240-1241 1453-145414741705-17141912-19132015- 2018205825132599-26032724-27262814-28152923- 29242975-29773374-33763414-341734333481-3482 3518-352036243789-37943916-39253949-39534515- 45164555-45564562-45684581-45824785-47894854- 48625190-51915572-55735580-55865768-57725802 5899-59025980-59846403-640468517209-72127447- 74497534-75357604-
  • Genomic Genomic EPM003 1604-16074008-40364049-40514180-41904923-4928 clones DNA 52425326-53307067-708770937175-71777426-7442 from the from 74758807-88138822-88358839-88468850-88538855- short arm Genetic 885888628870-887388768892-88989088-90999152- of Research 91819205-92089226-92299239-925792599294-9296 chromoso 9301-93041034310347-1035010443-1044410726- me 8 1072710759-1076410775-1077710842-1086713000- 1300413307-133081334113422-1342313425-13495 13730-1373513737-137501382314981-1498315110- 1511115120-1512315718-1572515815-1581715827- 1583315939-1594315966-1596815991-1599616016

Abstract

The present invention provides novel nucleic acids, novel polypeptide sequences encoded by these nucleic acids and uses thereof.

Description

NOVEL NUCLEIC ACIDS AND POLYPEPTIDES
1. TECHNICAL FIELD
The present invention provides novel polynucleotides and proteins encoded by such polynucleotides, along with uses for these polynucleotides and proteins, for example in therapeutic, diagnostic and research methods.
2. BACKGROUND
Technology aimed at the discovery of protein factors (including e.g., cytokines, such as lymphokines, interferons, CSFs, chemokines, and interleukins) has matured rapidly over the past decade. The now routine hybridization cloning and expression cloning techniques clone novel polynucleotides "directly" in the sense that they rely on information directly related to the discovered protein (i.e., partial DNA/amino acid sequence of the protein in the case of hybridization cloning; activity of the protein in the case of expression cloning). More recent "indirect" cloning techniques such as signal sequence cloning, which isolates DNA sequences based on the presence of a now well-recognized secretory leader sequence motif, as well as various PCR-based or low stringency hybridization-based cloning techniques, have advanced the state of the art by making available large numbers of DNA/amino acid sequences for proteins that are known to have biological activity, for example, by virtue of their secreted nature in the case of leader sequence cloning, by virtue of their cell or tissue source in the case of PCR-based techniques, or by virtue of structural similarity to other genes of known biological activity.
Identified polynucleotide and polypeptide sequences have numerous applications in, for example, diagnostics, forensics, gene mapping; identification of mutations responsible for genetic disorders or other traits, to assess biodiversity, and to produce many other types of data and products dependent on DNA and amino acid sequences.
3. SUMMARY OF THE INVENTION
The compositions of the present invention include novel isolated polypeptides, novel isolated polynucleotides encoding such polypeptides, including recombinant DNA molecules, cloned genes or degenerate variants thereof, especially naturally occurring variants such as allelic variants, antisense polynucleotide molecules, and antibodies that specifically recognize one or more epitopes present on such polypeptides, as well as hybridomas producing such antibodies.
The compositions of the present invention additionally include vectors, including expression vectors, containing the polynucleotides of the invention, cells genetically engineered to contain such polynucleotidesand cells genetically engineered to express such polynucleotides. The present invention relates to a collection or library of at least one novel nucleic acid sequence assembled from expressed sequence tags (ESTs) isolated mainly by sequencing by hybridization (SBH), and in some cases, sequences obtained from one or more public databases. The invention relates also to the proteins encoded by such polynucleotides, along with therapeutic, diagnostic and research utilities for these polynucleotides and proteins. These nucleic acid sequences are designated as SEQ ID NO: 1 -30368. The polypeptides sequences are designated SEQ ID NO: 30369-60736. The nucleic acids and polypeptides are provided in the Sequence Listing. In the nucleic acids provided in the Sequence Listing, A is adenosine; C is cytosine; G is guanine; T is thymine; and N is any of the four bases. In the amino acids provided in the Sequence Listing, * corresponds to the stop codon.
The nucleic acid sequences of the present invention also include, nucleic acid sequences that hybridize to the complement of SEQ ID NO: 1-30368 under stringent hybridization conditions; nucleic acid sequences which are allelic variants or species homologues of any of the nucleic acid sequences recited above, or nucleic acid sequences that encode a peptide comprising a specific domain or truncation of the peptides encoded by SEQ ID NO: 1-30368. A polynucleotide comprising a nucleotide sequence having at least 90% identity to an identifying sequence of SEQ ID NO: 1-30368 or a degenerate variant or fragment thereof. The identifying sequence can be 100 base pairs in length.
The nucleic acid sequences of the present invention also include the sequence information from the nucleic acid sequences of SEQ ID NO: 1 -30368. The sequence information can be a segment of any one of SEQ ID NO: 1 -30368 that uniquely identifies or represents the sequence information of SEQ ID NO: 1-30368.
A collection as used in this application can be a collection of only one polynucleotide. The collection of sequence information or identifying information of each sequence can be provided on a nucleic acid array. In one embodiment, segments of sequence information is provided on a nucleic acid array to detect the polynucleotide that contains the segment. The array can be designed to detect full-match or mismatch to the polynucleotide that contains the segment. The collection can also be provided in a computer-readable format.
This invention also includes the reverse or direct complement of any of the nucleic acid sequences recited above; cloning or expression vectors containing the nucleic acid sequences; and host cells or organisms transformed with these expression vectors. Nucleic acid sequences (or their reverse or direct complements) according to the invention have numerous applications in a variety of techniques known to those skilled in the art of molecular biology, such as use as hybridization probes, use as primers for PCR, use in an array, use in computer-readable media, use in sequencing full-length genes, use for chromosome and gene mapping, use in the recombinant production of protein, and use in the generation of anti-sense DNA or RNA, their chemical analogs and the like.
In a preferred embodiment, the nucleic acid sequences of SEQ ID NO: 1 -30368 or novel segments or parts of the nucleic acids of the invention are used as primers in expression assays that are well known in the art. In a particularly preferred embodiment, the nucleic acid sequences of SEQ ID NO: 1 -30368 or novel segments or parts of the nucleic acids provided herein are used in diagnostics for identifying expressed genes or, as well known in the art and exemplified by Vollrath et al., Science 258:52-59 (1992), as expressed sequence tags for physical mapping of the human genome. The isolated polynucleotides of the invention include, but are not limited to, a polynucleotide comprising any one of the nucleotide sequences set forth in SEQ ID NO: 1-30368; a polynucleotide comprising any of the full length protein coding sequences of SEQ ID NO: 1 -30368; and a polynucleotide comprising any of the nucleotide sequences of the mature protein coding sequences of SEQ ID NO: 1-30368. The polynucleotides of the present invention also include, but are not limited to, a polynucleotide that hybridizes under stringent hybridization conditions to (a) the complement of any one of the nucleotide sequences set forth in SEQ ID NO: 1 -30368; (b) a nucleotide sequence encoding any one of the amino acid sequences set forth in the Sequence Listing (e.g., SEQ ID NO: 30369-60736); (c) a polynucleotide which is an allelic variant of any polynucleotides recited above; (d) a polynucleotide which encodes a species homolog (e.g. orthologs) of any of the proteins recited above; or (e) a polynucleotide that encodes a polypeptide comprising a specific domain or truncation of any of the polypeptides comprising an amino acid sequence set forth in the Sequence Listing.
The isolated polypeptides of the invention include, but are not limited to, a polypeptide comprising any of the amino acid sequences set forth in the Sequence Listing; or the corresponding full length or mature protein. Polypeptides of the invention also include polypeptides with biological activity that are encoded by (a) any of the polynucleotides having a nucleotide sequence set forth in SEQ ID NO: 1-30368; or (b) polynucleotides that hybridize to the complement of the polynucleotides of (a) under stringent hybridization conditions. Biologically or immunologically active variants of any of the polypeptide sequences in the Sequence Listing, and "substantial equivalents" thereof (e.g. , with at least about 65%, 70%, 75%, 80%, 85%, 90%, 95%, 98% or 99% amino acid sequence identity) that preferably retain biological activity are also contemplated. The polypeptides of the invention may be wholly or partially chemically synthesized but are preferably produced by recombinant means using the genetically engineered cells (e.g. host cells) of the invention. The invention also provides compositions comprising a polypeptide of the invention.
Polypeptide compositions of the invention may further comprise an acceptable carrier, such as a hydrophilic, e.g., pharmaceutically acceptable, carrier.
The invention also provides host cells transformed or transfected with a polynucleotide of the invention.
The invention also relates to methods for producing a polypeptide of the invention comprising growing a culture of the host cells of the invention in a suitable culture medium under conditions permitting expression of the desired polypeptide, and purifying the polypeptide from the culture or from the host cells. Preferred embodiments include those in which the protein produced by such process is a mature form of the protein.
Polynucleotides according to the invention have numerous applications in a variety of techniques known to those skilled in the art of molecular biology. These techniques include use as hybridization probes, use as oligomers, or primers, for PCR, use for chromosome and gene mapping, use in the recombinant production of protein, and use in generation of anti-sense DNA or RNA, their chemical analogs and the like. For example, when the expression of an mRNA is largely restricted to a particular cell or tissue type, polynucleotides of the invention can be used as hybridization probes to detect the presence of the particular cell or tissue mRNA in a sample using, e.g., in situ hybridization.
In other exemplary embodiments, the polynucleotides are used in diagnostics as expressed sequence tags for identifying expressed genes or, as well known in the art and exemplified by Vollrath et al., Science 258:52-59 (1992), as expressed sequence tags for physical mapping of the human genome.
The polypeptides according to the invention can be used in a variety of conventional procedures and methods that are currently applied to other proteins. For example, a polypeptide of the invention can be used to generate an antibody that specifically binds the polypeptide. Such antibodies, particularly monoclonal antibodies, are useful for detecting or quantitating the polypeptide in tissue. The polypeptides of the invention can also be used as molecular weight markers, and as a food supplement.
Methods are also provided for preventing, treating, or ameliorating a medical condition which comprises the step of administering to a mammalian subject a therapeutically effective amount of a composition comprising a polypeptide of the present invention and a pharmaceutically acceptable carrier.
In particular, the polypeptides and polynucleotides of the invention can be utilized, for example, in methods for the prevention and/or treatment of disorders involving aberrant protein expression or biological activity. The present invention further relates to methods for detecting the presence of the polynucleotides or polypeptides of the invention in a sample. Such methods can, for example, be utilized as part of prognostic and diagnostic evaluation of disorders as recited herein and for the identification of subjects exhibiting a predisposition to such conditions. The invention provides a method for detecting the polynucleotides of the invention in a sample, comprising contacting the sample with a compound that binds to and forms a complex with the polynucleotide of interest for a period sufficient to form the complex and under conditions sufficient to form a complex and detecting the complex such that if a complex is detected, the polynucleotide of interest is detected. The invention also provides a method for detecting the polypeptides of the invention in a sample comprising contacting the sample with a compound that binds to and forms a complex with the polypeptide under conditions and for a period sufficient to form the complex and detecting the formation of the complex such that if a complex is formed, the polypeptide is detected.
The invention also provides kits comprising polynucleotide probes and/or monoclonal antibodies, and optionally quantitative standards, for carrying out methods of the invention. Furthermore, the invention provides methods for evaluating the efficacy of drugs, and monitoring the progress of patients, involved in clinical trials for the treatment of disorders as recited above.
The invention also provides methods for the identification of compounds that modulate (t.e., increase or decrease) the expression or activity of the polynucleotides and/or polypeptides of the invention. Such methods can be utilized, for example, for the identification of compounds that can ameliorate symptoms of disorders as recited herein. Such methods can include, but are not limited to, assays for identifying compounds and other substances that interact with (e.g., bind to) the polypeptides of the invention. The invention provides a method for identifying a compound that binds to the polypeptides of the invention comprising contacting the compound with a polypeptide of the invention in a cell for a time sufficient to form a polypeptide/compound complex, wherein the complex drives expression of a reporter gene sequence in the cell; and detecting the complex by detecting the reporter gene sequence expression such that if expression of the reporter gene is detected the compound that binds to a polypeptide of the invention is identified.
The methods of the invention also provides methods for treatment which involve the administration of the polynucleotides or polypeptides of the invention to individuals exhibiting symptoms or tendencies. In addition, the invention encompasses methods for treating diseases or disorders as recited herein comprising administering compounds and other substances that modulate the overall activity of the target gene products. Compounds and other substances can effect such modulation either on the level of target gene/protein expression or target protein activity.
The polypeptides of the present invention and the polynucleotides encoding them are also useful for the same functions known to one of skill in the art as the polypeptides and polynucleotides to which they have homology (set forth in the sequence listing). If no homology is set forth for a sequence, then the polypeptides and polynucleotides of the present invention are useful for a variety of applications, as described herein, including use in arrays for detection.
4. DETAILED DESCRIPTION OF THE INVENTION
4.1 DEFINITIONS
It must be noted that as used herein and in the appended claims, the singular forms "a", "an" and "the" include plural references unless the context clearly dictates otherwise. The term "active" refers to those forms of the polypeptide which retain the biologic and/or immunologic activities of any naturally occurring polypeptide. According to the invention, the terms "biologically active" or "biological activity" refer to a protein or peptide having structural, regulatory or biochemical functions of a naturally occurring molecule. Likewise "immunologically active" or "immunological activity" refers to the capability of the natural, recombinant or synthetic polypeptide to induce a specific immune response in appropriate animals or cells and to bind with specific antibodies.
The term "activated cells" as used in this application are those cells which are engaged in extracellular or intracellular membrane trafficking, including the export of secretory or enzymatic molecules as part of a normal or disease process. The terms "complementary" or "complementarity" refer to the natural binding of polynucleotides by base pairing. For example, the sequence 5'-AGT-3' binds to the complementary sequence 3'-TCA-5'. Complementarity between two single-stranded molecules may be "partial" such that only some of the nucleic acids bind or it may be "complete" such that total complementarity exists between the single stranded molecules. The degree of complementarity between the nucleic acid strands has significant effects on the efficiency and strength of the hybridization between the nucleic acid strands.
The term "embryonic stem cells (ES)" refers to a cell that can give rise to many differentiated cell types in an embryo or an adult, including the germ cells. The term "germ line stem cells (GSCs)" refers to stem cells derived from primordial stem cells that provide a steady and continuous source of germ cells for the production of gametes. The term "primordial germ cells (PGCs)" refers to a small population of cells set aside from other cell lineages particularly from the yolk sac, mesenteries, or gonadal ridges during embryogenesis that have the potential to differentiate into germ cells and other cells. PGCs are the source from which GSCs and ES cells are derived The PGCs, the GSCs and the ES cells are capable of self-renewal. Thus these cells not only populate the germ line and give rise to a plurality of terminally differentiated cells that comprise the adult specialized organs, but are able to regenerate themselves.
The term "expression modulating fragment," EMF, means a series of nucleotides which modulates the expression of an operably linked ORF or another EMF.
As used herein, a sequence is said to "modulate the expression of an operably linked sequence" when the expression of the sequence is altered by the presence of the EMF. EMFs include, but are not limited to, promoters, and promoter modulating sequences (inducible elements). One class of EMFs are nucleic acid fragments which induce the expression of an operably linked ORF in response to a specific regulatory factor or physiological event. The terms "nucleotide sequence" or "nucleic acid" or "polynucleotide" or "oligonucleotide" are used interchangeably and refer to a heteropolymer of nucleotides or the sequence of these nucleotides. These phrases also refer to DNA or RNA of genomic or synthetic origin which may be single-stranded or double-stranded and may represent the sense or the antisense strand, to peptide nucleic acid (PNA) or to any DNA-like or RNA-like material. In the sequences herein A is adenine, C is cytosine, T is thymine, G is guanine and N is A, C, G or T (U). It is contemplated that where the polynucleotide is RNA, the T (thymine) in the sequences provided herein is substituted with U (uracil). Generally, nucleic acid segments provided by this invention may be assembled from fragments of the genome and short oligonucleotide linkers, or from a series of oligonucleotides, or from individual nucleotides, to provide a synthetic nucleic acid which is capable of being expressed in a recombinant transcriptional unit comprising regulatory elements derived from a microbial or viral operon, or a eukaryotic gene.
The terms "oligonucleotide fragment" or a "polynucleotide fragment", "portion," or "segment" or "probe" or "primer" are used interchangeably and refer to a sequence of nucleotide residues which are at least about 5 nucleotides, more preferably at least about 7 nucleotides, more preferably at least about 9 nucleotides, more preferably at least about 1 1 nucleotides and most preferably at least about 17 nucleotides. The fragment is preferably less than about 500 nucleotides, preferably less than about 200 nucleotides, more preferably less than about 100 nucleotides, more preferably less than about 50 nucleotides and most preferably less than 30 nucleotides. Preferably the probe is from about 6 nucleotides to about 200 nucleotides, preferably from about 15 to about 50 nucleotides, more preferably from about 17 to 30 nucleotides and most preferably from about 20 to 25 nucleotides. Preferably the fragments can be used in polymerase chain reaction (PCR), various hybridization procedures or microarray procedures to identify or amplify identical or related parts of mRNA or DNA molecules. A fragment or segment may uniquely identify each polynucleotide sequence of the present invention. Preferably the fragment comprises a sequence substantially similar to any one of SEQ ID NO: 1-30368.
Probes may, for example, be used to determine whether specific mRNA molecules are present in a cell or tissue or to isolate similar nucleic acid sequences from chromosomal DNA as described by Walsh et al. (Walsh, P.S. et al., 1992, PCR Methods Appl 1 :241-250). They may be labeled by nick translation, Klenow fill-in reaction, PCR, or other methods well known in the art. Probes of the present invention, their preparation and/or labeling are elaborated in
Sambrook, J. et al., 1989, Molecular Cloning: A Laboratory Manual, Cold Spring Harbor Laboratory, NY; or Ausubel, F.M. et al., 1989, Current Protocols in Molecular Biology, John Wiley & Sons, New York NY, both of which are incorporated herein by reference in their entirety. The nucleic acid sequences of the present invention also include the sequence information from the nucleic acid sequences of SEQ ID NO: 1-30368. The sequence information can be a segment of any one of SEQ ID NO: 1-30368 that uniquely identifies or represents the sequence information of that sequence of SEQ ID NO: 1-30368. One such segment can be a twenty-mer nucleic acid sequence because the probability that a twenty-mer is fully matched in the human genome is 1 in 300. In the human genome, there are three billion base pairs in one set of chromosomes. Because 4 possible twenty-mers exist, there are 300 times more twenty-mers than there are base pairs in a set of human chromosomes. Using the same analysis, the probability for a seventeen-mer to be fully matched in the human genome is approximately 1 in 5. When these segments are used in arrays for expression studies, fifteen-mer segments can be used. The probability that the fifteen-mer is fully matched in the expressed sequences is also approximately one in five because expressed sequences comprise less than approximately 5% of the entire genome sequence.
Similarly, when using sequence information for detecting a single mismatch, a segment can be a twenty-five mer. The probability that the twenty-five mer would appear in a human genome with a single mismatch is calculated by multiplying the probability for a full match (1 ÷42~) times the increased probability for mismatch at each nucleotide position (3 x 25). The probability that an eighteen mer with a single mismatch can be detected in an array for expression studies is approximately one in five. The probability that a twenty-mer with a single mismatch can be detected in a human genome is approximately one in five. The term "open reading frame," ORF, means a series of nucleotide triplets coding for amino acids without any termination codons and is a sequence translatable into protein.
The terms "operably linked" or "operably associated" refer to functionally related nucleic acid sequences. For example, a promoter is operably associated or operably linked with a coding sequence if the promoter controls the transcription of the coding sequence. While operably linked nucleic acid sequences can be contiguous and in the same reading frame, certain genetic elements e.g. repressor genes are not contiguously linked to the coding sequence but still control transcription/translation of the coding sequence.
The term "pluripotent" refers to the capability of a cell to differentiate into a number of differentiated cell types that are present in an adult organism. A pluripotent cell is restricted in its differentiation capability in comparison to a totipotent cell.
The terms "polypeptide" or "peptide" or "amino acid sequence" refer to an oligopeptide, peptide, polypeptide or protein sequence or fragment thereof and to naturally occurring or synthetic molecules. A polypeptide "fragment," "portion," or "segment" is a stretch of amino acid residues of at least about 5 amino acids, preferably at least about 7 amino acids, more preferably at least about 9 amino acids and most preferably at least about 17 or more amino acids. The peptide preferably is not greater than about 200 amino acids, more preferably less than 150 amino acids and most preferably less than 100 amino acids. Preferably the peptide is from about 5 to about 200 amino acids. To be active, any polypeptide must have sufficient length to display biological and/or immunological activity.
The term "naturally occurring polypeptide" refers to polypeptides produced by cells that have not been genetically engineered and specifically contemplates various polypeptides arising from post-translational modifications of the polypeptide including, but not limited to, acetylation, carboxylation, glycosylation, phosphorylation, lipidation and acylation. The term "translated protein coding portion" means a sequence which encodes for the full length protein which may include any leader sequence or any processing sequence.
The term "mature protein coding sequence" means a sequence which encodes a peptide or protein without a signal or leader sequence. The "mature protein portion" means that portion of the protein which does not include a signal or leader sequence. The peptide may have been produced by processing in the cell which removes any leader/signal sequence. The mature protein portion may or may not include an initial methionine residue. The methionine residue may be removed from the protein during processing in the cell. The peptide may be produced synthetically or the protein may have been produced using a polynucleotide only encoding for the mature protein coding sequence. The term "derivative" refers to polypeptides chemically modified by such techniques as ubiquitination, labeling (e.g., with radionuclides or various enzymes), covalent polymer attachment such as pegylation (derivatization with polyethylene glycol) and insertion or substitution by chemical synthesis of amino acids such as ornithine, which do not normally occur in human proteins.
The term "variant"(or "analog") refers to any polypeptide differing from naturally occurring polypeptides by amino acid insertions, deletions, and substitutions, created using, e g., recombinant DNA techniques. Guidance in determining which amino acid residues may be replaced, added or deleted without abolishing activities of interest, may be found by comparing the sequence of the particular polypeptide with that of homologous peptides and minimizing the number of amino acid sequence changes made in regions of high homology (conserved regions) or by replacing amino acids with consensus sequence.
Alternatively, recombinant variants encoding these same or similar polypeptides may be synthesized or selected by making use of the "redundancy" in the genetic code. Various codon substitutions, such as the silent changes which produce various restriction sites, may be introduced to optimize cloning into a plasmid or viral vector or expression in a particular prokaryotic or eukaryotic system. Mutations in the polynucleotide sequence may be reflected in the polypeptide or domains of other peptides added to the polypeptide to modify the properties of any part of the polypeptide, to change characteristics such as ligand-binding affinities, interchain affinities, or degradation/turnover rate.
Preferably, amino acid "substitutions" are the result of replacing one amino acid with another amino acid having similar structural and/or chemical properties, i.e., conservative amino acid replacements. "Conservative" amino acid substitutions may be made on the basis of similarity in polarity, charge, solubility, hydrophobicity, hydrophilicity, and/or the amphipathic nature of the residues involved. For example, nonpolar (hydrophobic) amino acids include alanine, leucine, isoleucine, valine, proline, phenylalanine, tryptophan, and methionine; polar neutral amino acids include glycine, serine, threonine, cysteine, tyrosine, asparagine, and glutamine; positively charged (basic) amino acids include arginine, lysine, and histidine; and negatively charged (acidic) amino acids include aspartic acid and glutamic acid. "Insertions" or "deletions" are preferably in the range of about 1 to 20 amino acids, more preferably 1 to 10 amino acids. The variation allowed may be experimentally determined by systematically making insertions, deletions, or substitutions of amino acids in a polypeptide molecule using recombinant DNA techniques and assaying the resulting recombinant variants for activity. Alternatively, where alteration of function is desired, insertions, deletions or non-conservative alterations can be engineered to produce altered polypeptides. Such alterations can, for example, alter one or more of the biological functions or biochemical characteristics of the polypeptides of the invention. For example, such alterations may change polypeptide characteristics such as ligand-binding affinities, interchain affinities, or degradation/turnover rate. Further, such alterations can be selected so as to generate polypeptides that are better suited for expression, scale up and the like in the host cells chosen for expression. For example, cysteine residues can be deleted or substituted with another amino acid residue in order to eliminate disulfide bridges.
The terms "purified" or "substantially purified" as used herein denotes that the indicated nucleic acid or polypeptide is present in the substantial absence of other biological macromolecules, e.g., polynucleotides, proteins, and the like. In one embodiment, the polynucleotide or polypeptide is purified such that it constitutes at least 95% by weight, more preferably at least 99% by weight, of the indicated biological macromolecules present (but water, buffers, and other small molecules, especially molecules having a molecular weight of less than 1000 daltons, can be present). The term "isolated" as used herein refers to a nucleic acid or polypeptide separated from at least one other component (e.g., nucleic acid or polypeptide) present with the nucleic acid or polypeptide in its natural source. In one embodiment, the nucleic acid or polypeptide is found in the presence of (if anything) only a solvent, buffer, ion, or other component normally present in a solution of the same. The terms "isolated" and "purified" do not encompass nucleic acids or polypeptides present in their natural source.
The term "recombinant," when used herein to refer to a polypeptide or protein, means that a polypeptide or protein is derived from recombinant (e.g., microbial, insect, or mammalian) expression systems. "Microbial" refers to recombinant polypeptides or proteins made in bacterial or fungal (e.g., yeast) expression systems. As a product, "recombinant microbial" defines a polypeptide or protein essentially free of native endogenous substances and unaccompanied by associated native glycosylation. Polypeptides or proteins expressed in most bacterial cultures, e.g., E. coli, will be free of glycosylation modifications; polypeptides or proteins expressed in yeast will have a glycosylation pattern in general different from those expressed in mammalian cells. The term "recombinant expression vehicle or vector" refers to a plasmid or phage or virus or vector, for expressing a polypeptide from a DNA (RNA) sequence. An expression vehicle can comprise a transcriptional unit comprising an assembly of (1) a genetic element or elements having a regulatory role in gene expression, for example, promoters or enhancers, (2) a structural or coding sequence which is transcribed into mRNA and translated into protein, and (3) appropriate transcription initiation and termination sequences. Structural units intended for use in yeast or eukaryotic expression systems preferably include a leader sequence enabling extracellular secretion of translated protein by a host cell. Alternatively, where recombinant protein is expressed without a leader or transport sequence, it may include an amino terminal methionine residue. This residue may or may not be subsequently cleaved from the expressed recombinant protein to provide a final product.
The term "recombinant expression system" means host cells which have stably integrated a recombinant transcriptional unit into chromosomal DNA or carry the recombinant transcriptional unit extrachromosomally. Recombinant expression systems as defined herein will express heterologous polypeptides or proteins upon induction of the regulatory elements linked to the DNA segment or synthetic gene to be expressed. This term also means host cells which have stably integrated a recombinant genetic element or elements having a regulatory role in gene expression, for example, promoters or enhancers. Recombinant expression systems as defined herein will express polypeptides or proteins endogenous to the cell upon induction of the regulatory elements linked to the endogenous DNA segment or gene to be expressed. The cells can be prokaryotic or eukaryotic.
The term "secreted" includes a protein that is transported across or through a membrane, including transport as a result of signal sequences in its amino acid sequence when it is expressed in a suitable host cell. "Secreted" proteins include without limitation proteins secreted wholly (e.g., soluble proteins) or partially (e.g., receptors) from the cell in which they are expressed. "Secreted" proteins also include without limitation proteins that are transported across the membrane of the endoplasmic reticulum. "Secreted" proteins are also intended to include proteins containing non-typical signal sequences (e.g. Interleukin-1 Beta, see Krasney, P.A. and Young, P.R. (1992) Cytokine 4(2):134 -143) and factors released from damaged cells (e.g. Interleukin-1 Receptor Antagonist, see Arend, W.P. et. al. (1998) Annu. Rev. Immunol. 16:27-55)
Where desired, an expression vector may be designed to contain a "signal or leader sequence" which will direct the polypeptide through the membrane of a cell. Such a sequence may be naturally present on the polypeptides of the present invention or provided from heterologous protein sources by recombinant DNA techniques. The term "stringent" is used to refer to conditions that are commonly understood in the art as stringent. Stringent conditions can include highly stringent conditions (i.e., hybridization to filter-bound DNA in 0.5 M NaHPO4, 7% sodium dodecyl sulfate (SDS), 1 mM EDTA at 65°C, and washing in 0.1 X SSC/0.1% SDS at 68°C), and moderately stringent conditions (i.e., washing in 0.2X SSC/0.1% SDS at 42°C). Other exemplary hybridization conditions are described herein in the examples. In instances of hybridization of deoxyoligonucleotides, additional exemplary stringent hybridization conditions include washing in 6X SSC/0.05% sodium pyrophosphate at 37°C (for 14-base oligonucleotides), 48°C (for 17-base oligos), 55°C (for 20-base oligonucleotides), and 60°C (for 23-base oligonucleotides). As used herein, "substantially equivalent" can refer both to nucleotide and amino acid sequences, for example a mutant sequence, that varies from a reference sequence by one or more substitutions, deletions, or additions, the net effect of which does not result in an adverse functional dissimilarity between the reference and subject sequences. Typically, such a substantially equivalent sequence varies from one of those listed herein by no more than about 35% (i.e., the number of individual residue substitutions, additions, and/or deletions in a substantially equivalent sequence, as compared to the corresponding reference sequence, divided by the total number of residues in the substantially equivalent sequence is about 0.35 or less). Such a sequence is said to have 65% sequence identity to the listed sequence. In one embodiment, a substantially equivalent, e.g., mutant, sequence of the invention varies from a listed sequence by no more than 30% (70% sequence identity); in a variation of this embodiment, by no more than 25% (75% sequence identity); and in a further variation of this embodiment, by no more than 20% (80% sequence identity) and in a further variation of this embodiment, by no more than 10% (90% sequence identity) and in a further variation of this embodiment, by no more that 5% (95% sequence identity). Substantially equivalent, e.g., mutant, amino acid sequences according to the invention preferably have at least 80% sequence identity with a listed amino acid sequence, more preferably at least 85% sequence identity, more preferably at least 90% sequence identity, more preferably at least 95% identity, more preferably at least 98% identity, and most preferably at least 99% identity. Substantially equivalent nucleotide sequences of the invention can have lower percent sequence identities, taking into account, for example, the redundancy or degeneracy of the genetic code. Preferably, nucleotide sequence has at least about 65% identity, more preferably at least about 75% identity, more preferably at least about 80% sequence identity, more preferably at least about 85% sequence identity, more preferably at least about 90% sequence identity, and most preferably at least about 95% identity, more preferably at least about 98% sequence identity, and most preferably at least about 99% sequence identity. For the purposes of the present invention, sequences having substantially equivalent biological activity and substantially equivalent expression characteristics are considered substantially equivalent. For the purposes of determining equivalence, truncation of the mature sequence (e.g., via a mutation which creates a spurious stop codon) should be disregarded. Sequence identity may be determined, e.g., using the Jotun Hein method (Hein, J. (1990) Methods Enzymol. 183:626-645). Identity between sequences can also be determined by other methods known in the art, e.g. by varying hybridization conditions.
The term "totipotent" refers to the capability of a cell to differentiate into all of the cell types of an adult organism. The term "transformation" means introducing DNA into a suitable host cell so that the
DNA is replicable, either as an extrachromosomal element, or by chromosomal integration. The term "transfection" refers to the taking up of an expression vector by a suitable host cell, whether or not any coding sequences are in fact expressed. The term "infection" refers to the introduction of nucleic acids into a suitable host cell by use of a virus or viral vector. As used herein, an "uptake modulating fragment," UMF, means a series of nucleotides which mediate the uptake of a linked DNA fragment into a cell. UMFs can be readily identified using known UMFs as a target sequence or target motif with the computer-based systems described below. The presence and activity of a UMF can be confirmed by attaching the suspected UMF to a marker sequence. The resulting nucleic acid molecule is then incubated with an appropriate host under appropriate conditions and the uptake of the marker sequence is determined. As described above, a UMF will increase the frequency of uptake of a linked marker sequence.
Each of the above terms is meant to encompass all that is described for each, unless the context dictates otherwise.
4.2 NUCLEIC ACIDS OF THE INVENTION
Nucleotide sequences of the invention are set forth in the Sequence Listing. The isolated polynucleotides of the invention include a polynucleotide comprising the nucleotide sequences of SEQ ID NO: 1-30368; a polynucleotide encoding any one of the peptide sequences of SEQ ID NO: 30369-60736; and a polynucleotide comprising the nucleotide sequence encoding the mature protein coding sequence of the polypeptides of any one of SEQ ID NO: 30369-60736. The polynucleotides of the present invention also include, but are not limited to, a polynucleotide that hybridizes under stringent conditions to (a) the complement of any of the nucleotides sequences of SEQ ID NO: 1-30368; (b) nucleotide sequences encoding any one of the amino acid sequences set forth in the Sequence Listing; (c) a polynucleotide which is an allelic variant of any polynucleotide recited above; (d) a polynucleotide which encodes a species homolog of any of the proteins recited above; or (e) a polynucleotide that encodes a polypeptide comprising a specific domain or truncation of the polypeptides of SEQ ID NO: 30369-60736. Domains of interest may depend on the nature of the encoded polypeptide; e.g., domains in receptor-like polypeptides include ligand-binding, extracellular, transmembrane, or cytoplasmic domains, or combinations thereof; domains in immunoglobulin-like proteins include the variable immunoglobulin-like domains; domains in enzyme-like polypeptides include catalytic and substrate binding domains; and domains in ligand polypeptides include receptor-binding domains. The polynucleotides of the invention include naturally occurring or wholly or partially synthetic DNA, e.g., cDNA and genomic DNA, and RNA, e.g., mRNA. The polynucleotides may include all of the coding region of the cDNA or may represent a portion of the coding region of the cDNA.
The present invention also provides genes corresponding to the cDNA sequences disclosed herein. The corresponding genes can be isolated in accordance with known methods using the sequence information disclosed herein. Such methods include the preparation of probes or primers from the disclosed sequence information for identification and/or amplification of genes in appropriate genomic libraries or other sources of genomic materials. Further 5' and 3' sequence can be obtained using methods known in the art. For example, full length cDNA or genomic DNA that corresponds to any of the polynucleotides of SEQ ID NO: 1-30368 can be obtained by screening appropriate cDNA or genomic DNA libraries under suitable hybridization conditions using any of the polynucleotides of SEQ ID NO: 1-30368 or a portion thereof as a probe. Alternatively, the polynucleotides of SEQ ID NO: 1-30368 may be used as the basis for suitable primer(s) that allow identification and/or amplification of genes in appropriate genomic DNA or cDNA libraries. The nucleic acid sequences of the invention can be assembled from ESTs and sequences
(including cDNA and genomic sequences) obtained from one or more public databases, such as dbEST, gbpri, and UniGene. The EST sequences can provide identifying sequence information, representative fragment or segment information, or novel segment information for the full-length gene. The polynucleotides of the invention also provide polynucleotides including nucleotide sequences that are substantially equivalent to the polynucleotides recited above. Polynucleotides according to the invention can have, e.g., at least about 65%, at least about 70%, at least about 75%, at least about 80%, 81%, 82%, 83%, 84%, more typically at least about 85%, 86%, 87%, 88%, 89%, more typically at least about 90%, 91%, 92%, 93%, 94%, and even more typically at least about 95%, 96%, 97%, 98%, 99%, sequence identity to a polynucleotide recited above.
Included within the scope of the nucleic acid sequences of the invention are nucleic acid sequence fragments that hybridize under stringent conditions to any of the nucleotide sequences of SEQ ID NO: 1-30368, or complements thereof, which fragment is greater than about 5 nucleotides, preferably 7 nucleotides, more preferably greater than 9 nucleotides and most preferably greater than 17 nucleotides. Fragments of, e.g. 15, 17, or 20 nucleotides or more that are selective for (i.e. specifically hybridize to any one of the polynucleotides of the invention) are contemplated. Probes capable of specifically hybridizing to a polynucleotide can differentiate polynucleotide sequences of the invention from other polynucleotide sequences in the same family of genes or can differentiate human genes from genes of other species, and are preferably based on unique nucleotide sequences.
The sequences falling within the scope of the present invention are not limited to these specific sequences, but also include allelic and species variations thereof. Allelic and species variations can be routinely determined by comparing the sequence provided in SEQ ID NO: 1 - 30368, a representative fragment thereof, or a nucleotide sequence at least 90% identical, preferably 95% identical, to SEQ ID NO: 1 -30368 with a sequence from another isolate of the same species. Furthermore, to accommodate codon variability, the invention includes nucleic acid molecules coding for the same amino acid sequences as do the specific ORFs disclosed herein. In other words, in the coding region of an ORF, substitution of one codon for another codon that encodes the same amino acid is expressly contemplated. The nearest neighbor or homology result for the nucleic acids of the present invention, including SEQ ID NO: 1 -30368 can be obtained by searching a database using an algorithm or a program. Preferably, a BLAST which stands for Basic Local Alignment Search Tool is used to search for local sequence alignments (Altshul, S.F. J Mol. Evol. 36 290-300 (1993) and Altschul S.F. et al. J. Mol. Biol. 21 :403-410(1990)). Alternatively a FASTA version 3 search against Genpept, using Fastxy algorithm.
Species homologs (or orthologs) of the disclosed polynucleotides and proteins are also provided by the present invention. Species homologs may be isolated and identified by making suitable probes or primers from the sequences provided herein and screening a suitable nucleic acid source from the desired species. The invention also encompasses allelic variants of the disclosed polynucleotides or proteins; that is, naturally occurring alternative forms of the isolated polynucleotide which also encode proteins which are identical, homologous or related to that encoded by the polynucleotides.
The nucleic acid sequences of the invention are further directed to sequences which encode variants of the described nucleic acids. These amino acid sequence variants may be prepared by methods known in the art by introducing appropriate nucleotide changes into a native or variant polynucleotide. There are two variables in the construction of amino acid sequence variants: the location of the mutation and the nature of the mutation. Nucleic acids encoding the amino acid sequence variants are preferably constructed by mutating the polynucleotide to encode an amino acid sequence that does not occur in nature. These nucleic acid alterations can be made at sites that differ in the nucleic acids from different species
(variable positions) or in highly conserved regions (constant regions). Sites at such locations will typically be modified in series, e.g., by substituting first with conservative choices (e.g., hydrophobic amino acid to a different hydrophobic amino acid) and then with more distant choices (e.g., hydrophobic amino acid to a charged amino acid), and then deletions or insertions may be made at the target site. Amino acid sequence deletions generally range from about 1 to 30 residues, preferably about 1 to 10 residues, and are typically contiguous. Amino acid insertions include amino- and/or carboxyl-terminal fusions ranging in length from one to one hundred or more residues, as well as intrasequence insertions of single or multiple amino acid residues. Intrasequence insertions may range generally from about 1 to 10 amino residues, preferably from 1 to 5 residues. Examples of terminal insertions include the heterologous signal sequences necessary for secretion or for intracellular targeting in different host cells and sequences such as FLAG or poly-histidine sequences useful for purifying the expressed protein. In a preferred method, polynucleotides encoding the novel amino acid sequences are changed via site-directed mutagenesis. This method uses oligonucleotide sequences to alter a polynucleotide to encode the desired amino acid variant, as well as sufficient adjacent nucleotides on both sides of the changed amino acid to form a stable duplex on either side of the site of being changed. In general, the techniques of site-directed mutagenesis are well known to those of skill in the art and this technique is exemplified by publications such as, Edelman et al., DNA 2: 183 (1983). A versatile and efficient method for producing site-specific changes in a polynucleotide sequence was published by Zoller and Smith, Nucleic Acids Res. 10:6487-6500 (1982). PCR may also be used to create amino acid sequence variants of the novel nucleic acids. When small amounts of template DNA are used as starting material, primer(s) that differs slightly in sequence from the corresponding region in the template DNA can generate the desired amino acid variant. PCR amplification results in a population of product DNA fragments that differ from the polynucleotide template encoding the polypeptide at the position specified by the primer. The product DNA fragments replace the corresponding region in the plasmid and this gives a polynucleotide encoding the desired amino acid variant.
A further technique for generating amino acid variants is the cassette mutagenesis technique described in Wells et al., Gene 34:315 (1985); and other mutagenesis techniques well known in the art, such as, for example, the techniques in Sambrook et al., supra, and Current Protocols in Molecular Biology, Ausubel et al. Due to the inherent degeneracy of the genetic code, other DNA sequences which encode substantially the same or a functionally equivalent amino acid sequence may be used in the practice of the invention for the cloning and expression of these novel nucleic acids. Such DNA sequences include those which are capable of hybridizing to the appropriate novel nucleic acid sequence under stringent conditions.
Polynucleotides encoding preferred polypeptide truncations of the invention can be used to generate polynucleotides encoding chimeric or fusion proteins comprising one or more domains of the invention and heterologous protein sequences.
The polynucleotides of the invention additionally include the complement of any of the polynucleotides recited above. The polynucleotide can be DNA (genomic, cDNA, amplified, or synthetic) or RNA. Methods and algorithms for obtaining such polynucleotides are well known to those of skill in the art and can include, for example, methods for determining hybridization conditions that can routinely isolate polynucleotides of the desired sequence identities.
In accordance with the invention, polynucleotide sequences comprising the mature protein coding sequences corresponding to any one of SEQ ID NO: 1-30368, or functional equivalents thereof, may be used to generate recombinant DNA molecules that direct the expression of that nucleic acid, or a functional equivalent thereof, in appropriate host cells. Also included are the cDNA inserts of any of the clones identified herein.
A polynucleotide according to the invention can be joined to any of a variety of other nucleotide sequences by well-established recombinant DNA techniques (see Sambrook J et al. (1989) Molecular Cloning: A Laboratory Manual, Cold Spring Harbor Laboratory, NY). Useful nucleotide sequences for joining to polynucleotides include an assortment of vectors, e.g., plasmids, cosmids, lambda phage derivatives, phagemids, and the like, that are well known in the art. Accordingly, the invention also provides a vector including a polynucleotide of the invention and a host cell containing the polynucleotide. In general, the vector contains an origin of replication functional in at least one organism, convenient restriction endonuclease sites, and a selectable marker for the host cell. Vectors according to the invention include expression vectors, replication vectors, probe generation vectors, and sequencing vectors. A host cell according to the invention can be a prokaryotic or eukaryotic cell and can be a unicellular organism or part of a multicellular organism.
The present invention further provides recombinant constructs comprising a nucleic acid having any of the nucleotide sequences of SEQ ID NO: 1-30368 or a fragment thereof or any other polynucleotides of the invention. In one embodiment, the recombinant constructs of the present invention comprise a vector, such as a plasmid or viral vector, into which a nucleic acid having any of the nucleotide sequences of SEQ ID NO: 1-30368 or a fragment thereof is inserted, in a forward or reverse orientation. In the case of a vector comprising one of the ORFs of the present invention, the vector may further comprise regulatory sequences, including for example, a promoter, operably linked to the ORF. Large numbers of suitable vectors and promoters are known to those of skill in the art and are commercially available for generating the recombinant constructs of the present invention. The following vectors are provided by way of example. Bacterial: pBs, phagescript, PsiX174, pBluescript SK, pBs KS, pNH8a, pNH16a, pNH18a, pNH46a (Stratagene); pTrc99A, pKK223-3, pKK233-3, pDR540, pRIT5 (Pharmacia). Eukaryotic: pWLneo, pSV2cat, pOG44, PXTI, pSG (Stratagene) pSVK3, pBPV, pMSG, pSVL (Pharmacia).
The isolated polynucleotide of the invention may be operably linked to an expression control sequence such as the pMT2 or pED expression vectors disclosed in Kaufman et al, Nucleic Acids Res. 19, 4485-4490 (1991), in order to produce the protein recombinantly. Many suitable expression control sequences are known in the art. General methods of expressing recombinant proteins are also known and are exemplified in R. Kaufman, Methods in Enzymology 185, 537-566 (1990). As defined herein "operably linked" means that the isolated polynucleotide of the invention and an expression control sequence are situated within a vector or cell in such a way that the protein is expressed by a host cell which has been transformed (transfected) with the ligated polynucleotide/expression control sequence.
Promoter regions can be selected from any desired gene using CAT (chloramphenicol transferase) vectors or other vectors with selectable markers. Two appropriate vectors are pKK232-8 and pCM7. Particular named bacterial promoters include lad, lacZ, T3, T7, gpt, lambda PR, and trc. Eukaryotic promoters include CMV immediate early, HSV thymidine kinase, early and late SV40, LTRs from retrovirus, and mouse metallothionein-I. Selection of the appropriate vector and promoter is well within the level of ordinary skill in the art. Generally, recombinant expression vectors will include origins of replication and selectable markers permitting transformation of the host cell, e.g., the ampicillin resistance gene ofE coli and S. cerevisiae TRP1 gene, and a promoter derived from a highly-expressed gene to direct transcription of a downstream structural sequence. Such promoters can be derived from operons encoding glycolytic enzymes such as 3-phosphoglycerate kinase (PGK), a-factor, acid phosphatase, or heat shock proteins, among others. The heterologous structural sequence is assembled in appropriate phase with translation initiation and termination sequences, and preferably, a leader sequence capable of directing secretion of translated protein into the periplasmic space or extracellular medium. Optionally, the heterologous sequence can encode a fusion protein including an amino terminal identification peptide imparting desired characteristics, e.g., stabilization or simplified purification of expressed recombinant product. Useful expression vectors for bacterial use are constructed by inserting a structural DNA sequence encoding a desired protein together with suitable translation initiation and termination signals in operable reading phase with a functional promoter. The vector will comprise one or more phenotypic selectable markers and an origin of replication to ensure maintenance of the vector and to, if desirable, provide amplification within the host. Suitable prokaryotic hosts for transformation include E. coli, Bacillus subtilis, Salmonella typhimuήum and various species within the genera Pseudomonas, Streptomyces, and Staphylococcus, although others may also be employed as a matter of choice.
As a representative but non-limiting example, useful expression vectors for bacterial use can comprise a selectable marker and bacterial origin of replication derived from commercially available plasmids comprising genetic elements of the well known cloning vector pBR322 (ATCC 37017). Such commercial vectors include, for example, pKK223-3 (Pharmacia Fine Chemicals, Uppsala, Sweden) and GEM 1 (Promega Biotech, Madison, WI, USA). These pBR322 "backbone" sections are combined with an appropriate promoter and the structural sequence to be expressed. Following transformation of a suitable host strain and growth of the host strain to an appropriate cell density, the selected promoter is induced or derepressed by appropriate means (e.g., temperature shift or chemical induction) and cells are cultured for an additional period. Cells are typically harvested by centrifugation, disrupted by physical or chemical means, and the resulting crude extract retained for further purification.
Polynucleotides of the invention can also be used to induce immune responses. For example, as described in Fan et al., Nat. Biotech. 17:870-872 (1999), incorporated herein by reference, nucleic acid sequences encoding a polypeptide may be used to generate antibodies against the encoded polypeptide following topical administration of naked plasmid DΝA or following injection, and preferably intramuscular injection of the DΝA. The nucleic acid sequences are preferably inserted in a recombinant expression vector and may be in the form of naked DΝA.
4.3 ANTISENSE
Another aspect of the invention pertains to isolated antisense nucleic acid molecules that are hybridizable to or complementary to the nucleic acid molecule comprising the nucleotide sequence of SEQ ID NO: 1-30368, or fragments, analogs or derivatives thereof. An "antisense" nucleic acid comprises a nucleotide sequence that is complementary to a "sense" nucleic acid encoding a protein, e.g. , complementary to the coding strand of a double-stranded cDNA molecule or complementary to an mRNA sequence. In specific aspects, antisense nucleic acid molecules are provided that comprise a sequence complementary to at least about 10, 25, 50, 100, 250 or 500 nucleotides or an entire coding strand, or to only a portion thereof. Nucleic acid molecules encoding fragments, homologs, derivatives and analogs of a protein of any of SEQ ID NO: 30369-60736 or antisense nucleic acids complementary to a nucleic acid sequence of SEQ
ID NO: 1-30368 are additionally provided.
In one embodiment, an antisense nucleic acid molecule is antisense to a "coding region" of the coding strand of a nucleotide sequence of the invention. The term "coding region" refers to the region of the nucleotide sequence comprising codons which are translated into amino acid residues. In another embodiment, the antisense nucleic acid molecule is antisense to a "noncoding region" of the coding strand of a nucleotide sequence of the invention. The term "noncoding region" refers to 5' and 3' sequences which flank the coding region that are not translated into amino acids (i.e., also referred to as 5' and 3' untranslated regions). Given the coding strand sequences encoding a nucleic acid disclosed herein (e.g., SEQ
ID NO: 1-30368), antisense nucleic acids of the invention can be designed according to the rules of Watson and Crick or Hoogsteen base pairing. The antisense nucleic acid molecule can be complementary to the entire coding region of a mRNA, but more preferably is an oligonucleotide that is antisense to only a portion of the coding or noncoding region of a mRNA. For example, the antisense oligonucleotide can be complementary to the region surrounding the translation start site of a mRNA. An antisense oligonucleotide can be, for example, about 5, 10, 15, 20, 25, 30, 35, 40, 45 or 50 nucleotides in length. An antisense nucleic acid of the invention can be constructed using chemical synthesis or enzymatic ligation reactions using procedures known in the art. For example, an antisense nucleic acid (e.g., an antisense oligonucleotide) can be chemically synthesized using naturally occurring nucleotides or variously modified nucleotides designed to increase the biological stability of the molecules or to increase the physical stability of the duplex formed between the antisense and sense nucleic acids, e.g., phosphorothioate derivatives and acridine substituted nucleotides can be used.
Examples of modified nucleotides that can be used to generate the antisense nucleic acid include: 5-fluorouracil, 5-bromouracil, 5-chlorouracil, 5-iodouracil, hypoxanthine, xanthine, 4-acetylcytosine, 5-(carboxyhydroxylmethyl) uracil, 5-carboxymethylaminomethyl- 2-thiouridine, 5-carboxymethylaminomethyluracil, dihydrouracil, beta-D-galactosylqueosine, inosine, N6-isopentenyladenine, 1-methylguanine, 1 -methylinosine, 2,2-dimethylguanine, 2-methyladenine, 2-methylguanine, 3-methylcytosine, 5-methylcytosine, N6-adenine, 7-methylguanine, 5-methylaminomethyluracil, 5-methoxyaminomethyl-2-thiouracil, beta-D-mannosylqueosine, 5'-methoxycarboxymethyluracil, 5-methoxyuracil, 2-methylthio-N6-isopentenyladenine, uracil-5-oxyacetic acid (v), wybutoxosine, pseudouracil, queosine, 2-thiocytosine, 5-methyl-2-thiouracil, 2-thiouracil, 4-thiouracil, 5-methyluracil, uracil-5-oxyacetic acid methylester, uracil-5-oxyacetic acid (v), 5-methyl-2-thiouracil, 3-(3-amino-3-N-2-carboxypropyl) uracil, (acp3)w, and 2,6-diaminopurine. Alternatively, the antisense nucleic acid can be produced biologically using an expression vector into which a nucleic acid has been subcloned in an antisense orientation (i.e., RNA transcribed from the inserted nucleic acid will be of an antisense orientation to a target nucleic acid of interest, described further in the following subsection). The antisense nucleic acid molecules of the invention are typically administered to a subject or generated in situ such that they hybridize with or bind to cellular mRNA and/or genomic DNA encoding a protein according to the invention to thereby inhibit expression of the protein, e.g., by inhibiting transcription and/or translation. The hybridization can be by conventional nucleotide complementarity to form a stable duplex, or, for example, in the case of an antisense nucleic acid molecule that binds to DNA duplexes, through specific interactions in the major groove of the double helix. An example of a route of administration of antisense nucleic acid molecules of the invention includes direct injection at a tissue site. Alternatively, antisense nucleic acid molecules can be modified to target selected cells and then administered systemically. For example, for systemic administration, antisense molecules can be modified such that they specifically bind to receptors or antigens expressed on a selected cell surface, e.g. , by linking the antisense nucleic acid molecules to peptides or antibodies that bind to cell surface receptors or antigens. The antisense nucleic acid molecules can also be delivered to cells using the vectors described herein. To achieve sufficient intracellular concentrations of antisense molecules, vector constructs in which the antisense nucleic acid molecule is placed under the control of a strong pol II or pol III promoter are preferred.
In yet another embodiment, the antisense nucleic acid molecule of the invention is an -a n omeric nucleic acid molecule. An -a nomeric nucleic acid molecule forms specific double-stranded hybrids with complementary RNA in which, contrary to the usual -uni ts, the strands run parallel to each other (Gaultier et al. (1987) Nucleic Acids Res 15: 6625-6641). The antisense nucleic acid molecule can also comprise a 2'-o-methylribonucleotide (Inoue et al.
(1987) Nucleic Acids Res 15: 6131-6148) or a chimeric RNA -DNA analogue (Inoue et al. (1987) FEBS Lett 215: 321-330).
4.4 RIBOZYMES AND PNA MOIETIES In still another embodiment, an antisense nucleic acid of the invention is a ribozyme.
Ribozymes are catalytic RNA molecules with ribonuclease activity that are capable of cleaving a single-stranded nucleic acid, such as a mRNA, to which they have a complementary region. Thus, ribozymes (e.g., hammerhead ribozymes (described in Haselhoff and Gerlach (1988) Nature 334:585-591)) can be used to catalytically cleave a mRNA transcripts to thereby inhibit translation of a mRNA. A ribozyme having specificity for a nucleic acid of the invention can be designed based upon the nucleotide sequence of a DNA disclosed herein (i.e., SEQ ID NO: 1 -
30368). For example, a derivative of a Tetrahymena L-19 IVS RNA can be constructed in which the nucleotide sequence of the active site is complementary to the nucleotide sequence to be cleaved in an mRNA of SEQ ID NO: 1-30368 (see, e.g., Cech et al. U.S. Pat. No. 4,987,071; and Cech et al. U.S. Pat. No. 5,116,742). Alternatively, polynucleotides of the invention can be used to select a catalytic RNA having a specific ribonuclease activity from a pool of RNA molecules. See, e.g. , Bartel et al., (1993) Science 261 :141 1-1418.
Alternatively, gene expression can be inhibited by targeting nucleotide sequences complementary to the regulatory region (e.g., promoter and/or enhancers) to form triple helical structures that prevent transcription of the gene in target cells. See generally, Helene. (1991) Anticancer Drug Des. 6: 569-84; Helene. et al. (1992) Ann. N Y. Acad. Sci. 660:27-36; and Maher (1992) Bioassays 14: 807-15.
In various embodiments, the nucleic acids of the invention can be modified at the base moiety, sugar moiety or phosphate backbone to improve, e.g., the stability, hybridization, or solubility of the molecule. For example, the deoxyribose phosphate backbone of the nucleic acids can be modified to generate peptide nucleic acids (see Hyrup et al. (1996) Bioorg Med Chem 4: 5-23). As used herein, the terms "peptide nucleic acids" or "PNAs" refer to nucleic acid mimics, e.g., DNA mimics, in which the deoxyribose phosphate backbone is replaced by a pseudopeptide backbone and only the four natural nucleobases are retained. The neutral backbone of PNAs has been shown to allow for specific hybridization to DNA and RNA under conditions of low ionic strength. The synthesis of PNA oligomers can be performed using standard solid phase peptide synthesis protocols as described in Hyrup et al. (1996) above; Perry-O'Keefe et al. (1996) PNAS 93: 14670-675.
PNAs of the invention can be used in therapeutic and diagnostic applications. For example, PNAs can be used as antisense or antigene agents for sequence-specific modulation of gene expression by, e.g., inducing transcription or translation arrest or inhibiting replication. PNAs of the invention can also be used, e.g., in the analysis of single base pair mutations in a gene by, e.g., PNA directed PCR clamping; as artificial restriction enzymes when used in combination with other enzymes, e.g., SI nucleases (Hyrup B. (1996) above); or as probes or primers for DNA sequence and hybridization (Hyrup et al. (1996), above; Perry-O'Keefe (1996), above).
In another embodiment, PNAs of the invention can be modified, e.g., to enhance their stability or cellular uptake, by attaching lipophilic or other helper groups to PNA, by the formation of PNA-DNA chimeras, or by the use of liposomes or other techniques of drug delivery known in the art. For example, PNA-DNA chimeras can be generated that may combine the advantageous properties of PNA and DNA. Such chimeras allow DNA recognition enzymes, e.g., RNase H and DNA polymerases, to interact with the DNA portion while the PNA portion would provide high binding affinity and specificity. PNA-DNA chimeras can be linked using linkers of appropriate lengths selected in terms of base stacking, number of bonds between the nucleobases, and orientation (Hyrup (1996) above). The synthesis of PNA-DNA chimeras can be performed as described in Hyrup (1996) above and Finn et al. (1996) Nucl Acids Res 24: 3357-63. For example, a DNA chain can be synthesized on a solid support using standard phosphoramidite coupling chemistry, and modified nucleoside analogs, e.g., 5'-(4-methoxytrityl)amino-5'-deoxy-thymidine phosphoramidite, can be used between the PNA and the 5' end of DNA (Mag et al. (1989) Nucl Acid Res 17: 5973-88). PNA monomers are then coupled in a stepwise manner to produce a chimeric molecule with a 5' PNA segment and a 3' DNA segment (Finn et al. (1996) above). Alternatively, chimeric molecules can be synthesized with a 5' DNA segment and a 3' PNA segment. See, Petersen et al. (1975) Bioorg Med Chem Lett 5: 11 19-11124. In other embodiments, the oligonucleotide may include other appended groups such as peptides (e.g., for targeting host cell receptors in vivo), or agents facilitating transport across the cell membrane (see, e.g., Letsinger et al., 1989, Proc. Natl. Acad. Sci. U.S.A. 86:6553-6556; Lemaitre et al., 1987, Proc. Natl. Acad. Sci. 84:648-652; PCT Publication No. W088/09810) or the blood-brain barrier (see, e.g., PCT Publication No. W089/10134). In addition, oligonucleotides can be modified with hybridization triggered cleavage agents (See, e.g. , Krol et al., 1988, BioTechniques 6:958-976) or intercalating agents (see, e.g., Zon, 1988, Pharm. Res. 5:539-549). To this end, the oligonucleotide may be conjugated to another molecule, e.g., a peptide, a hybridization triggered cross-linking agent, a transport agent, a hybridization-triggered cleavage agent, etc.
4.5 HOSTS
The present invention further provides host cells genetically engineered to contain the polynucleotides of the invention. For example, such host cells may contain nucleic acids of the invention introduced into the host cell using known transformation, transfection or infection methods. The present invention still further provides host cells genetically engineered to express the polynucleotides of the invention, wherein such polynucleotides are in operative association with a regulatory sequence heterologous to the host cell which drives expression of the polynucleotides in the cell.
Knowledge of nucleic acid sequences allows for modification of cells to permit, or increase, expression of endogenous polypeptide. Cells can be modified (e.g., by homologous recombination) to provide increased polypeptide expression by replacing, in whole or in part, the naturally occurring promoter with all or part of a heterologous promoter so that the cells express the polypeptide at higher levels. The heterologous promoter is inserted in such a manner that it is operatively linked to the encoding sequences. See, for example, PCT International Publication No. WO94/12650, PCT International Publication No. WO92/20808, and PCT International
Publication No. WO91/09955. It is also contemplated that, in addition to heterologous promoter DNA, amplifiable marker DNA (e.g., ada, dhfr, and the multifunctional CAD gene which encodes carbamyl phosphate synthase, aspartate transcarbamylase, and dihydroorotase) and/or intron DNA may be inserted along with the heterologous promoter DNA. If linked to the coding sequence, amplification of the marker DNA by standard selection methods results in co- amplification of the desired protein coding sequences in the cells.
The host cell can be a higher eukaryotic host cell, such as a mammalian cell, a lower eukaryotic host cell, such as a yeast cell, or the host cell can be a prokaryotic cell, such as a bacterial cell. Introduction of the recombinant construct into the host cell can be effected by calcium phosphate transfection, DEAE, dextran mediated transfection, or electroporation (Davis, L. et al., Basic Methods in Molecular Biology (1986)). The host cells containing one of the polynucleotides of the invention, can be used in conventional manners to produce the gene product encoded by the isolated fragment (in the case of an ORF) or can be used to produce a heterologous protein under the control of the EMF. Any host/vector system can be used to express one or more of the ORFs of the present invention. These include, but are not limited to, eukaryotic hosts such as HeLa cells, Cv-1 cell, COS cells, 293 cells, and Sf9 cells, as well as prokaryotic host such as E. coli and B. subtilis. The most preferred cells are those which do not normally express the particular polypeptide or protein or which expresses the polypeptide or protein at low natural level. Mature proteins can be expressed in mammalian cells, yeast, bacteria, or other cells under the control of appropriate promoters. Cell-free translation systems can also be employed to produce such proteins using RNAs derived from the DNA constructs of the present invention. Appropriate cloning and expression vectors for use with prokaryotic and eukaryotic hosts are described by Sambrook, et al., in Molecular Cloning: A Laboratory Manual, Second Edition, Cold Spring Harbor, New York (1989), the disclosure of which is hereby incorporated by reference.
Various mammalian cell culture systems can also be employed to express recombinant protein. Examples of mammalian expression systems include the COS-7 lines of monkey kidney fibroblasts, described by Gluzman, Cell 23:175 (1981). Other cell lines capable of expressing a compatible vector are, for example, the C127, monkey COS cells, Chinese Hamster Ovary (CHO) cells, human kidney 293 cells, human epidermal A431 cells, human Colo205 cells, 3T3 cells, CV-1 cells, other transformed primate cell lines, normal diploid cells, cell strains derived from in vitro culture of primary tissue, primary explants, HeLa cells, mouse L cells, BHK, HL-60, U937, HaK or Jurkat cells. Mammalian expression vectors will comprise an origin of replication, a suitable promoter and also any necessary ribosome binding sites, polyadenylation site, splice donor and acceptor sites, transcriptional termination sequences, and 5 ' flanking nontranscribed sequences. DNA sequences derived from the SV40 viral genome, for example, SV40 origin, early promoter, enhancer, splice, and polyadenylation sites may be used to provide the required nontranscribed genetic elements. Recombinant polypeptides and proteins produced in bacterial culture are usually isolated by initial extraction from cell pellets, followed by one or more salting-out, aqueous ion exchange or size exclusion chromatography steps. Protein refolding steps can be used, as necessary, in completing configuration of the mature protein. Finally, high performance liquid chromatography (HPLC) can be employed for final purification steps. Microbial cells employed in expression of proteins can be disrupted by any convenient method, including freeze-thaw cycling, sonication, mechanical disruption, or use of cell lysing agents.
Alternatively, it may be possible to produce the protein in lower eukaryotes such as yeast or insects or in prokaryotes such as bacteria. Potentially suitable yeast strains include Saccharomyces cerevisiae, Schizosaccharomyces pombe, Kluyveromyces strains, Candida, or any yeast strain capable of expressing heterologous proteins. Potentially suitable bacterial strains include Escherichia coli, Bacillus subtilis, Salmonella typhimurium, or any bacterial strain capable of expressing heterologous proteins. If the protein is made in yeast or bacteria, it may be necessary to modify the protein produced therein, for example by phosphorylation or glycosylation of the appropriate sites, in order to obtain the functional protein. Such covalent attachments may be accomplished using known chemical or enzymatic methods. In another embodiment of the present invention, cells and tissues may be engineered to express an endogenous gene comprising the polynucleotides of the invention under the control of inducible regulatory elements, in which case the regulatory sequences of the endogenous gene may be replaced by homologous recombination. As described herein, gene targeting can be used to replace a gene's existing regulatory region with a regulatory sequence isolated from a different gene or a novel regulatory sequence synthesized by genetic engineering methods. Such regulatory sequences may be comprised of promoters, enhancers, scaffold-attachment regions, negative regulatory elements, transcriptional initiation sites, regulatory protein binding sites or combinations of said sequences. Alternatively, sequences which affect the structure or stability of the RNA or protein produced may be replaced, removed, added, or otherwise modified by targeting. These sequence include polyadenylation signals, mRNA stability elements, splice sites, leader sequences for enhancing or modifying transport or secretion properties of the protein, or other sequences which alter or improve the function or stability of protein or RNA molecules.
The targeting event may be a simple insertion of the regulatory sequence, placing the gene under the control of the new regulatory sequence, e.g., inserting a new promoter or enhancer or both upstream of a gene. Alternatively, the targeting event may be a simple deletion of a regulatory element, such as the deletion of a tissue-specific negative regulatory element. Alternatively, the targeting event may replace an existing element; for example, a tissue-specific enhancer can be replaced by an enhancer that has broader or different cell-type specificity than the naturally occurring elements. Here, the naturally occurring sequences are deleted and new sequences are added. In all cases, the identification of the targeting event may be facilitated by the use of one or more selectable marker genes that are contiguous with the targeting DNA, allowing for the selection of cells in which the exogenous DNA has integrated into the host cell genome. The identification of the targeting event may also be facilitated by the use of one or more marker genes exhibiting the property of negative selection, such that the negatively selectable marker is linked to the exogenous DNA, but configured such that the negatively selectable marker flanks the targeting sequence, and such that a correct homologous recombination event with sequences in the host cell genome does not result in the stable integration of the negatively selectable marker. Markers useful for this purpose include the Herpes Simplex Virus thymidine kinase (TK) gene or the bacterial xanthine-guanine phosphoribosyl-transferase (gpt) gene.
The gene targeting or gene activation techniques which can be used in accordance with this aspect of the invention are more particularly described in U.S. Patent No. 5,272,071 to Chappel; U.S. Patent No. 5,578,461 to Sherwin et al.; International Application No. PCT/US92/09627 (WO93/09222) by Selden et al.; and International Application No.
PCT/US90/06436 (WO91/06667) by Skoultchi et al., each of which is incorporated by reference herein in its entirety.
4.6 POLYPEPTIDES OF THE INVENTION The isolated polypeptides of the invention include, but are not limited to, a polypeptide comprising: the amino acid sequences set forth as any one of SEQ ID NO: 30369-60736 or an amino acid sequence encoded by any one of the nucleotide sequences SEQ ID NO: 1-30368 or the corresponding full length or mature protein. Polypeptides of the invention also include polypeptides preferably with biological or immunological activity that are encoded by: (a) a polynucleotide having any one of the nucleotide sequences set forth in SEQ ID NO: 1-30368 or (b) polynucleotides encoding any one of the amino acid sequences set forth as SEQ ID NO:
30369-60736 or (c) polynucleotides that hybridize to the complement of the polynucleotides of either (a) or (b) under stringent hybridization conditions. The invention also provides biologically active or immunologically active variants of any of the amino acid sequences set forth as SEQ ID NO: 30369-60736 or the corresponding full length or mature protein; and
"substantial equivalents" thereof (e.g., with at least about 65%, at least about 70%, at least about 75%, at least about 80%, at least about 85%, 86%, 87%, 88%, 89%, at least about 90%, 91%, 92%, 93%, 94%, typically at least about 95%, 96%, 97%, more typically at least about 98%, or most typically at least about 99% amino acid identity) that retain biological activity. Polypeptides encoded by allelic variants may have a similar, increased, or decreased activity compared to polypeptides comprising SEQ ID NO: 30369-60736.
Fragments of the proteins of the present invention which are capable of exhibiting biological activity are also encompassed by the present invention. Fragments of the protein may be in linear form or they may be cyclized using known methods, for example, as described in H. U. Saragovi, et al., Bio/Technology 10, 773-778 (1992) and in R. S. McDowell, et al., J. Amer. Chem. Soc. 114, 9245-9253 (1992), both of which are incorporated herein by reference. Such fragments may be fused to carrier molecules such as immunoglobulins for many purposes, including increasing the valency of protein binding sites.
The present invention also provides both full-length and mature forms (for example, without a signal sequence or precursor sequence) of the disclosed proteins. The protein coding sequence is identified in the sequence listing by translation of the disclosed nucleotide sequences. The mature form of such protein may be obtained by expression of a full-length polynucleotide in a suitable mammalian cell or other host cell. The sequence of the mature form of the protein is also determinable from the amino acid sequence of the full-length form. Where proteins of the present invention are membrane bound, soluble forms of the proteins are also provided. In such forms, part or all of the regions causing the proteins to be membrane bound are deleted so that the proteins are fully secreted from the cell in which they are expressed.
Protein compositions of the present invention may further comprise an acceptable carrier, such as a hydrophilic, e.g., pharmaceutically acceptable, carrier. The present invention further provides isolated polypeptides encoded by the nucleic acid fragments of the present invention or by degenerate variants of the nucleic acid fragments of the present invention. By "degenerate variant" is intended nucleotide fragments which differ from a nucleic acid fragment of the present invention (e.g., an ORF) by nucleotide sequence but, due to the degeneracy of the genetic code, encode an identical polypeptide sequence. Preferred nucleic acid fragments of the present invention are the ORFs that encode proteins. A variety of methodologies known in the art can be utilized to obtain any one of the isolated polypeptides or proteins of the present invention. At the simplest level, the amino acid sequence can be synthesized using commercially available peptide synthesizers. The synthetically-constructed protein sequences, by virtue of sharing primary, secondary or tertiary structural and/or conformational characteristics with proteins may possess biological properties in common therewith, including protein activity. This technique is particularly useful in producing small peptides and fragments of larger polypeptides. Fragments are useful, for example, in generating antibodies against the native polypeptide. Thus, they may be employed as biologically active or immunological substitutes for natural, purified proteins in screening of therapeutic compounds and in immunological processes for the development of antibodies.
The polypeptides and proteins of the present invention can alternatively be purified from cells which have been altered to express the desired polypeptide or protein. As used herein, a cell is said to be altered to express a desired polypeptide or protein when the cell, through genetic manipulation, is made to produce a polypeptide or protein which it normally does not produce or which the cell normally produces at a lower level. One skilled in the art can readily adapt procedures for introducing and expressing either recombinant or synthetic sequences into eukaryotic or prokaryotic cells in order to generate a cell which produces one of the polypeptides or proteins of the present invention.
The invention also relates to methods for producing a polypeptide comprising growing a culture of host cells of the invention in a suitable culture medium, and purifying the protein from the cells or the culture in which the cells are grown. For example, the methods of the invention include a process for producing a polypeptide in which a host cell containing a suitable expression vector that includes a polynucleotide of the invention is cultured under conditions that allow expression of the encoded polypeptide. The polypeptide can be. recovered from the culture, conveniently from the culture medium, or from a lysate prepared from the host cells and further purified. Preferred embodiments include those in which the protein produced by such process is a full length or mature form of the protein.
In an alternative method, the polypeptide or protein is purified from bacterial cells which naturally produce the polypeptide or protein. One skilled in the art can readily follow known methods for isolating polypeptides and proteins in order to obtain one of the isolated polypeptides or proteins of the present invention. These include, but are not limited to, immunochromatography, HPLC, size-exclusion chromatography, ion-exchange chromatography, and immuno-affinity chromatography. See, e.g., Scopes, Protein Purification: Principles and Practice, Springer-Verlag (1994); Sambrook, et al., in Molecular Cloning: A Laboratory Manual; Ausubel et al., Current Protocols in Molecular Biology. Polypeptide fragments that retain biological/immunological activity include fragments comprising greater than about 100 amino acids, or greater than about 200 amino acids, and fragments that encode specific protein domains.
The purified polypeptides can be used in in vitro binding assays which are well known in the art to identify molecules which bind to the polypeptides. These molecules include but are not limited to, for e.g., small molecules, molecules from combinatorial libraries, antibodies or other proteins. The molecules identified in the binding assay are then tested for antagonist or agonist activity in in vivo tissue culture or animal models that are well known in the art. In brief, the molecules are titrated into a plurality of cell cultures or animals and then tested for either cell/animal death or prolonged survival of the animal/cells.
In addition, the peptides of the invention or molecules capable of binding to the peptides may be complexed with toxins, e.g., ricin or cholera, or with other compounds that are toxic to cells. The toxin-binding molecule complex is then targeted to a tumor or other cell by the specificity of the binding molecule for SEQ ID NO: 30369-60736. The protein of the invention may also be expressed as a product of transgenic animals, e.g., as a component of the milk of transgenic cows, goats, pigs, or sheep which are characterized by somatic or germ cells containing a nucleotide sequence encoding the protein.
The proteins provided herein also include proteins characterized by amino acid sequences similar to those of purified proteins but into which modification are naturally provided or deliberately engineered. For example, modifications in the peptide or DNA sequence can be made by those skilled in the art using known techniques. Modifications of interest in the protein sequences may include the alteration, substitution, replacement, insertion or deletion of a selected amino acid residue in the coding sequence. For example, one or more of the cysteine residues may be deleted or replaced with another amino acid to alter the conformation of the molecule. Techniques for such alteration, substitution, replacement, insertion or deletion are well known to those skilled in the art (see, e.g., U.S. Pat. No. 4,518,584). Preferably, such alteration, substitution, replacement, insertion or deletion retains the desired activity of the protein. Regions of the protein that are important for the protein function can be determined by various methods known in the art including the alanine-scanning method which involved systematic substitution of single or strings of amino acids with alanine, followed by testing the resulting alanine-containing variant for biological activity. This type of analysis determines the importance of the substituted amino acid(s) in biological activity. Regions of the protein that are important for protein function may be determined by the eMATRIX program.
Other fragments and derivatives of the sequences of proteins which would be expected to retain protein activity in whole or in part and are useful for screening or other immunological methodologies may also be easily made by those skilled in the art given the disclosures herein.
Such modifications are encompassed by the present invention.
The protein may also be produced by operably linking the isolated polynucleotide of the invention to suitable control sequences in one or more insect expression vectors, and employing an insect expression system. Materials and methods for baculovirus/insect cell expression systems are commercially available in kit form from, e.g., Invitrogen, San Diego, Calif, U.S.A. (the MaxBat™ kit), and such methods are well known in the art, as described in Summers and Smith, Texas Agricultural Experiment Station Bulletin No. 1555 (1987), incorporated herein by reference. As used herein, an insect cell capable of expressing a polynucleotide of the present invention is "transformed."
The protein of the invention may be prepared by culturing transformed host cells under culture conditions suitable to express the recombinant protein. The resulting expressed protein may then be purified from such culture (i.e., from culture medium or cell extracts) using known purification processes, such as gel filtration and ion exchange chromatography. The purification of the protein may also include an affinity column containing agents which will bind to the protein; one or more column steps over such affinity resins as concanavalin A-agarose, heparin-toyopearl™ or Cibacrom blue 3GA Sepharose™; one or more steps involving hydrophobic interaction chromatography using such resins as phenyl ether, butyl ether, or propyl ether; or immunoaffinity chromatography. Alternatively, the protein of the invention may also be expressed in a form that will facilitate purification. For example, it may be expressed as a fusion protein, such as those of maltose binding protein (MBP), glutathione-S-transferase (GST) or thioredoxin (TRX), or as a His-tag. Kits for expression and purification of such fusion proteins are commercially available from New England BioLab (Beverly, Mass.), Pharmacia (Piscataway, N.J.) and Invitrogen, respectively. The protein can also be tagged with an epitope and subsequently purified by using a specific antibody directed to such epitope. One such epitope ("FLAG®") is commercially available from Kodak (New Haven, Conn.).
Finally, one or more reverse-phase high performance liquid chromatography (RP- HPLC) steps employing hydrophobic RP-HPLC media, e.g. , silica gel having pendant methyl or other aliphatic groups, can be employed to further purify the protein. Some or all of the foregoing purification steps, in various combinations, can also be employed to provide a substantially homogeneous isolated recombinant protein. The protein thus purified is substantially free of other mammalian proteins and is defined in accordance with the present invention as an "isolated protein." The polypeptides of the invention include analogs (variants). This embraces fragments, as well as peptides in which one or more amino acids has been deleted, inserted, or substituted. Also, analogs of the polypeptides of the invention embrace fusions of the polypeptides or modifications of the polypeptides of the invention, wherein the polypeptide or analog is fused to another moiety or moieties, e.g. , targeting moiety or another therapeutic agent. Such analogs may exhibit improved properties such as activity and/or stability. Examples of moieties which may be fused to the polypeptide or an analog include, for example, targeting moieties which provide for the delivery of polypeptide to pancreatic cells, e.g., antibodies to pancreatic cells, antibodies to immune cells such as T-cells, monocytes, dendritic cells, granulocytes, etc., as well as receptor and ligands expressed on pancreatic or immune cells. Other moieties which may be fused to the polypeptide include therapeutic agents which are used for treatment, for example, immunosuppressive drugs such as cyclosporin, SK506, azathioprine, CD3 antibodies and steroids. Also, polypeptides may be fused to immune modulators, and other cytokines such as alpha or beta interferon.
4.6.1 DETERMINING POLYPEPTIDE AND POLYNUCLEOTIDE IDENTITY AND SIMILARITY
Preferred identity and/or similarity are designed to give the largest match between the sequences tested. Methods to determine identity and similarity are codified in computer programs including, but are not limited to, the GCG program package, including GAP
(Devereux, J., et al., Nucleic Acids Research 12(1):387 (1984); Genetics Computer Group, University of Wisconsin, Madison, WI), BLASTP, BLASTN, BLASTX, FASTA (Altschul, S.F. et al., J. Molec. Biol. 215:403-410 (1990), PSI-BLAST (Altschul S.F. et al., Nucleic Acids Res. vol. 25, pp. 3389-3402, herein incorporated by reference), eMatrix software (Wu et al., J. Comp. Biol., Vol. 6, pp. 219-235 (1999), herein incorporated by reference), eMotif software (Nevill- Manning et al, ISMB-97, Vol. 4, pp. 202-209, herein incorporated by reference), pFam software (Sonnhammer et al., Nucleic Acids Res., Vol. 26(1), pp. 320-322 (1998), herein incorporated by reference) and the Kyte-Doolittle hydrophobocity prediction algorithm (J. Mol Biol, 157, pp. 105-31 (1982), incorporated herein by reference). The BLAST programs are publicly available from the National Center for Biotechnology Information (NCBI) and other sources (BLAST Manual, Altschul, S., et al. NCB NLM NIH Bethesda, MD 20894; Altschul, S., et al., J. Mol. Biol. 215:403-410 (1990).
4.7 CHIMERIC AND FUSION PROTEINS
The invention also provides chimeric or fusion proteins. As used herein, a "chimeric protein" or "fusion protein" comprises a polypeptide of the invention operatively linked to another polypeptide. Within a fusion protein the polypeptide according to the invention can correspond to all or a portion of a protein according to the invention. In one embodiment, a fusion protein comprises at least one biologically active portion of a protein according to the invention. In another embodiment, a fusion protein comprises at least two biologically active portions of a protein according to the invention. Within the fusion protein, the term "operatively linked" is intended to indicate that the polypeptide according to the invention and the other polypeptide are fused in-frame to each other. The polypeptide can be fused to the N-terminus or C-terminus.
For example, in one embodiment a fusion protein comprises a polypeptide according to the invention operably linked to the extracellular domain of a second protein.
In another embodiment, the fusion protein is a GST-fusion protein in which the polypeptide sequences of the invention are fused to the C-terminus of the GST (i.e., glutathione S-transferase) sequences.
In another embodiment, the fusion protein is an immunoglobulin fusion protein in which the polypeptide sequences according to the invention comprises one or more domains are fused to sequences derived from a member of the immunoglobulin protein family. The immunoglobulin fusion proteins of the invention can be incorporated into pharmaceutical compositions and administered to a subject to inhibit an interaction between a ligand and a protein of the invention on the surface of a cell, to thereby suppress signal transduction in vivo. The immunoglobulin fusion proteins can be used to affect the bioavailability of a cognate ligand. Inhibition of the ligand/protein interaction may be useful therapeutically for both the treatment of proliferative and differentiative disorders, e,g., cancer as well as modulating (e.g., promoting or inhibiting) cell survival. Moreover, the immunoglobulin fusion proteins of the invention can be used as immunogens to produce antibodies in a subject, to purify ligands, and in screening assays to identify molecules that inhibit the interaction of a polypeptide of the invention with a ligand. A chimeric or fusion protein of the invention can be produced by standard recombinant DNA techniques. For example, DNA fragments coding for the different polypeptide sequences are ligated together in-frame in accordance with conventional techniques, e.g., by employing blunt-ended or stagger-ended termini for ligation, restriction enzyme digestion to provide for appropriate termini, filling-in of cohesive ends as appropriate, alkaline phosphatase treatment to avoid undesirable joining, and enzymatic ligation. In another embodiment, the fusion gene can be synthesized by conventional techniques including automated DNA synthesizers. Alternatively, PCR amplification of gene fragments can be carried out using anchor primers that give rise to complementary overhangs between two consecutive gene fragments that can subsequently be annealed and reamplified to generate a chimeric gene sequence (see, for example, Ausubel et al. (eds.) CURRENT PROTOCOLS IN MOLECULAR BIOLOGY, John Wiley & Sons, 1992). Moreover, many expression vectors are commercially available that already encode a fusion moiety (e.g., a GST polypeptide). A nucleic acid encoding a polypeptide of the invention can be cloned into such an expression vector such that the fusion moiety is linked in-frame to the protein of the invention.
4.8 GENE THERAPY
Mutations in the polynucleotides of the invention gene may result in loss of normal function of the encoded protein. The invention thus provides gene therapy to restore normal activity of the polypeptides of the invention; or to treat disease states involving polypeptides of the invention. Delivery of a functional gene encoding polypeptides of the invention to appropriate cells is effected ex vivo, in situ, or in vivo by use of vectors, and more particularly viral vectors (e.g., adenovirus, adeno-associated virus, or a retrovirus), or ex vivo by use of physical DNA transfer methods (e.g., liposomes or chemical treatments). See, for example, Anderson, Nature, supplement to vol. 392, no. 6679, pp.25-20 (1998). For additional reviews of gene therapy technology see Friedmann, Science, 244: 1275-1281 (1989); Verma, Scientific American: 68-84 (1990); and Miller, Nature, 357: 455-460 (1992). Introduction of any one of the nucleotides of the present invention or a gene encoding the polypeptides of the present invention can also be accomplished with extrachromosomal substrates (transient expression) or artificial chromosomes (stable expression). Cells may also be cultured ex vivo in the presence of proteins of the present invention in order to proliferate or to produce a desired effect on or activity in such cells. Treated cells can then be introduced in vivo for therapeutic purposes. Alternatively, it is contemplated that in other human disease states, preventing the expression of or inhibiting the activity of polypeptides of the invention will be useful in treating the disease states. It is contemplated that antisense therapy or gene therapy could be applied to negatively regulate the expression of polypeptides of the invention.
Other methods inhibiting expression of a protein include the introduction of antisense molecules to the nucleic acids of the present invention, their complements, or their translated RNA sequences, by methods known in the art. Further, the polypeptides of the present invention can be inhibited by using targeted deletion methods, or the insertion of a negative regulatory element such as a silencer, which is tissue specific.
The present invention still further provides cells genetically engineered in vivo to express the polynucleotides of the invention, wherein such polynucleotides are in operative association with a regulatory sequence heterologous to the host cell which drives expression of the polynucleotides in the cell. These methods can be used to increase or decrease the expression of the polynucleotides of the present invention.
Knowledge of DNA sequences provided by the invention allows for modification of cells to permit, increase, or decrease, expression of endogenous polypeptide. Cells can be modified (e.g. , by homologous recombination) to provide increased polypeptide expression by replacing, in whole or in part, the naturally occurring promoter with all or part of a heterologous promoter so that the cells express the protein at higher levels. The heterologous promoter is inserted in such a manner that it is operatively linked to the desired protein encoding sequences. See, for example, PCT International Publication No. WO 94/12650, PCT International Publication No. WO 92/20808, and PCT International Publication No. WO 91/09955. It is also contemplated that, in addition to heterologous promoter DNA, amplifiable marker DNA (e.g. , ada, dhfr, and the multifunctional CAD gene which encodes carbamyl phosphate synthase, aspartate transcarbamylase, and dihydroorotase) and/or intron DNA may be inserted along with the heterologous promoter DNA. If linked to the desired protein coding sequence, amplification of the marker DNA by standard selection methods results in co-amplification of the desired protein coding sequences in the cells. In another embodiment of the present invention, cells and tissues may be engineered to express an endogenous gene comprising the polynucleotides of the invention under the control of inducible regulatory elements, in which case the regulatory sequences of the endogenous gene may be replaced by homologous recombination. As described herein, gene targeting can be used to replace a gene's existing regulatory region with a regulatory sequence isolated from a different gene or a novel regulatory sequence synthesized by genetic engineering methods. Such regulatory sequences may be comprised of promoters, enhancers, scaffold-attachment regions, negative regulatory elements, transcriptional initiation sites, regulatory protein binding sites or combinations of said sequences. Alternatively, sequences which affect the structure or stability of the RNA or protein produced may be replaced, removed, added, or otherwise modified by targeting. These sequences include polyadenylation signals, mRNA stability elements, splice sites, leader sequences for enhancing or modifying transport or secretion properties of the protein, or other sequences which alter or improve the function or stability of protein or RNA molecules.
The targeting event may be a simple insertion of the regulatory sequence, placing the gene under the control of the new regulatory sequence, e.g., inserting a new promoter or enhancer or both upstream of a gene. Alternatively, the targeting event may be a simple deletion of a regulatory element, such as the deletion of a tissue-specific negative regulatory element. Alternatively, the targeting event may replace an existing element; for example, a tissue-specific enhancer can be replaced by an enhancer that has broader or different cell-type specificity than the naturally occurring elements. Here, the naturally occurring sequences are deleted and new sequences are
i_> added. In all cases, the identification of the targeting event may be facilitated by the use of one or more selectable marker genes that are contiguous with the targeting DNA, allowing for the selection of cells in which the exogenous DNA has integrated into the cell genome. The identification of the targeting event may also be facilitated by the use of one or more marker genes exhibiting the property of negative selection, such that the negatively selectable marker is linked to the exogenous DNA, but configured such that the negatively selectable marker flanks the targeting sequence, and such that a correct homologous recombination event with sequences in the host cell genome does not result in the stable integration of the negatively selectable marker. Markers useful for this purpose include the Herpes Simplex Virus thymidine kinase (TK) gene or the bacterial xanthine-guanine phosphoribosyl-transferase (gpt) gene.
The gene targeting or gene activation techniques which can be used in accordance with this aspect of the invention are more particularly described in U.S. Patent No. 5,272,071 to Chappel; U.S. Patent No. 5,578,461 to Sherwinet al.; International Application No. PCT/US92/09627 (WO93/09222)by Selden et al.; and International Application No. PCT/US90/06436 ( WO91 /06667) by Skoultchi et al . , each of which is incorporated by reference herein in its entirety.
4.9 TRANSGENIC ANIMALS
In preferred methods to determine biological functions of the polypeptides of the invention in vivo, one or more genes provided by the invention are either over expressed or inactivated in the germ line of animals using homologous recombination [Capecchi, Science 244:1288-1292 (1989)]. Animals in which the gene is over expressed, under the regulatory control of exogenous or endogenous promoter elements, are known as transgenic animals. Animals in which an endogenous gene has been inactivated by homologous recombination are referred to as "knockout" animals. Knockout animals, preferably non-human mammals, can be prepared as described in U.S. Patent No. 5,557,032, incorporated herein by reference.
Transgenic animals are useful to determine the roles polypeptides of the invention play in biological processes, and preferably in disease states. Transgenic animals are useful as model systems to identify compounds that modulate lipid metabolism. Transgenic animals, preferably non-human mammals, are produced using methods as described in U.S. Patent No 5,489,743 and PCT Publication No. WO94/28122, incorporated herein by reference.
Transgenic animals can be prepared wherein all or part of a promoter of the polynucleotides of the invention is either activated or inactivated to alter the level of expression of the polypeptides of the invention. Inactivation can be carried out using homologous recombination methods described above. Activation can be achieved by supplementing or even replacing the homologous promoter to provide for increased protein expression. The homologous promoter can be supplemented by insertion of one or more heterologous enhancer elements known to confer promoter activation in a particular tissue.
The polynucleotides of the present invention also make possible the development, through, e.g., homologous recombination or knock out strategies, of animals that fail to express polypeptides of the invention or that express a variant polypeptide. Such animals are useful as models for studying the in vivo activities of polypeptide as well as for studying modulators of the polypeptides of the invention.
In preferred methods to determine biological functions of the polypeptides of the invention in vivo, one or more genes provided by the invention are either over expressed or inactivated in the germ line of animals using homologous recombination [Capecchi, Science 244:1288-1292 (1989)]. Animals in which the gene is over expressed, under the regulatory control of exogenous or endogenous promoter elements, are known as transgenic animals. Animals in which an endogenous gene has been inactivated by homologous recombination are referred to as "knockout" animals. Knockout animals, preferably non-human mammals, can be prepared as described in U.S. Patent No. 5,557,032, incorporated herein by reference.
Transgenic animals are useful to determine the roles polypeptides of the invention play in biological processes, and preferably in disease states. Transgenic animals are useful as model systems to identify compounds that modulate lipid metabolism. Transgenic animals, preferably non-human mammals, are produced using methods as described in U.S. Patent No 5,489,743 and PCT Publication No. WO94/28122, incorporated herein by reference.
Transgenic animals can be prepared wherein all or part of the polynucleotides of the invention promoter is either activated or inactivated to alter the level of expression of the polypeptides of the invention. Inactivation can be carried out using homologous recombination methods described above. Activation can be achieved by supplementing or even replacing the homologous promoter to provide for increased protein expression. The homologous promoter can be supplemented by insertion of one or more heterologous enhancer elements known to confer promoter activation in a particular tissue.
4.10 USES AND BIOLOGICAL ACTIVITY The polynucleotides and proteins of the present invention are expected to exhibit one or more of the uses or biological activities (including those associated with assays cited herein) identified herein. Uses or activities described for proteins of the present invention may be provided by administration or use of such proteins or of polynucleotides encoding such proteins (such as, for example, in gene therapies or vectors suitable for introduction of DNA). The mechanism underlying the particular condition or pathology will dictate whether the polypeptides of the invention, the polynucleotides of the invention or modulators (activators or inhibitors) thereof would be beneficial to the subject in need of treatment. Thus, "therapeutic compositions of the invention" include compositions comprising isolated polynucleotides (including recombinant DNA molecules, cloned genes and degenerate variants thereof) or polypeptides of the invention (including full length protein, mature protein and truncations or domains thereof), or compounds and other substances that modulate the overall activity of the target gene products, either at the level of target gene/protein expression or target protein activity. Such modulators include polypeptides, analogs, (variants), including fragments and fusion proteins, antibodies and other binding proteins; chemical compounds that directly or indirectly activate or inhibit the polypeptides of the invention (identified, e.g., via drug screening assays as described herein); antisense polynucleotides and polynucleotides suitable for triple helix formation; and in particular antibodies or other binding partners that specifically recognize one or more epitopes of the polypeptides of the invention.
The polypeptides of the present invention may likewise be involved in cellular activation or in one of the other physiological pathways described herein.
4.10.1 RESEARCH USES AND UTILITIES
The polynucleotides provided by the present invention can be used by the research community for various purposes. The polynucleotides can be used to express recombinant protein for analysis, characterization or therapeutic use; as markers for tissues in which the corresponding protein is preferentially expressed (either constitutively or at a particular stage of tissue differentiation or development or in disease states); as molecular weight markers on gels; as chromosome markers or tags (when labeled) to identify chromosomes or to map related gene positions; to compare with endogenous DNA sequences in patients to identify potential genetic disorders; as probes to hybridize and thus discover novel, related DNA sequences; as a source of information to derive PCR primers for genetic fingerprinting; as a probe to "subtract-out" known sequences in the process of discovering other novel polynucleotides; for selecting and making oligomers for attachment to a "gene chip" or other support, including for examination of expression patterns; to raise anti-protein antibodies using DNA immunization techniques; and as an antigen to raise anti-DNA antibodies or elicit another immune response. Where the polynucleotide encodes a protein which binds or potentially binds to another protein (such as, for example, in a receptor-ligand interaction), the polynucleotide can also be used in interaction trap assays (such as, for example, that described in Gyuris et al., Cell 75:791-803 (1993)) to identify polynucleotides encoding the other protein with which binding occurs or to identify inhibitors of the binding interaction. The polypeptides provided by the present invention can similarly be used in assays to determine biological activity, including in a panel of multiple proteins for high-throughput screening; to raise antibodies or to elicit another immune response; as a reagent (including the labeled reagent) in assays designed to quantitatively determine levels of the protein (or its receptor) in biological fluids; as markers for tissues in which the corresponding polypeptide is preferentially expressed (either constitutively or at a particular stage of tissue differentiation or development or in a disease state); and, of course, to isolate correlative receptors or ligands. Proteins involved in these binding interactions can also be used to screen for peptide or small molecule inhibitors or agonists of the binding interaction. Any or all of these research utilities are capable of being developed into reagent grade or kit format for commercialization as research products.
Methods for performing the uses listed above are well known to those skilled in the art. References disclosing such methods include without limitation "Molecular Cloning: A Laboratory Manual", 2d ed., Cold Spring Harbor Laboratory Press, Sambrook, J., E. F. Fritsch and T. Maniatis eds., 1989, and "Methods in Enzymology: Guide to Molecular Cloning Techniques", Academic Press, Berger, S. L. and A. R. Kimmel eds., 1987.
4.10.2 NUTRITIONAL USES
Polynucleotides and polypeptides of the present invention can also be used as nutritional sources or supplements. Such uses include without limitation use as a protein or amino acid supplement, use as a carbon source, use as a nitrogen source and use as a source of carbohydrate. In such cases the polypeptide or polynucleotide of the invention can be added to the feed of a particular organism or can be administered as a separate solid or liquid preparation, such as in the form of powder, pills, solutions, suspensions or capsules. In the case of microorganisms, the polypeptide or polynucleotide of the invention can be added to the medium in or on which the microorganism is cultured.
4.10.3 CYTOKINE AND CELL PROLIFERATION/DIFFERENTIATION ACTIVITY A polypeptide of the present invention may exhibit activity relating to cytokine, cell proliferation (either inducing or inhibiting) or cell differentiation (either inducing or inhibiting) activity or may induce production of other cytokines in certain cell populations. A polynucleotide of the invention can encode a polypeptide exhibiting such attributes. Many protein factors discovered to date, including all known cytokines, have exhibited activity in one or more factor-dependent cell proliferation assays, and hence the assays serve as a convenient confirmation of cytokine activity. The activity of therapeutic compositions of the present invention is evidenced by any one of a number of routine factor dependent cell proliferation assays for cell lines including, without limitation, 32D, DA2, DA1G, T10, B9, B9/11, BaF3, MC9/G, M+(preB M+), 2E8, RB5, DAI, 123, Tl 165, HT2, CTLL2, TF-1, Mo7e, CMK, HUVEC, and Caco. Therapeutic compositions of the invention can be used in the following: Assays for T-cell or thymocyte proliferation include without limitation those described in: Current Protocols in Immunology, Ed by J. E. Coligan, A. M. Kruisbeek, D. H. Margulies, E. M. Shevach, W. Strober, Pub. Greene Publishing Associates and Wiley-Interscience (Chapter 3, In Vitro assays for Mouse Lymphocyte Function 3.1-3.19; Chapter 7, Immunologic studies in Humans); Takai et al., J. Immunol. 137:3494-3500, 1986; Bertagnolli et al., J. Immunol.
145:1706-1712, 1990; Bertagnolli et al., Cellular Immunology 133:327-341, 1991; Bertagnolli, et al., I. Immunol. 149:3778-3783, 1992; Bowman et al., I. Immunol. 152:1756-1761, 1994.
Assays for cytokine production and/or proliferation of spleen cells, lymph node cells or thymocytes include, without limitation, those described in: Polyclonal T cell stimulation, Kruisbeek, A. M. and Shevach, E. M. In Current Protocols in Immunology. J. E. e.a. Coligan eds. Vol 1 pp. 3.12.1-3.12.14, John Wiley and Sons, Toronto. 1994; and Measurement of mouse and human interleukin-γ, Schreiber, R. D. In Current Protocols in Immunology. J. E. e.a. Coligan eds. Vol 1 pp. 6.8.1-6.8.8, John Wiley and Sons, Toronto. 1994.
Assays for proliferation and differentiation of hematopoietic and lymphopoietic cells include, without limitation, those described in: Measurement of Human and Murine Interleukin 2 and Interleukin 4, Bottomly, K., Davis, L. S. and Lipsky, P. E. In Current Protocols in Immunology. J. E. e.a. Coligan eds. Vol 1 pp. 6.3.1-6.3.12, John Wiley and Sons, Toronto. 1991; deVries et al., J. Exp. Med. 173:1205-121 1, 1991; Moreau et al., Nature 336:690-692, 1988; Greenberger et al., Proc. Natl. Acad. Sci. U.S.A. 80:2931-2938, 1983; Measurement of mouse and human interleukin 6~Nordan, R. In Current Protocols in Immunology. J. E. Coligan eds. Vol 1 pp. 6.6.1-6.6.5, John Wiley and Sons, Toronto. 1991 ; Smith et al., Proc. Natl. Aced. Sci. U.S.A. 83:1857-1861, 1986; Measurement of human Interleukin 1 1-Bennett, F., Giannotti, J., Clark, S. C. and Turner, K. J. In Current Protocols in Immunology. J. E. Coligan eds. Vol 1 pp. 6.15.1 John Wiley and Sons, Toronto. 1991 ; Measurement of mouse and human Interleukin 9— Ciarletta, A., Giannotti, J., Clark, S. C. and Turner, K. J. In Current Protocols in Immunology. J. E. Coligan eds. Vol 1 pp. 6.13.1, John Wiley and Sons, Toronto. 1991.
Assays for T-cell clone responses to antigens (which will identify, among others, proteins that affect APC-T cell interactions as well as direct T-cell effects by measuring proliferation and cytokine production) include, without limitation, those described in: Current Protocols in Immunology, Ed by J. E. Coligan, A. M. Kruisbeek, D. H. Margulies, E. M. Shevach, W Strober, Pub. Greene Publishing Associates and Wiley-Interscience (Chapter 3, In Vitro assays for Mouse
Lymphocyte Function; Chapter 6, Cytokines and their cellular receptors; Chapter 7, Immunologic studies in Humans); Weinberger et al., Proc. Natl. Acad. Sci. USA 77:6091-6095, 1980; Weinberger et al., Eur. J. Immun. 11 :405-411, 1981; Takai et al., J. Immunol. 137:3494-3500, 1986; Takai et al., J. Immunol. 140:508-512, 1988.
4.10.4 STEM CELL GROWTH FACTOR ACTIVITY
A polypeptide of the present invention may exhibit stem cell growth factor activity and be involved in the proliferation, differentiation and survival of pluripotent and totipotent stem cells including primordial germ cells, embryonic stem cells, hematopoietic stem cells and/or germ line stem cells. Administration of the polypeptide of the invention to stem cells in vivo or ex vivo is expected to maintain and expand cell populations in a totipotential or pluripotential state which would be useful for re-engineering damaged or diseased tissues, transplantation, manufacture of bio-pharmaceuticals and the development of bio-sensors. The ability to produce large quantities of human cells has important working applications for the production of human proteins which currently must be obtained from non-human sources or donors, implantation of cells to treat diseases such as Parkinson's, Alzheimer's and other neurodegenerative diseases; tissues for grafting such as bone marrow, skin, cartilage, tendons, bone, muscle (including cardiac muscle), blood vessels, cornea, neural cells, gastrointestinal cells and others; and organs for transplantation such as kidney, liver, pancreas (including islet cells), heart and lung.
It is contemplated that multiple different exogenous growth factors and/or cytokines may be administered in combination with the polypeptide of the invention to achieve the desired effect, including any of the growth factors listed herein, other stem cell maintenance factors, and specifically including stem cell factor (SCF), leukemia inhibitory factor (LIF), Flt-3 ligand (Flt- 3L), any of the interleukins, recombinant soluble IL-6 receptor fused to IL-6, macrophage inflammatory protein 1 -alpha (MIP-1 -alpha), G-CSF, GM-CSF, thrombopoietin (TPO), platelet factor 4 (PF-4), platelet-derived growth factor (PDGF), neural growth factors and basic fibroblast growth factor (bFGF).
Since totipotent stem cells can give rise to virtually any mature cell type, expansion of these cells in culture will facilitate the production of large quantities of mature cells. Techniques for culturing stem cells are known in the art and administration of polypeptides of the invention, optionally with other growth factors and/or cytokines, is expected to enhance the survival and proliferation of the stem cell populations. This can be accomplished by direct administration of the polypeptide of the invention to the culture medium. Alternatively, stroma cells transfected with a polynucleotide that encodes for the polypeptide of the invention can be used as a feeder layer for the stem cell populations in culture or in vivo. Stromal support cells for feeder layers may include embryonic bone marrow fibroblasts, bone marrow stromal cells, fetal liver cells, or cultured embryonic fibroblasts (see U.S. Patent No. 5,690,926).
Stem cells themselves can be transfected with a polynucleotide of the invention to induce autocrine expression of the polypeptide of the invention. This will allow for generation of undifferentiated totipotential/pluripotential stem cell lines that are useful as is or that can then be differentiated into the desired mature cell types. These stable cell lines can also serve as a source of undifferentiated totipotential/pluripotential mRNA to create cDNA libraries and templates for polymerase chain reaction experiments. These studies would allow for the isolation and identification of differentially expressed genes in stem cell populations that regulate stem cell proliferation and/or maintenance.
Expansion and maintenance of totipotent stem cell populations will be useful in the treatment of many pathological conditions. For example, polypeptides of the present invention may be used to manipulate stem cells in culture to give rise to neuroepithelial cells that can be used to augment or replace cells damaged by illness, autoimmune disease, accidental damage or genetic disorders. The polypeptide of the invention may be useful for inducing the proliferation of neural cells and for the regeneration of nerve and brain tissue, i.e. for the treatment of central and peripheral nervous system diseases and neuropathies, as well as mechanical and traumatic disorders which involve degeneration, death or trauma to neural cells or nerve tissue. In addition, the expanded stem cell populations can also be genetically altered for gene therapy purposes and to decrease host rejection of replacement tissues after grafting or implantation. Expression of the polypeptide of the invention and its effect on stem cells can also be manipulated to achieve controlled differentiation of the stem cells into more differentiated cell types. A broadly applicable method of obtaining pure populations of a specific differentiated cell type from undifferentiated stem cell populations involves the use of a cell-type specific promoter driving a selectable marker. The selectable marker allows only cells of the desired type to survive. For example, stem cells can be induced to differentiate into cardiomyocytes (Wobus et al., Differentiation, 48: 173-182, (1991); Klug et al., J. Clin. Invest., 98(1): 216-224, (1998)) or skeletal muscle cells (Browder, L. W. In: Principles of Tissue Engineering eds. Lanza et al., Academic Press (1997)). Alternatively, directed differentiation of stem cells can be accomplished by culturing the stem cells in the presence of a differentiation factor such as retinoic acid and an antagonist of the polypeptide of the invention which would inhibit the effects of endogenous stem cell factor activity and allow differentiation to proceed.
In vitro cultures of stem cells can be used to determine if the polypeptide of the invention exhibits stem cell growth factor activity. Stem cells are isolated from any one of various cell sources (including hematopoietic stem cells and embryonic stem cells) and cultured on a feeder layer, as described by Thompson et al. Proc. Natl. Acad. Sci, U.S.A., 92: 7844-7848 (1995), in the presence of the polypeptide of the invention alone or in combination with other growth factors or cytokines. The ability of the polypeptide of the invention to induce stem cells proliferation is determined by colony formation on semi-solid support e.g. as described by Bernstein et al., Blood, 77: 2316-2321 (1991).
4.10.5 HEMATOPOIESIS REGULATING ACTIVITY
A polypeptide of the present invention may be involved in regulation of hematopoiesis and, consequently, in the treatment of myeloid or lymphoid cell disorders. Even marginal biological activity in support of colony forming cells or of factor-dependent cell lines indicates involvement in regulating hematopoiesis, e.g. in supporting the growth and proliferation of erythroid progenitor cells alone or in combination with other cytokines, thereby indicating utility, for example, in treating various anemias or for use in conjunction with irradiation/chemotherapy to stimulate the production of erythroid precursors and/or erythroid cells; in supporting the growth and proliferation of myeloid cells such as granulocytes and monocytes/macrophages (i.e., traditional CSF activity) useful, for example, in conjunction with chemotherapy to prevent or treat consequent myelo-suppression; in supporting the growth and proliferation of megakaryocytes and consequently of platelets thereby allowing prevention or treatment of various platelet disorders such as thrombocytopenia, and generally for use in place of or complimentary to platelet transfusions; and/or in supporting the growth and proliferation of hematopoietic stem cells which are capable of maturing to any and all of the above-mentioned hematopoietic cells and therefore find therapeutic utility in various stem cell disorders (such as those usually treated with transplantation, including, without limitation, aplastic anemia and paroxysmal nocturnal hemoglobinuria), as well as in repopulating the stem cell compartment post irradiation/chemotherapy, either in-vivo or ex-vivo (i.e., in conjunction with bone marrow transplantation or with peripheral progenitor cell transplantation (homologous or heterologous)) as normal cells or genetically manipulated for gene therapy.
Therapeutic compositions of the invention can be used in the following: Suitable assays for proliferation and differentiation of various hematopoietic lines are cited above.
Assays for embryonic stem cell differentiation (which will identify, among others, proteins that influence embryonic differentiation hematopoiesis) include, without limitation, those described in: Johansson et al. Cellular Biology 15:141-151, 1995; Keller et al., Molecular and Cellular Biology 13:473-486, 1993; McClanahan et al., Blood 81 :2903-2915, 1993. Assays for stem cell survival and differentiation (which will identify, among others, proteins that regulate lympho-hematopoiesis) include, without limitation, those described in: Methylcellulose colony forming assays, Freshney, M. G. In Culture of Hematopoietic Cells. R. I. Freshney, et al. eds. Vol pp. 265-268, Wiley-Liss, Inc., New York, N.Y. 1994; Hirayama et al., Proc. Natl. Acad. Sci. USA 89:5907-5911, 1992; Primitive hematopoietic colony forming cells with high proliferative potential, McNiece, I. K. and Briddell, R. A. In Culture of Hematopoietic Cells. R. I. Freshney, et al. eds. Vol pp. 23-39, Wiley-Liss, Inc., New York, N.Y. 1994; Neben et al., Experimental Hematology 22:353-359, 1994; Cobblestone area forming cell assay, Ploemacher, R. E. In Culture of Hematopoietic Cells. R. I. Freshney, et al. eds. Vol pp. 1-21, Wiley-Liss, Inc., New York, N.Y. 1994; Long term bone marrow cultures in the presence of stromal cells, Spooncer, E., Dexter, M. and Allen, T. In Culture of Hematopoietic Cells. R. I. Freshney, et al. eds. Vol pp. 163-179, Wiley-Liss, Inc., New York, N.Y. 1994; Long term culture initiating cell assay, Sutherland, H. J. In Culture of Hematopoietic Cells. R. I. Freshney, et al. eds. Vol pp. 139-162, Wiley-Liss, Inc., New York, N.Y. 1994.
4.10.6 TISSUE GROWTH ACTIVITY
A polypeptide of the present invention also may be involved in bone, cartilage, tendon, ligament and/or nerve tissue growth or regeneration, as well as in wound healing and tissue repair and replacement, and in healing of burns, incisions and ulcers. A polypeptide of the present invention which induces cartilage and/or bone growth in circumstances where bone is not normally formed, has application in the healing of bone fractures and cartilage damage or defects in humans and other animals. Compositions of a polypeptide, antibody, binding partner, or other modulator of the invention may have prophylactic use in closed as well as open fracture reduction and also in the improved fixation of artificial joints. De novo bone formation induced by an osteogenic agent contributes to the repair of congenital, trauma induced, or oncologic resection induced craniofacial defects, and also is useful in cosmetic plastic surgery.
A polypeptide of this invention may also be involved in attracting bone-forming cells, stimulating growth of bone-forming cells, or inducing differentiation of progenitors of bone-forming cells. Treatment of osteoporosis, osteoarthritis, bone degenerative disorders, or periodontal disease, such as through stimulation of bone and/or cartilage repair or by blocking inflammation or processes of tissue destruction (collagenase activity, osteoclast activity, etc.) mediated by inflammatory processes may also be possible using the composition of the invention. Another category of tissue regeneration activity that may involve the polypeptide of the present invention is tendon/ligament formation. Induction of tendon/ligament-like tissue or other tissue formation in circumstances where such tissue is not normally formed, has application in the healing of tendon or ligament tears, deformities and other tendon or ligament defects in humans and other animals. Such a preparation employing a tendon/ligament-like tissue inducing protein may have prophylactic use in preventing damage to tendon or ligament tissue, as well as use in the improved fixation of tendon or ligament to bone or other tissues, and in repairing defects to tendon or ligament tissue. De novo tendon/ligament-like tissue formation induced by a composition of the present invention contributes to the repair of congenital, trauma induced, or other tendon or ligament defects of other origin, and is also useful in cosmetic plastic surgery for attachment or repair of tendons or ligaments. The compositions of the present invention may provide environment to attract tendon- or ligament-forming cells, stimulate growth of tendon- or ligament-forming cells, induce differentiation of progenitors of tendon- or ligament- forming cells, or induce growth of tendon/ligament cells or progenitors ex vivo for return in vivo to effect tissue repair. The compositions of the invention may also be useful in the treatment of tendinitis, carpal tunnel syndrome and other tendon or ligament defects. The compositions may also include an appropriate matrix and/or sequestering agent as a carrier as is well known in the art.
The compositions of the present invention may also be useful for proliferation of neural cells and for regeneration of nerve and brain tissue, i.e. for the treatment of central and peripheral nervous system diseases and neuropathies, as well as mechanical and traumatic disorders, which involve degeneration, death or trauma to neural cells or nerve tissue. More specifically, a composition may be used in the treatment of diseases of the peripheral nervous system, such as peripheral nerve injuries, peripheral neuropathy and localized neuropathies, and central nervous system diseases, such as Alzheimer's, Parkinson's disease, Huntington's disease, amyotrophic lateral sclerosis, and Shy-Drager syndrome. Further conditions that may be treated in accordance with the present invention include mechanical and traumatic disorders, such as spinal cord disorders, head trauma and cerebrovascular diseases such as stroke. Peripheral neuropathies resulting from chemotherapy or other medical therapies may also be treatable using a composition of the invention.
Compositions of the invention may also be useful to promote better or faster closure of non-healing wounds, including without limitation pressure ulcers, ulcers associated with vascular insufficiency, surgical and traumatic wounds, and the like.
Compositions of the present invention may also be involved in the generation or regeneration of other tissues, such as organs (including, for example, pancreas, liver, intestine, kidney, skin, endothelium), muscle (smooth, skeletal or cardiac) and vascular (including vascular endothelium) tissue, or for promoting the growth of cells comprising such tissues. Part of the desired effects may be by inhibition or modulation of fibrotic scarring may allow normal tissue to regenerate. A polypeptide of the present invention may also exhibit angiogenic activity. A composition of the present invention may also be useful for gut protection or regeneration and treatment of lung or liver fibrosis, reperfusion injury in various tissues, and conditions resulting from systemic cytokine damage.
A composition of the present invention may also be useful for promoting or inhibiting differentiation of tissues described above from precursor tissues or cells; or for inhibiting the growth of tissues described above.
Therapeutic compositions of the invention can be used in the following: Assays for tissue generation activity include, without limitation, those described in: International Patent Publication No. WO95/16035 (bone, cartilage, tendon); International Patent Publication No. WO95/05846 (nerve, neuronal); International Patent Publication No. WO91/07491 (skin, endothelium).
Assays for wound healing activity include, without limitation, those described in: Winter, Epidermal Wound Healing, pps. 71-112 (Maibach, H. I. and Rovee, D. T., eds.), Year Book Medical Publishers, Inc., Chicago, as modified by Eaglstein and Mertz, J. Invest. Dermatol 71 :382-84 (1978).
4.10.7 IMMUNE STIMULATING OR SUPPRESSING ACTIVITY A polypeptide of the present invention may also exhibit immune stimulating or immune suppressing activity, including without limitation the activities for which assays are described herein. A polynucleotide of the invention can encode a polypeptide exhibiting such activities. A protein may be useful in the treatment of various immune deficiencies and disorders (including severe combined immunodeficiency (SOD)), e.g., in regulating (up or down) growth and proliferation of T and/or B lymphocytes, as well as effecting the cytolytic activity of NK cells and other cell populations. These immune deficiencies may be genetic or be caused by viral (e.g., HIV) as well as bacterial or fungal infections, or may result from autoimmune disorders. More specifically, infectious diseases causes by viral, bacterial, fungal or other infection may be treatable using a protein of the present invention, including infections by HIV, hepatitis viruses, herpes viruses, mycobacteria, Leishmania spp., malaria spp. and various fungal infections such as candidiasis. Of course, in this regard, proteins of the present invention may also be useful where a boost to the immune system generally may be desirable, i.e., in the treatment of cancer. Autoimmune disorders which may be treated using a protein of the present invention include, for example, connective tissue disease, multiple sclerosis, systemic lupus erythematosus, rheumatoid arthritis, autoimmune pulmonary inflammation, Guillain-Barre syndrome, autoimmune thyroiditis, insulin dependent diabetes mellitis, myasthenia gravis, graft-versus-host disease and autoimmune inflammatory eye disease. Such a protein (or antagonists thereof, including antibodies) of the present invention may also to be useful in the treatment of allergic reactions and conditions (e.g., anaphylaxis, serum sickness, drug reactions, food allergies, insect venom allergies, mastocytosis, allergic rhinitis, hypersensitivity pneumonitis, urticaria, angioedema, eczema, atopic dermatitis, allergic contact dermatitis, erythema multiforme, Stevens- Johnson syndrome, allergic conjunctivitis, atopic keratoconjunctivitis, venereal keratoconjunctivitis, giant papillary conjunctivitis and contact allergies), such as asthma (particularly allergic asthma) or other respiratory problems. Other conditions, in which immune suppression is desired (including, for example, organ transplantation), may also be treatable using a protein (or antagonists thereof) of the present invention. The therapeutic effects of the polypeptides or antagonists thereof on allergic reactions can be evaluated by in vivo animals models such as the cumulative contact enhancement test (Lastbom et al., Toxicology 125: 59-66, 1998), skin prick test (Hoffmann et al., Allergy 54: 446-54, 1999), guinea pig skin sensitization test (Vohr et al., Arch. Toxocol. 73: 501-9), and murine local lymph node assay (Kimber et al., J. Toxicol. Environ. Health 53: 563-79). Using the proteins of the invention it may also be possible to modulate immune responses, in a number of ways. Down regulation may be in the form of inhibiting or blocking an immune response already in progress or may involve preventing the induction of an immune response. The functions of activated T cells may be inhibited by suppressing T cell responses or by inducing specific tolerance in T cells, or both. Immunosuppression of T cell responses is generally an active, non-antigen-specific, process which requires continuous exposure of the T cells to the suppressive agent. Tolerance, which involves inducing non-responsiveness or anergy in T cells, is distinguishable from immunosuppression in that it is generally antigen-specific and persists after exposure to the tolerizing agent has ceased. Operationally, tolerance can be demonstrated by the lack of a T cell response upon reexposure to specific antigen in the absence of the tolerizing agent.
Down regulating or preventing one or more antigen functions (including without limitation B lymphocyte antigen functions (such as, for example, B7)), e.g., preventing high level lymphokine synthesis by activated T cells, will be useful in situations of tissue, skin and organ transplantation and in graft-versus-host disease (GVHD). For example, blockage of T cell function should result in reduced tissue destruction in tissue transplantation. Typically, in tissue transplants, rejection of the transplant is initiated through its recognition as foreign by T cells, followed by an immune reaction that destroys the transplant. The administration of a therapeutic composition of the invention may prevent cytokine synthesis by immune cells, such as T cells, and thus acts as an immunosuppressant. Moreover, a lack of costimulation may also be sufficient to anergize the T cells, thereby inducing tolerance in a subject. Induction of long-term tolerance by B lymphocyte antigen-blocking reagents may avoid the necessity of repeated administration of these blocking reagents. To achieve sufficient immunosuppression or tolerance in a subject, it may also be necessary to block the function of a combination of B lymphocyte antigens. The efficacy of particular therapeutic compositions in preventing organ transplant rejection or GVHD can be assessed using animal models that are predictive of efficacy in humans. Examples of appropriate systems which can be used include allogeneic cardiac grafts in rats and xenogeneic pancreatic islet cell grafts in mice, both of which have been used to examine the immunosuppressive effects of CTLA4Ig fusion proteins in vivo as described in Lenschow et al., Science 257:789-792 (1992) and Turka et al., Proc. Natl. Acad. Sci USA, 89:1 1102-11 105 (1992). In addition, murine models of GVHD (see Paul ed., Fundamental Immunology, Raven Press, New York, 1989, pp. 846-847) can be used to determine the effect of therapeutic compositions of the invention on the development of that disease.
Blocking antigen function may also be therapeutically useful for treating autoimmune diseases. Many autoimmune disorders are the result of inappropriate activation of T cells that are reactive against self-tissue and which promote the production of cytokines and autoanti bodies involved in the pathology of the diseases. Preventing the activation of autoreactive T cells may reduce or eliminate disease symptoms. Administration of reagents which block stimulation of T cells can be used to inhibit T cell activation and prevent production of autoantibodies or T cell-derived cytokines which may be involved in the disease process. Additionally, blocking reagents may induce antigen-specific tolerance of autoreactive T cells which could lead to long-term relief from the disease. The efficacy of blocking reagents in preventing or alleviating autoimmune disorders can be determined using a number of well-characterized animal models of human autoimmune diseases. Examples include murine experimental autoimmune encephalitis, systemic lupus erythmatosis in MRL/lpr/lpr mice or
NZB hybrid mice, murine autoimmune collagen arthritis, diabetes mellitus in NOD mice and BB rats, and murine experimental myasthenia gravis (see Paul ed., Fundamental Immunology, Raven Press, New York, 1989, pp. 840-856).
Upregulation of an antigen function (e.g., a B lymphocyte antigen function), as a means of up regulating immune responses, may also be useful in therapy. Upregulation of immune responses may be in the form of enhancing an existing immune response or eliciting an initial immune response. For example, enhancing an immune response may be useful in cases of viral infection, including systemic viral diseases such as influenza, the common cold, and encephalitis. Alternatively, anti-viral immune responses may be enhanced in an infected patient by removing T cells from the patient, costimulating the T cells in vitro with viral antigen-pulsed APCs either expressing a peptide of the present invention or together with a stimulatory form of a soluble peptide of the present invention and reintroducing the in vitro activated T cells into the patient. Another method of enhancing anti-viral immune responses would be to isolate infected cells from a patient, transfect them with a nucleic acid encoding a protein of the present invention as described herein such that the cells express all or a portion of the protein on their surface, and reintroduce the transfected cells into the patient. The infected cells would now be capable of delivering a costimulatory signal to, and thereby activate, T cells in vivo.
A polypeptide of the present invention may provide the necessary stimulation signal to T cells to induce a T cell mediated immune response against the transfected tumor cells. In addition, tumor cells which lack MHC class I or MHC class II molecules, or which fail to reexpress sufficient mounts of MHC class I or MHC class II molecules, can be transfected with nucleic acid encoding all or a portion of (e.g., a cytoplasmic-domain truncated portion) of an MHC class I alpha chain protein and β microglobulin protein or an MHC class II alpha chain protein and an MHC class II beta chain protein to thereby express MHC class I or MHC class II proteins on the cell surface. Expression of the appropriate class I or class II MHC in conjunction with a peptide having the activity of a B lymphocyte antigen (e.g., B7-1, B7-2, B7-3) induces a T cell mediated immune response against the transfected tumor cell. Optionally, a gene encoding an antisense construct which blocks expression of an MHC class II associated protein, such as the invariant chain, can also be cotransfected with a DNA encoding a peptide having the activity of a B lymphocyte antigen to promote presentation of tumor associated antigens and induce tumor specific immunity. Thus, the induction of a T cell mediated immune response in a human subject may be sufficient to overcome tumor-specific tolerance in the subject.
The activity of a protein of the invention may, among other means, be measured by the following methods:
Suitable assays for thymocyte or splenocyte cytotoxicity include, without limitation, those described in: Current Protocols in Immunology, Ed by J. E. Coligan, A. M. Kruisbeek, D. H. Margulies, E. M. Shevach, W. Strober, Pub. Greene Publishing Associates and WileyTnterscience (Chapter 3, In Vitro assays for Mouse Lymphocyte Function 3.1-3.19; Chapter 7, Immunologic studies in Humans); Herrmann et al., Proc. Natl. Acad. Sci. USA 78:2488-2492, 1981 ; Herrmann et al., J. Immunol. 128: 1968-1974, 1982; Handa et al., J.
Immunol. 135:1564-1572, 1985; Takai et al., I. Immunol. 137:3494-3500, 1986; Takai et al., J. Immunol. 140:508-512, 1988; Bowman et al., J. Virology 61 :1992-1998; Bertagnolli et al., Cellular Immunology 133:327-341, 1991; Brown et al., J. Immunol. 153:3079-3092, 1994. Assays for T-cell-dependent immunoglobulin responses and isotype switching (which will identify, among others, proteins that modulate T-cell dependent antibody responses and that affect Thl/Th2 profiles) include, without limitation, those described in: Maliszewski, J. Immunol. 144:3028-3033, 1990; and Assays for B cell function: In vitro antibody production, Mond, J. J. and Brunswick, M. In Current Protocols in Immunology. J. E. e.a. Coligan eds. Vol 1 pp. 3.8.1-3.8.16, John Wiley and Sons, Toronto. 1994.
Mixed lymphocyte reaction (MLR) assays (which will identify, among others, proteins that generate predominantly Thl and CTL responses) include, without limitation, those described in: Current Protocols in Immunology, Ed by J. E. Coligan, A. M. Kruisbeek, D. H. Margulies, E. M. Shevach, W. Strober, Pub. Greene Publishing Associates and Wiley-Interscience (Chapter 3, In Vitro assays for Mouse Lymphocyte Function 3.1-3.19; Chapter 7, Immunologic studies in Humans); Takai et al., J. Immunol. 137:3494-3500, 1986; Takai et al., J. Immunol. 140:508-512, 1988; Bertagnolli et al., J. Immunol. 149:3778-3783, 1992.
Dendritic cell-dependent assays (which will identify, among others, proteins expressed by dendritic cells that activate naive T-cells) include, without limitation, those described in: Guery et al., J. Immunol. 134:536-544, 1995; Inaba et al., Journal of Experimental Medicine
173:549-559, 1991; Macatonia et al., Journal of Immunology 154:5071-5079, 1995; Porgador et al.. Journal of Experimental Medicine 182:255-260, 1995; Nair et al., Journal of Virology 67:4062-4069, 1993; Huang et al., Science 264:961-965, 1994; Macatonia et al., Journal of Experimental Medicine 169:1255-1264, 1989; Bhardwaj et al., Journal of Clinical Investigation 94:797-807, 1994; and Inaba et al., Journal of Experimental Medicine 172:631-640, 1990.
Assays for lymphocyte survival/apoptosis (which will identify, among others, proteins that prevent apoptosis after superantigen induction and proteins that regulate lymphocyte homeostasis) include, without limitation, those described in: Darzynkiewicz et al., Cytometry 13:795-808, 1992; Gorczyca et al., Leukemia 7:659-670, 1993; Gorczyca et al., Cancer Research 53:1945-1951, 1993; Itoh et al., Cell 66:233-243, 1991 ; Zacharchuk, Journal of Immunology 145:4037-4045, 1990; Zamai et al., Cytometry 14:891-897, 1993; Gorczyca et al., International Journal of Oncology 1 :639-648, 1992.
Assays for proteins that influence early steps of T-cell commitment and development include, without limitation, those described in: Antica et al., Blood 84:1 11-1 17, 1994; Fine et al., Cellular Immunology 155:111-122, 1994; Galy et al., Blood 85:2770-2778, 1995; Toki et al.,
Proc. Nat. Acad Sci. USA 88:7548-7551, 1991.
4.10.8 ACTIVIN/INHIBIN ACTIVITY A polypeptide of the present invention may also exhibit activin- or inhibin-related activities. A polynucleotide of the invention may encode a polypeptide exhibiting such characteristics. Inhibins are characterized by their ability to inhibit the release of follicle stimulating hormone (FSH), while activins and are characterized by their ability to stimulate the release of follicle stimulating hormone (FSH). Thus, a polypeptide of the present invention, alone or in heterodimers with a member of the inhibin family, may be useful as a contraceptive based on the ability of inhibins to decrease fertility in female mammals and decrease spermatogenesis in male mammals. Administration of sufficient amounts of other inhibins can induce infertility in these mammals. Alternatively, the polypeptide of the invention, as a homodimer or as a heterodimer with other protein subunits of the inhibin group, may be useful as a fertility inducing therapeutic, based upon the ability of activin molecules in stimulating FSH release from cells of the anterior pituitary. See, for example, U.S. Pat. No. 4,798,885. A polypeptide of the invention may also be useful for advancement of the onset of fertility in sexually immature mammals, so as to increase the lifetime reproductive performance of domestic animals such as, but not limited to, cows, sheep and pigs. The activity of a polypeptide of the invention may, among other means, be measured by the following methods.
Assays for activin/inhibin activity include, without limitation, those described in: Vale et al., Endocrinology 91 :562-572, 1972; Ling et al., Nature 321 :779-782, 1986; Vale et al., Nature 321 :776-779, 1986; Mason et al., Nature 318:659-663, 1985; Forage et al., Proc. Natl. Acad. Sci. USA 83:3091-3095, 1986.
4.10.9 CHEMOTACTIC/CHEMOKINETIC ACTIVITY
A polypeptide of the present invention may be involved in chemotactic or chemokinetic activity for mammalian cells, including, for example, monocytes, fibroblasts, neutrophils, T-cells, mast cells, eosinophils, epithelial and/or endothelial cells. A polynucleotide of the invention can encode a polypeptide exhibiting such attributes. Chemotactic and chemokinetic receptor activation can be used to mobilize or attract a desired cell population to a desired site of action. Chemotactic or chemokinetic compositions (e.g. proteins, antibodies, binding partners, or modulators of the invention) provide particular advantages in treatment of wounds and other trauma to tissues, as well as in treatment of localized infections. For example, attraction of lymphocytes, monocytes or neutrophils to tumors or sites of infection may result in improved immune responses against the tumor or infecting agent.
A protein or peptide has chemotactic activity for a particular cell population if it can stimulate, directly or indirectly, the directed orientation or movement of such cell population. Preferably, the protein or peptide has the ability to directly stimulate directed movement of cells. Whether a particular protein has chemotactic activity for a population of cells can be readily determined by employing such protein or peptide in any known assay for cell chemotaxis. Therapeutic compositions of the invention can be used in the following: Assays for chemotactic activity (which will identify proteins that induce or prevent chemotaxis) consist of assays that measure the ability of a protein to induce the migration of cells across a membrane as well as the ability of a protein to induce the adhesion of one cell population to another cell population. Suitable assays for movement and adhesion include, without limitation, those described in: Current Protocols in Immunology, Ed by J. E. Coligan, A. M. Kruisbeek, D. H. Marguiles, E. M. Shevach, W. Strober, Pub. Greene Publishing Associates and Wiley-Interscience (Chapter 6.12, Measurement of alpha and beta Chemokines
6.12.1-6.12.28; Taub et al. J. Clin. Invest. 95:1370-1376, 1995; Lind et al. APMIS 103:140-146, 1995; Muller et al Eur. J. Immunol. 25:1744-1748; Gruber et al. J. of Immunol. 152:5860-5867, 1994; Johnston et al. J. of Immunol. 153:1762-1768, 1994.
4.10.10 HEMOSTATIC AND THROMBOLYTIC ACTIVITY
A polypeptide of the invention may also be involved in hemostatis or thrombolysis or thrombosis. A polynucleotide of the invention can encode a polypeptide exhibiting such attributes. Compositions may be useful in treatment of various coagulation disorders (including hereditary disorders, such as hemophilias) or to enhance coagulation and other hemostatic events in treating wounds resulting from trauma, surgery or other causes. A composition of the invention may also be useful for dissolving or inhibiting formation of thromboses and for treatment and prevention of conditions resulting therefrom (such as, for example, infarction of cardiac and central nervous system vessels (e.g., stroke).
Therapeutic compositions of the invention can be used in the following: Assay for hemostatic and thrombolytic activity include, without limitation, those described in: Linet et al., J. Clin. Pharmacol. 26:131-140, 1986; Burdick et al., Thrombosis Res. 45:413-419, 1987; Humphrey et al., Fibrinolysis 5:71-79 (1991); Schaub, Prostaglandins 35:467-474, 1988.
4.10.11 CANCER DIAGNOSIS AND THERAPY Polypeptides of the invention may be involved in cancer cell generation, proliferation or metastasis. Detection of the presence or amount of polynucleotides or polypeptides of the invention may be useful for the diagnosis and/or prognosis of one or more types of cancer. For example, the presence or increased expression of a polynucleotide/polypeptide of the invention may indicate a hereditary risk of cancer, a precancerous condition, or an ongoing malignancy. Conversely, a defect in the gene or absence of the polypeptide may be associated with a cancer condition. Identification of single nucleotide polymorphisms associated with cancer or a predisposition to cancer may also be useful for diagnosis or prognosis.
Cancer treatments promote tumor regression by inhibiting tumor cell proliferation, inhibiting angiogenesis (growth of new blood vessels that is necessary to support tumor growth) and/or prohibiting metastasis by reducing tumor cell motility or invasiveness. Therapeutic compositions of the invention may be effective in adult and pediatric oncology including in solid phase tumors/malignancies, locally advanced tumors, human soft tissue sarcomas, metastatic cancer, including lymphatic metastases, blood cell malignancies including multiple myeloma, acute and chronic leukemias, and lymphomas, head and neck cancers including mouth cancer, larynx cancer and thyroid cancer, lung cancers including small cell carcinoma and non-small cell cancers, breast cancers including small cell carcinoma and ductal carcinoma, gastrointestinal cancers including esophageal cancer, stomach cancer, colon cancer, colorectal cancer and polyps associated with colorectal neoplasia, pancreatic cancers, liver cancer, urologic cancers including bladder cancer and prostate cancer, malignancies of the female genital tract including ovarian carcinoma, uterine (including endometrial) cancers, and solid tumor in the ovarian follicle, kidney cancers including renal cell carcinoma, brain cancers including intrinsic brain tumors, neuroblastoma, astrocytic brain tumors, gliomas, metastatic tumor cell invasion in the central nervous system, bone cancers including osteomas, skin cancers including malignant melanoma, tumor progression of human skin keratinocytes, squamous cell carcinoma, basal cell carcinoma, hemangiopericytoma and Karposi's sarcoma.
Polypeptides, polynucleotides, or modulators of polypeptides of the invention (including inhibitors and stimulators of the biological activity of the polypeptide of the invention) may be administered to treat cancer. Therapeutic compositions can be administered in therapeutically effective dosages alone or in combination with adjuvant cancer therapy such as surgery, chemotherapy, radiotherapy, thermotherapy, and laser therapy, and may provide a beneficial effect, e.g. reducing tumor size, slowing rate of tumor growth, inhibiting metastasis, or otherwise improving overall clinical condition, without necessarily eradicating the cancer.
The composition can also be administered in therapeutically effective amounts as a portion of an anti-cancer cocktail. An anti-cancer cocktail is a mixture of the polypeptide or modulator of the invention with one or more anti-cancer drugs in addition to a pharmaceutically acceptable carrier for delivery. The use of anti-cancer cocktails as a cancer treatment is routine. Anti-cancer drugs that are well known in the art and can be used as a treatment in combination with the polypeptide or modulator of the invention include: Actinomycin D, Aminoglutethimide, Asparaginase, Bleomycin, Busulfan, Carboplatin, Carmustine,
Chlorambucil, Cisplatin (cis-DDP), Cyclophosphamide, Cytarabine HCl (Cytosine arabinoside), Dacarbazine, Dactinomycin, Daunorubicin HCl, Doxorubicin HCl, Estramustine phosphate sodium, Etoposide (VI 6-213), Floxuridine, 5-Fluorouracil (5-Fu), Flutamide, Hydroxyurea (hydroxycarbamide), Ifosfamide, Interferon Alpha-2a, Interferon Alpha-2b, Leuprolide acetate (LHRH-releasing factor analog), Lomustine, Mechlorethamine HCl (nitrogen mustard), Melphalan, Mercaptopurine, Mesna, Methotrexate (MTX), Mitomycin, Mitoxantrone HCl, Octreotide, Plicamycin, Procarbazine HCl, Streptozocin, Tamoxifen citrate, Thioguanine, Thiotepa, Vinblastine sulfate, Vincristine sulfate, Amsacrine, Azacitidine, Hexamethylmelamine, Interleukin-2, Mitoguazone, Pentostatin, Semustine, Teniposide, and Vindesine sulfate. In addition, therapeutic compositions of the invention may be used for prophylactic treatment of cancer. There are hereditary conditions and/or environmental situations (e.g. exposure to carcinogens) known in the art that predispose an individual to developing cancers. Under these circumstances, it may be beneficial to treat these individuals with therapeutically effective doses of the polypeptide of the invention to reduce the risk of developing cancers. In vitro models can be used to determine the effective doses of the polypeptide of the invention as a potential cancer treatment. These in vitro models include proliferation assays of cultured tumor cells, growth of cultured tumor cells in soft agar (see Freshney, (1987) Culture of Animal Cells: A Manual of Basic Technique, Wily-Liss, New York, NY Ch 18 and Ch 21), tumor systems in nude mice as described in Giovanella et al., J. Natl. Can. Inst., 52: 921-30 (1974), mobility and invasive potential of tumor cells in Boyden Chamber assays as described in Pilkington et al., Anticancer Res., 17: 4107-9 (1997), and angiogenesis assays such as induction of vascularization of the chick chorioallantoic membrane or induction of vascular endothelial cell migration as described in Ribatta et al., Intl. J. Dev. Biol., 40: 1189-97 (1999) and Li et al., Clin. Exp. Metastasis, 17:423-9 (1999), respectively. Suitable tumor cells lines are available, e.g. from American Type Tissue Culture Collection catalogs.
4.10.12 RECEPTOR/LIGAND ACTIVITY
A polypeptide of the present invention may also demonstrate activity as receptor, receptor ligand or inhibitor or agonist of receptor/ligand interactions. A polynucleotide of the invention can encode a polypeptide exhibiting such characteristics. Examples of such receptors and ligands include, without limitation, cytokine receptors and their ligands, receptor kinases and their ligands, receptor phosphatases and their ligands, receptors involved in cell-cell interactions and their ligands (including without limitation, cellular adhesion molecules (such as selectins, integrins and their ligands) and receptor/ligand pairs involved in antigen presentation, antigen recognition and development of cellular and humoral immune responses. Receptors and ligands are also useful for screening of potential peptide or small molecule inhibitors of the relevant receptor/ligand interaction. A protein of the present invention (including, without limitation, fragments of receptors and ligands) may themselves be useful as inhibitors of receptor/ligand interactions. The activity of a polypeptide of the invention may, among other means, be measured by the following methods:
Suitable assays for receptor-ligand activity include without limitation those described in: Current Protocols in Immunology, Ed by J. E. Coligan, A. M. Kruisbeek, D. H. Margulies, E. M. Shevach, W. Strober, Pub. Greene Publishing Associates and Wiley- Interscience (Chapter 7.28, Measurement of Cellular Adhesion under static conditions 7.28.1- 7.28.22), Takai et al., Proc. Natl. Acad. Sci. USA 84:6864-6868, 1987; Bierer et al., J. Exp. Med. 168:1145-1156, 1988; Rosenstein et al., J. Exp. Med. 169:149-160 1989; Stoltenborg et al., J. Immunol. Methods 175:59-68, 1994; Stitt et al., Cell 80:661-670, 1995.
By way of example, the polypeptides of the invention may be used as a receptor for a ligand(s) thereby transmitting the biological activity of that ligand(s). Ligands may be identified through binding assays, affinity chromatography, dihybrid screening assays, BIAcore assays, gel overlay assays, or other methods known in the art.
Studies characterizing drugs or proteins as agonist or antagonist or partial agonists or a partial antagonist require the use of other proteins as competing ligands. The polypeptides of the present invention or ligand(s) thereof may be labeled by being coupled to radioisotopes, colorimetric molecules or toxin molecules by conventional methods. ("Guide to Protein Purification" Murray P. Deutscher (ed) Methods in Enzymology Vol. 182 (1990) Academic Press, Inc. San Diego). Examples of radioisotopes include, but are not limited to, tritium and carbon- 14. Examples of colorimetric molecules include, but are not limited to, fluorescent molecules such as fluorescamine, or rhodamine or other colorimetric molecules. Examples of toxins include, but are not limited, to ricin.
4.10.13 DRUG SCREENING
This invention is particularly useful for screening chemical compounds by using the novel polypeptides or binding fragments thereof in any of a variety of drug screening techniques. The polypeptides or fragments employed in such a test may either be free in solution, affixed to a solid support, borne on a cell surface or located intracellularly. One method of drug screening utilizes eukaryotic or prokaryotic host cells which are stably transformed with recombinant nucleic acids expressing the polypeptide or a fragment thereof. Drugs are screened against such transformed cells in competitive binding assays. Such cells, either in viable or fixed form, can be used for standard binding assays. One may measure, for example, the formation of complexes between polypeptides of the invention or fragments and the agent being tested or examine the diminution in complex formation between the novel polypeptides and an appropriate cell line, which are well known in the art. Sources for test compounds that may be screened for ability to bind to or modulate (i.e., increase or decrease) the activity of polypeptides of the invention include (1) inorganic and organic chemical libraries, (2) natural product libraries, and (3) combinatorial libraries comprised of either random or mimetic peptides, oligonucleotides or organic molecules. Chemical libraries may be readily synthesized or purchased from a number of commercial sources, and may include structural analogs of known compounds or compounds that are identified as "hits" or "leads" via natural product screening.
The sources of natural product libraries are microorganisms (including bacteria and fungi), animals, plants or other vegetation, or marine organisms, and libraries of mixtures for screening may be created by: (1) fermentation and extraction of broths from soil, plant or marine microorganisms or (2) extraction of the organisms themselves. Natural product libraries include polyketides, non-ribosomal peptides, and (non-naturally occurring) variants thereof. For a review, see Science 282:63-68 (1998).
Combinatorial libraries are composed of large numbers of peptides, oligonucleotides or organic compounds and can be readily prepared by traditional automated synthesis methods, PCR, cloning or proprietary synthetic methods. Of particular interest are peptide and oligonucleotide combinatorial libraries. Still other libraries of interest include peptide, protein, peptidomimetic, multiparallel synthetic collection, recombinatorial, and polypeptide libraries. For a review of combinatorial chemistry and libraries created therefrom, see Myers, Curr. Opin. Biotechnol. 8:701-707 (1997). For reviews and examples of peptidomimetic libraries, see Al-Obeidi et al., Mol. Biotechnol, 9(3):205-23 (1998); Hruby et al., Curr Opin Chem Biol,
1(1):1 14-19 (1997); Dorner et al., Bioorg Med Chem, 4(5):709-15 (1996) (alkylated dipeptides). Identification of modulators through use of the various libraries described herein permits modification of the candidate "hit" (or "lead") to optimize the capacity of the "hit" to bind a polypeptide of the invention. The molecules identified in the binding assay are then tested for antagonist or agonist activity in in vivo tissue culture or animal models that are well known in the art. In brief, the molecules are titrated into a plurality of cell cultures or animals and then tested for either cell/animal death or prolonged survival of the animal/cells.
The binding molecules thus identified may be complexed with toxins, e.g., ricin or cholera, or with other compounds that are toxic to cells such as radioisotopes. The toxin-binding molecule complex is then targeted to a tumor or other cell by the specificity of the binding molecule for a polypeptide of the invention. Alternatively, the binding molecules may be complexed with imaging agents for targeting and imaging purposes.
4.10.14 ASSAY FOR RECEPTOR ACTIVITY The invention also provides methods to detect specific binding of a polypeptide e.g. a ligand or a receptor. The art provides numerous assays particularly useful for identifying previously unknown binding partners for receptor polypeptides of the invention. For example, expression cloning using mammalian or bacterial cells, or dihybrid screening assays can be used to identify polynucleotides encoding binding partners. As another example, affinity chromatography with the appropriate immobilized polypeptide of the invention can be used to isolate polypeptides that recognize and bind polypeptides of the invention. There are a number of different libraries used for the identification of compounds, and in particular small molecules, that modulate (i.e., increase or decrease) biological activity of a polypeptide of the invention. Ligands for receptor polypeptides of the invention can also be identified by adding exogenous ligands, or cocktails of ligands to two cells populations that are genetically identical except for the expression of the receptor of the invention: one cell population expresses the receptor of the invention whereas the other does not. The responses of the two cell populations to the addition of ligands(s) are then compared. Alternatively, an expression library can be co-expressed with the polypeptide of the invention in cells and assayed for an autocrine response to identify potential ligand(s). As still another example, BIAcore assays, gel overlay assays, or other methods known in the art can be used to identify binding partner polypeptides, including, (1) organic and inorganic chemical libraries, (2) natural product libraries, and (3) combinatorial libraries comprised of random peptides, oligonucleotides or organic molecules.
The role of downstream intracellular signaling molecules in the signaling cascade of the polypeptide of the invention can be determined. For example, a chimeric protein in which the cytoplasmic domain of the polypeptide of the invention is fused to the extracellular portion of a protein, whose ligand has been identified, is produced in a host cell. The cell is then incubated with the ligand specific for the extracellular portion of the chimeric protein, thereby activating the chimeric receptor. Known downstream proteins involved in intracellular signaling can then be assayed for expected modifications i.e. phosphorylation. Other methods known to those in the art can also be used to identify signaling molecules involved in receptor activity.
4.10.15 ANTI-INFLAMMATORY ACTIVITY Compositions of the present invention may also exhibit anti-inflammatory activity. The anti-inflammatory activity may be achieved by providing a stimulus to cells involved in the inflammatory response, by inhibiting or promoting cell-cell interactions (such as, for example, cell adhesion), by inhibiting or promoting chemotaxis of cells involved in the inflammatory process, inhibiting or promoting cell extravasation, or by stimulating or suppressing production of other factors which more directly inhibit or promote an inflammatory response. Compositions with such activities can be used to treat inflammatory conditions including chronic or acute conditions), including without limitation intimation associated with infection (such as septic shock, sepsis or systemic inflammatory response syndrome (SIRS)), ischemia-reperfusion injury, endotoxin lethality, arthritis, complement-mediated hyperacute rejection, nephritis, cytokine or chemokine-induced lung injury, inflammatory bowel disease, Crohn's disease or resulting from over production of cytokines such as TNF or IL-1. Compositions of the invention may also be useful to treat anaphylaxis and hypersensitivity to an antigenic substance or material. Compositions of this invention may be utilized to prevent or treat conditions such as, but not limited to, sepsis, acute pancreatitis, endotoxin shock, cytokine induced shock, rheumatoid arthritis, chronic inflammatory arthritis, pancreatic cell damage from diabetes mellitus type 1, graft versus host disease, inflammatory bowel disease, inflamation associated with pulmonary disease, other autoimmune disease or inflammatory disease, an antiproliferative agent such as for acute or chronic mylegenous leukemia or in the prevention of premature labor secondary to intrauterine infections.
4.10.16 LEUKEMIAS
Leukemias and related disorders may be treated or prevented by administration of a therapeutic that promotes or inhibits function of the polynucleotides and/or polypeptides of the invention. Such leukemias and related disorders include but are not limited to acute leukemia, acute lymphocytic leukemia, acute myelocytic leukemia, myeloblastic, promyelocytic, myelomonocytic, monocytic, erythroleukemia, chronic leukemia, chronic myelocytic (granulocytic) leukemia and chronic lymphocytic leukemia (for a review of such disorders, see Fishman et al., 1985, Medicine, 2d Ed., J.B. Lippincott Co., Philadelphia).
4.10.17 NERVOUS SYSTEM DISORDERS Nervous system disorders, involving cell types which can be tested for efficacy of intervention with compounds that modulate the activity of the polynucleotides and/or polypeptides of the invention, and which can be treated upon thus observing an indication of therapeutic utility, include but are not limited to nervous system injuries, and diseases or disorders which result in either a disconnection of axons, a diminution or degeneration of neurons, or demyelination. Nervous system lesions which may be treated in a patient (including human and non-human mammalian patients) according to the invention include but are not limited to the following lesions of either the central (including spinal cord, brain) or peripheral nervous systems: (i) traumatic lesions, including lesions caused by physical injury or associated with surgery, for example, lesions which sever a portion of the nervous system, or compression injuries;
(ii) ischemic lesions, in which a lack of oxygen in a portion of the nervous system results in neuronal injury or death, including cerebral infarction or ischemia, or spinal cord infarction or ischemia;
(iii) infectious lesions, in which a portion of the nervous system is destroyed or injured as a result of infection, for example, by an abscess or associated with infection by human immunodeficiency virus, herpes zoster, or herpes simplex virus or with Lyme disease, tuberculosis, syphilis; (iv) degenerative lesions, in which a portion of the nervous system is destroyed or injured as a result of a degenerative process including but not limited to degeneration associated with Parkinson's disease, Alzheimer's disease, Huntington's chorea, or amyotrophic lateral sclerosis;
(v) lesions associated with nutritional diseases or disorders, in which a portion of the nervous system is destroyed or injured by a nutritional disorder or disorder of metabolism including but not limited to, vitamin B12 deficiency, folic acid deficiency, Wernicke disease, tobacco-alcohol amblyopia, Marchiafava-Bignami disease (primary degeneration of the corpus callosum), and alcoholic cerebellar degeneration;
(vi) neurological lesions associated with systemic diseases including but not limited to diabetes (diabetic neuropathy, Bell's palsy), systemic lupus erythematosus, carcinoma, or sarcoidosis;
(vii) lesions caused by toxic substances including alcohol, lead, or particular neurotoxins; and
(viii) demyelinated lesions in which a portion of the nervous system is destroyed or injured by a demyelinating disease including but not limited to multiple sclerosis, human immunodeficiency virus-associated myelopathy, transverse myelopathy or various etiologies, progressive multifocal leukoencephalopathy, and central pontine myelinolysis.
Therapeutics which are useful according to the invention for treatment of a nervous system disorder may be selected by testing for biological activity in promoting the survival or differentiation of neurons. For example, and not by way of limitation, therapeutics which elicit any of the following effects may be useful according to the invention: (i) increased survival time of neurons in culture; (ii) increased sprouting of neurons in culture or in vivo;
(iii) increased production of a neuron-associated molecule in culture or in vivo, e.g., choline acetyltransferase or acetylcholinesterase with respect to motor neurons; or (iv) decreased symptoms of neuron dysfunction in vivo. Such effects may be measured by any method known in the art. In preferred, non-limiting embodiments, increased survival of neurons may be measured by the method set forth in Arakawa et al. (1990, J. Neurosci. 10:3507-3515); increased sprouting of neurons may be detected by methods set forth in Pestronk et al. (1980, Exp. Neurol. 70:65-82) or Brown et al. (1981, Ann. Rev. Neurosci. 4:17-42); increased production of neuron-associated molecules may be measured by bioassay, enzymatic assay, antibody binding, Northern blot assay, etc., depending on the molecule to be measured; and motor neuron dysfunction may be measured by assessing the physical manifestation of motor neuron disorder, e.g., weakness, motor neuron conduction velocity, or functional disability.
In specific embodiments, motor neuron disorders that may be treated according to the invention include but are not limited to disorders such as infarction, infection, exposure to toxin, trauma, surgical damage, degenerative disease or malignancy that may affect motor neurons as well as other components of the nervous system, as well as disorders that selectively affect neurons such as amyotrophic lateral sclerosis, and including but not limited to progressive spinal muscular atrophy, progressive bulbar palsy, primary lateral sclerosis, infantile and juvenile muscular atrophy, progressive bulbar paralysis of childhood (Fazio-Londe syndrome), poliomyelitis and the post polio syndrome, and Hereditary Motorsensory Neuropathy (Charcot-Marie-Tooth Disease).
4.10.18 OTHER ACTIVITIES
A polypeptide of the invention may also exhibit one or more of the following additional activities or effects: inhibiting the growth, infection or function of, or killing, infectious agents, including, without limitation, bacteria, viruses, fungi and other parasites; effecting (suppressing or enhancing) bodily characteristics, including, without limitation, height, weight, hair color, eye color, skin, fat to lean ratio or other tissue pigmentation, or organ or body part size or shape
(such as, for example, breast augmentation or diminution, change in bone form or shape); effecting biorhythms or circadian cycles or rhythms; effecting the fertility of male or female subjects; effecting the metabolism, catabolism, anabolism, processing, utilization, storage or elimination of dietary fat, lipid, protein, carbohydrate, vitamins, minerals, co-factors or other nutritional factors or component(s); effecting behavioral characteristics, including, without limitation, appetite, libido, stress, cognition (including cognitive disorders), depression (including depressive disorders) and violent behaviors; providing analgesic effects or other pain reducing effects; promoting differentiation and growth of embryonic stem cells in lineages other than hematopoietic lineages; hormonal or endocrine activity; in the case of enzymes, correcting deficiencies of the enzyme and treating deficiency-related diseases; treatment of hyperproliferative disorders (such as, for example, psoriasis); immunoglobulin-like activity (such as, for example, the ability to bind antigens or complement); and the ability to act as an antigen in a vaccine composition to raise an immune response against such protein or another material or entity which is cross-reactive with such protein.
4.10.19 IDENTIFICATION OF POLYMORPHISMS
The demonstration of polymorphisms makes possible the identification of such polymorphisms in human subjects and the pharmacogenetic use of this information for diagnosis and treatment. Such polymorphisms may be associated with, e.g., differential predisposition or susceptibility to various disease states (such as disorders involving inflammation or immune response) or a differential response to drug administration, and this genetic information can be used to tailor preventive or therapeutic treatment appropriately. For example, the existence of a polymorphism associated with a predisposition to inflammation or autoimmune disease makes possible the diagnosis of this condition in humans by identifying the presence of the polymorphism.
Polymorphisms can be identified in a variety of ways known in the art which all generally involve obtaining a sample from a patient, analyzing DNA from the sample, optionally involving isolation or amplification of the DNA, and identifying the presence of the polymorphism in the DNA. For example, PCR may be used to amplify an appropriate fragment of genomic DNA which may then be sequenced. Alternatively, the DNA may be subjected to allele-specific oligonucleotide hybridization (in which appropriate oligonucleotides are hybridized to the DNA under conditions permitting detection of a single base mismatch) or to a single nucleotide extension assay (in which an oligonucleotide that hybridizes immediately adjacent to the position of the polymorphism is extended with one or more labeled nucleotides). In addition, traditional restriction fragment length polymorphism analysis (using restriction enzymes that provide differential digestion of the genomic DNA depending on the presence or absence of the polymorphism) may be performed. Arrays with nucleotide sequences of the present invention can be used to detect polymorphisms. The array can comprise modified nucleotide sequences of the present invention in order to detect the nucleotide sequences of the present invention. In the alternative, any one of the nucleotide sequences of the present invention can be placed on the array to detect changes from those sequences.
Alternatively a polymorphism resulting in a change in the amino acid sequence could also be detected by detecting a corresponding change in amino acid sequence of the protein, e.g., by an antibody specific to the variant sequence.
4.10.20 ARTHRITIS AND INFLAMMATION
The immunosuppressive effects of the compositions of the invention against rheumatoid arthritis are determined in an experimental animal model system. The experimental model system is adjuvant induced arthritis in rats, and the protocol is described by J. Holoshitz, et at., 1983, Science, 219:56, or by B. Waksman et al., 1963, Int. Arch. Allergy Appl. Immunol., 23:129. Induction of the disease can be caused by a single injection, generally intradermally, of a suspension of killed Mycobacterium tuberculosis in complete Freund's adjuvant (CFA). The route of injection can vary, but rats may be injected at the base of the tail with an adjuvant mixture. The polypeptide is administered in phosphate buffered solution (PBS) at a dose of about 1-5 mg/kg. The control consists of administering PBS only.
The procedure for testing the effects of the test compound would consist of intradermally injecting killed Mycobacterium tuberculosis in CFA followed by immediately administering the test compound and subsequent treatment every other day until day 24. At 14, 15, 18, 20, 22, and 24 days after injection of Mycobacterium CFA, an overall arthritis score may be obtained as described by J. Holoskitz above. An analysis of the data would reveal that the test compound would have a dramatic affect on the swelling of the joints as measured by a decrease of the arthritis score.
4.11 THERAPEUTIC METHODS
The compositions (including polypeptide fragments, analogs, variants and antibodies or other binding partners or modulators including antisense polynucleotides) of the invention have numerous applications in a variety of therapeutic methods. Examples of therapeutic applications include, but are not limited to, those exemplified herein. 4.11.1 EXAMPLE
One embodiment of the invention is the administration of an effective amount of the polypeptides or other composition of the invention to individuals affected by a disease or disorder that can be modulated by regulating the peptides of the invention. While the mode of administration is not particularly important, parenteral administration is preferred. An exemplary mode of administration is to deliver an intravenous bolus. The dosage of the polypeptides or other composition of the invention will normally be determined by the prescribing physician. It is to be expected that the dosage will vary according to the age, weight, condition and response of the individual patient. Typically, the amount of polypeptide administered per dose will be in the range of about 0.01 μg/kg to 100 mg/kg of body weight, with the preferred dose being about 0.1 μg/kg to 10 mg/kg of patient body weight. For parenteral administration, polypeptides of the invention will be formulated in an injectable form combined with a pharmaceutically acceptable parenteral vehicle. Such vehicles are well known in the art and examples include water, saline, Ringer's solution, dextrose solution, and solutions consisting of small amounts of the human serum albumin. The vehicle may contain minor amounts of additives that maintain the isotonicity and stability of the polypeptide or other active ingredient. The preparation of such solutions is within the skill of the art.
4.12 PHARMACEUTICAL FORMULATIONS AND ROUTES OF ADMINISTRATION
A protein or other composition of the present invention (from whatever source derived, including without limitation from recombinant and non-recombinant sources and including antibodies and other binding partners of the polypeptides of the invention) may be administered to a patient in need, by itself, or in pharmaceutical compositions where it is mixed with suitable carriers or excipient(s) at doses to treat or ameliorate a variety of disorders. Such a composition may optionally contain (in addition to protein or other active ingredient and a carrier) diluents, fillers, salts, buffers, stabilizers, solubilizers, and other materials well known in the art. The term "pharmaceutically acceptable" means a non-toxic material that does not interfere with the effectiveness of the biological activity of the active ingredient(s). The characteristics of the carrier will depend on the route of administration. The pharmaceutical composition of the invention may also contain cytokines, lymphokines, or other hematopoietic factors such as M-CSF, GM-CSF, TNF, IL-1, IL-2, IL-3, IL-4, IL-5, IL-6, IL-7, IL-8, IL-9, IL-10, IL-11, IL-12, IL-13, IL-14, IL-15, IFN, TNF0, TNF1 , TNF2, G-CSF, Meg-CSF, thrombopoietin, stem cell factor, and erythropoietin. In further compositions, proteins of the invention may be combined with other agents beneficial to the treatment of the disease or disorder in question. These agents include various growth factors such as epidermal growth factor (EGF), platelet-derived growth factor (PDGF), transforming growth factors (TGF-α and TGF-β), insulin-like growth factor (IGF), as well as cytokines described herein.
The pharmaceutical composition may further contain other agents which either enhance the activity of the protein or other active ingredient or complement its activity or use in treatment. Such additional factors and/or agents may be included in the pharmaceutical composition to produce a synergistic effect with protein or other active ingredient of the invention, or to minimize side effects. Conversely, protein or other active ingredient of the present invention may be included in formulations of the particular clotting factor, cytokine, lymphokine, other hematopoietic factor, thrombolytic or anti-thrombotic factor, or anti- inflammatory agent to minimize side effects of the clotting factor, cytokine, lymphokine, other hematopoietic factor, thrombolytic or anti-thrombotic factor, or anti-inflammatory agent (such as IL-lRa, IL-1 Hyl, IL-1 Hy2, anti-TNF, corticosteroids, immunosuppressive agents). A protein of the present invention may be active in multimers (e.g., heterodimers or homodimers) or complexes with itself or other proteins. As a result, pharmaceutical compositions of the invention may comprise a protein of the invention in such multimeric or complexed form.
As an alternative to being included in a pharmaceutical composition of the invention including a first protein, a second protein or a therapeutic agent may be concurrently administered with the first protein (e.g., at the same time, or at differing times provided that therapeutic concentrations of the combination of agents is achieved at the treatment site).
Techniques for formulation and administration of the compounds of the instant application may be found in "Remington's Pharmaceutical Sciences," Mack Publishing Co., Easton, PA, latest edition. A therapeutically effective dose further refers to that amount of the compound sufficient to result in amelioration of symptoms, e.g., treatment, healing, prevention or amelioration of the relevant medical condition, or an increase in rate of treatment, healing, prevention or amelioration of such conditions. When applied to an individual active ingredient, administered alone, a therapeutically effective dose refers to that ingredient alone. When applied to a combination, a therapeutically effective dose refers to combined amounts of the active ingredients that result in the therapeutic effect, whether administered in combination, serially or simultaneously.
In practicing the method of treatment or use of the present invention, a therapeutically effective amount of protein or other active ingredient of the present invention is administered to a mammal having a condition to be treated. Protein or other active ingredient of the present invention may be administered in accordance with the method of the invention either alone or in combination with other therapies such as treatments employing cytokines, lymphokines or other hematopoietic factors. When co- administered with one or more cytokines, lymphokines or other hematopoietic factors, protein or other active ingredient of the present invention may be administered either simultaneously with the cytokine(s), lymphokine(s), other hematopoietic factor(s), thrombolytic or anti-thrombotic factors, or sequentially. If administered sequentially, the attending physician will decide on the appropriate sequence of administering protein or other active ingredient of the present invention in combination with cytokine(s), lymphokine(s), other hematopoietic factor(s), thrombolytic or anti-thrombotic factors.
4.12.1 ROUTES OF ADMINISTRATION Suitable routes of administration may, for example, include oral, rectal, transmucosal, or intestinal administration; parenteral delivery, including intramuscular, subcutaneous, intramedullary injections, as well as intrathecal, direct intraventricular, intravenous, intraperitoneal, intranasal, or intraocular injections. Administration of protein or other active ingredient of the present invention used in the pharmaceutical composition or to practice the method of the present invention can be carried out in a variety of conventional ways, such as oral ingestion, inhalation, topical application or cutaneous, subcutaneous, intraperitoneal, parenteral or intravenous injection. Intravenous administration to the patient is preferred.
Alternately, one may administer the compound in a local rather than systemic manner, for example, via injection of the compound directly into a arthritic joints or in fibrotic tissue, often in a depot or sustained release formulation. In order to prevent the scarring process frequently occurring as complication of glaucoma surgery, the compounds may be administered topically, for example, as eye drops. Furthermore, one may administer the drug in a targeted drug delivery system, for example, in a liposome coated with a specific antibody, targeting, for example, arthritic or fibrotic tissue. The liposomes will be targeted to and taken up selectively by the afflicted tissue.
The polypeptides of the invention are administered by any route that delivers an effective dosage to the desired site of action. The determination of a suitable route of administration and an effective dosage for a particular indication is within the level of skill in the art. Preferably for wound treatment, one administers the therapeutic compound directly to the site. Suitable dosage ranges for the polypeptides of the invention can be extrapolated from these dosages or from similar studies in appropriate animal models. Dosages can then be adjusted as necessary by the clinician to provide maximal therapeutic benefit.
4.12.2 COMPOSITIONS/FORMULATIONS Pharmaceutical compositions for use in accordance with the present invention thus may be formulated in a conventional manner using one or more physiologically acceptable carriers comprising excipients and auxiliaries which facilitate processing of the active compounds into preparations which can be used pharmaceutically. These pharmaceutical compositions may be manufactured in a manner that is itself known, e.g. , by means of conventional mixing, dissolving, granulating, dragee-making, levigating, emulsifying, encapsulating, entrapping or lyophilizing processes. Proper formulation is dependent upon the route of administration chosen. When a therapeutically effective amount of protein or other active ingredient of the present invention is administered orally, protein or other active ingredient of the present invention will be in the form of a tablet, capsule, powder, solution or elixir. When administered in tablet form, the pharmaceutical composition of the invention may additionally contain a solid carrier such as a gelatin or an adjuvant. The tablet, capsule, and powder contain from about 5 to 95% protein or other active ingredient of the present invention, and preferably from about 25 to 90% protein or other active ingredient of the present invention. When administered in liquid form, a liquid carrier such as water, petroleum, oils of animal or plant origin such as peanut oil, mineral oil, soybean oil, or sesame oil, or synthetic oils may be added. The liquid form of the pharmaceutical composition may further contain physiological saline solution, dextrose or other saccharide solution, or glycols such as ethylene glycol, propylene glycol or polyethylene glycol. When administered in liquid form, the pharmaceutical composition contains from about 0.5 to 90% by weight of protein or other active ingredient of the present invention, and preferably from about 1 to 50% protein or other active ingredient of the present invention.
When a therapeutically effective amount of protein or other active ingredient of the present invention is administered by intravenous, cutaneous or subcutaneous injection, protein or other active ingredient of the present invention will be in the form of a pyrogen-free, parenterally acceptable aqueous solution. The preparation of such parenterally acceptable protein or other active ingredient solutions, having due regard to pH, isotonicity, stability, and the like, is within the skill in the art. A preferred pharmaceutical composition for intravenous, cutaneous, or subcutaneous injection should contain, in addition to protein or other active ingredient of the present invention, an isotonic vehicle such as Sodium Chloride Injection, Ringer's Injection, Dextrose Injection, Dextrose and Sodium Chloride Injection, Lactated Ringer's Injection, or other vehicle as known in the art. The pharmaceutical composition of the present invention may also contain stabilizers, preservatives, buffers, antioxidants, or other additives known to those of skill in the art. For injection, the agents of the invention may be formulated in aqueous solutions, preferably in physiologically compatible buffers such as Hanks's solution, Ringer's solution, or physiological saline buffer. For transmucosal administration, penetrants appropriate to the barrier to be permeated are used in the formulation. Such penetrants are generally known in the art.
For oral administration, the compounds can be formulated readily by combining the active compounds with pharmaceutically acceptable carriers well known in the art. Such carriers enable the compounds of the invention to be formulated as tablets, pills, dragees, capsules, liquids, gels, syrups, slurries, suspensions and the like, for oral ingestion by a patient to be treated. Pharmaceutical preparations for oral use can be obtained from a solid excipient, optionally grinding a resulting mixture, and processing the mixture of granules, after adding suitable auxiliaries, if desired, to obtain tablets or dragee cores. Suitable excipients are, in particular, fillers such as sugars, including lactose, sucrose, mannitol, or sorbitol; cellulose preparations such as, for example, maize starch, wheat starch, rice starch, potato starch, gelatin, gum tragacanth, methyl cellulose, hydroxypropylmethyl-cellulose, sodium carboxymethylcellulose, and/or polyvinylpyrrolidone (PVP). If desired, disintegrating agents may be added, such as the cross-linked polyvinyl pyrrolidone, agar, or alginic acid or a salt thereof such as sodium alginate. Dragee cores are provided with suitable coatings. For this purpose, concentrated sugar solutions may be used, which may optionally contain gum arabic, talc, polyvinyl pyrrolidone, carbopol gel, polyethylene glycol, and/or titanium dioxide, lacquer solutions, and suitable organic solvents or solvent mixtures. Dyestuffs or pigments may be added to the tablets or dragee coatings for identification or to characterize different combinations of active compound doses.
Pharmaceutical preparations which can be used orally include push-fit capsules made of gelatin, as well as soft, sealed capsules made of gelatin and a plasticizer, such as glycerol or sorbitol. The push-fit capsules can contain the active ingredients in admixture with filler such as lactose, binders such as starches, and/or lubricants such as talc or magnesium stearate and, optionally, stabilizers. In soft capsules, the active compounds may be dissolved or suspended in suitable liquids, such as fatty oils, liquid paraffin, or liquid polyethylene glycols. In addition, stabilizers may be added. All formulations for oral administration should be in dosages suitable for such administration. For buccal administration, the compositions may take the form of tablets or lozenges formulated in conventional manner. For administration by inhalation, the compounds for use according to the present invention are conveniently delivered in the form of an aerosol spray presentation from pressurized packs or a nebuliser, with the use of a suitable propellant, e.g., dichlorodifluoromethane, trichlorofluoromethane, dichlorotetrafluoroethane, carbon dioxide or other suitable gas. In the case of a pressurized aerosol the dosage unit may be determined by providing a valve to deliver a metered amount. Capsules and cartridges of, e.g., gelatin for use in an inhaler or insufflator may be formulated containing a powder mix of the compound and a suitable powder base such as lactose or starch. The compounds may be formulated for parenteral administration by injection, e.g., by bolus injection or continuous infusion. Formulations for injection may be presented in unit dosage form, e.g., in ampules or in multi-dose containers, with an added preservative. The compositions may take such forms as suspensions, solutions or emulsions in oily or aqueous vehicles, and may contain formulatory agents such as suspending, stabilizing and/or dispersing agents.
Pharmaceutical formulations for parenteral administration include aqueous solutions of the active compounds in water-soluble form. Additionally, suspensions of the active compounds may be prepared as appropriate oily injection suspensions. Suitable lipophilic solvents or vehicles include fatty oils such as sesame oil, or synthetic fatty acid esters, such as ethyl oleate or triglycerides, or liposomes. Aqueous injection suspensions may contain substances which increase the viscosity of the suspension, such as sodium carboxymethyl cellulose, sorbitol, or dextran. Optionally, the suspension may also contain suitable stabilizers or agents which increase the solubility of the compounds to allow for the preparation of highly concentrated solutions. Alternatively, the active ingredient may be in powder form for constitution with a suitable vehicle, e.g., sterile pyrogen-free water, before use.
The compounds may also be formulated in rectal compositions such as suppositories or retention enemas, e.g., containing conventional suppository bases such as cocoa butter or other glycerides. In addition to the formulations described previously, the compounds may also be formulated as a depot preparation. Such long acting formulations may be administered by implantation (for example subcutaneously or intramuscularly) or by intramuscular injection. Thus, for example, the compounds may be formulated with suitable polymeric or hydrophobic materials (for example as an emulsion in an acceptable oil) or ion exchange resins, or as sparingly soluble derivatives, for example, as a sparingly soluble salt.
A pharmaceutical carrier for the hydrophobic compounds of the invention is a co-solvent system comprising benzyl alcohol, a nonpolar surfactant, a water-miscible organic polymer, and an aqueous phase. The co-solvent system may be the VPD co-solvent system. VPD is a solution of 3% w/v benzyl alcohol, 8% w/v of the nonpolar surfactant polysorbate 80, and 65% w/v polyethylene glycol 300, made up to volume in absolute ethanol. The VPD co-solvent system (VPD:5W) consists of VPD diluted 1 :1 with a 5% dextrose in water solution. This co-solvent system dissolves hydrophobic compounds well, and itself produces low toxicity upon systemic administration. Naturally, the proportions of a co-solvent system may be varied considerably without destroying its solubility and toxicity characteristics. Furthermore, the identity of the co-solvent components may be varied: for example, other low-toxicity nonpolar surfactants may be used instead of polysorbate 80; the fraction size of polyethylene glycol may be varied; other biocompatible polymers may replace polyethylene glycol, e.g. polyvinyl pyrrolidone; and other sugars or polysaccharides may substitute for dextrose. Alternatively, other delivery systems for hydrophobic pharmaceutical compounds may be employed. Liposomes and emulsions are well known examples of delivery vehicles or carriers for hydrophobic drugs. Certain organic solvents such as dimethylsulfoxide also may be employed, although usually at the cost of greater toxicity. Additionally, the compounds may be delivered using a sustained-release system, such as semipermeable matrices of solid hydrophobic polymers containing the therapeutic agent. Various types of sustained-release materials have been established and are well known by those skilled in the art. Sustained-release capsules may, depending on their chemical nature, release the compounds for a few weeks up to over 100 days. Depending on the chemical nature and the biological stability of the therapeutic reagent, additional strategies for protein or other active ingredient stabilization may be employed.
The pharmaceutical compositions also may comprise suitable solid or gel phase carriers or excipients. Examples of such carriers or excipients include but are not limited to calcium carbonate, calcium phosphate, various sugars, starches, cellulose derivatives, gelatin, and polymers such as polyethylene glycols. Many of the active ingredients of the invention may be provided as salts with pharmaceutically compatible counter ions. Such pharmaceutically acceptable base addition salts are those salts which retain the biological effectiveness and properties of the free acids and which are obtained by reaction with inorganic or organic bases such as sodium hydroxide, magnesium hydroxide, ammonia, trialkylamine, dialkylamine, monoalkylamine, dibasic amino acids, sodium acetate, potassium benzoate, triethanol amine and the like.
The pharmaceutical composition of the invention may be in the form of a complex of the protein(s) or other active ingredient(s) of present invention along with protein or peptide antigens. The protein and/or peptide antigen will deliver a stimulatory signal to both B and T lymphocytes. B-lymphocytes will respond to antigen through their surface immunoglobulin receptor. T lymphocytes will respond to antigen through the T cell receptor (TCR) following presentation of the antigen by MHC proteins. MHC and structurally related proteins including those encoded by class I and class II MHC genes on host cells will serve to present the peptide antigen(s) to T lymphocytes. The antigen components could also be supplied as purified MHC-peptide complexes alone or with co-stimulatory molecules that can directly signal T cells. Alternatively antibodies able to bind surface immunoglobulin and other molecules on B cells as well as antibodies able to bind the TCR and other molecules on T cells can be combined with the pharmaceutical composition of the invention. The pharmaceutical composition of the invention may be in the form of a liposome in which protein of the present invention is combined, in addition to other pharmaceutically acceptable carriers, with amphipathic agents such as lipids which exist in aggregated form as micelles, insoluble monolayers, liquid crystals, or lamellar layers in aqueous solution. Suitable lipids for liposomal formulation include, without limitation, monoglycerides, diglycerides, sulfatides, lysolecithins, phospholipids, saponin, bile acids, and the like. Preparation of such liposomal formulations is within the level of skill in the art, as disclosed, for example, in U.S. Patent Nos. 4,235,871; 4,501,728; 4,837,028; and 4,737,323, all of which are incorporated herein by reference. The amount of protein or other active ingredient of the present invention in the pharmaceutical composition of the present invention will depend upon the nature and severity of the condition being treated, and on the nature of prior treatments which the patient has undergone. Ultimately, the attending physician will decide the amount of protein or other active ingredient of the present invention with which to treat each individual patient. Initially, the attending physician will administer low doses of protein or other active ingredient of the present invention and observe the patient's response. Larger doses of protein or other active ingredient of the present invention may be administered until the optimal therapeutic effect is obtained for the patient, and at that point the dosage is not increased further. It is contemplated that the various pharmaceutical compositions used to practice the method of the present invention should contain about 0.01 μg to about 100 mg (preferably about 0.1 μg to about 10 mg, more preferably about 0.1 μg to about 1 mg) of protein or other active ingredient of the present invention per kg body weight. For compositions of the present invention which are useful for bone, cartilage, tendon or ligament regeneration, the therapeutic method includes administering the composition topically, systematically, or locally as an implant or device. When administered, the therapeutic composition for use in this invention is, of course, in a pyrogen-free, physiologically acceptable form. Further, the composition may desirably be encapsulated or injected in a viscous form for delivery to the site of bone, cartilage or tissue damage. Topical administration may be suitable for wound healing and tissue repair. Therapeutically useful agents other than a protein or other active ingredient of the invention which may also optionally be included in the composition as described above, may alternatively or additionally, be administered simultaneously or sequentially with the composition in the methods of the invention. Preferably for bone and/or cartilage formation, the composition would include a matrix capable of delivering the protein-containing or other active ingredient-containing composition to the site of bone and/or cartilage damage, providing a structure for the developing bone and cartilage and optimally capable of being resorbed into the body. Such matrices may be formed of materials presently in use for other implanted medical applications.
The choice of matrix material is based on biocompatibility, biodegradability, mechanical properties, cosmetic appearance and interface properties. The particular application of the compositions will define the appropriate formulation. Potential matrices for the compositions may be biodegradable and chemically defined calcium sulfate, tricalcium phosphate, hydroxyapatite, polylactic acid, polyglycolic acid and polyanhydrides. Other potential materials are biodegradable and biologically well-defined, such as bone or dermal collagen. Further matrices are comprised of pure proteins or extracellular matrix components. Other potential matrices are nonbiodegradable and chemically defined, such as sintered hydroxyapatite, bioglass, aluminates, or other ceramics. Matrices may be comprised of combinations of any of the above- mentioned types of material, such as polylactic acid and hydroxyapatite or collagen and tricalcium phosphate. The bioceramics may be altered in composition, such as in calcium-aluminate-phosphate and processing to alter pore size, particle size, particle shape, and biodegradability. Presently preferred is a 50:50 (mole weight) copolymer of lactic acid and glycolic acid in the form of porous particles having diameters ranging from 150 to 800 microns. In some applications, it will be useful to utilize a sequestering agent, such as carboxymethyl cellulose or autologous blood clot, to prevent the protein compositions from disassociating from the matrix. A preferred family of sequestering agents is cellulosic materials such as alkylcelluloses
(including hydroxyalkylcelluloses), including methylcellulose, ethylcellulose, hydroxyethylcellulose, hydroxypropylcellulose, hydroxypropyl-methylcellulose, and carboxymethylcellulose, the most preferred being cationic salts of carboxymethylcellulose (CMC). Other preferred sequestering agents include hyaluronic acid, sodium alginate, poly(ethylene glycol), polyoxyethylene oxide, carboxyvinyl polymer and poly(vinyl alcohol). The amount of sequestering agent useful herein is 0.5-20 wt %, preferably 1-10 wt % based on total formulation weight, which represents the amount necessary to prevent desorption of the protein from the polymer matrix and to provide appropriate handling of the composition, yet not so much that the progenitor cells are prevented from infiltrating the matrix, thereby providing the protein the opportunity to assist the osteogenic activity of the progenitor cells. In further compositions, proteins or other active ingredients of the invention may be combined with other agents beneficial to the treatment of the bone and/or cartilage defect, wound, or tissue in question. These agents include various growth factors such as epidermal growth factor (EGF), platelet derived growth factor (PDGF), transforming growth factors (TGF- and TGF-β), and insulin-like growth factor (IGF). The therapeutic compositions are also presently valuable for veterinary applications.
Particularly domestic animals and thoroughbred horses, in addition to humans, are desired patients for such treatment with proteins or other active ingredients of the present invention. The dosage regimen of a protein-containing pharmaceutical composition to be used in tissue regeneration will be determined by the attending physician considering various factors which modify the action of the proteins, e.g., amount of tissue weight desired to be formed, the site of damage, the condition of the damaged tissue, the size of a wound, type of damaged tissue (e.g., bone), the patient's age, sex, and diet, the severity of any infection, time of administration and other clinical factors. The dosage may vary with the type of matrix used in the reconstitution and with inclusion of other proteins in the pharmaceutical composition. For example, the addition of other known growth factors, such as IGF I (insulin like growth factor I), to the final composition, may also effect the dosage. Progress can be monitored by periodic assessment of tissue/bone growth and/or repair, for example, X-rays, histomorphometric determinations and tetracycline labeling. Polynucleotides of the present invention can also be used for gene therapy. Such polynucleotides can be introduced either in vivo or ex vivo into cells for expression in a mammalian subject. Polynucleotides of the invention may also be administered by other known methods for introduction of nucleic acid into a cell or organism (including, without limitation, in the form of viral vectors or naked DNA). Cells may also be cultured ex vivo in the presence of proteins of the present invention in order to proliferate or to produce a desired effect on or activity in such cells. Treated cells can then be introduced in vivo for therapeutic purposes.
4.12.3 EFFECTIVE DOSAGE
Pharmaceutical compositions suitable for use in the present invention include compositions wherein the active ingredients are contained in an effective amount to achieve its intended purpose. More specifically, a therapeutically effective amount means an amount effective to prevent development of or to alleviate the existing symptoms of the subject being treated. Determination of the effective amount is well within the capability of those skilled in the art, especially in light of the detailed disclosure provided herein. For any compound used in the method of the invention, the therapeutically effective dose can be estimated initially from appropriate in vitro assays. For example, a dose can be formulated in animal models to achieve a circulating concentration range that can be used to more accurately determine useful doses in humans. For example, a dose can be formulated in animal models to achieve a circulating concentration range that includes the ICJO as determined in cell culture (i.e., the concentration of the test compound which achieves a half-maximal inhibition of the protein's biological activity).
Such information can be used to more accurately determine useful doses in humans.
A therapeutically effective dose refers to that amount of the compound that results in amelioration of symptoms or a prolongation of survival in a patient. Toxicity and therapeutic efficacy of such compounds can be determined by standard pharmaceutical procedures in cell cultures or experimental animals, e.g., for determining the LD 0 (the dose lethal to 50% of the population) and the ED50 (the dose therapeutically effective in 50% of the population). The dose ratio between toxic and therapeutic effects is the therapeutic index and it can be expressed as the ratio between LD50 and ED50. Compounds which exhibit high therapeutic indices are preferred. The data obtained from these cell culture assays and animal studies can be used in formulating a range of dosage for use in human. The dosage of such compounds lies preferably within a range of circulating concentrations that include the ED50 with little or no toxicity. The dosage may vary within this range depending upon the dosage form employed and the route of administration utilized. The exact formulation, route of administration and dosage can be chosen by the individual physician in view of the patient's condition. See, e.g., Fingl et al., 1975, in "The
Pharmacological Basis of Therapeutics", Ch. 1 p.l. Dosage amount and interval may be adjusted individually to provide plasma levels of the active moiety which are sufficient to maintain the desired effects, or minimal effective concentration (MEC). The MEC will vary for each compound but can be estimated from in vitro data. Dosages necessary to achieve the MEC will depend on individual characteristics and route of administration. However, HPLC assays or bioassays can be used to determine plasma concentrations.
Dosage intervals can also be determined using MEC value. Compounds should be administered using a regimen that maintains plasma levels above the MEC for 10-90% of the time, preferably between 30-90% and most preferably between 50-90%. In cases of local administration or selective uptake, the effective local concentration of the drug may not be related to plasma concentration.
An exemplary dosage regimen for polypeptides or other compositions of the invention will be in the range of about 0.01 μg/kg to 100 mg/kg of body weight daily, with the preferred dose being about 0.1 μg/kg to 25 mg/kg of patient body weight daily, varying in adults and children. Dosing may be once daily, or equivalent doses may be delivered at longer or shorter intervals.
The amount of composition administered will, of course, be dependent on the subject being treated, on the subject's age and weight, the severity of the affliction, the manner of administration and the judgment of the prescribing physician. 4.12.4 PACKAGING
The compositions may, if desired, be presented in a pack or dispenser device which may contain one or more unit dosage forms containing the active ingredient. The pack may, for example, comprise metal or plastic foil, such as a blister pack. The pack or dispenser device may be accompanied by instructions for administration. Compositions comprising a compound of the invention formulated in a compatible pharmaceutical carrier may also be prepared, placed in an appropriate container, and labeled for treatment of an indicated condition.
4.13 ANTIBODIES Also included in the invention are antibodies to proteins, or fragments of proteins of the invention. The term "antibody" as used herein refers to immunoglobulin molecules and immunologically active portions of immunoglobulin (Ig) molecules, i.e., molecules that contain an antigen-binding site that specifically binds (immunoreacts with) an antigen. Such antibodies include, but are not limited to, polyclonal, monoclonal, chimeric, single chain, Fab, Fat,' and F(a ')2 fragments, and an Fab expression library. In general, an antibody molecule obtained from humans relates to any of the classes IgG, IgM, IgA, IgE and IgD, which differ from one another by the nature of the heavy chain present in the molecule. Certain classes have subclasses as well, such as IgGi, IgG2, and others. Furthermore, in humans, the light chain may be a kappa chain or a lambda chain. Reference herein to antibodies includes a reference to all such classes, subclasses and types of human antibody species.
An isolated related protein of the invention may be intended to serve as an antigen, or a portion or fragment thereof, and additionally can be used as an immunogen to generate antibodies that immunospecifically bind the antigen, using standard techniques for polyclonal and monoclonal antibody preparation. The full-length protein can be used or, alternatively, the invention provides antigenic peptide fragments of the antigen for use as immunogens. An antigenic peptide fragment comprises at least 6 amino acid residues of the amino acid sequence of the full length protein, (for example the amino acid sequence shown in SEQ ID NO: 30369), and encompasses an epitope thereof such that an antibody raised against the peptide forms a specific immune complex with the full length protein or with any fragment that contains the epitope. Preferably, the antigenic peptide comprises at least 10 amino acid residues, or at least 15 amino acid residues, or at least 20 amino acid residues, or at least 30 amino acid residues. Preferred epitopes encompassed by the antigenic peptide are regions of the protein that are located on its surface; commonly these are hydrophilic regions.
In certain embodiments of the invention, at least one epitope encompassed by the antigenic peptide is a region on the surface of the protein of the invention that is located on the surface of the protein, e.g., a hydrophilic region. A hydrophobicity analysis of the human related protein sequence will indicate which regions of a related protein are particularly hydrophilic and, therefore, are likely to encode surface residues useful for targeting antibody production. As a means for targeting antibody production, hydropathy plots showing regions of hydrophilicity and hydrophobicity may be generated by any method well known in the art, including, for example, the Kyte Doolittle or the Hopp Woods methods, either with or without Fourier transformation. See, e.g., Hopp and Woods, 1981, Proc. Nat. Acad. Sci. USA 78: 3824-3828; Kyte and Doolittle 1982, J. Mol. Biol. 157: 105-142, each of which is incorporated herein by reference in its entirety. Antibodies that are specific for one or more domains within an antigenic protein, or derivatives, fragments, analogs or homologs thereof, are also provided herein.
A protein of the invention, or a derivative, fragment, analog, homolog or ortholog thereof, may be utilized as an immunogen in the generation of antibodies that immunospecifically bind these protein components.
Various procedures known within the art may be used for the production of polyclonal or monoclonal antibodies directed against a protein of the invention, or against derivatives, fragments, analogs homologs or orthologs thereof (see, for example, Antibodies: A Laboratory Manual, Harlow E, and Lane D, 1988, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, NY, incorporated herein by reference). Some of these antibodies are discussed below.
5.13.1 Polyclonal Antibodies
For the production of polyclonal antibodies, various suitable host animals (e.g., rabbit, goat, mouse or other mammal) may be immunized by one or more injections with the native protein, a synthetic variant thereof, or a derivative of the foregoing. An appropriate immunogenic preparation can contain, for example, the naturally occurring immunogenic protein, a chemically synthesized polypeptide representing the immunogenic protein, or a recombinantly expressed immunogenic protein. Furthermore, the protein may be conjugated to a second protein known to be immunogenic in the mammal being immunized. Examples of such immunogenic proteins include but are not limited to keyhole limpet hemocyanin, serum albumin, bovine thyroglobulin, and soybean trypsin inhibitor. The preparation can further include an adjuvant. Various adjuvants used to increase the immunological response include, but are not limited to, Freund's (complete and incomplete), mineral gels (e.g., aluminum hydroxide), surface active substances (e.g., lysolecithin, pluronic polyols, polyanions, peptides, oil emulsions, dinitrophenol, etc.), adjuvants usable in humans such as Bacille Calmette-Guerin and Corynebacterium parvum, or similar immunostimulatory agents. Additional examples of adjuvants which can be employed include MPL-TDM adjuvant (monophosphoryl Lipid A, synthetic trehalose dicorynomycolate).
The polyclonal antibody molecules directed against the immunogenic protein can be isolated from the mammal (e.g., from the blood) and further purified by well known techniques, such as affinity chromatography using protein A or protein G, which provide primarily the IgG fraction of immune serum. Subsequently, or alternatively, the specific antigen which is the target of the immunoglobulin sought, or an epitope thereof, may be immobilized on a column to purify the immune specific antibody by immunoaffinity chromatography. Purification of immunoglobulins is discussed, for example, by D. Wilkinson (The Scientist, published by The Scientist, Inc., Philadelphia PA, Vol. 14, No. 8 (April 17, 2000), pp. 25-28).
5.13.2 Monoclonal Antibodies
The term "monoclonal antibody" (MAb) or "monoclonal antibody composition", as used herein, refers to a population of antibody molecules that contain only one molecular species of antibody molecule consisting of a unique light chain gene product and a unique heavy chain gene product. In particular, the complementarity determining regions (CDRs) of the monoclonal antibody are identical in all the molecules of the population. MAbs thus contain an antigen- binding site capable of immunoreacting with a particular epitope of the antigen characterized by a unique binding affinity for it. Monoclonal antibodies can be prepared using hybridoma methods, such as those described by Kohler and Milstein, Nature. 256:495 (1975). In a hybridoma method, a mouse, hamster, or other appropriate host animal, is typically immunized with an immunizing agent to elicit lymphocytes that produce or are capable of producing antibodies that will specifically bind to the immunizing agent. Alternatively, the lymphocytes can be immunized in vitro. The immunizing agent will typically include the protein antigen, a fragment thereof or a fusion protein thereof. Generally, either peripheral blood lymphocytes are used if cells of human origin are desired, or spleen cells or lymph node cells are used if non-human mammalian sources are desired. The lymphocytes are then fused with an immortalized cell line using a suitable fusing agent, such as polyethylene glycol, to form a hybridoma cell (Goding, Monoclonal Antibodies: Principles and Practice. Academic Press, (1986) pp. 59-103). Immortalized cell lines are usually transformed mammalian cells, particularly myeloma cells of rodent, bovine and human origin. Usually, rat or mouse myeloma cell lines are employed. The hybridoma cells can be cultured in a suitable culture medium that preferably contains one or more substances that inhibit the growth or survival of the unfused, immortalized cells. For example, if the parental cells lack the enzyme hypoxanthine guanine phosphoribosyl transferase (HGPRT or HPRT), the culture medium for the hybridomas typically will include hypoxanthine, aminopterin, and thymidine ("HAT medium"), which substances prevent the growth of HGPRT-deficient cells.
Preferred immortalized cell lines are those that fuse efficiently, support stable high level expression of antibody by the selected antibody-producing cells, and are sensitive to a medium such as HAT medium. More preferred immortalized cell lines are murine myeloma lines, which can be obtained, for instance, from the Salk Institute Cell Distribution Center, San Diego, California and the American Type Culture Collection, Manassas, Virginia. Human myeloma and mouse-human heteromyeloma cell lines also have been described for the production of human monoclonal antibodies (Kozbor, J. Immunol., 133:3001 (1984); Brodeur et al., Monoclonal Antibody Production Techniques and Applications, Marcel Dekker, Inc., New York, (1987) pp. 51-63).
The culture medium in which the hybridoma cells are cultured can then be assayed for the presence of monoclonal antibodies directed against the antigen. Preferably, the binding specificity of monoclonal antibodies produced by the hybridoma cells is determined by immunoprecipitation or by an in vitro binding assay, such as radioimmunoassay (RIA) or enzyme-linked immunoabsorbent assay (ELISA). Such techniques and assays are known in the art. The binding affinity of the monoclonal antibody can, for example, be determined by the Scatchard analysis of Munson and Pollard, Anal. Biochem., 107:220 (1980). Preferably, antibodies having a high degree of specificity and a high binding affinity for the target antigen are isolated.
After the desired hybridoma cells are identified, the clones can be subcloned by limiting dilution procedures and grown by standard methods. Suitable culture media for this purpose include, for example, Dulbecco's Modified Eagle's Medium and RPMI-1640 medium. Alternatively, the hybridoma cells can be grown in vivo as ascites in a mammal. The monoclonal antibodies secreted by the subclones can be isolated or purified from the culture medium or ascites fluid by conventional immunoglobulin purification procedures such as, for example, protein A-Sepharose, hydroxylapatite chromatography, gel electrophoresis, dialysis, or affinity chromatography.
The monoclonal antibodies can also be made by recombinant DNA methods, such as those described in U.S. Patent No. 4,816,567. DNA encoding the monoclonal antibodies of the invention can be readily isolated and sequenced using conventional procedures (e.g., by using oligonucleotide probes that are capable of binding specifically to genes encoding the heavy and light chains of murine antibodies). The hybridoma cells of the invention serve as a preferred source of such DNA. Once isolated, the DNA can be placed into expression vectors, which are then transfected into host cells such as simian COS cells, Chinese hamster ovary (CHO) cells, or myeloma cells that do not otherwise produce immunoglobulin protein, to obtain the synthesis of monoclonal antibodies in the recombinant host cells. The DNA also can be modified, for example, by substituting the coding sequence for human heavy and light chain constant domains in place of the homologous murine sequences (U.S. Patent No. 4,816,567; Morrison, Nature 368, 812-13 (1994)) or by covalently joining to the immunoglobulin coding sequence all or part of the coding sequence for a non-immunoglobulin polypeptide. Such a non-immunoglobulin polypeptide can be substituted for the constant domains of an antibody of the invention, or can be substituted for the variable domains of one antigen-combining site of an antibody of the invention to create a chimeric bivalent antibody.
5.13.2 Humanized Antibodies
The antibodies directed against the protein antigens of the invention can further comprise humanized antibodies or human antibodies. These antibodies are suitable for administration to humans without engendering an immune response by the human against the administered immunoglobulin. Humanized forms of antibodies are chimeric immunoglobulins, immunoglobulin chains or fragments thereof (such as Fv, Fab, Fab', F(ab')2 or other antigen- binding subsequences of antibodies) that are principally comprised of the sequence of a human immunoglobulin, and contain minimal sequence derived from a non-human immunoglobulin. Humanization can be performed following the method of Winter and co-workers (Jones et al., Nature, 321 :522-525 (1986); Riechmann et al., Nature, 332:323-327 (1988); Verhoeyen et al., Science. 239:1534-1536 (1988)), by substituting rodent CDRs or CDR sequences for the corresponding sequences of a human antibody. (See also U.S. Patent No. 5,225,539.) In some instances, Fv framework residues of the human immunoglobulin are replaced by corresponding non-human residues. Humanized antibodies can also comprise residues which are found neither in the recipient antibody nor in the imported CDR or framework sequences. In general, the humanized antibody will comprise substantially all of at least one, and typically two, variable domains, in which all or substantially all of the CDR regions correspond to those of a non-human immunoglobulin and all or substantially all of the framework regions are those of a human immunoglobulin consensus sequence. The humanized antibody optimally also will comprise at least a portion of an immunoglobulin constant region (Fc), typically that of a human immunoglobulin (Jones et al., 1986; Riechmann et al., 1988; and Presta, Curr. Op. Struct. Biol., 2:593-596 (1992)).
5.13.3 Human Antibodies Fully human antibodies relate to antibody molecules in which essentially the entire sequences of both the light chain and the heavy chain, including the CDRs, arise from human genes. Such antibodies are termed "human antibodies", or "fully human antibodies" herein. Human monoclonal antibodies can be prepared by the trioma technique; the human B-cell hybridoma technique (see Kozbor, et al., 1983 Immunol Today 4: 72) and the EBV hybridoma technique to produce human monoclonal antibodies (see Cole, et al., 1985 In: MONOCLONAL ANTIBODIES AND CANCER THERAPY, Alan R. Liss, Inc., pp. 77-96). Human monoclonal antibodies may be utilized in the practice of the present invention and may be produced by using human hybridomas (see Cote, et al., 1983. Proc Natl Acad Sci USA 80: 2026-2030) or by transforming human B-cells with Epstein Barr Virus in vitro (see Cole, et al., 1985 In: MONOCLONAL ANTIBODIES AND CANCER THERAPY, Alan R. Liss, Inc., pp. 77-96).
In addition, human antibodies can also be produced using additional techniques, including phage display libraries (Hoogenboom and Winter, J. Mol. Biol., 227:381 (1991); Marks et al., J. Mol. Biol., 222:581 (1991)). Similarly, human antibodies can be made by introducing human immunoglobulin loci into transgenic animals, e.g., mice in which the endogenous immunoglobulin genes have been partially or completely inactivated. Upon challenge, human antibody production is observed, which closely resembles that seen in humans in all respects, including gene rearrangement, assembly, and antibody repertoire. This approach is described, for example, in U.S. Patent Nos. 5,545,807; 5,545,806; 5,569,825; 5,625,126; 5,633,425; 5,661,016, and in Marks et al. (Bio/Technology 10, 779-783 (1992)); Lonberg et al. (Nature 368 856-859 (1994)); Morrison ( Nature 368. 812-13 (1994)); Fishwild et al,( Nature Biotechnology F4, 845-51 (1996)); Neuberger (Nature Biotechnology H, 826 (1996)); and Lonberg and Huszar (Intern. Rev. Immunol. 13 65-93 (1995)).
Human antibodies may additionally be produced using transgenic nonhuman animals which are modified so as to produce fully human antibodies rather than the animal's endogenous antibodies in response to challenge by an antigen. (See PCT publication WO94/02602). The endogenous genes encoding the heavy and light immunoglobulin chains in the nonhuman host have been incapacitated, and active loci encoding human heavy and light chain immunoglobulins are inserted into the host's genome. The human genes are incorporated, for example, using yeast artificial chromosomes containing the requisite human DNA segments. An animal which provides all the desired modifications is then obtained as progeny by crossbreeding intermediate transgenic animals containing fewer than the full complement of the modifications. The preferred embodiment of such a nonhuman animal is a mouse, and is termed the Xenomouse as disclosed in PCT publications WO 96/33735 and WO 96/34096. This animal produces B cells which secrete fully human immunoglobulins. The antibodies can be obtained directly from the animal after immunization with an immunogen of interest, as, for example, a preparation of a polyclonal antibody, or alternatively from immortalized B cells derived from the animal, such as hybridomas producing monoclonal antibodies. Additionally, the genes encoding the immunoglobulins with human variable regions can be recovered and expressed to obtain the antibodies directly, or can be further modified to obtain analogs of antibodies such as, for example, single chain Fv molecules.
An example of a method of producing a nonhuman host, exemplified as a mouse, lacking expression of an endogenous immunoglobulin heavy chain is disclosed in U.S. Patent No. 5,939,598. It can be obtained by a method including deleting the J segment genes from at least one endogenous heavy chain locus in an embryonic stem cell to prevent rearrangement of the locus and to prevent formation of a transcript of a rearranged immunoglobulin heavy chain locus, the deletion being effected by a targeting vector containing a gene encoding a selectable marker; and producing from the embryonic stem cell a transgenic mouse whose somatic and germ cells contain the gene encoding the selectable marker. A method for producing an antibody of interest, such as a human antibody, is disclosed in
U.S. Patent No. 5,916,771. It includes introducing an expression vector that contains a nucleotide sequence encoding a heavy chain into one mammalian host cell in culture, introducing an expression vector containing a nucleotide sequence encoding a light chain into another mammalian host cell, and fusing the two cells to form a hybrid cell. The hybrid cell expresses an antibody containing the heavy chain and the light chain.
In a further improvement on this procedure, a method for identifying a clinically relevant epitope on an immunogen, and a correlative method for selecting an antibody that binds immunospecifically to the relevant epitope with high affinity, are disclosed in PCT publication WO 99/53049.
5.13.4 F«b Fragments and Single Chain Antibodies
According to the invention, techniques can be adapted for the production of single-chain antibodies specific to an antigenic protein of the invention (see e.g., U.S. Patent No. 4,946,778). In addition, methods can be adapted for the construction of Fab expression libraries (see e.g., Huse, et al., 1989 Science 246: 1275-1281) to allow rapid and effective identification of monoclonal Fab fragments with the desired specificity for a protein or derivatives, fragments, analogs or homologs thereof. Antibody fragments that contain the idiotypes to a protein antigen may be produced by techniques known in the art including, but not limited to: (i) an F(ab')2 fragment produced by pepsin digestion of an antibody molecule; (ii) an Fab fragment generated by reducing the disulfide bridges of an F(ab')2 fragment; (iii) an Fab fragment generated by the treatment of the antibody molecule with papain and a reducing agent and (iv) Fv fragments.
5.13.5 Bispecific Antibodies Bispecific antibodies are monoclonal, preferably human or humanized, antibodies that have binding specificities for at least two different antigens. In the present case, one of the binding specificities is for an antigenic protein of the invention. The second binding target is any other antigen, and advantageously is a cell-surface protein or receptor or receptor subunit. Methods for making bispecific antibodies are known in the art. Traditionally, the recombinant production of bispecific antibodies is based on the co-expression of two immunoglobulin heavy-chain/light-chain pairs, where the two heavy chains have different specificities (Milstein and Cuello, Nature, 305:537-539 (1983)). Because of the random assortment of immunoglobulin heavy and light chains, these hybridomas (quadromas) produce a potential mixture often different antibody molecules, of which only one has the correct bispecific structure. The purification of the correct molecule is usually accomplished by affinity chromatography steps. Similar procedures are disclosed in WO 93/08829, published 13 May 1993, and in Traunecker et al, 1991 EMBO J., 10:3655-3659.
Antibody variable domains with the desired binding specificities (antibody-antigen combining sites) can be fused to immunoglobulin constant domain sequences. The fusion preferably is with an immunoglobulin heavy-chain constant domain, comprising at least part of the hinge, CH2, and CH3 regions. It is preferred to have the first heavy-chain constant region (CHI) containing the site necessary for light-chain binding present in at least one of the fusions. DNAs encoding the immunoglobulin heavy-chain fusions and, if desired, the immunoglobulin light chain, are inserted into separate expression vectors, and are co-transfected into a suitable host organism. For further details of generating bispecific antibodies see, for example, Suresh et al., Methods in Enzymology, 121 :210 (1986).
According to another approach described in WO 96/27011, the interface between a pair of antibody molecules can be engineered to maximize the percentage of heterodimers which are recovered from recombinant cell culture. The preferred interface comprises at least a part of the CH3 region of an antibody constant domain. In this method, one or more small amino acid side chains from the interface of the first antibody molecule are replaced with larger side chains (e.g. tyrosine or tryptophan). Compensatory "cavities" of identical or similar size to the large side chain(s) are created on the interface of the second antibody molecule by replacing large amino acid side chains with smaller ones (e.g. alanine or threonine). This provides a mechanism for increasing the yield of the heterodimer over other unwanted end-products such as homodimers. Bispecific antibodies can be prepared as full length antibodies or antibody fragments (e.g.
F(ab')2 bispecific antibodies). Techniques for generating bispecific antibodies from antibody fragments have been described in the literature. For example, bispecific antibodies can be prepared using chemical linkage. Brennan et al., Science 229:81 (1985) describe a procedure wherein intact antibodies are proteolytically cleaved to generate F(ab')2 fragments. These fragments are reduced in the presence of the dithiol complexing agent sodium arsenite to stabilize vicinal dithiols and prevent intermolecular disulfide formation. The Fab' fragments generated are then converted to thionitrobenzoate (TNB) derivatives. One of the Fab'-TNB derivatives is then reconverted to the Fab'-thiol by reduction with mercaptoethylamine and is mixed with an equimolar amount of the other Fab'-TNB derivative to form the bispecific antibody. The bispecific antibodies produced can be used as agents for the selective immobilization of enzymes.
Additionally, Fab' fragments can be directly recovered from E. coli and chemically coupled to form bispecific antibodies. Shalaby et al., J. Exp. Med. 175:217-225 (1992) describe the production of a fully humanized bispecific antibody F(ab')2 molecule. Each Fab' fragment was separately secreted from E. coli and subjected to directed chemical coupling in vitro to form the bispecific antibody. The bispecific antibody thus formed was able to bind to cells overexpressing the ErbB2 receptor and normal human T cells, as well as trigger the lytic activity of human cytotoxic lymphocytes against human breast tumor targets. Various techniques for making and isolating bispecific antibody fragments directly from recombinant cell culture have also been described. For example, bispecific antibodies have been produced using leucine zippers. Kostelny et al., J. Immunol. 148(5): 1547-1553 (1992). The leucine zipper peptides from the Fos and Jun proteins were linked to the Fab' portions of two different antibodies by gene fusion. The antibody homodimers were reduced at the hinge region to form monomers and then re-oxidized to form the antibody heterodimers. This method can also be utilized for the production of antibody homodimers. The "diabody" technology described by Hollinger et al., Proc. Natl. Acad. Sci. USA 90:6444-6448 (1993) has provided an alternative mechanism for making bispecific antibody fragments. The fragments comprise a heavy-chain variable domain (Vn) connected to a light-chain variable domain (VL) by a linker which is too short to allow pairing between the two domains on the same chain. Accordingly, the VH and V domains of one fragment are forced to pair with the complementary VL and VH domains of another fragment, thereby forming two antigen-binding sites. Another strategy for making bispecific antibody fragments by the use of single-chain Fv (sFv) dimers has also been reported. See, Gruber et al., J. Immunol. 152:5368 (1994). Antibodies with more than two valencies are contemplated. For example, trispecific antibodies can be prepared. Tutt et al., J. Immunol. 147:60 (1991). Exemplary bispecific antibodies can bind to two different epitopes, at least one of which originates in the protein antigen of the invention. Alternatively, an anti-antigenic arm of an immunoglobulin molecule can be combined with an arm which binds to a triggering molecule on a leukocyte such as a T-cell receptor molecule (e.g. CD2, CD3, CD28, or B7), or Fc receptors for IgG (FcγR), such as FcγRI (CD64), FcγRII (CD32) and FcγRIII (CD 16) so as to focus cellular defense mechanisms to the cell expressing the particular antigen. Bispecific antibodies can also be used to direct cytotoxic agents to cells which express a particular antigen. These antibodies possess an antigen-binding arm and an arm which binds a cytotoxic agent or a radionuclide chelator, such as EOTUBE, DPTA, DOTA, or TETA. Another bispecific antibody of interest binds the protein antigen described herein and further binds tissue factor (TF).
5.13.6 Heteroconjugate Antibodies Heteroconjugate antibodies are also within the scope of the present invention.
Heteroconjugate antibodies are composed of two covalently joined antibodies. Such antibodies have, for example, been proposed to target immune system cells to unwanted cells (U.S. Patent No. 4,676,980), and for treatment of HIV infection (WO 91/00360; WO 92/200373; EP 03089). It is contemplated that the antibodies can be prepared in vitro using known methods in synthetic protein chemistry, including those involving crosslinking agents. For example, immunotoxins can be constructed using a disulfide exchange reaction or by forming a thioether bond. Examples of suitable reagents for this purpose include iminothiolate and methyl-4- mercaptobutyrimidate and those disclosed, for example, in U.S. Patent No. 4,676,980.
5.13.7 Effector Function Engineering
It can be desirable to modify the antibody of the invention with respect to effector function, so as to enhance, e.g., the effectiveness of the antibody in treating cancer. For example, cysteine residue(s) can be introduced into the Fc region, thereby allowing interchain disulfide bond formation in this region. The homodimeric antibody thus generated can have improved internalization capability and/or increased complement-mediated cell killing and antibody-dependent cellular cytotoxicity (ADCC). See Caron et al., J. Exp Med., 176: 1191- 1 195 (1992) and Shopes, J. Immunol., 148: 2918-2922 (1992). Homodimeric antibodies with enhanced anti-tumor activity can also be prepared using heterobifunctional cross-linkers as described in Wolff et al. Cancer Research, 53: 2560-2565 (1993). Alternatively, an antibody can be engineered that has dual Fc regions and can thereby have enhanced complement lysis and
ADCC capabilities. See Stevenson et al., Anti-Cancer Drug Design, 3: 219-230 (1989).
5.13.8 Immunoconjugates The invention also pertains to immunoconjugates comprising an antibody conjugated to a cytotoxic agent such as a chemotherapeutic agent, toxin (e.g., an enzymatically active toxin of bacterial, fungal, plant, or animal origin, or fragments thereof), or a radioactive isotope (i.e., a radioconjugate).
Chemotherapeutic agents useful in the generation of such immunoconjugates have been described above. Enzymatically active toxins and fragments thereof that can be used include diphtheria A chain, nonbinding active fragments of diphtheria toxin, exotoxin A chain (from Pseudomonas aeruginosa), ricin A chain, abrin A chain, modeccin A chain, alpha-sarcin, Aleurites fordii proteins, dianthin proteins, Phytolaca americana proteins (PAPI, PAPII, and PAP-S), momordica charantia inhibitor, curcin, crotin, sapaonaria officinalis inhibitor, gelonin, mitogellin, restrictocin, phenomycin, enomycin, and the tricothecenes. A variety of radionuclides are available for the production of radioconjugated antibodies. Examples include 212Bi, 13 lI, 131In, 90Y, and l86Re.
Conjugates of the antibody and cytotoxic agent are made using a variety of bifunctional protein-coupling agents such as N-succinimidyl-3-(2-pyridyldithiol) propionate (SPDP), iminothiolane (IT), bifunctional derivatives of imidoesters (such as dimethyl adipimidate HCL), active esters (such as disuccinimidyl suberate), aldehydes (such as glutareldehyde), bis-azido compounds (such as bis (p-azidobenzoyl) hexanediamine), bis-diazonium derivatives (such as bis-(p-diazoniumbenzoyl)-ethylenediamine), diisocyanates (such as tolyene 2,6-diisocyanate), and bis-active fluorine compounds (such as 1 ,5-difluoro-2,4-dinitrobenzene). For example, a ricin immunotoxin can be prepared as described in Vitetta et al., Science, 238: 1098 (1987). Carbon- 14-labeled l-isothiocyanatobenzyl-3-methyldiethylene triaminepentaacetic acid (MX- DTPA) is an exemplary chelating agent for conjugation of radionucleotide to the antibody. See WO94/11026.
In another embodiment, the antibody can be conjugated to a "receptor" (such streptavidin) for utilization in tumor pretargeting wherein the antibody-receptor conjugate is administered to the patient, followed by removal of unbound conjugate from the circulation using a clearing agent and then administration of a "ligand" (e.g., avidin) that is in turn conjugated to a cytotoxic agent.
4.14 COMPUTER READABLE SEQUENCES In one application of this embodiment, a nucleotide sequence of the present invention can be recorded on computer readable media. As used herein, "computer readable media" refers to any medium which can be read and accessed directly by a computer. Such media include, but are not limited to: magnetic storage media, such as floppy discs, hard disc storage medium, and magnetic tape; optical storage media such as CD-ROM; electrical storage media such as RAM and ROM; and hybrids of these categories such as magnetic/optical storage media. A skilled artisan can readily appreciate how any of the presently known computer readable mediums can be used to create a manufacture comprising computer readable medium having recorded thereon a nucleotide sequence of the present invention. As used herein, "recorded" refers to a process for storing information on computer readable medium. A skilled artisan can readily adopt any of the presently known methods for recording information on computer readable medium to generate manufactures comprising the nucleotide sequence information of the present invention.
A variety of data storage structures are available to a skilled artisan for creating a computer readable medium having recorded thereon a nucleotide sequence of the present invention. The choice of the data storage structure will generally be based on the means chosen to access the stored information. In addition, a variety of data processor programs and formats can be used to store the nucleotide sequence information of the present invention on computer readable medium. The sequence information can be represented in a word processing text file, formatted in commercially-available software such as WordPerfect and Microsoft Word, or represented in the form of an ASCII file, stored in a database application, such as DB2, Sybase, Oracle, or the like. A skilled artisan can readily adapt any number of data processor structuring formats (e.g. text file or database) in order to obtain computer readable medium having recorded thereon the nucleotide sequence information of the present invention.
By providing any of the nucleotide sequences SEQ ID NO: 1-30368 or a representative fragment thereof; or a nucleotide sequence at least 95% identical to any of the nucleotide sequences of SEQ ID NO: 1-30368 in computer readable form, a skilled artisan can routinely access the sequence information for a variety of purposes. Computer software is publicly available which allows a skilled artisan to access sequence information provided in a computer readable medium. The examples which follow demonstrate how software which implements the BLAST (Altschul et al., J. Mol. Biol. 215:403-410 (1990)) and BLAZE (Brutlag et al., Comp.
Chem. 17:203-207 (1993)) search algorithms on a Sybase system is used to identify open reading frames (ORFs) within a nucleic acid sequence. Such ORFs may be protein encoding fragments and may be useful in producing commercially important proteins such as enzymes used in fermentation reactions and in the production of commercially useful metabolites. As used herein, "a computer-based system" refers to the hardware means, software means, and data storage means used to analyze the nucleotide sequence information of the present invention. The minimum hardware means of the computer-based systems of the present invention comprises a central processing unit (CPU), input means, output means, and data storage means. A skilled artisan can readily appreciate that any one of the currently available computer-based systems are suitable for use in the present invention. As stated above, the computer-based systems of the present invention comprise a data storage means having stored therein a nucleotide sequence of the present invention and the necessary hardware means and software means for supporting and implementing a search means. As used herein, "data storage means" refers to memory which can store nucleotide sequence information of the present invention, or a memory access means which can access manufactures having recorded thereon the nucleotide sequence information of the present invention.
As used herein, "search means" refers to one or more programs which are implemented on the computer-based system to compare a target sequence or target structural motif with the sequence information stored within the data storage means. Search means are used to identify fragments or regions of a known sequence which match a particular target sequence or target motif. A variety of known algorithms are disclosed publicly and a variety of commercially available software for conducting search means are and can be used in the computer-based systems of the present invention. Examples of such software includes, but is not limited to, Smith- Waterman, MacPattern (EMBL), BLASTN and BLASTA (NPOLYPEPTIDEIA). A skilled artisan can readily recognize that any one of the available algorithms or implementing software packages for conducting homology searches can be adapted for use in the present computer-based systems. As used herein, a "target sequence" can be any nucleic acid or amino acid sequence of six or more nucleotides or two or more amino acids. A skilled artisan can readily recognize that the longer a target sequence is, the less likely a target sequence will be present as a random occurrence in the database. The most preferred sequence length of a target sequence is from about 10 to 300 amino acids, more preferably from about 30 to 100 nucleotide residues. However, it is well recognized that searches for commercially important fragments, such as sequence fragments involved in gene expression and protein processing, may be of shorter length.
As used herein, "a target structural motif," or "target motif," refers to any rationally selected sequence or combination of sequences in which the sequence(s) are chosen based on a three-dimensional configuration which is formed upon the folding of the target motif. There are a variety of target motifs known in the art. Protein target motifs include, but are not limited to, enzyme active sites and signal sequences. Nucleic acid target motifs include, but are not limited to, promoter sequences, hairpin structures and inducible expression elements (protein binding sequences).
4.15 TRIPLE HELIX FORMATION In addition, the fragments of the present invention, as broadly described, can be used to control gene expression through triple helix formation or antisense DNA or RNA, both of which methods are based on the binding of a polynucleotide sequence to DNA or RNA. Polynucleotides suitable for use in these methods are preferably 20 to 40 bases in length and are designed to be complementary to a region of the gene involved in transcription (triple helix - see Lee et al., Nucl. Acids Res. 6:3073 (1979); Cooney et al., Science 15241 :456 (1988); and Dervan et al., Science 251 :1360 (1991)) or to the mRNA itself (antisense - Olmno, J. Neurochem. 56:560 (1991); Oligodeoxynucleotides as Antisense Inhibitors of Gene Expression, CRC Press, Boca Raton, FL (1988)). Triple helix-formation optimally results in a shut-off of RNA transcription from DNA, while antisense RNA hybridization blocks translation of an mRNA molecule into polypeptide. Both techniques have been demonstrated to be effective in model systems. Information contained in the sequences of the present invention is necessary for the design of an antisense or triple helix oligonucleotide.
4.16 DIAGNOSTIC ASSAYS AND KITS The present invention further provides methods to identify the presence or expression of one of the ORFs of the present invention, or homolog thereof, in a test sample, using a nucleic acid probe or antibodies of the present invention, optionally conjugated or otherwise associated with a suitable label.
In general, methods for detecting a polynucleotide of the invention can comprise contacting a sample with a compound that binds to and forms a complex with the polynucleotide for a period sufficient to form the complex, and detecting the complex, so that if a complex is detected, a polynucleotide of the invention is detected in the sample. Such methods can also comprise contacting a sample under stringent hybridization conditions with nucleic acid primers that anneal to a polynucleotide of the invention under such conditions, and amplifying annealed polynucleotides, so that if a polynucleotide is amplified, a polynucleotide of the invention is detected in the sample.
In general, methods for detecting a polypeptide of the invention can comprise contacting a sample with a compound that binds to and forms a complex with the polypeptide for a period sufficient to form the complex, and detecting the complex, so that if a complex is detected, a polypeptide of the invention is detected in the sample. In detail, such methods comprise incubating a test sample with one or more of the antibodies or one or more of the nucleic acid probes of the present invention and assaying for binding of the nucleic acid probes or antibodies to components within the test sample.
Conditions for incubating a nucleic acid probe or antibody with a test sample vary. Incubation conditions depend on the format employed in the assay, the detection methods employed, and the type and nature of the nucleic acid probe or antibody used in the assay. One skilled in the art will recognize that any one of the commonly available hybridization, amplification or immunological assay formats can readily be adapted to employ the nucleic acid probes or antibodies of the present invention. Examples of such assays can be found in Chard, T., An Introduction to Radioimmunoassay and Related Techniques, Elsevier Science Publishers, Amsterdam, The Netherlands (1986); Bullock, G.R. et al., Techniques in Immunocytochemistry, Academic Press, Orlando, FL Vol. 1 (1982), Vol. 2 (1983), Vol. 3 (1985); Tijssen, P., Practice and Theory of immunoassay s: Laboratory Techniques in Biochemistry and Molecular Biology, Elsevier Science Publishers, Amsterdam, The Netherlands (1985). The test samples of the present invention include cells, protein or membrane extracts of cells, or biological fluids such as sputum, blood, serum, plasma, or urine. The test sample used in the above-described method will vary based on the assay format, nature of the detection method and the tissues, cells or extracts used as the sample to be assayed. Methods for preparing protein extracts or membrane extracts of cells are well known in the art and can be readily be adapted in order to obtain a sample which is compatible with the system utilized.
In another embodiment of the present invention, kits are provided which contain the necessary reagents to carry out the assays of the present invention. Specifically, the invention provides a compartment kit to receive, in close confinement, one or more containers which comprises: (a) a first container comprising one of the probes or antibodies of the present invention; and (b) one or more other containers comprising one or more of the following: wash reagents, reagents capable of detecting presence of a bound probe or antibody.
In detail, a compartment kit includes any kit in which reagents are contained in separate containers. Such containers include small glass containers, plastic containers or strips of plastic or paper. Such containers allows one to efficiently transfer reagents from one compartment to another compartment such that the samples and reagents are not cross-contaminated, and the agents or solutions of each container can be added in a quantitative fashion from one compartment to another. Such containers will include a container which will accept the test sample, a container which contains the antibodies used in the assay, containers which contain wash reagents (such as phosphate buffered saline, Tris-buffers, etc.), and containers which contain the reagents used to detect the bound antibody or probe. Types of detection reagents include labeled nucleic acid probes, labeled secondary antibodies, or in the alternative, if the primary antibody is labeled, the enzymatic, or antibody binding reagents which are capable of reacting with the labeled antibody. One skilled in the art will readily recognize that the disclosed probes and antibodies of the present invention can be readily incorporated into one of the established kit formats which are well known in the art.
4.17 MEDICAL IMAGING
The novel polypeptides and binding partners of the invention are useful in medical imaging of sites expressing the molecules of the invention (e.g., where the polypeptide of the invention is involved in the immune response, for imaging sites of inflammation or infection). See, e.g., Kunkel et al., U.S. Pat. NO. 5,413,778. Such methods involve chemical attachment of a labeling or imaging agent, administration of the labeled polypeptide to a subject in a pharmaceutically acceptable carrier, and imaging the labeled polypeptide in vivo at the target site.
4.18 SCREENING ASSAYS
Using the isolated proteins and polynucleotides of the invention, the present invention further provides methods of obtaining and identifying agents which bind to a polypeptide encoded by an ORF corresponding to any of the nucleotide sequences set forth in SEQ ID NO: 1-30368, or bind to a specific domain of the polypeptide encoded by the nucleic acid. In detail, said method comprises the steps of:
(a) contacting an agent with an isolated protein encoded by an ORF of the present invention, or nucleic acid of the invention; and
(b) determining whether the agent binds to said protein or said nucleic acid. In general, therefore, such methods for identifying compounds that bind to a polynucleotide of the invention can comprise contacting a compound with a polynucleotide of the invention for a time sufficient to form a polynucleotide/compound complex, and detecting the complex, so that if a polynucleotide/compound complex is detected, a compound that binds to a polynucleotide of the invention is identified. Likewise, in general, therefore, such methods for identifying compounds that bind to a polypeptide of the invention can comprise contacting a compound with a polypeptide of the invention for a time sufficient to form a polypeptide/compound complex, and detecting the complex, so that if a polypeptide/compound complex is detected, a compound that binds to a polynucleotide of the invention is identified. Methods for identifying compounds that bind to a polypeptide of the invention can also comprise contacting a compound with a polypeptide of the invention in a cell for a time sufficient to form a polypeptide/compound complex, wherein the complex drives expression of a receptor gene sequence in the cell, and detecting the complex by detecting reporter gene sequence expression, so that if a polypeptide/compound complex is detected, a compound that binds a polypeptide of the invention is identified.
Compounds identified via such methods can include compounds which modulate the activity of a polypeptide of the invention (that is, increase or decrease its activity, relative to activity observed in the absence of the compound). Alternatively, compounds identified via such methods can include compounds which modulate the expression of a polynucleotide of the invention (that is, increase or decrease expression relative to expression levels observed in the absence of the compound). Compounds, such as compounds identified via the methods of the invention, can be tested using standard assays well known to those of skill in the art for their ability to modulate activity/expression. The agents screened in the above assay can be, but are not limited to, peptides, carbohydrates, vitamin derivatives, or other pharmaceutical agents. The agents can be selected and screened at random or rationally selected or designed using protein modeling techniques.
For random screening, agents such as peptides, carbohydrates, pharmaceutical agents and the like are selected at random and are assayed for their ability to bind to the protein encoded by the ORF of the present invention. Alternatively, agents may be rationally selected or designed. As used herein, an agent is said to be "rationally selected or designed" when the agent is chosen based on the configuration of the particular protein. For example, one skilled in the art can readily adapt currently available procedures to generate peptides, pharmaceutical agents and the like, capable of binding to a specific peptide sequence, in order to generate rationally designed antipeptide peptides, for example see Hurby et al., Application of Synthetic Peptides: Antisense Peptides," In Synthetic Peptides, A User's Guide, W.H. Freeman, NY (1992), pp. 289-307, and Kaspczak et al., Biochemistry 28:9230-8 (1989), or pharmaceutical agents, or the like.
In addition to the foregoing, one class of agents of the present invention, as broadly described, can be used to control gene expression through binding to one of the ORFs or EMFs of the present invention. As described above, such agents can be randomly screened or rationally designed/selected. Targeting the ORF or EMF allows a skilled artisan to design sequence specific or element specific agents, modulating the expression of either a single ORF or multiple ORFs which rely on the same EMF for expression control. One class of DNA binding agents are agents which contain base residues which hybridize or form a triple helix formation by binding to DNA or RNA. Such agents can be based on the classic phosphodiester, ribonucleic acid backbone, or can be a variety of sulfhydryl or polymeric derivatives which have base attachment capacity.
Agents suitable for use in these methods preferably contain 20 to 40 bases and are designed to be complementary to a region of the gene involved in transcription (triple helix - see Lee et al., Nucl. Acids Res. 6:3073 (1979); Cooney et al., Science 241 :456 (1988); and Dervan et al., Science 251 :1360 (1991)) or to the mRNA itself (antisense - Okano, J. Neurochem. 56:560
(1991); Oligodeoxynucleotides as Antisense Inhibitors of Gene Expression, CRC Press, Boca
Raton, FL (1988)). Triple helix-formation optimally results in a shut-off of RNA transcription from DNA, while antisense RNA hybridization blocks translation of an mRNA molecule into polypeptide. Both techniques have been demonstrated to be effective in model systems.
Information contained in the sequences of the present invention is necessary for the design of an antisense or triple helix oligonucleotide and other DNA binding agents.
Agents that bind to a protein encoded by one of the ORFs of the present invention can be used as a diagnostic agent. Agents which bind to a protein encoded by one of the ORFs of the present invention can be formulated using known techniques to generate a pharmaceutical composition.
4.19 USE OF NUCLEIC ACIDS AS PROBES
Another aspect of the subject invention is to provide for polypeptide-specific nucleic acid hybridization probes capable of hybridizing with naturally occurring nucleotide sequences. The hybridization probes of the subject invention may be derived from any of the nucleotide sequences SEQ ID NO: 1-30368. Because the corresponding gene is only expressed in a limited number of tissues, a hybridization probe derived from of any of the nucleotide sequences SEQ ID NO: 1-30368 can be used as an indicator of the presence of RNA of cell type of such a tissue in a sample.
Any suitable hybridization technique can be employed, such as, for example, in situ hybridization. PCR as described in US Patents Nos. 4,683,195 and 4,965,188 provides additional uses for oligonucleotides based upon the nucleotide sequences. Such probes used in PCR may be of recombinant origin, may be chemically synthesized, or a mixture of both. The probe will comprise a discrete nucleotide sequence for the detection of identical sequences or a degenerate pool of possible sequences for identification of closely related genomic sequences. Other means for producing specific hybridization probes for nucleic acids include the cloning of nucleic acid sequences into vectors for the production of mRNA probes. Such vectors are known in the art and are commercially available and may be used to synthesize RNA probes in vitro by means of the addition of the appropriate RNA polymerase as T7 or SP6 RNA polymerase and the appropriate radioactively labeled nucleotides. The nucleotide sequences may be used to construct hybridization probes for mapping their respective genomic sequences. The nucleotide sequence provided herein may be mapped to a chromosome or specific regions of a chromosome using well known genetic and/or chromosomal mapping techniques. These techniques include in situ hybridization, linkage analysis against known chromosomal markers, hybridization screening with libraries or flow-sorted chromosomal preparations specific to known chromosomes, and the like. The technique of fluorescent in situ hybridization of chromosome spreads has been described, among other places, in Verma et al (1988) Human Chromosomes: A Manual of Basic Techniques, Pergamon Press, New York NY. Fluorescent in situ hybridization of chromosomal preparations and other physical chromosome mapping techniques may be correlated with additional genetic map data. Examples of genetic map data can be found in the 1994 Genome Issue of Science (265 : 1981 f). Correlation between the location of a nucleic acid on a physical chromosomal map and a specific disease (or predisposition to a specific disease) may help delimit the region of DNA associated with that genetic disease. The nucleotide sequences of the subject invention may be used to detect differences in gene sequences between normal, carrier or affected individuals.
4.20 PREPARATION OF SUPPORT BOUND OLIGONUCLEOTIDES
Oligonucleotides, i.e., small nucleic acid segments, may be readily prepared by, for example, directly synthesizing the oligonucleotide by chemical means, as is commonly practiced using an automated oligonucleotide synthesizer.
Support bound oligonucleotides may be prepared by any of the methods known to those of skill in the art using any suitable support such as glass, polystyrene or Teflon. One strategy is to precisely spot oligonucleotides synthesized by standard synthesizers. Immobilization can be achieved using passive adsorption (Inouye & Hondo, (1990) J. Clin. Microbiol.28(6) 1469-72); using UV light (Nagata et al, 1985; Dahlen et al., 1987; Morrissey & Collins, (1989) Mol. Cell Probes 3(2) 189-207) or by covalent binding of base modified DNA (Keller etal, 1988; 1989); all references being specifically incorporated herein.
Another strategy that may be employed is the use of the strong biotin-streptavidin interaction as a linker. For example, Broude et al. (1994) Proc. Natl. Acad. Sci. USA 91 (8) 3072-6, describe the use of biotinylated probes, although these are duplex probes, that are immobilized on streptavidin-coated magnetic beads. Streptavidin-coated beads may be purchased from Dynal, Oslo. Of course, this same linking chemistry is applicable to coating any surface with streptavidin. Biotinylated probes may be purchased from various sources, such as, e.g., Operon Technologies (Alameda, CA). Nunc Laboratories (Naperville, IL) is also selling suitable material that could be used. Nunc
Laboratories have developed a method by which DNA can be covalently bound to the microwell surface termed Covalink NH. CovaLink NH is a polystyrene surface grafted with secondary amino groups (>NH) that serve as bridge-heads for further covalent coupling. CovaLink Modules may be purchased from Nunc Laboratories. DNA molecules may be bound to CovaLink exclusively at the 5'-end by a phosphoramidatebond, allowing immobilization of more than 1 pmol of DNA (Rasmussenet α/., (1991) Anal. Biochem. 198(1) 138-42).
The use of CovaLink NH strips for covalent binding of DNA molecules at the 5'-end has been described (Rasmussen et al., (1991). In this technology, a phosphoramidatebond is employed (Chu et al., (1983) Nucleic Acids Res. 11(8) 6513-29). This is beneficial as immobilization using only a single covalent bond is preferred. The phosphoramidatebond joins the DNA to the CovaLink NH secondary amino groups that are positioned at the end of spacer arms covalently grafted onto the polystyrene surface through a 2 nm long spacer arm. To link an oligonucleotide to CovaLink NH via an phosphoramidatebond, the oligonucleotide terminus must have a 5'-end phosphate group. It is, perhaps, even possible for biotin to be covalently bound to CovaLink and then streptavidin used to bind the probes.
More specifically, the linkage method includes dissolving DNA in water (7.5 ng/ul) and denaturing for 10 min. at 95°C and cooling on ice for 10 min. Ice-cold 0.1 M 1 -methylimidazole, pH 7.0 ( 1 -Melm ), is then added to a final concentration of 10 mM 1 -Melm7. A ss DNA solution is then dispensed into CovaLink NH strips (75 ul/well) standing on ice.
Carbodiimide 0.2 M 1 -ethyl-3-(3-dimethylaminopropyl)-carbodiimide (EDC), dissolved in 10 mM 1 -Melm7, is made fresh and 25 ul added per well. The strips are incubated for 5 hours at 50°C. After incubation the strips are washed using, e.g., Nunc-Immuno Wash; first the wells are washed 3 times, then they are soaked with washing solution for 5 min., and finally they are washed 3 times (where in the washing solution is 0.4 N NaOH, 0.25% SDS heated to 50°C).
It is contemplated that a further suitable method for use with the present invention is that described in PCT Patent Application WO 90/03382 (Southern & Maskos), incorporated herein by reference. This method of preparing an oligonucleotide bound to a support involves attaching a nucleoside 3'-reagent through the phosphate group by a covalent phosphodiester link to aliphatic hydroxyl groups carried by the support. The oligonucleotide is then synthesized on the supported nucleoside and protecting groups removed from the synthetic oligonucleotide chain under standard conditions that do not cleave the oligonucleotide from the support. Suitable reagents include nucleoside phosphoramidite and nucleoside hydrogen phosphorate.
An on-chip strategy for the preparation of DNA probe for the preparation of DNA probe arrays may be employed. For example, addressable laser-activated photodeprotection may be employed in the chemical synthesis of oligonucleotides directly on a glass surface, as described by
Fodor et al. (1991) Science 251 (4995) 767-73, incorporated herein by reference. Probes may also be immobilized on nylon supports as described by Van Ness et al. ( 1991 ) Nucleic Acids Res. 19(12) 3345-50; or linked to Teflon using the method of Duncan & Cavalier (1988) Anal. Biochem. 169( 1 ) 104-8; all references being specifically incorporated herein.
To link an oligonucleotide to a nylon support, as described by Van Ness et al. ( 1991 ), requires activation of the nylon surface via alkylation and selective activation of the 5'-amine of oligonucleotides with cyanuric chloride.
One particular way to prepare support bound oligonucleotides is to utilize the light-generated synthesis described by Pease et al, (1994) PNAS USA 91(1 1) 5022-6, incorporated herein by reference). These authors used current photolithographic techniques to generate arrays of immobilized oligonucleotide probes (DNA chips). These methods, in which light is used to direct the synthesis of oligonucleotide probes in high-density, miniaturized arrays, utilize photolabile 5'-protected N-acyl-deoxynucleosidephosphoramidites, surface linker chemistry and versatile combinatorial synthesis strategies. A matrix of 256 spatially defined oligonucleotide probes may be generated in this manner.
4.21 PREPARATION OF NUCLEIC ACID FRAGMENTS
The nucleic acids may be obtained from any appropriate source, such as cDNAs, genomic
DNA, chromosomal DNA, mi crodissected chromosome bands, cosmid or YAC inserts, and RNA, including mRNA without any amplification steps. For example, Sambrook et al. ( 1989) describes three protocols for the isolation of high molecular weight DNA from mammalian cells (p.
9.14-9.23).
DNA fragments may be prepared as clones in Ml 3, plasmid or lambda vectors and/or prepared directly from genomic DNA or cDNA by PCR or other amplification methods. Samples may be prepared or dispensed in multiwell plates. About 100- 1000 ng of DNA samples may be prepared in 2-500 ml of final volume.
The nucleic acids would then be fragmented by any of the methods known to those of skill in the art including, for example, using restriction enzymes as described at 9.24-9.28 of Sambrook et al. (1989), shearing by ultrasound and NaOH treatment. Low pressure shearing is also appropriate, as described by Schriefer et al. (1990) Nucleic
Acids Res. 18(24) 7455-6, incorporated herein by reference). In this method, DNA samples are passed through a small French pressure cell at a variety of low to intermediate pressures. A lever device allows controlled application of low to intermediate pressures to the cell. The results of these studies indicate that low-pressure shearing is a useful alternative to sonic and enzymatic DNA fragmentation methods.
One particularly suitable way for fragmenting DNA is contemplated to be that using the two base recognition endonuclease, Cv/JI, described by Fitzgerald et al. (1992) Nucleic Acids Res. 20(14) 3753-62. These authors described an approach for the rapid fragmentation and fractionation of DNA into particular sizes that they contemplated to be suitable for shotgun cloning and sequencing.
The restriction endonuclease Cv/JI normally cleaves the recognition sequence PuGCPy between the G and C to leave blunt ends. Atypical reaction conditions, which alter the specificity of this enzyme (Cv/JI* *), yield a quasi-random distribution of DNA fragments form the small molecule pUC 19 (2688 base pairs). Fitzgerald et al. (1992) quantitatively evaluated the randomness of this fragmentation strategy, using a Cv/JI* * digest of pUC 19 that was size fractionated by a rapid gel filtration method and directly ligated, without end repair, to a lac Z minus M 13 cloning vector. Sequence analysis of 76 clones showed that Cv/JI* * restricts pyGCPy and PuGCPu, in addition to PuGCPy sites, and that new sequence data is accumulated at a rate consistent with random fragmentation.
As reported in the literature, advantages of this approach compared to sonication and agarose gel fractionation include: smaller amounts of DNA are required (0.2-0.5 ug instead of 2-5 ug); and fewer steps are involved (no preligation, end repair, chemical extraction, or agarose gel electrophoresis and elution are needed.
Irrespective of the manner in which the nucleic acid fragments are obtained or prepared, it is important to denature the DNA to give single stranded pieces available for hybridization. This is achieved by incubating the DNA solution for 2-5 minutes at 80-90°C. The solution is then cooled quickly to 2°C to prevent renaturation of the DNA fragments before they are contacted with the chip. Phosphate groups must also be removed from genomic DNA by methods known in the art.
4.22 PREPARATION OF DNA ARRAYS
Arrays may be prepared by spotting DNA samples on a support such as a nylon membrane. Spotting may be performed by using arrays of metal pins (the positions of which correspond to an array of wells in a microtiter plate) to repeated by transfer of about 20 nl of a DNA solution to a nylon membrane. By offset printing, a density of dots higher than the density of the wells is achieved. One to 25 dots may be accommodated in 1 mm , depending on the type of label used. By avoiding spotting in some preselected number of rows and columns, separate subsets (subarrays) may be formed. Samples in one subarray may be the same genomic segment of DNA (or the same gene) from different individuals, or may be different, overlapped genomic clones. Each of the subarrays may represent replica spotting of the same samples. In one example, a selected gene segment may be amplified from 64 patients. For each patient, the amplified gene segment may be in one 96-well plate (all 96 wells containing the same sample). A plate for each of the 64 patients is prepared. By using a 96-pin device, all samples may be spotted on one 8 x 12 cm membrane. Subarrays may contain 64 samples, one from each patient. Where the 96 subarrays are identical, the dot span may be 1 mm and there may be a 1 mm space between subarrays.
Another approach is to use membranes or plates (available from NUNC, Naperville, Illinois) which may be partitioned by physical spacers e.g. a plastic grid molded over the membrane, the grid being similar to the sort of membrane applied to the bottom of multiwell plates, or hydrophobic strips. A fixed physical spacer is not preferred for imaging by exposure to flat phosphor-storage screens or x-ray films.
The present invention is illustrated in the following examples. Upon consideration of the present disclosure, one of skill in the art will appreciate that many other embodiments and variations may be made in the scope of the present invention. Accordingly, it is intended that the broader aspects of the present invention not be limited to the disclosure of the following examples. The present invention is not to be limited in scope by the exemplified embodiments wliich are intended as illustrations of single aspects of the invention, and compositions and methods which are functionally equivalent are within the scope of the invention. Indeed, numerous modifications and variations in the practice of the invention are expected to occur to those skilled in the art upon consideration of the present preferred embodiments. Consequently, the only limitations which should be placed upon the scope of the invention are those which appear in the appended claims.
All references cited within the body of the instant specification are hereby incorporated by reference in their entirety.
5.0 EXAMPLES
5.1 EXAMPLE 1
Novel Nucleic Acid Sequences Obtained From Various Libraries
A plurality of novel nucleic acids were obtained from cDNA libraries prepared from various human tissues and in some cases isolated from a genomic library derived from human chromosome using standard PCR, SBH sequence signature analysis and Sanger sequencing techniques. The inserts of the library were amplified with PCR using primers specific for the vector sequences which flank the inserts. Clones from cDNA libraries were spotted on nylon membrane filters and screened with oligonucleotide probes (e.g. , 7-mers) to obtain signature sequences. The clones were clustered into groups of similar or identical sequences. Representative clones were selected for sequencing. In some cases, the 5' sequence of the amplified inserts was then deduced using a typical
Sanger sequencing protocol. PCR products were purified and subjected to fluorescent dye terminator cycle sequencing. Single pass gel sequencing was done using a 377 Applied Biosystems (ABI) sequencer to obtain the novel nucleic acid sequences. In some cases RACE (Rapid Amplification of cDNA Ends) was performed to further extend the sequence in the 5' direction.
5.2 EXAMPLE 2
Novel Contigs
The novel contigs of the invention were assembled from sequences that were obtained from a cDNA library by methods described in Example 1 above, and in some cases sequences obtained from one or more public databases. The sequences for the resulting nucleic acid contigs are designated as SEQ ID NO: 1 -30368 and are provided in the attached Sequence Listing. The contigs were assembled using an EST sequence as a seed. Then a recursive algorithm was used to extend the seed EST into an extended assemblage, by pulling additional sequences from different databases (i.e., Hyseq's database containing EST sequences, dbEST version 115, gb pri 115, and UniGene version 103, and exons from public domain genomic sequences predicted by GenScan) that belong to this assemblage. The algorithm terminated when there was no additional sequences from the above databases that would extend the assemblage. Further, the inclusion of component sequences into the assemblage was based on a BLASTN hit to the extending assemblage with BLAST score greater than 300 and percent identity greater than 95%.
The novel predicted polypeptides (including proteins) encoded by the novel polynucleotides (SEQ ID NO: 1 -30368) of the present invention are incorporated in the attached Sequence Listing. A subset the predicted polypeptide sequences contain an unknown amino acid, a stop codon, a possible nucleotide deletion or a possible nucleotide insertion. These sequences have been shown in their entirety with the special characters in Table 2. Table 2 also shows the corresponding start and stop nucleotide locations to each of SEQ ID NO: 1 -30368. Table 2 also indicates the method by which the polypeptide was predicted. Method A refers to a polypeptide obtained by using a software program called FASTY (available from http://fasta.bioch.virginia,edu which selects a polypeptide based on a comparison of the translated novel polynucleotide to known polynucleotides (W.R. Pearson, Methods in Enzymology, 183:63-98 (1990), herein incorporated by reference). Method B refers to a polypeptide obtained by using a software program called GenScan for human/vertebrate sequences (available from Stanford University, Office of Technology Licensing) that predicts the polypeptide based on a probabilistic model of gene structure/compositional properties (C. Burge and S. Karlin, J. Mol. Biol., 268:78-94 (1997), incorporated herein by reference). Method C refers to a polypeptide obtained by using a Hyseq proprietary software program that translates the novel polynucleotide and its complementary strand into six possible amino acid sequences (forward and reverse frames) and chooses the polypeptide with the longest open reading frame. The nearest neighbor results for SEQ ID NO: 1 -30368 were obtained by a BLASTP version 2.0al 19MP-WashU search against Genpept release 121 and Geneseq release 200103 (Derwent), using BLAST algorithm. The nearest neighbor result showed the closest homologue for SEQ ID NO: 1-30368. The nearest neighbor results for SEQ ID NO: 1-30368 are incorporated in the attached Sequence Listing. Using eMatrix software package (Stanford University, Stanford, CA) (Wu et al., J.
Comp. Biol., Vol. 6 pp. 219-235 (1999) herein incorporated by reference), all the sequences were examined to determine whether they had identifiable signature regions. The attached Sequence Listing provodes the results obtained by eMatrix analysis for each polypeptide as follows: the signature region found in the indicated polypeptide sequences, the description of the signature, the eMatrix p-value(s) and the position(s) of the signature within the polypeptide sequence.
Using the pFam software program (Sonnhammer et al., Nucleic Acids Res., Vol. 26(1) pp. 320-322 (1998) herein incorporated by reference) all the polypeptide sequences were examined for domains with homology to certain peptide domains. The attached Sequence Listing provides the results obtained by PFAM analysis for each peptide, namely: the name of the domain found, the description, the p-value and the pFam score for the identified domain within the sequence.
Tables 1 and 2 follow. Table 1 shows the various tissue sources of SEQ ID NO: 1-30368. Table 2 shows the start and stop nucleotides for the translated amino acid sequence for which each assemblage encodes. Table 2 also provides a correlation between the amino acid sequences set forth in the Sequence Listing, the nucleotide sequences set forth in the Sequence Listing and the SEQ ID NO: in USSN 09/540,217
Table 1
Tissue RNA Library SEQ ID NOS: origin Source Name adult brain GIBCO AB3001 39-41 192197-200315-316540-542576-580608-622 63510041185-11871273-1279143114741721-1722 20362136-213724572471-247425132599-26032988- 29893105-310632123276-32773306-330833523365 3374-337634333448-34503555-355836933949-3953 4067-40724160-41624558-45604581-45824612-4614 4837-48405483-54845603-5606570058025980-5984 6135-61366403-64046452-64537209-72127447-7449 7452-74607536-75417554-75557622-76237630-7636 7660-76657701-770377717778-77837798-78017921- 792379948010-80128025-80268145-81518227-8229 84158497-84998936-89388986-89919002-90049013- 90179337-93389366-93689375-93769391-93929395- 93969431-943694439475-94769517-95189522-9525 9586-95899603-96049851-98529854-98559874-9895 9905-99089947-99529969-99809986-999210025- 1002610033-1003710167-101721027710480-10482 10488-1048910498-1050310520-1052210537-10538 10592-1059410628-1063011226-1122711339-11344 11406-1140711431-1143211731-1173412150-12151 1223912241-1224412555-1255912615-1261812785- 1278712978-1298112984-1298512997-1299913567- 1356813592-1359513606-1360813873-1387513999- 1400414360-1436914650-1465114684-1468515013- 150181509615174-1518115209-1521015250-15251 1525715323-1532415548-1555215568-1557215576- 1557715588-1558915699-1570015881-1588316438- 1643916473-1647816496-1649716609-1661116686- 1669316700-1670116727-1672916836-1684216934- 1693716949-1695317455-1745617857-1786117958- 1796318029-1803018136-1813818423-1842518516- 1851818535-1853718624-1862618668-1867218719- 1872218750-1875618790-1879318802-1880418836- 1883818899-1890318919-1892118943-1894518947- 1895018964-1896918989-1899019013-1901719045- 1904819057-1906519142-1914719154-1915519224 19316-1931719345-1934919355-193601936219370 19385-1938919415-1941719422-1943119442-19444 1950319560-195621956619604-196071969319709- 1971019727-1973219736-197421977219804-19808 19921-1992919933-1993819943-1994619969-19981 20015-2001720029-2004320087-2009420099-20102 20111-2011220122-2012720161-2016420167-20171 20180-2018120189-2019420198-2019920215-20218 20281-202822028920321-2032420349-2035420361 20393-2040020415-2041720437-2044020524-20535 20542-2054520554-2055820607-2061220614-20615 20646-2065220698-2070720718-2072520727-20732 20789-2079120806-2081220844-2084920888-20889 2092620938-2094220999-2100421027-2103121062- 2106621072-2107521137-2114021145-2114821153- Tissue RNA Library SEQ ID NOS: origin Source Name
2115421272- -2127421277-21283 21410-2141421434- 2143921485--2149121495-2150021647-2165521729- 2173321929--2193521958-2196121973-2197421978 22000-2200622026-2202922040--2204122087-22088 22101-2210722141-221432216022250-2225222284- 222892230922314-2231722336-2234222347-22348 22358-223592237222405-224082249522534-22539 22634-2264322653-2265422661--2266222665-22667 22671-2267422700-2270122794--2279622805-22809 22887-2289122899-2290022948-2295022952-22953 22982-2298622991-2299423059--230602307123141 232492325123329-2333723412-2341423489-23490 23492-2349323508-2350923543--235442370423834- 2383523890-•238922395924014-2401825289-25290 25319-2532125374-2537525966-2596826205-26206 26258-262592630326316-26321263272633726373- 2637426596-■2660126788-267892684326850-26852 2689727067-•2707027100-2710227150-2715127247- '2725127304-■2730527439-2744027493-2749527636- 2763927750-■2775427814-2781827861-2786427890- 2789227989-2799028099-2810028311-2831328424 28426-2842829278-2928329409-•294162944429718- 2972130141-30142 adult brain GIBCO ABD003 3051-52144-145170-171180-181202-203255-256302 315-316319-323326-327395406-407412-413464-465 540-542549553-554576-578626-637652-653656697- 699716-717721-725785-786825847-855952967969- 9711001-100410471067-10711092-10941097-1098 1123-11271169-11791259-12621269-127913071431 1453-14541471-147714831490-14921602-16031644- 16451667-1668184018601887-1889193119671986- 198720582275-22762383-23882455-245724692471- 247425132540-25412577-258425912599-26032663- 26652692-269427102814-281529262937-29382975- 297730013006-300830903151-31543205-32073366 343334513472-34733531-35343555-355835903624 36353671-367236853705-37063735-37363949-3953 4053-40544080-40824124-41264406-44074489-4493 4517-45224562-45734623-46394785-47894845-4851 4857-48614874-48894897-48984971-49745092-5094 5267-52685291-52945335-53365480-54825608-5609 5700576258025808-58125919-59215956-59575980- 59845986-59886011-60216140-61436293-62966400 6403-64046406-64106651 -66536662-66646791-6794 6877-68786932-69387214-72157245-72487447-7449 7536-75417611-76127624-76297640-76427668-7670 7695-76967757-775977717778-77867789-77937798- 780178907898-790079767986-79877995-79988002 8017-80198049-805680698117-81218152-81568162 8174-81778182-81838242-824682508301-83068343- 83448351-83608363-83668368-83708409-84148497- 84998512-85138543-85508607-86098612-86168754- 87598762-87668768-87708777-87798917-89189013- 90179031-90359038-90459063-90679072-90749306- 93139321-93239375-93769391 -93929406-94079437- Tissue RNA Library SEQ ID NOS: origin Source Name
94409444-94459454-94559517- 951895319561-9562 9603-96049729-97319733-97389757-97589763-9767 98269828-98329851-98529854-98559864-98669874- 98979923-99249947-99539956-99579969-998010011 10015-1001610033-1004010167-1017210265-10272 102771030610449-1045010470--1047310498-10503 10537-1054210592-1059410607-1060810612-10614 10624-1062610628-1063010638-1063910870-10875 10881-1088310886-1088910891 -1089310895-10898 10904-1090510913-1091410980-1098511035-11037 11066-110681108111123-1112411274-1127511295- 1129911339-1135011419-1142211465-1146611582- 1158311586-1160211607-116081167911693-11695 11731-1173411749-1175011775--1177611780-11782 11803-1180411835-118361184011842-1185511901- 1190511937-1193812042-1204412121-1212612131- 1213212150-1215112186-1218912194-1219612206- 1220812283-122841236112555-■1255912573-12574 12581-1258212615-126181263712653-1265412673- 126751272312760-1276212785--1278712796-12798 12805-1280613077-1307913083-1308613576-13579 13592-1359513603-1361213638-13641 1366413865- 1386613885-1388713903-1390513994-1399714008- 1402014023-1402614044-1404514130-1413114141- 1414214187-1419514264-1426514268-1426914299- 1430114313-143171434614360- 143691460414607- 1460914640-1464214650-1465114684-1468514789- 1479115019-1502415093-1509515182-1518315218- 1521915257-1525915290-1529115406-1540715486- 1548915532-1553515543-1554615553-1555615576- 1557715588-1558915631-1563215699-1570015988- 1599016006-1601516044-1604616075-1607916086- 1608816107-161091617216397-1639816422-16429 16451-1645216470-1647816498-•1650016609-16611 16636-16637166421665216698-1670516836-16842 16934-1693917010-1701417284-■1728517330-17332 1796318015-1801618029-1803018136-1813818400- 1840218419-1842018423-1842518492-1849418516- 185181852718533-185371861718625-1862618633- 1863718671-1867218689-1869218717-1872218750- 1875618759-18761 18771-1877218778-1879318796 18802-1880418811-1881318822- 1882418856-18880 18882-1888818899-1890318919- 18921 18934-18939 18941 18947-18951 18955-1895918975-1897718989- 1899018993-1899619005-1900919013-1901819045- 1904819057-1905819062-1906519074-1908019102- 1910519142-1914719154-191551915919209-19210 19213-1922019251-192521925719260-1926219266- 1926719306-1930919316-1931719355-1936019362- 193641937019373-1937419380-1938419387-19389 19395-1940019415-1941719422- 1943419442-19444 19446-1944819461 1948719526-195291953619560- 195621956619604-196071962619656-1965719667- 19668196931969819709-1971019727-1974219759- 1976319800-1980819813-1981519921-1992919933- Tissue RNA Library SEQ ID NOS: origin Source Name
1994619953-1995719962-1996319972-1998120029- 2004320072-2007920099-2010220106-2011220114- 201202013020146-2014820151-2015420161-20164 20167-2017120180-2019420198-2020720222-20225 20235-2024020257-2026220265-202782028920316 20321-2032420328-2033020345-203542036020393- 2040020415-2042020425-2043120441-2044620469- 2047420476-2047920485-2049020502-2050320505- 2051020514-2052020542-2054520548-2055720559- 205622056820607-2061220614-2061920644-20645 20649-206712068120683-2068520689-2069220698- 2070720712-2071320718-207252075320758-20767 20789-2079720806-2081220824-208492086320897- 2090020927-2092820938-2094220952-2095420999- 2100421027-2104521062-2106621069-2107521105- 2111121141-2114221153-2115421171-2117421197- 2119821202-2120721225-2122621229-2123521237- 2124721256-2126221272-2127421277-2128021297- 2129821301-2130321351-2135221434-2143921446- 2145021467-2146921485-2149121647-2165521712- 2171721729-2173321881-2188521899-2190221905- 2191021917-219212192421929-2193821948-21950 21955-2195721971-2197221978-2198222000-22015 22020-2202922042-2204622080-2208822090-22094 22101-2210722117-2211922141-221512216022169- 2217022187-2219222208-222262223022251-22252 22261-2226422277-2228922300-223022231822329- 2233222343-2235022358-2235922365-2237122373 22381-2238822399-2240422409-2241022434-22435 22440-22448224952255922571-2258122607-22609 22644-2265122653-2265422661-2266222665-22667 22671-2267422703-2270622760-2276222794-22796 22823-2282922857-2285822870-2287422881-22893 22923-2292422948-229542297322982-2298623007- 2302123047-230522307023080-2308323112-23116 23212-2321523229-2323323237-232392324923251 23329-2333723343-2334423382-233902339923412- 2342123486-2348723489-2349023492-2349323495- 2349623508-235092370423718-2372123726-23730 23761-2376323771-237802380023802-2380923816- 2381923827-2383323836-2383723843-2384423878- 2388023890-2389223941-239562395924005-24011 24014-2401824021-2402424772-2477425085-25090 25279-2529025307-2530825319-2532225373-25375 25403-2540725598-2560326196-2619926209-26213 26217-2621826221-2622326237-2623926258-26259 2626626304263272633726347-2634826350-26352 26359-2636526373-2637426376-263772639526423 26469-2647026596-2660126665-2666626681-26683 26691-266942673626755-2675626788-2678926844- 2684526876-2687927044-2704727053-2705727067- 2707027100-271022710527133-2713427193-27200 2720627209-272132721827254-2726027269-27270 27281-2728227299-2730127304-273052733427340- 2734227493-2749527501-2750327544-2754527574- Tissue RNA Library SEQ ID NOS: origin Source Name
27577 27600- -27606 27700-27701 27814-27815 27823- 2782427861- -2786427890-27892 27940-27942 27970 28040-28041 28099--2810028142- -2814528186 28263- 2826828286 28311-2831328324- 2834528361- 28362 2842429301- •2930329328-29337 29339-29340 29343- 2934529409- 2941629594-29604 29718-29721 29940- 2994929960- 2996130141-30142 30150-30156 30218- 3022030233- 3023530240-30242 adult brain Clontech ABR001 15227-230329-330414-418716-717934-9351136- 11391436-14371472-14731505-15061593-15942058 2132-21372139-21422378-238124072550-25522577- 25842587-25883094-30963221-322233773414-3417 3526-352938613949-39534340-43414515-45164574- 45764857-48614986-49875092-5094565457005864- 586659926140-61436540-65416570-65716814-6831 7668-76707802-780479948008-80098017-80198111 8129-8131816081628242-82468368-83698453-8454 8512-85138762-87668982-898393399391-93929510- 9516953196669682-9683982810167-1017210312 10520-1052210913-1091410959-109621106411071- 1107511345-1135011805-1180811835-1183611900 11937-1193812050-1205612194-1219712796-12798 13925-139261460414714-1471714785-1478615182- 1518315400-1540315462-1546315545-1554615563- 1556416123-1612816174-1617616570-1657316601 166231664216851-168531692416934-1693717963 18015-1801618046-1804818500-1850118516-18518 1853518655-1865818671-1867218762-1876618955- 189591901819045-1904819207-192081925719350 19380-1938419447-1944919484-1948619526-19529 19659-19661 19670-1967119706-1971019764-19767 19804-1980819924-1992519962-1996319965-19967 2012020189-2019420231-2023420271-2027320412- 2041320441-2044620456-2046820485-2049420521- 2052320676-2068020710-2071120718-2072520733- 2073420747-2075120824-2082620836-2084320926 21060-2106121069-2107521105-2111121153-21154 21353-2135421410-2141421454-2145721554-21556 21647-216552192421929-2193522003-2201522040- 2204122045-2204622077-2208322108-2211622165- 2216822246-2224922284-2228922373-2237422411- 2243222625-2262822637-2264322671-2267423080- 2308323112-231192314123201-2320223358-23360 23412-2341823526-235312376123793-2379723802- 2380523878-2388024014-2401824105-2411324116 25403-2540726232-2623326270-2627226285-26290 26685-2668627012-2701427028-2702927098-27099 27377-2737827493-2749527544-275452762327640- 2764127729-2773927840-278442797028361-28362 2842429427-2943830233-30235 adult brain Clontech ABR006 21360-361510-511546-547579-580792-795969-971 1165-11681228-1231 1252-12561453-14541472-1473 1681-16871975-19772044-20452214-22192231-2232 2270-227123062396-240024582826-28272951-2955 31583274-32753313-33143326-33313483-34843686 Tissue RNA Library SEQ ID NOS: origin Source Name
3856-38573939-39414080-40824091-409541734441- 44434484-44854765-47734785-47894795-48094988 499253645930-59315971-59726155-61596450-6451 7353-736773747580-75977701-77037784-77867972- 797580208101-810481068152-815681628184-8185 8342837084898758-87598936-89388978-89819009- 90179029-90309063-90679069-90789306-93119339 9479-94859517-951898289874-98959909-991210033- 1003710254-1026110498-1050310523-1053210624- 1063910923-1092610936-1093711328-1133111708- 1171012573-1257412743-1275012785-1278713556- 1355913660-1367513847-1384914054-1405614170- 1417214347-14348143901460414666-1466814996 150971525715283-1528615336-1534015397-15398 15576-1557715886-1588715896-1590516022-16023 1623716461-1646416496-164971662316643-16648 1665216752-1675317238-1724117369-1737017937- 1793917956-1795718029-1803018044-1804518049- 1806418393-1839418411-1841818423-1842518527 18536-1853718719-1872218762-1876618778-18780 18829-1883418899-1890318934-1893519001-19004 1915919207-1920819211-1921219221-1922219273 1935019484-1948619670-196711969819727-19732 19736-1974219800-1980819814-1981519855-19856 1993919943-1994619951-1995219972-1998020069- 2007120087-2009420099-2010220122-2012720157- 2018120200-2020720274-202782031620321-20324 2041420441-2044620456-2046820491-2049420524- 2053520542-205452054720554-2055720607-20612 20676-2068020718-2072520747-2075120897-20900 21005-2100821069-2107121088-2109621145-21148 21157-2116921171-2117421176-2118021213-21215 21241-2124721263-2126621272-2127421277-21280 21343-2135021377-2139721463-2146521554-21556 21911-2191221929-2193521955-2195721973-21974 2197822007-220152201922045-2204622070-22076 22080-2208822090-2209122108-2211422120-22127 22135-2213822144-2215122158-2215922246-22249 22284-2228922347-2234822375-2238222389-22432 22625-2262822653-2265422671-2267422833-22834 22881-2288622916-2292223007-230212307123080- 2308323223-2322523229-2323323358-2336023379- 2338123412-2341423434-234382348623506-23507 23543-2354423555-2356523771-2378023827-23833 23841-2384423941-2395624035-2404025213-25219 25279-252902531525383-2541526235-2623626280 26310-263142633726361-2636526409-2641026425- 26427266032668726860-2686227067-2707027100- 271022710527193-2720027209-2721327441-27443 27562-2756327589-2759227821-2782227825-27838 28146-28147282962846429328-2933129343-29345 29409-2941629439-2944529940-299493020530221- 30223 adult brain Clontech ABR008 3032-3342-44101 167180-181 197-200233-234307- 308319-327329-330332414-420540-545549576-580 Tissue RNA Library SEQ ID NOS: origin Source Name
612652656667-668694-695706-711716-717763845- 848852-855864-867895-898900910912927949-950 953-954981-9851001-100510291044-10461097-1098 11071123-11271131-11331158-1161 1169-11791185- 118711941196-11981203-12041210-12111232-1237 1273-1279130413071409-14101432-143514701483 1508-1510151215141566-15751577-15891591-1594 15961599-16101618-16191622-162616291632-1636 1640-1641 1643-164816751678-16871692-16931705- 17141717-17191721-17231828183818511853-1854 1885-18861905-19071923-19271931 1956-19591961- 19621967-198819902144-21502194-22032206-2219 2231-22322270-22742284-229823062379-23802383- 2415245724752570-25902592-260326072663-2665 2675-2678270827102724-27262730-27322781-2782 28062808-28112826-28272871-28762886-28872889- 289129202923-29242934-29352948-29492951 -2955 2983-29862988-298930013016-3017303930903098- 31003105-310631303147-31693182-31873192-3193 3195-32123214-32153219-32293237-32413244-3246 3250-32723276-3277329033093313-33143322-3323 3326-333133773414-34203426-34333477-34783507 3512-35143518-35203526-352935483555-35583593 3595-359936043618-36193635-36363657-36603671- 36723683-36843687-3691369337043735-37363752- 37533780-381738623866-38673873-387539113949- 39534053-40544062-408540904121-412641314141- 41434151-415541744209-42114484-44854515-4516 45344536-45394541-454245464574-45784593-4594 4660-466247004720-47254751-47584762-47734777- 47794785-47894795-48094819-48234825-48264854- 48564874-48894946-49484950-49564958-49634970 4984-49875100-510851225124-51265131-51325146 5154-51795190-51915246-52525272-52755331-5332 5485-54865525-55265532-55335557-55605567-5568 56555695-56985744-57475803-58045856-58585885- 58865899-59025973-59755977-59786005-60066010- 60216026-60426154-61616166-61686184-61856209- 62116219-62226235-62376297-629863416350-6351 636763786406-64106439-64406450-64516542-6543 6549-65776582-6584662266246640-66446662-6664 6669-66706773-67786832-68506884-68856898-6899 6943-69587187-71917198-72037206-72087213-7215 7236-72377249-72667275-72787350-73527364-7365 75517561-75627580-76007615-76177630-76367668- 76737681-76877693-769676987734-77367745-7746 7763-77717787-77887798-78017805-78087815-7818 7898-79017904-79127917-79187921-79237947-7948 7978-79837986-79877993-799480028006-80078010- 80168043-80458047-80488059-806880978107-8108 8112-81168132-81448152-81568160-81778184-8185 8187-82008204-82108227-82328253-825782698297- 830783428347-83488351-83608363-83648391-8399 8409-84158420-84218430-84528457-84588461-8477 8484-850585098512-85138543-85508554-85558588- Tissue RNA Library SEQ ID NOS: origin Source Name
85978612-86168688-869087098724-87378758-8759 8771-87768914-89458998-90019009-902090369049- 90679069-90719075-90789317-93189337-93399375- 937693829427-94289431-94339437-94409456-9458 9479-94819510-95189529-95319552-95549563-9566 95859590-95929594-96029617-9626966796729732 9745-97549786-97909792-97959808-98129828-9832 9857-98619867-98689874-98959899-99049909-9912 9925-99539965-99809984-99929997-1000910033- 1003710164-1017210265-1026710320-1032310326- 1032810454-1046610470-1048210498-1050510513- 1051510520-1052210543-1054910595-1059910603- 1060610609-1061110615-1063010640-1064410780- 10782108341087610881-1088310895-1089810906 10930-1093310936-109371094010965-1096610972 10976-1097810980-1098511024-1102711042-11044 11064-11070110801116011204-1120711212-11216 11228-1122911317-1132111345-1135011397-11398 11476-1147811711-1171211743-1174411757-11760 11805-11809118401184211878-1187911890-11891 11901-1190511910-11911119161193411937-11938 11947-1195011971-1197312011-1201312050-12057 12102-1211012112-1211512117-1211812144-12146 12190-1219112194-1219612202-1221012215-12217 1222512241-1225312255-1225812374-1237712398- 1240112403-1240512408-1240912412-1241912436 12545-1254612571-1257412593-1259712610-12612 12625-126311263712641-1264512648-1265412659- 1266212668-126691272312785-127871286912898- 1289913100-1310613499-1352813531-1353913545- 1355913571-1357313576-1358013584-1358813632- 1363313638-1364113645-1365113668-1367213757- 1375913850-1385313885-138871389813918-13919 1393013994-1399713999-1400314044-1404514058- 1405914061-1406314092-1409414105-1410614170- 1417214192-1419514199-14201 14209-1421514227 14236-1423814261-1426314283-1429114307-14317 14344-1434514360-1436914389-1439214452-14455 14570-145741460414607-1460914650-1465114681- 1468214684-1470414709-1471914789-1479114822- 1483814846-1484714871-1487414991-1499214994- 1499614998-1500215009-1502415093-1509515151- 1515715187-1518915212-1521515218-1521915222- 1522415243-1524815250-1525115257-1525915283- 1528615288-1528915292-1529315300-1530115329- 1534015343-1534415347-1535915381-1538515397- 1539815404-1540715443-1545315460-1546115464- 1546515468-1547215476-154781548515491-15525 15543-1554415563-1556415567-1557715591-15592 15606-1560715623-1562815661-1566315688-15694 15699-1570015705-1570815766-1577115780-15788 15855-1587915890-1589515902-1590315961-15963 15981-1598215988-1599016030-1604316053-16058 16060-16061 16096-1609716118-1612016123-16128 16130-161321613516138-1613916167-1617216174- Tissue RNA Library SEQ ID NOS: origin Source Name
1617616200-16201 1621816224-1622516233-16235 1623716290-1629516342-1634416350-1635116359 16422-1642916451-1645216456-1646016470-16472 16479-16481 16496-1649716501-1650416553-16556 16574-1657716614-166151662316628-1663116635 16643-1665516710-1671216714-1672316730-16733 16738-16739167421674516747-1674816752-16753 16765-1676716812-1682116836-1684216854-16858 16860-1687016878-1689816903-1690916911-16912 16925-1693216946-1694816967-1697416996-16999 17018-1702317067-1706917074-170791711517117- 1713217137-1714217145-1714617149-1716717202- 172031724217244-1724817263-1726717284-17306 17330-1733217359-1736017366-173701739617457- 1746217489-1749017571-1757417857-1786117922- 1792617930-1793117933-1793917949-1795017953- 1795517958-1796217995-1800418020-1802218029- 1803018038-1804218055-1806018097-1813418136- 1813818269-182701840318421-1842218500-18501 18516-1851818533-1853518621-1862418651-18658 1866118668-1867218717-1872618745-1874918759- 1876018762-1876618787-1879318806-1880718829- 1883418846-1885318857-1888018894-1889618904- 1890718915-1893318947-1895018964-1896918993- 1899519001-190091901219029-1903519045-19048 1905519057-1906519068-1907019074-1908019091- 1909219096-1910119142-1914719159-1916119211- 192201922419254-192551925719266-1926719274 19295-1929619306-1930719310-1931219316-19317 19331-1934019345-1935419375-1937919385-19389 19395-1940219422-1943119446-1944819458-19461 19464-1948619510-1952419526-1953719539-19554 19560-1956219564-1956619598-1960119617-19619 1962719659-1966819670-1967819683-1968719698- 1969919706-1971019727-1973519743-1974819757- 19771 19773-1980819813-1981519819-1982219855- 1985619921-1992319926-1992919938-1993919943- 199461995019953-1995719962-1996319965-19967 19972-19981 19995-2001720026-2002720053-20079 20087-2009420099-2010520111-2011220120-20154 20157-2018120189-2019420198-2022120231-20241 20244-2025220257-2026220265-2027020274-20278 20281-2028220286-2028720289-202962029920328- 2033020336-2033820345-2035420357-2035920401- 2040520414-2041720425-2043120437-2047120475- 2047920491-2049620505-2051020514-2053620542- 205452054720554-2056820575-2057820607-20612 20614-2061520625-2062820631-2063620642-20643 20646-2068720689-2069220708-2073420739-20757 2076720789-2079720801-2081220822-2082320827- 2084920853-20854208632086520879-2088020890- 2090120927-209322094320948-2094920952-20954 20977-2098420999-2100821016-2102021032-21045 21062-2106821076-2108721101-2111121122-21127 21133-2113521153-2115421157-2116921171-21174 Tissue RNA Library SEQ ID NOS: origin Source Name
21176-2119321225-2122621237-2124721253-21271 21277-2128021284-2129421297-2129821326-21334 21343-2135421377-2139721410-2141421434-21439 21454-2146121480-2148221485-2150021532-21533 21554-2155621568-2157221647-2165521712-21724 21877-2189421905-2195021955-2196621971-21972 21979-2198721989-2200222007-2201822020-22029 22040-2204122045-2204922056-2205922074-22083 22087-2212722135-2213822144-2215622158-22159 22161-2217522195-2219822208-2221122218-22224 22243-2224522261-2227022277-222992230922312- 2231322329-2233222343-2234622349-2235122358- 2235922365-2237122381-2238222389-2239222394- 2239822434-224352249522531-2253322545-22559 22561-2256822571-2258122599-2260222607-22609 22625-2265222671-2267422700-227012270722712- 2271522730-2273122743-227482276722805-22816 22823-2282722833-2283422843-2284522854-22856 22881-2288622916-229242293922941-2294722952- 22953229712297722980-2298623007-2302823047- 2305223064-2306723071-2307323080-2308323085- 2308623098-2310123125-2313623206-2320923223- 2322523229-2323323237-232392324523329-23339 23353-2335423356-2336023373-2337423379-23381 23392-2339523400-2340123415-234192342223425- 2343023434-2344723473-2348623489-2349023492- 2349823506-235072351423543-2354423549-23565 23587-2359823682-236862370423718-2372123731- 237362374223762-2376323768-2380523810-23819 23827-2383523839-2384223845-238482388223941- 239562395923999-2400124005-2401124014-24018 24021-2402424029-2403324035-2404024130-24144 24495-2449825085-2509025114-2512625258-25297 25306-2530825314-2531825322-2533025334-25337 25359-2536325376-2537725382-2540125403-25407 25410-2541325849-2585226029-2603126196-26199 26217-2621826258-2625926265262802628426294- 2630226310-263142632726332-2633726354-26355 26359-263652636926373-2637426378-2638526395 26406-2640726409-264112642326425-2642726429 26448-2645226465-2646626469-2654626604-26606 26657-2665926667-2666826678-2668026705-26707 26732267372674226748-26754268312684226844- 2684526860-2686226873-268752687926888-26890 27012-2701427028-270292709127100-2710227122- 2712827147-2714927193-272002721827247-27251 27261-2726527269-2727027304-2731427411-27431 27435-274402745527463-2746627476-2747727498- 2750027504-2750727520-2752127544-2754527548- 2755427557-2756427574-2757727602-2760627624- 2763427649-2765427662-2768927702-2770627729- 2773927750-2775627784-2778827790-2779627799- 2780227809-2781327816-2782027830-2783827840- 2784727853-2785627865-2786827890-2789227938- 2793928001-280042801828050-280862809228095-
Figure imgf000110_0001
Tissue RNA Library SEQ ID NOS: origin Source Name
14098-1409914117-1412214141-1414214166-14168
1422714264-1426514360-143691460414714-14717
14981-1498315019-15024151121525715292-15297
15329-15331 15381-1538515397-1539815408-15414
15543-1554415869-1587015884-1588515896-15900
16121-1612816168-1617116174-1617616233-16235
16408-1640916498-1650016570-1657316609-16613
16624-1662516702-1670316716-1672316738-16739
16851-1685317268-1727417292-172961745417956-
1796218536-185371862418717-1872218730-18732
18750-187561879618842-1884518897-1889818925-
1893318964-1896918989-189901901219074-19080
1914819207-1920819213-19220192241925319260-
192621927419306-1931219316-1931719351-19354
19363-1936419407-1941119432-194341944119511
19522-1952419526-1952919574-1957819663-19666
19700-1970519709-1971019736-1974219800-19803
19814-1981519921-1992319933-199371993919950
19972-1998120099-2010220104-2010520137-20148
20151-2015420157-2016420167-2017120200-20207
20229-202302024120257-2026220271-2027820281-
202822028920305-2030720437-2044020447-20451
20497-2049820514-2052020524-2053520537-20541
20559-2056220575-20578-20625-2062820649-20652
2068120683-2068520698-2070720712-2071320806-
2081220824-208432086520868-2087120879-20880
20890-2090020927-2092820957-209622102621101-
2111121123-2112721153-2115421171-2117421197-
2119821229-2123221241-2124721263-2126621284-
2129621377-2139721454-2145721495-2150021729-
2173321788-2179921899-2190221911-2191221917-
219212192421936-2193821958-2196622007-22015
2201922030-2203422037-2203922045-2204622080-
2208822092-220942209722101-2210722135-22138
22165-2216822200-2220322243-2224522261-22264
22277-2228922303-2230922381-2238222405-22408
22553-2255822566-2256822607-2261722661-22662
22730-2273122823-2282722907-2290822941-22947
22952-2295323059-2306023080-2308323085-23090
23212-2321523229-232332325123321-2332723399
23412-2341423425-2342723473-2347423483-23485
23692-2369323762-2376323845-238482388223890-
2390223999-2400124005-240112405624481-24490
25085-2509025319-2532225331-2533525366-25369
25374-2537525403-2540726140-2614326189-26194
26205-2620626258-26259263272633726349-26352
26354-26355263572639426406-2640726469-26470
26476-2647726604-266062673726860-2686227100-
27102272182722527261-2726527304-2730527497
27510-2751127544-2754527555-2755627562-27563
2758827602-2760627649-2765427726-2772827805-
2780627819-2782028099-2810028189-2819028199-
2820328468-2848428511-2851328580-2859528910-
2891329288-292892936329718-2972130085-30087
30135-3014030224-3023230236-30239 Tissue RNA Library SEQ ID NOS: origin Source Name cultured Stratagen ADP001 180-181221333-335373-374395398-399461-463534- preadi- e 535576-578619-620716-7171097-10981235-1237 pocytes 1240-1241 1453-145414741705-17141912-19132015- 2018205825132599-26032724-27262814-28152923- 29242975-29773374-33763414-341734333481-3482 3518-352036243789-37943916-39253949-39534515- 45164555-45564562-45684581-45824785-47894854- 48625190-51915572-55735580-55865768-57725802 5899-59025980-59846403-640468517209-72127447- 74497534-75357604-76057615-76177630-76367695- 76967745-77467757-775977717806-780878218113- 81148137-81418347-83488351-83608368-83698466- 84768497-84998607-86098612-86168723-87328762- 87668995-89969007-90089013-90179040-90429267- 92689287-92909306-93119482-94859522-95259531 9735-9742982610253-102611027710490-10493 10516-1051810899-1090010907-109101113511339- 1134411399-1140211416-1141811431-1143211465- 1146611561-1156211568-1157111711-1171211731- 1173411739-1174011780-1178211835-1183611873- 1187512150-1215112171-1217212194-1219612361 12432-1243312470-1247112545-1254612601-12602 1264012642-1264412755-1275912939-1294312997- 1299913077-1307913556-1355913592-1359513629- 1363113638-1364113873-1387513885-1388714058- 1405914261-1426314274-1427614283-1428814694- 147011478414818-148201507015100-1510915182- 151831525715302-1530315326-1532815439-15442 15545-1554615588-1558915600-1560115658-15660 15869-158701619016345-163491654516558-16562 166421665216836-1684216851-168531724217286- 1729117330-1733217335-1733917454-1746417958- 1796218015-1801618029-1803018061 18069-18072 18230-1823918500-1850118625-1862618644-18650 18655-1865818673-1867718771-1877218778-18780 187891879618894-1889618925-1893318964-18969 1897518989-1899019045-1904819096-1910119207- 192081922319251-1925219266-192671927319316- 1931719351-1935419362-193641937019372-19374 19387-1938919395-1940019407-1941119422-19434 19442-1944419515-1952119564-1956619670-19671 19736-1974219804-1980819813-1981519819-19822 19921-1992319933-1993819953-1995719972-19980 20099-201032013020161-2016420189-2019420226- 2022820231-2023420257-2027020286-2028720289 20328-203302036020415-2041720432-2044020447- 2045120456-2046820537-2054520554-2055720563- 2056720625-2062820631-2063420676-2068020686- 2068720733-2073420747-207512076720824-20826 20836-2084320858-208622086520927-2092820957- 20962209912101521062-2106621076-2108721122 21157-2116921181-2119321211-2121521225-21226 21272-2127421277-2128021297-2130321463-21465 21780-2178721911-2191221958-219612197822003- 2200622020-2202522040-2204122048-2204922062-
II Tissue RNA Library SEQ ID NOS: origin Source Name
22073 22080-22083 22092-22094 22120-2212722141-
2214322169-2217022208-2221122225-2222622250
22265-2227022277-2228922314-2231722323-22328
22349-2235022358-2235922362--2236422661-22664
22669-2267022698-2270122726--2272722852-22853
22870-2287422969-2297022978--2297923046-23050
23066-230672307023098-2310123218-2321923229-
232332323623252--2325623358--2336023379-23385
23448-2345023508-235092370423726-2373023760-
23761237642377023798-2379923802-2380523845-
2384823890-239022395923990-2399223999-24001
24313-2431425085-2509025336--2533725340-25341
25374-2537525416-2541725680--2568625986-26016
26024-260282626626310-26314263272633726409-
2641026421-2642226607-266092668426705-26707
26755-2675626860-2686226873--2687827150-27151
27261 -2726527269-2727027729--2773927750-27756
27825--2782627861 -2786427890--2789227989-27990
28099-281002818628315-2831628365-2837028407-
284152842428426--2842828450-■284532846428617-
2862728787-2878929096-2910129304-2930829328-
2933129354-2935729508-2951829663-2967229926-
2993529960-2996130141-3014230218-30220 adrenal Clontech ADR002 48-4951-52329-330332546-547635652656694-695 gland 864-867901-904100510581158-11611273-12791453- 14541516-151915511608-16101651-16521654-1656 1661-16751705-1714191419902015-201820582306 2383-23882393-23952417-24222517-25182540-2541 2599-26032675-26782695-26982911-29122923-2926 2951-29552983-2986300030043107-31083322-3323 33653441-34423605-36073649-365336933781-3784 3871-38723939-39414053-40544156-41574165-4170 4413-44144515-451645424550-45514581-45825057- 50595109-51155285-52875376-53785480-54825538- 55405695-56985766-5767580259466011-60216081- 60826166-61686223-62256299-63006339-63406403- 64046454-645868526857-68587209-72127228-7237 7258-72637407-74147461-74647536-75417611-7612 7630-76367657-76597681-76877693-76947745-7746 7757-775977947806-78087823789079577984-7985 7990-799280208025-80268069-807080798145-8156 82338301-83098370-83748430-84338461-84648504- 85058539-85428607-860986888736-87378758-8759 8762-87668982-89839002-90049072-90749456-9458 94609472-94749526-95309535-95379617-96269675 9733-97389743-97449770-97719780-97819794-9795 98289851-98539923-992499459965-99689984-9985 10015-1001610263-1026410273-102741027710279- 1028010311 10320-1032310513-1051510531-10532 10543-1054910592-1059410603-106061062710640- 1064410780-1078210870-1087210881-1088310891- 108931090610913-1091411081 11297-1129911405- 1140711465-114661154611582-1158311612-11639 11669-11671 1180911901-1190711918-1192012194- 1219612202-1220512237-1223812256-1225712259- Tissue RNA Library SEQ ID NOS: origin Source Name
1226212455-1245912470-1247112486-1254012545- 12561 12590-1259212601-126021272312785-12787 1294312968-1296912997-1299913077-1307913581- 1358313592-1359513638-13641 136651367613931- 1393314170-1417214257-1426314304-1430514360- 1436914456-144581460414661-1466214684-14685 14789-1479115025-1506915098-1509915218-15219 15243-1524815250-1525115258-1525915290-15291 15329-1533115392-1539615406-1540715412-15414 15526-1552715536-1553715578-1558015688-15694 15906-1590715961-1596315988-1599016118-16120 16174-1617616226-1622716350-1635116365-16366 16461-1646416508-16511166001662316636-16637 16643-1664816749-1675316851-1685316894-16896 16911-1691216933-1693717042-1705717067-17069 17255-1725617264-1726717292-1729617372-17374 17441-174501745417569-1757017583-1758417958- 1796218005-1800618029-180301806518412-18418 18421-1842518500-1850118577-1857818587-18616 18619-1862018644-1865018691-1869218723-18726 18738-1875818762-18766187891879618836-18838 1885618899-189031891018934-189351897518983- 1898518989-1899019001-1900419062-1906519071- 1908019211-1921219228-192291925719271-19272 19350-1935419363-193641937019401-1940219407- 19411 19415-1941719422-19431 194871951419560- 1956519604-1960719620-1962219656-1965719659- 19661 19667-1966819683-19687196931969819700- 1970819727-1973219759-1976319800-1980819814- 1981519855-1985619915-1992019933-1993719943- 1994619965-1996719972-1998020029-2004320072- 2007320106-2011020146-2015020161-2016420167- 2017120189-2019720208-2021420219-2022120235- 202402024320250-2025220265-2027320305-20307 20321-2032420328-203302036120366-2036820401- 2040520415-2041720437-2044020485-2049020505- 2051020514-2052320547-2055320569-2057420607- 2061220676-2068020710-2071120727-2073220758- 2076620788-2079120819-2082120836-2084920855- 2085720866-208672092620938-2094220948-20949 20963-2097221005-2100821072-2108721141-21142 21157-2116921171-2117421208-2121021241-21247 21377-2139721410-2141421451-2145321480-21482 21485-2149421647-2165521881-2188521899-21902 21958-2196121973-2197421983-2198722007-22015 22026-2202922035-2203622070-2207322115-22119 22128-2213422141-2214322152-2215622169-22170 22187-2219222195-2219822204-2220722218-22224 22261-2227622284-2228922292-2229922303-22308 22312-223132231822358-2235922365-2237122373 22375-2237622409-2241022433-2243522455-22465 2249522561-2256822625-2262822671-2267422690 22696-2269722700-2270122737-2273922760-22762 22810-2281622823-2282722857-2285822887-22891 229542304623051 -230522307123252-2325623329- Tissue RNA Library SEQ ID NOS: origin Source Name
2333723356-2335723439-23447 2348823499-23505 235992370423731 -2373623827-■2383323845-23848 23871-2387523878-238802388223890-2390224014- 2401824021-240242405624130-.2414424194-24196 24446-2444825085-2509025289-2529025293-25297 25323-2532525705-2571526024--2602826033-26049 26237-262392625226258-2625926361-2636526383- 2638526428-264292643126469-2647026479-26545 26559-2656026563-2656626596-2660126665-26666 26676-266772674626748-2675426804-2680526853- 2685427158-2716327209-2721327379-2738027383 27401-2741027441-2744527520--2752127544-27545 27600-2760127649-276542769727812-2781327825- 2782627890-278922792928050-2805928095-28096 2842428446-2844828458-2846029328-2933129343- 2934529940-2994929954-2995530135-3014030150- 3015630189-30194 adult heart GIBCO AHR001 3042-4451-52124-125156180-181192197-200202- 203221227-230260294-301315-316324-325329-330 332-335373-374395-397408-411414-421425-427505 515-517529576-578605-606660685-688716-722738- 740825845-846849-851891895-898901-904914919- 931934-9361001-10031005102810471071 1092-1094 1097-10981158-11611185-11871235-12371240-1241 1243-12451249-12621280-12821302-130313061312- 13131418-14191424-142914311453-14541467-1470 14741476-147714831488-14891569-15701608-1610 161816261667-16681692-16931792-179418281866- 18721905-19071912-19131925-192719431961-1962 1986-19871990-199220362222-22262272-22762341- 23432393-23952404-24062440-24442471-24742565- 25662570-25742599-260326252675-26782692-2694 2706286128642867-286829122923-292429262975- 29772988-2989299330013006-300830903126-3127 3151-31543161-31633205-32073223-322532283276- 32773281-32873326-33313374-33773392-33963433 3435-34423452-345435073531-35343620-36213661- 366336933729-37303770-37723789-37943834-3835 39303939-39413949-39534067-40724091-40964124- 41264141-41434338-43414354-43554420-44214469- 44704515-451645254531-45334555-45564562-4573 45794581-45824600-46014623-464346514663-4665 4675-46804698-46994785-47894825-48264833-4836 4857-48614971-49744989-49915057-50595074-5081 5084-5085513852465253-52665335-53365362-5363 5474-54775519-55215530-55315557-55605567-5568 5580-558357005743-57475754-575758025805-5807 5827-58285875-58795919-59215939-59415965-5966 5973-59755977-59785980-598460846135-61366166- 61686209-621162846389-639264006403-64046450- 64516523-65266551-656766226674-66756773-6778 7209-72127236-72407256-72637283-728474817536- 75417557-75597580-75977611-76127615-76177620- 76217630-76367671-76737678-76897695-76967701- 77037726-77277729-77417745-77467757-77597778- Tissue RNA Library SEQ ID NOS: origin Source Name
77837787-77947798-78017806-78087815-78187822 78907906-79127921-79257972-79757977-79837994 7999-80018008-80098013-801680208069-80708077- 80798084-80858090-80938098-81008107-81088112- 81148117-81218137-81418145-81518187-82008247- 824882508297-830083108342-83608368-83698409- 84148434-84528466-84768497-84998512-85138539- 85428612-861686888698-87008751-87538758-8759 8762-87668768-87798919-89338978-89819013-9017 9031-90369040-90459049-90679196-92009306-9311 9321-93239331-93339391-93929395-93969427-9428 94439472-94769478-94819487-94889491-94939506- 95099517-95189529-95319535-95379581-95849593 9603-96049617-962696599726-97279729-97319735- 97449794-97959799-98009803-980798269828-9832 9851-98529857-98619874-98959916-99209923-9924 9969-99809984-99859997-100091001210017-10019 10025-1002610033-1004010161-1016310167-10172 1026210268-10274102771030610326-1032810451- 1045310470-1047310484-1048910491-1049310498- 105031051910537-1053810551-1055410592-10594 10600-1060610612-1061410628-1063010870-10875 10877-1087810881-108831089410901-1090210913- 1091410931-1093310976-109781102111066-11068 11123-1112411202-1120711291-1129911308-11310 11314-113151132111326-1132711345-1135011372- 1139611468-1147411476-114781149511515-11517 11538-115401154611565-1156711579-1158111609 11644-1164811658-1165911711-1171211714-11720 11731-1173411745-1174811780-1178211803-11804 1180911835-1183711901-1190511921-1193011941- 119441195111977-1197812023-1204112050-12056 12117-1211812131-1213212135-121371214012150- 121511217312192-1219612202-1220812256-12258 1236112374-12401 12455-1245912470-1247112475- 1247612555-1255912573-1257412590-1259612637 12642-1264412655-1265812668-126691272312725- 1272712755-1275912785-1278712796-1279812805- 1280612872-1288112970-1297212978-1298112997- 1299913077-1307913545-1354813563-1356413567- 1356813576-1357913592-1359713652-1365913674- 1367513873-138751389813916-1391913954-13956 14023-1402614036-1403714044-1404514048-14053 14058-140601412914137-1413814166-1416814192- 1419514209-1421514236-1423814264-1426514283- 1428814304-1430514355-1435614360-1436914498- 14501 1460414640-1464414650-14651 14684-14685 14702-1470414709-1471714846-1484714991-14992 15013-1502415172-1517315182-1518315190-15191 15218-1521915243-1525115253-1525515257-15259 15283-1528615290-1529115298-1529915323-15324 15336-15340153441539915412-1541415530-15531 15568-1557215576-1557715588-1558915614-15616 15631-1563215699-1570015766-15771 15855-15857 15863-1587015881-1588515925-15931 15988-15990
15 Tissue RNA Library SEQ ID NOS: origin Source Name
16075-1607916110-161121613716141-1614316226- 1622716350-16351 16365-1636616403-1640716465- 1646616473-1648116496-1649716508-1651116520- 1652916536-165391660016602-1661116619-16621 16623-1662516636-16637166421665216700-16701 16704-1670516716-1672316836-1684216875-16877 16894-1689616911-1691216934-1693916949-16953 17036-17041 17105-1710617116-1711817124-17126 17131-17132172321724217292-1729617330-17332 17335-1733917451-1745617571-1757417857-17861 17958-1796218001-1800318015-1801618029-18030 18136-1813818400-1840218405-1840618421-18425 18490-18491 18495-1850118533-1853718583-18584 18625-1862618629-1863218644-1865818668-18670 18673-1867718689-1869318717-1872618729-18732 18745-1874918759-1876118771-1878318789-18793 1879618802-1880418806-1880718811-1881318822- 1883418839-1884518856-1888018882-1888818897- 1890318915-1892118923-1893918941-1894218944- 1895018955-1895918964-1896918975-1897718993- 1899519005-1900919013-1901819029-1903919049- 190531905519057-1905819062-190831909019096- 19101191341915319207-1921019213-1922019226- 1922719251-1925219254-1925519260-1926219266- 1926719295-1929619306-1930719316-1931719345- 193641937019373-1937419380-1940019407-19411 19415-1941719422-1943419438-1944419454-19455 19462-1946319467-194861950319526-1952919536 19560-1956219564-195661958619604-1960719656- 1965719663-1966819672-19673196881969319727- 1974819759-1976719772-1980819814-1981519819- 1982219921-1992919933-1993919943-1994619965- 1996719972-1998020029-2004320048-2006320069- 2007120087-2009420106-2011020114-2012020130 20133-2014820151-2015420161-2016420167-20171 20182-2019720208-2021420235-2024120244-20253 20257-2026220265-2027020274-2027820281-20282 202892029920305-203072031620321-2032420336- 2034120345-2035420393-204052041120414-20417 20425-2043120437-2047420476-2047920485-20494 20505-2050920514-2053520554-2055720559-20578 20614-2061920625-2063020646-2064820676-20681 20683-2068520698-2070720710-2071320718-20725 20727-2073420747-20751207672078820792-20797 20806-2081220836-2084920863208652087120879- 2088020897-209002092620929-2093220938-20942 20948-209492095120963-2097220977-2098820999- 2100421015-2102021027-2104521060-2106621072- 2107521101-2110421123-2112721137-2114221145- 2115121157-2116921171-2117421181-2119321202- 2120721227-2122821233-2123521237-2126621277- 2129421301-2130321351-2135221377-2139721408- 2140921434-2143921446-2145321463-2146521467- 2146921480-2148221485-2149121529-2153021587- 2165521881-2188521892-2189421925-2193521955-
16 Tissue RNA Library SEQ ID NOS: origin Source Name
2196821971-2197421978-2198721995-2199922003- 2201522019-2202522030-2203422040-2204422047 22050-2205522060-2206122070-2207622098-22107 22120-2212722135-2213822141-2214322152-22156 2216022165-2217522187-2219222195-2219822208- 2221122218-2222622246-2224922251-2225222261- 2227622284-2228922300-2230222310-2231822323- 2232822343-2235022352-2235622358-2235922365 2237322383-2238822409-224102249522553-22559 22571-2258122607-2260922625-2262822634-22636 22644-2265122653-2265422665-226672269022700- 2270122730-2273122760-2276222768-2277022805- 2281622821-2282222843-2284522857-2285822881- 2289122899-229002291122923-2292422941-22950 22952-2295422969-22970229732297722980-22989 22991-229942299823046-2305523059-2306023063- 2306723070-2307123094-2311123138-2314023201- 2320223206-2320923212-2321523220-2322123229- 2323323235-2323623251-2325623279-2328123356- 2336023373-2337723379-233852339923415-23421 2343323473-2347423486-2348723489-2349023492- 2349323495-2349623506-2351323519-2352023543- 2354423555-235652359923673-236742369123704 23726-2373623760-2376323771-2378023798-23799 23802-2380523827-2383323836-2383723849-23852 23878-2388023890-238922395923990-2399224014- 2401824021-240252405625213-2521925279-25290 25307-253082531325331-2533525340-2534325347- 2535425383-2540125403-2540725534-2553626024- 2602826195-2619926205-2620826237-2623926248- 2625226258-2626126266-2626726273-2628026285- 2629026305-2630626327-263282633726361-26365 2636926373-2637426389-26393263952641126423 26460-2646326469-2647026479-2654526563-26566 2660326665-2666626681-2668326698-2669926737 26748-2675426788-2678926804-268052683226843- 2684526850-2685726860-2686226876-2687926882- 2688426980-269842698827037-2703827040-27043 2705227067-2707027080-270902709227129-27132 27150-2715127154-2715527173-2717427193-27200 27209-2721327229-2723227247-2725127269-27270 27275-2727627283-2728727304-273052734327348- 2735327379-273802743827501-2750727510-27511 27544-2754527551-2755427560-2756427600-27601 27636-2763927649-2765427662-2767227700-27701 27729-2773927750-2775427807-2780827821-27822 27861-2786427896-279272798627989-2799028001- 2800328012-2801728095-2809628099-2810028108- 2812128142-2814528160281862828628290-28292 28311-2831328315-2831628324-2834528361-28362 28421-2842428426-2842829265-2926729278-29283 29328-2933729339-2934029343-2934529354-29357 29508-2951829594-2960429718-2972129919-29922 29940-2995329960-2996130141-3014230150-30156 30184-3019930218-3022030233-3023530243-30252 Tissue RNA Library SEQ ID NOS: origin Source Name
30361-30368 adult GIBCO AKD001 163042-4448-49156164167170-171180-181 197-200 kidney 202-203255-256260282-284302-318329-330332-335 339358-359373-375395398-399412-413419-420422- 429458461-467470-471505515-517524-525530-531 619-620635661664694-695706-707716-722779-782 785-786832-836847-848852-855880-882913934-935 938981-9851001-1003100510281044-104510471071 1092-10941097-10981123-11271169-11791203-1204 1240-12411280-12821295-12971302-130313071312- 13131346-13471414-14171431 1449-14501452-1454 1456-14601467-147014831488-148914991569-1570 1578-1585162016261705-17141721-17221792-1794 1912-19131915-19161918-19191925-19271947-1948 1980-19821986-198719902015-20182031-20322058 2231-22322275-227622992383-23882404-24062423- 24242471-24742570-25742599-26032649-26502695- 269828022808-28112820-28212871-28762883-2887 2923-292429262934-293529672975-29772988-2989 30013006-3008302030393078-3079309031123151- 31543205-320732283255-325932713274-32773326- 3331335233653374-33773418-3422343334663507 3515-35173525360636243671-3672368636933708 3729-37303768-37693789-379438093827-38283840- 38523949-39534091-40954124-41264213-42144220 43364342-43464413-44144424-44304441-44434469- 44704515-45164531-453345424555-45564562-4568 4581-45824612-46144640-46434785-47894857-4861 4868-48694971-49745092-50945109-511551925269- 52715311-53255335-534153705512-55215532-5533 5538-55605569-55735641-56495695-569857005802- 58075820-58225827-58285854-58555899-59025919- 59215930-59365939-59415943-59455956-59575973- 59755977-59785980-59846040-60426135-61366170 6209-62116235-62376300637663786403-64056450- 64536503-65106533-65356542-65436585-65926624 68837209-72127214-72197258-72667275-72767299- 73017447-74497576-75777604-760576097611-7612 7615-76217630-76367640-76427657-76597671-7687 7693-76967701-77037726-77287737-77437745-7746 7778-77867789-78047806-78087812-781878217891- 78957898-79007906-79127921-79257972-79757978- 79837990-799480028010-80128059-806380708074- 807680798107-81088113-81148132-81368145-8156 8163-81738187-82008216-82188227-822982348247- 824882508253-82578263-82658297-830083108342 8349-83648368-83698404-84068409-84158420-8421 8449-84528461-84648487-84888497-84998502-8505 8507-85088512-85158537-85508554-85558601-8603 8607-86098612-86168688-86908751-87538758-8759 8768-87708919-89338986-89949002-90049029-9036 9043-90459072-90749084-90859267-92689306-9311 9321-93239337-93389343-93469366-93719391-9392 9395-93969415-94199443-94449456-945894609472- 947694789482-94889506-95189522-95259531 -9534
11! Tissue RNA Library SEQ DD NOS: origin Source Name
9581-958395859594-96029617-962696599664-9665 96769682-968396889729-97319757-97679770-9771 9786-97909794-97959799-980098269828-98349848- 98529854-98559857-98619874-98959902-99089921- 99249927-99289954-99559965-99889997-10009 1001210025-1002610033-1003710161-1017210265- 1027710279-102801030610326-1032810449-10453 10480-1048210484-1048710498-1050310506-10507 10513-1051910531-1053210543-1054910588-10601 10603-1060610612-1062310638-1064410820-10821 10870-1087610881-1088310907-1091010913-10914 10931-10933109381101011024-1102511069-11070 11123-111241115811217-1121911230-1124911289- 1129011295-1129611300-113071131711326-11327 11339-1135011406-1140711423-1142611431-11432 11476-1147811493-1149511544-115461155311582- 1158311586-1160211607-1160911612-1163911644- 1164811658-1166111711-1171211715-1171611718- 1172011731-1173411739-1174011745-1174811775- 1177611803-1180811835-11836118401184211937- 1193812040-1204112127-121301214012149-12151 12175-1217612186-1218912202-1220812256-12262 12374-1237712470-124711247712486-1251812545- 1254612555-1256112565-1256812601-1260212615- 126181263712642-1264412653-1265812666-12667 12676-1267812714-1271512720-1272312725-12727 12755-1276212774-1277512782-1278712872-12881 12963-1296412997-1299913077-1307913556-13562 13592-1359513603-1361213638-1364113668-13672 13873-138751389813918-13921 1393013934-13936 13954-1395613999-1400414023-1402614044-14045 14058-1405914166-1416814170-141721418614192- 1419514236-1423814264-1426714314-1431714344- 1434514355-1435614459-144621460414607-14609 1462614640-1464214650-1465114661-1466214681- 1468214684-1468514705-1470614714-1471714723- 147241478414789-147911480915013-1501815093- 1509615182-1518315187-1519115218-1521915243- 1524815250-1525115257-1525915290-1529115336- 1534015412-1541415457-1545915530-1553115545- 1554615576-1557715588-1558915623-1562615699- 1570015867-1586815880-1588515890-1589515988- 1599016006-1601516044-1604616053-1605816115- 1611716121-1612216130-1613216141-1614316152- 1615316168-1617116174-1617616233-1623516238 1631516360-1636116364-1636616368-1637216408- 1640916434-1643716461-1646416470-1647816527- 165291654516558-1656216583-1658516602-16611 1662316636-166371664216652-1665516668-16674 16683-1669316700-1670316716-1672316800-16801 16823-1682616828-1683116836-1684216851-16853 16878-16881 16894-1689616913-1691516934-16939 17010-1701417026-1702817036-170371711417131- 1713217145-1714617153-1715417156-1715917238- 17241 17244-1724817251-1725417297-1730617330- Tissue RNA Library SEQ ID NOS: origin Source Name
1733217335-1733917356-1735817404-1740817454 17922-1792317951-1795217958-1796318008-18009 18015-1801618029-1803018069-1807218400-18402 18411-1841818421-1842518500-1850118516-18518 1852718533-1853418536-1853718577-1857818585- 1861718624-1862618629-1863718644-1865018655- 18661 18668-1868118689-1869318717-1872618730- 1873218738-1875618759-1876918771-1877218778- 1878318787-187931879618802-1880418806-18807 18811-1881318822-1883418836-188381885418856- 1888018882-1888818894-1889618899-1890318919- 1892118923-1893918947-1895018964-1896918975 18989-18990189961900019005-1900919012-19018 19025-1902819045-190481905419057-1905819062- 1906519071-1908519096-1910119106-1911819134 19138-1914019142-191471915319163-1920119207- 1922019226-1922719253-192551925719260-19262 1927319306-1930919316-1931719345-1935419361- 1936419368-1937019372-1937419385-1940219407- 1941119415-1941719422-1943419441-1944419458- 1946119494-194961950319512-1951319515-19521 19526-1953119560-1956219566-1957819580-19582 1958619617-196191962719656-1965719667-19668 19674-19678196931969819706-1971019727-19732 19736-19751 19759-197631977219800-1980819813- 1981519915-1991819921-1992919933-1994619953- 1995719962-1996319965-1996719972-1998120015- 2001720026-2002720029-2004320048-2005220069- 2007120074-2007920087-2010220106-2011220114- 2012020122-201272013020133-2015020161-20164 20167-2019420198-2020720215-2021820229-20241 20244-2025320257-2026220265-2027020274-20278 20289-2029620299-2030420317-2032420328-20330 20343-2034820366-2036820401-2040520415-20420 20425-2045520469-2047120482-204902050120510- 2053520537-2054520548-2055720559-2057820614- 2061920625-2063020635-2063620642-2066520681 2068820698-2070720710-2072620733-2073820747- 2075320758-2076720788-2079720806-2081220816- 2081820824-2084920855-208672087120879-20884 20897-2090020922-2092820933-2093520938-20943 20952-2095420963-2097220977-2098420999-21008 21021-2102321060-2106621069-2110021123-21127 21137-2114221145-2115121171-2117421176-21180 21194-2119821208-2121021233-2123521237-21240 21248-2125221256-2126621272-2129421297-21298 21326-2133421351-2135521377-213972140521428- 2145321458-2146121463-2146521467-2146921480- 2148221485-2149121495-2150021529-2153021532- 2153321554-2155621587-2165521729-2173321744- 2174721877-2188521892-2189421911 -2191221915- 219212192421929-219352193921948-2195021955- 2196821973-2197421978-2198822000-2201522019- 2202522030-2203422042-220442204722056-22076 22080-2208622101-2210722115-2211622135-22138 Tissue RNA Library SEQ ID NOS: origin Source Name
22141 •2214322160 22165-2217022187- 2219222195-
2219822208-2221122218-2222622231-2223522243-
2225022253-2225522261-2226422271-2228922300-
2230222312-2231822323-2232822333-•2233522343-
2235022358-2235922365-223712237322377-22380
22383-■2239222399-2240822434-2243522440-22448
22455--224652249522531-2253322551- 2255822560
22571-•2258122599-2260222607-2260922622-22628
22634-■2264322653--2265422661-2266722671-22674
2269022700-2270122726-2272722730--2273122760-
2276222768-2277022775-2277622781--2278222794-
2279622805-2281622821-2282222828--2282922838-
2284922852-2285322857-2285822899--2290022925-
2293822948-2295022952-2295322955--2295722973-
2297722991-2299422999-2302123026--2302823046-
2305223059-2306023063-2306723070--2307123098-
2310123117-2311923138-2314123201--2320223206-
2320923212-2321523218-232222323623240-23244
23247232492325123261-2326223328-■2333723343-
2334423356-2336023373-2337423382--2339023396-
2339923415-2342123425-2342723433--2343723439-
2345023470-2347423486-2348723489--2349023492-
2349323495-2349623508-2351323543--2354423555-
2356523718-2371923726-2373623761--2376323771-
2378023793-2379923802-2380923816--2381923827-
2383323839-2384223845-2385223871--2387523890-
2390223911-239402395923999-2400124014-24018
24021-240242405624120-241292466524772-24774
2528925290253062531425319-2532125331-25335
25342-2534325370--2537525383-2540125410-25413
25416-2541725534--2553625966-2596826024-26028
26196-2619926205--2620626237-262432625226261
2626626270-2627226305-2630926316-•2632126327
2633726341-263442635826361-2636526373-26374
26389-26393264112642326448-2645126460-26463
26469-2647026559--2656026596-2660126678-26680
26698-266992673726755-26756268062682826831
26842--2684326846--2684926853-2685726860-26862
26873--2687926980--2698427012-2701427028-27029
2703527044-2704727050-270572709127142-27146
27156--2715727173--2717427184-2718727193-27205
2721827247-2725127269-2727027275-■2727627283-
2728727299-2730527323-2732527331-2733327441-
2744427446-2744927498-2750727510-2751127544-
2754527562-2756327570-2757127574-2757727588
27593--2759427600--2761527636-2763927649-27654
27662-•2767227698--2770127726-2774327750-27756
27814--2781827821--2782227825-2782627861-27868
27896--2792827943--2794727989-2799028008-28011
28050--2805928097-■2810028105-2812228142-28145
2818628204-2820928263-2826828272-2827628286
28290-•2829228311 ■2831328361-283642842428426-
2842828446-2844829265-2926729278-2928329328-
2933129339-2934029354-29357293712937529432-
2943529451-2945229746-2976129940-2995529958- Tissue RNA Library SEQ ID NOS: origin Source Name
2996130085-3008730141-3014230150-3015630195- 3019930218-3022030224-3023230361-30368 adult Invitroge AKT002 136-138255-256333-335414-418543-545553-554576- kidney n 578665-666716-717847-848913919-922981-9851034 1071 1086-10871097-10981123-11271158-11611298 1312-13131346-13471461-14631569-15701692-1693 1980-19822015-20182209-22102255-22562471-2474 2540-25412675-26783006-300832713276-32773374- 33773435-34403452-345335073789-37943834-3835 3840-38543949-39534122-41234160-41624486-4487 4515-45164562-45684660-46624785-47894795-4809 4971-49745149-51505331-5332551057005932-5936 5980-59846140-61436166-61686235-62376335-6336 6403-64046533-65356570-65716656-66617008-7010 7209-72127216-72197536-75417611-76127643-7647 7657-76597742-77437784-77867795-77977815-7818 7924-79257972-79757978-79838008-80128017-8019 80468059-806380798142-81518187-82008211-8213 8216-8218831083428368-83698512-85138526-8535 8543-85508554-85558936-89388982-898390369072- 90749337-93389369-93719391-93929456-94589475- 94769487-948895589563-956695859594-96029617- 96269735-97429759-97629828-98329851-98529916- 992099469989-999210015-100191027710449-10457 10520-1053010588-1059410600-1060110603-10606 10886-108891101011217-1121911300-1130111476- 1147811544-1154511603-1160611658-1165911715- 1171611731-1173411772-1177411835-1183611842 11873-1187511937-1193812121-1212612131-12132 121401214912186-1218912374-1237712545-12546 12560-1256112578-1258012593-125961259812637 12723-1272712796-1279812968-1296912984-12985 12997-1299913585-1358813592-1359513627-13631 13638-136411366513925-1392613954-1395614023- 1402614299-143011460414684-1468514721-14724 1478415243-152481525715326-1532815545-15546 15576-1557715588-1559015593-1560115623-15626 15863-1586616075-1607916096-1609716345-16349 16392-16394165451664216833-1683516934-16937 17026-1702817238-1724117251-1726117335-17339 17454-174561841118423-184251852718533-18534 18585-1858618625-1862618633-1863718671-18672 18717-1871818730-1873218759-1876118771-18776 18778-1878018790-187931879618802-1880418822- 1882418839-1884118906-1890718919-1892118934- 1893518944-1894519045-1905319057-1906119134 19207-1920819254-1925519283-1929419351-19354 19362-1936419368-1937019390-1940219407-19411 19461 19494-194961950319560-1956219567-19578 19656-1965719667-1966819727-1973519814-19815 19924-199251993819953- 1995720015-2001720026- 2002720029-2004320087-2009420099-2010220122- 201272013020146-2014820161-2016620182-20194 20200-202072024120257-2026220265-2027020279- 2028020290-2029620328-2033020412-2041320421-
Figure imgf000124_0001
Figure imgf000125_0001
Tissue RNA Library SEQ ID NOS: origin Source Name
2714627193-272002721827247-2725127269-27270 27493-2749527504-2750727544-2754527600-27601 27700-2770127755-2775627821-2782227825-27826 27861-2786427989-2799028008-2801128142-28145 2828628416-2841729278-2928329328-2933129343- 2934529352-2935329960-2996130141-3014230224- 3023230361-30368 lymph Clontech ALN001 167170-171180-181302332395425-427505635670- node 671849-851919-922973-987998-1000102810471086- 10871232-12341259-12621273-127913071456-1460 14991569-15701667-16681705-17141879-18801943 2063-20652126-21292272-22742404-240624092471- 247425333151-31543322-332333773461-34633512- 3514373738223834-38353879-388042554555-4556 4562-45685335-53365572-557355805650-56515802 5805-58075910-59116403-64046832-68507216-7219 7447-74497643-76477654-765576977744-77467757- 77597763-777078907972-79757978-79858010-8012 83428698-87009043-90459306-93119475-94769487- 948895319615-96169780-97819916-992010273- 102741030611123-1112411295-1129611339-11344 11431-1143211476-1147811538-1154011607-11608 1169611780-1178211835-118361214012150-12151 12374-1237712545-1254612615-1261812642-12644 1272312760-1276212782-1278412796-1279812872- 12881 13077-1308213592-135951390614304-14305 14546-1454915182-1518315190-1519115290-15291 15545-1554615548-1555015576-1557715699-15700 16044-1604616110-1611216284-1628916370-16374 16536-165381654516609-1661116628-1663116636- 16637166421665216954-1696117046-1705717131- 1713217330-1733217454-1745617958-1796218015- 1801618029-1803018405-184061841118533-18534 18579-18581 1861718719-1872618750-1875818771- 1877218778-18780187891880518811-1881318822- 1882418836-1883818846-1885318899-1890318975 189961901819062-190651909019122-1913119138- 1914019154-1915519211-192121927419283-19294 1937019380-1938419390-1939419401-1940219415- 1941719442-194441950319526-195291968819693 19706-1970819743-1975119813-1981519915-19918 19921-1992319948-1994919972-1998020048-20052 20066-2006820106-2011020161-2016420172-20179 20182-2018820235-2024020257-2026220437-20440 2055820563-205672076720924-2092520938-20942 21027-2103121233-2123521277-2128021377-21402 21480-2148221495-2150021529-2153021587-21646 21978-2198222115-2211622141-221432216022277- 2228322312-2231322336-2234222358-2235922373 22455-2246522644-22651226682269022700-22701 22732-2273522737-2273922823-2282722838-22842 230462307023074-2307623098-2310123237-23239 23379-233812343323448-2345023473-2347423486 23492-2349323495-2349623508-2350923526-23531 2376123802-2380523999-2400124014-2401824772- Tissue RNA Library SEQ ID NOS: origin Source Name
24774 25085- 25090 25319- ■2532125383- •2540126024- 2602826237-■26239262662633826341-2634426373- 2637426479-■26545 26755-2675626855-•2685727113 27147-■2714927158-2716327177-2717827203-27206 2721827245-2724627269-2727027283-•2728727304- 2730527510-2751127562-2756327600-2760127636- 2763927814-2781527825-2782627861-■2786427989- 2799028142-2814528157-281592844929328-29331 29339-■2934029451 •2945229940-2994930141-30142 30150-■30156 young GIBCO ALV001 10-14165-166170-171202-203218-219324-325332 liver 461-463534-535582-584674-676847-848900-904 1035-10371058-10601123-11271346-13471431 1470 14781527-15281705-17141778-17791908-19091967 1980-19822015-20182284-22982570-25742599-2603 3255-32593276-32773374-337635073512-35143518- 35204091-40954160-41624273-42744343-43464370- 43714515-45164562-45684600-46014640-46434857- 48615152-51535335-533653705512-55165519-5521 5572-55735899-59025997-60006011-602164006572- 65746585-65926791-67947209-72127214-72157258- 72637426-74427536-75417630-76367648-76537671- 76737681-76877701-77037742-77437778-77837798- 78057815-78187930-79327947-79487995-79978017- 801980798113-81148145-81518157-81598187-8200 8242-8246825083428349-83508377-83798434-8452 8461-84648497-84998502-85038512-85138543-8550 8554-85558607-86098689-86908758-87598762-8766 9317-93189391-93929395-939694609475-94769478 9522-9525953195589561-95629585-95899617-9626 9719-97229829-98329848-98509854-98559874-9895 9902-99049929-99359965-99689984-998510167- 1017210265-102671027710484-1048710520-10522 10588-1059110907-1091210936-109371096611314- 1131511406-1140711431-1143211468-1147211495 11741-H74211803-118041184011890-1189112140 1217312177-12181126371272312829-1283012948- 1296213077-1307913638-1364113865-1386613873- 1387513999-1400314054-1405614692-1469314784 14789-14791 15011-1501215019-150241509615182- 1518315218-1521915243-1524815250-1525115290- 15291 15381-1538515406-1540715412-1541415530- 1553115548-1555015576-1557715588-1558915591- 1559215863-1586616168-1617116177-1618116365- 1636616484-164861654516550-165521660116636- 1663716642-166481665216702-1670316710-16712 16752-1675316851-1685316878-1688116894-16896 16913-1691516934-1693717026-1702817284-17285 17330-1733217857-178611795518001-1800318029- 1803018069-1807218207-1821018421-1842218500- 18501 1852718536-1853718642-1864318668-18677 18719-1872218730-1873218745-1875818761 18773- 18777187891879618829-1883418857-1888018906- 1890718925-189331894218955-1895918976-18977 19029-1903519221-1922219224-1922519228-19229 Tissue RNA Library SEQ ID NOS: origin Source Name
19274-1927919306 1930719316-1931719345-19349 1937019387-1938919395-1940219422-1943119512- 1951319567-195731958619598-1960119617-19619 19663-19666196701967319706-1970819727-19732 19736-1974819759-1976319814-1981519819-19822 19919-1992019943-1994620029-2004320087-20094 20133-2014820151 -2015420161-2016420189-20194 20208-20218202412025320265-2027320286-20287 2029920349-203542041120425-2042620432-20446 20469-204712054720554-2055720559-2056220575- 2057820614-2061920635-2063620646-2065220666- 2067120676-206802068820712-2071320718-20725 2075320758-2076620792-2079720819-2082120827- 2084320868-2087020897-2090020938-2094220999- 2100421027-2103121062-2106621076-2109621170- 2117421202-2121021241-2124721256-2126221277- 2128321297-2129821326-2133421359-2136021434- 2143921447-2145021463-2146521470-2147621516- 2152821647-2165521877-2188021925-2192821962- 2196621973-2197421978-2198222000-2200222007- 2201522020-2202522030-2203422040-2204422084- 2208622101-221072216022165-2216822187-22192 22261-2227022284--2228922303-2230922333-22335 22343-2234622351223652249522534-2253922571- 2258122607-2260922655-2265722661-2266422675- 2267722680-226842269022794-2279622805-22816 22948-229502297122991-2299423057-2305823212- 2321523229-2323323379-2338123415-2341923433 23473-2347423483--2348523492-2349323508-23511 23519-2352023543--2354423673-2368623700-23701 2376123771-237802380023878-2388023893-23902 24014-2401824021--2402424481-2449025085-25090 2530625319-2532125347-2535425374-2537525416- 2541726024-2602826196-2619926205-2620626221- 2622326237-26239262602632726347-2634826373- 263742642326469-2647026788-267892684326853- 2685426860-2686526873-268752687926882-26884 27044-270472705227173-2717427180-2718827193- 2720027269-2727027340-2734227498-2750327544- 2754527649-2765427729-2773927750-2775427809- 2781127814-2781527989-2799028099-2810028186 282862842429328-2933129343-2934529940-29949 adult liver Invitroge ALV002 77170-171255-256318398-399553-554619-620716- n 717901-904912969-9711001-10031080-10811097- 10981120-11211123-11271136-11391205-12061273- 127913071432-14351438-1439149915201678-1680 1688-168916911705-171418381923-19242015-2018 2086-20872570-257426902871-287332033374-3377 3468-34713512-35143555-35583686-36913840-3852 3925-39303934-39353949-39534151-41554441-4443 4531-45334543-45454555-45564863-48644874-4889 4979-49815080-50815285-52875808-58125973-5975 5980-59846135-61366488-64896638-66396791 -6794 7275-72767279-728273747461-74647615-76177648- 76537671-76737681-768777717787-77887795-7797 Tissue RNA Library SEQ ID NOS: origin Source Name
7972-79758017-80198084-808580948107-81088113- 81148187-82008227-82328242-824683108323-8325 83428347-83488434-84488507-85088512-85138576- 85828733-87379072-90749264-92659306-93119317- 93189454-94559475-94769529-95319594-96029617- 96269757-97679794-979598289899-99019927-9928 99539956-99579969-99809984-998510015-10016 10262-1026710451-1045310498-105031055010595- 1059910603-1060610624-1062610835-1083910870- 1087210881-1088310913-1091411046-1104711212- 1121911544-1154511563-1156411574-1157611612- 1163911651-1165511669-1167111757-1175811835- 1183611901-1190511921-1193012150-1215112190- 1219112206-1220812486-1251812673-1267812707- 1271112725-1272712829-1283012944-1296212965- 1297213505-1351113531-1353513576-1357913603- 1360513632-1363313638-1364113668-1367213937- 1393814048-1405314061-1406314137-1413814264- 1426914283-142881460414650-146511468314720 14789-1479114981-1498315182-1518315243-15248 15412-1541415576-1557715588-1558915614-15616 15661-1566315688-1569415863-1586615988-15996 16365-1636616536-165381654516570-1657316602- 1660816642-16648166521679817026-1702817112 17131-1713217238-1724117335-173391745417487- 1748817791-1779918015-180161802318136-18138 18421-1842218642-1865018655-1865818750-18756 18773-187761879618806-1880718829-1883418839- 1884518904-1890518919-1892118925-1893319062- 1906519096-1910119207-1920819213-1922019271- 1927219274-1927919308-1930919316-1931719351- 1935419368-1937019385-1938619390-1939419422- 1943119442-1944419458-194601950319514-19521 195661958619617-1961919663-1966619670-19671 19709-1971019733-1973519757-197631977219915- 199181993919943-1994619953-1995720128-20130 20133-2013620146-2014820157-2016020200-20207 20231-202342024120263-2026420271-2027320286- 202872028920363-2036820406-2040720432-20446 20469-2047120485-2049020514-2052020524-20535 20559-2056820635-2063620646-2064820712-20717 20727-2073420747-2075120758-2076620789-20791 20816-2081820822-2082320836-2084320855-20857 2087120897-209002094320948-2094920999-21008 21027-2103121088-2109621105-2111121133-21135 21171-2117421194-2119621208-2121021248-21252 21281-2128321297-2129821326-2133421377-21397 21410-2141421447-2145321467-2147621485-21491 21718-2172421788-2179921911-2191221917-21921 22007-2201522030-220342204722074-2207622117- 221192216022265-2227622284-2228922303-22309 22323-2232822333-2233522343-223462235122357 2249522534-2253922551-225522255922571-22581 22607-2260922625-2262822632-2263322644-22651 22661-2266222680-2268422730-2273122833-22834 Tissue RNA Library SEQ ID NOS: origin Source Name
23125 -2313623279 -2328123415 -2341923475-23479
23492-2349323515--2351823543--2354423673-23674
23682-23686236902376123771-•2378023793-23797
23839-2384023845--2384823878-2388023890-23902
23999-2400124005--2401825085 -2509025279-25288
25319-2532125323--2532525340--2534125347-25354
25374-2537525416--2541726024--2602826196-26199
26248 -2625126273-•262792633826347-2634826350-
2635226610-266142723927384-273852743827446-
2745127498-2750027544-2754527600-2760127636-
2763927649-2765427861-278642798327995-27997
2800428050-2805928087-2809128099-2810028142-
2814728272-2827628416-2842428426-2842828882-
2888729332-2933729354-2935729427-2943129938-
2993929950-2995330085-30087 adult liver Clontech ALV003 53708434-844893829475-947611123-1112413638- 1364118829-188341914819221-1922219275-19279 19667-1966819921-1992323419263272684327269- 2727030141-30142 adult Invitroge AOV001 3-410-14163048-4951-52144-145156170-171177- ovary n 178180-181192195-200221227-230251-252255-256 273-280307-308315-316324-325329-330332360-361 367373-374395400404-407412-413419-420425-427 464-467470-471515-517524-525528534-535540-547 549573-575582-584635637669677680-684689-699 716-722785-786832-836845-848852-855860-863868- 875878-882901-904913919-922934-935964969-971 981-9851001-10041040-104110471072-10731092- 10941097-10981101-11061108-11391158-11611199- 120012461259-12651273-12791291-13061308-1313 13591371-13751388-14011408-14191422-14231430- 14371440-144814511453-14541467-147014741478 14831499-15021618162016261667-16681678-1687 1705-17141721-17221778-1779184018511866-1872 1899-19001912-19131915-19161925-192719671986- 19872015-201820302033-2034203620722197-2198 2202-22032222-22262234-22352272-227622992306 2404-24062428-24312469-2474251325162525-2528 2540-25412570-25742585-25862656-26582663-2665 2774-27752814-281528462852-28532871-28732888- 291029162918-29192923-292429262934-29352948- 29492951-29552975-29772988-29893006-30083023- 30253037-30383094-30963105-31063126-31273151- 31543182-31873195-31973205-32073255-32593268- 32693271-32723276-327932903306-33083310-3311 3322-33233326-33313336-333733453374-33763433- 343434543464-34663468-34713477-347835073512- 35203526-35293531-35343550-35603565-35883624 3675-36763686-369136933731-37323768-37693789- 38093834-38353840-385238603939-39413949-3953 4067-40764122-41264134-41364151-41624220-4238 4413-44154441-444344504483-44854512-45164531- 453345424555-45564562-45764581-45824600-4601 4608-46144623-46394683-468447004780-47814785- 47894795-48094819-48234857-48614868-48694950- Tissue RNA Library SEQ ID NOS: origin Source Name
49514958-49594971-49744980-49814984-49855057- 50595080-50825092-50945109-51155149-51505192 5253-52665269-52715285-52875335-533653645371 5384-53865480-54825495-54965499-55015517-5521 5525-55265557-55605572-55735581-55835591-5592 5652-56545695-569857005703-57045710-57115744- 57475765580258245827-584658495854-58585898- 59225939-59415943-59485953-59545973-59755980- 59845986-59896135-61366140-61436163-61686235- 62376282-62836335-633663866403-64056452-6453 6461-64836523-65266533-65356551-656766236657- 66616791-67946853-68546932-69387056-70577187- 71907192-71937209-72127216-72197245-72487256- 72717283-72987360-73617421-74237426-74427534- 75417554-75557576-75777580-75977604-76057608 7611-76127615-76177620-76217624-76297643-7647 7654-76567660-76657671-76737678-76897693-7703 7726-77277742-77437745-77517778-77867789-7797 7802-78057815-78187821-78227824-78257890-7901 7904-79127917-79187924-79257953-795579577968- 797179767978-79857988-799480028008-80128017- 80208025-80268041-80428049-805680698074-8076 80798084-80858090-80938132-81368142-81568162 8178-81858187-82008216-82188227-823282348247- 82488253-82578263-82658297-83008313-83148339- 83448347-83608363-83648368-83698383-83898404- 840684158420-84258430-84338449-84528461-8464 8487-84888497-84998504-850585098512-85138537- 85508601-86038607-86098612-86168736-87378751- 87668768-87708917-89338936-89458982-89838986- 89919002-90069013-90209022-90259031-90459063- 90679072-90749306-931193169331-93339337-9338 9343-934693829391-93929408-94149443-94459456- 94599472-94769479-94859506-95319546-95499559- 95669570-95789581-95839590-95929594-96049609- 961096139615-9626965996769714-97189726-9727 9729-97319733-97449782-97839794-97959815-9820 9828-98349848-98529854-98559857-98619899-9904 9916-99209923-99249927-99359947-99529956-9957 9960-99619965-99689981-99929997-1000910015- 1001910025-1002610038-1004010164-1016610173- 101741025310263-10277103061031110449-10453 10480-1048210488-1048910491-1049310498-10503 10508-1051210531-1053210537-1053810588-10594 10600-1060810612-1062310627-1063310638-10644 10820-10821 10873-1087510877-1087810881-10883 10899-1090310913-1091410927-1092810931-10933 10936-1093710963-109641096610980-1098511042- 1104411066-110681113511147-1115611212-11219 11292-1130311318-1132011339-1135011372-11396 11406-1140711419-1142611429-1143411460-11466 11473-1147411476-1147811489-1149411496-11498 11538-1154011544-1154611561-1156211586-11606 11609-1164811656-1165711669-11671 1169611715- 1171611731-1173411739-1174011745-1174811753- Tissue RNA Library SEQ ID NOS: origin Source Name
1176311766- -1176811777-11782 11791 11803-11804 1180911835 -118361184211856- 1185711886-11891 11901-1190511941-1194412023-1202612040-12041 12121-1212612131-1213212135-1213712150-12151 12186-1219112202-1220812215-1221712226-12228 12245-1225312256-1225712259-122621236112374- 1237712432-•1243312446-1245012455-1245912470- 1247112478-■1248212486-1254212545-1254612555- 125611256412573-1257412581- 1258212619-12624 1263712642-■126441264612655- 126581266512673- 1267512701-•1270512707-12711 12714-1272312755- 1276212785-•1278712796-1279812803-1280612824- 1282712829- 1283012872-1288112968-1296912977- 1298112997- 1299913068-1307013077-1307913425- 134951351213552-1355913571- 1357313576-13579 13585-1358813592-1359713603-1361213627-13633 13638-13651 13752-1375313865 ■1386613876-13881 13885-1388713908-1391013927 •1392913937-13938 13954-139561400414008-1402014044-1404514054- 1405614058- 1405914098-140991410414123-14124 14130-1413114137-1414014173 ■14181 14192-14195 14199-142011420614216-142261425714261-14265 14283-14291 14304-143051434614355-1435614546- 145491460414607-1460914628- 1463514650-14651 14666-1466814681-1468214684- 146891470714784 14789-1479114866-1487014991-■1499215011-15012 15075-1508415091-1509615182 ■1518315187-15189 15218-1521915233-1523615243-■1524815250-15251 1525715283- 1529115302-1532215329-1534015392- 1539615406- 1541415454-1545615460-1546115496- 1552515536- 1553715545-1554615563-1556415568- 1557215576- 1557715581-1558415588-1558915600- 1560115631- 1563215686-1568715699-1570015863- 15868158781588015888-1589515925-1593115988- 1599016041-160461605916075-1607916086-16088 16096-1609716107-1610916115-■1611716141-16143 16145-1614616152-1615516168-16171 16174-16176 16237162391637716408-1640916422-16431 16443- 1644516470-1647216479-1648816494-1650016508- 165111654516576-165981660016602-1660816623 16636-1665516663-16667167021670516714-16723 16752-1675316787-16788168001680316828-16831 16836-1684216851-16853168601687016894-16896 16934-169371694216949-1695316996-1699717018- 1702017038-1704117074-1707717119-1712217131- 1713217145- 1714617153-1715417242-1724317264- 1727417286- 1729617330-1733217335-1733917394- 1739517454-1745617791-1779917958-1796318015- 1801618024- 1803018038-1804218046-1804818097- 1813418136-1813818374-1837518400-1840218405- 1840618412-1841818421-1842518495-18501 18516- 185181852718536-1853718583- 1858618629-18637 18644-1865018655-186581866218668-1867718680- 1868118691- 1869318717-187261872918745-18761 18771-1877618784-1879318797- 1880418806-18807
13! Tissue RNA Library SEQ ID NOS: origin Source Name
18836-1884118846-1885418856-1888018882-18888 18894-1889618899-1890518908-1891018915-18918 18923-1893918943-1895018955-1895918964-18969 18975-1897718989-1899018993-1899619001-19004 1901219018-1902419029-1903919045-1904819055 19057-1906519071-1908319096-191181913419142- 1914719207-192101922419226-1922719251-19257 19260-192621926419266-1926719273-1927419283- 1929619306-1930919316-1931719341-1934219345- 1936019362-1936419368-1937019372-1937419380- 1938419387-1940219415-1943419438-1944419461 19484-1948619488-194931950319515-1952119530- 1953119560-1956219564-1956519567-1957819586- 1958719617-19622196251962719656-1965719659- 1966119663-1966819670-1967819683-1968719689- 1969319699-1970819727-1973519743-1974819759- 197671977219800-1981519855-1985619915-19929 19933-1993919943-1994619948-1995019965-19967 19969-1998120015-2001720029-2005220066-20073 20087-2010220106-201122012020122-2012720130 20133-2014820157-2016020167-2019720200-20214 20226-2023020235-2024020244-202492025320257- 2027020286-202892029920317-2032020328-20330 20345-203592036220366-2036820393-2040520412- 2041320415-2043120437-2045120456-2047520485- 2049420497-204982050120505-2052020524-20535 20537-2054520548-2057820607-2061220614-20619 20625-2063420644-2065220666-2067120676-20681 20683-2068520688-2069220698-2070720710-20711 20714-2072620752-2075320758-2076720788-20791 20801-2081520819-2084920853-2086320865-20870 2088120885-2090020922-2093220938-2094320948- 2094920957-2097220977-209882099120999-21004 21011-2101521021-2102321027-2103121060-21071 21076-2108721097-2111421122-2112721141-21142 21145-2115121153-2115421157-2116921171-21174 21197-2119821202-2120721221-2122221225-21226 21229-2123521237-2125221256-2126621272-21274 21277-2128021284-2129421297-2129821301-21303 2133421343-2135021353-2135421359-2136021377- 2139721410-2141421418-2142621428-2143921447- 2145321463-2146521467-2146921480-2148221485- 2149121495-2150021516-2153021554-2155621587- 2165521718-2172421729-2173321877-2188521892- 2189421911-2191221915-2192121924-2193921955- 2196821971-2197221975-2198721989-2199321995- 2202522047-2204922056-2207622080-2208322090- 2209422101-2210722115-2211922128-2213822141- 221432216022165-2216822171-2217722187-22192 22195-2219822204-2221122218-222262223022243- 2225222256-2226022265-2229922303-2230922312- 2233522343-2235622358-223712237322377-22380 22399-2241022433-2243522440-2244822455-22465 2249522534-2253922551-2255822560-2256522570- 2258122599-2260222607-2260922622-2262822632- Tissue RNA Library SEQ ID NOS: origin Source Name
2265122653-2265422661-2266722669-2267722690 22696-2269722700-2270122726-2272722730-22735 22741-2274222750-2276222768-2277022775-22776 22781-2278222794-2279622801-2281622821 -22822 22828-2282922846-2285322861-2286822870-22874 22881-2288622899-229062291122923-2292422955- 2295722962-2297022973-2297622978-2298622991- 229942299823007-2302123046-2305023059-23060 2306323070-230762308423098-2310423117-23119 23125-2313623138-2314123201-2320223206-23221 23229-2323323236-2323923242-232452324923251- 2325623261-2326323279-2328123321-2332723343- 2334423353-2335423356-2336023379-2339023392- 2339523399-2340123415-2341923428-2343223448- 2345023470-234792348623489-2349023492-23493 23495-2349823508-2351423543-235442354623599 23682-2368623692-236932370423718-2372123726- 2373623760-2376323765-2376723770-2378023793- 2379723806-2380923816-2381923827-2383323839- 2384023843-2384823867-2387023878-2388023882- 2390423906-2391023941-2395623990-2399223999- 2400124005-2401124014-2401824021-2402524029- 2403324035-240402405624092-2409724120-24125 24481-2449024772-2477424802-2481625085-25090 25114-2512625279-2529025319-2532125323-25325 25338-2534125355-2535825368-253692537325376- 2537725416-254172584826024-2602826196-26199 26205-2621326235-2624326258-262592626626280 26285-2629026292-2630226305-2630626316-26321 2632626328-263292633726349-2635226358-26365 26373-2637426376-2637726421-264242643126460- 2646326567-2658526596-2660626665-2666626678- 266802668426733-267352674626755-2675626788- 2678926804-2680526843-2684526860-2686226873- 2687926885-2688726980-2698426998-2701127044- 270472705227058-2705927067-2707027074-27077 2709127100-271022710527150-2715127154-27155 27173-2717427193-2720027207-2721527218-27221 27229-2723227247-2725127261-2726527269-27276 27283-2728727293-2730127304-2730527348-27353 27411-2741227414-2743127439-274442746827493- 2749527497-2750727510-2751127520-2752127544- 2754527557-2756327574-2758427589-2759227600- 2760627636-2763927649-2765627662-2767227698- 2770127719-2772027729-2773927807-2780827812- 2782227825-2782627840-2784427848-2785227861- 2786427890-2789227940-2794227989-2799027992- 2799728045-2804728050-2805928095-2809628099- 2810028105-2812228138-2814028142-2814528157- 281582818628199-2820328263-282682828628290- 2829228311-2831328315-2831628361-2836228424 28426-2842828468-2848428573-2857829050-29070 29265-2926729278-2928329301-2930329328-29331 29339-2934029343-2934529354-2935829378-29379 2940529409-2941629418-2941929432-2943529451-
I JJ
Figure imgf000135_0001
Tissue RNA Library SEQ ID NOS: origin Source Name
22284 -2228922303 -223082249522760 -2276222768- 2277022904-22906229712307123358-2336023473- 234742376123890- 2389223941-2395624495-24498 25085-250902530625374-2537525866-2587526175- 2618426359-2636026973-2697927053 •2705727074- 2707727269-2727027401-2741027439-2744027504- 2750727510-2751127544-2754527649•2765427662- 2767227825-2782628192-2819828365 •2837028728- 2873129278-292832936729409-2941629594-29604 adult GIBCO ASP001 170-171180-181197-200221326-327329-330358-359 spleen 373-374395404-405412-413425-427528534-535543- 545553-554656708-711785-786847-848934-935981- 9851001-100310051026-103110471086-10871101- 11031235-12371295-1297130714311453-14541499 15211569-15701692-16931705-17141721-17221879- 18801947-19482015-2018219423062383-23882420- 242125162599-260329262929-29302939-29433090 311231153274-32753326-33313374-33773418-3420 352536243681-36823810-38113949-39534206-4208 4386-43874417-44194441-44434515-45164555-4556 45614581-45824819-48245285-528753335335-5336 5525-55265534-55365557-55605572-557357765802 58465919-59215973-59756103-610464006403-6404 6523-65266791-67946898-68997209-72127216-7219 7447-74497580-759776087620-76217630-76367681- 76897745-77467757-77597778-77837921-79237984- 7985799480028041-80428077-80788101-81048113- 81148157-815981628187-82008247-82488363-8364 8449-84528507-85088512-85138543-85508607-8609 8612-86168689-86908936-89388986-89919007-9008 9013-90179038-90429063-90679072-907493059337- 93389427-94289470-94769487-94889519-95219526- 95289531-95349561-956295849723-97259735-9738 9782-97859794-97959815-9820982698289851-9852 9854-98559874-98959986-998810167-1017210277 1030610320-1032310523-1053210595-1059910615- 1062310627-1063010638-1063910873-1087510904- 1090510913-1091410980-1098511006-1100811198- 1119911397-1139811476-1147811561-1156211612- 1163911731-1173411745-1174811835-1183611878- 1187911886-118891193412023-1202612040-12041 12150-1215112186-1218912226-1222812256-12257 12361124511247712545-1254612565-1256812637 12642-1264412666-12667126991272312785-12787 12824-1282712872-1288112997-1299913077-13079 13865-138661389813927-1392913999-1400314094 14170-1417214209-1421514452-1445314546-14549 1460414650-1465114684-1468515182-1518315187- 1518915195-1519615222-1522415290-1529115326- 1532815486-1548915576-1557715588-1558915600- 1560115699-1570015867-1586816044-1604616110- 1611216174-1617616233-162351643316451-16452 16508-16511 16535-1653916542-1654516558-16562 1662316636-16637166421665216799-16801 16851- 1685316894-1689616934-1693717026-1702817058- Tissue RNA Library SEQ D3 NOS: origin Source Name
1706417105-171061711517238-17241 17330-17332 17335-1733917394-1739517454-1745617571-17574 17857-17861 17958-1796217998-1799918015-18016 18029-1803018419-1842018492-18501 18536-18537 18633-1863718691-1869218719-1872218750-18760 18767-1876918771-1877218778-187801878918796 18802-1880418811-1881318822-1882418836-18838 18842-1884518857-1888018882-1888818894-18896 1891018915-1891818923-1893318936-1893918960- 1896918975-1897719005-190091901819045-19048 19071-19083190901913419138-191471915119207- 1921019260-1926219266-1926719283-1929419306- 1930719316-1931719345-1935019362-1936419370 19373-1937419385-1938919395-1940219415-19417 19422-1943119438-1944419458-1946019462-19463 19488-1949319515-1952119526-195291953619566 1958619620-196241962719667-1966819672-19673 19679-196821969319706-1970819736-1975119772 1981319855-1985619915-1991819921-1992519933- 199391995020029-2004320106-2011220122-20127 2013020146-2014820161-201642024120244-20249 202532028920299-2030420328-2033020345-20354 203602041120418-2042420437-2044620472-20474 20485-2049420497-204982050120511-2051320548- 2055320569-2057820599-2060120614-2061920629- 2063420649-2065220666-206712068120714-20717 20727-20732207522076720788-2079120813-20815 20819-2082120836-2084320855-208632086520868- 2087020882-2088420890-2090020927-2092820938- 2094220999-210082112221176-2118021194-21196 21202-2120721256-2126221272-2127421277-21280 21284-2129421301-213032133421398-2140221447- 2145021480-2148221492-2150021647-2165521881- 2188521892-218942192421955-2195721967-21968 21978-2198722003-220152204722117-2212722141- 221432216022187-2219222253-2225522261-22276 22284-2228922312-2231322343-2234822358-22359 2237322405-224082249522534-2253922571-22581 22607-2260922634-226362269022700-2270122703- 2270622750-2276222781-2278222794-2280022810- 2281622821-2282222838-22842229112297323046 23051-2305223059-2306023064-2306523070-23071 23074-2307623123-2312423138-2314023201-23202 23212-2321523229-232332325123261-2326223356- 2336023386-2339023400-2340123415-2341823434- 2343723448-234502348623495-2349623508-23511 23543-2354423555-2356523760-2376123843-23844 23903-2390423906-2391024002-240042405624126- 2412925085-250902531325331-2533525340-25341 25370-2537525410-2541325416-2541726024-26028 26033-2604926196-2619926237-262392626626327 26341-2634426389-2639326421-2642226469-26470 266022684326858-2686226880-2688127025-27027 270912713527139-2714627193-2720027209-27213 2721827229-2723227247-2725127254-2726027269- Tissue RNA Library SEQ ID NOS: origin Source Name
27270 27299- 27301 27379-27380 2749627510- 27511 27544-•2754527562-■2756327572--2757327633- -27634 27636 •2763927649■2765627698--2769927750- 27756 27830-•2783827861-■2786427890--2789227989- 27990 28099-•2810028138 •2814028142--2814528286 28290- 2829228311-283132842428426-2842829278- 29283 29339-•2934029343-•2934529364--2936629409- 29417 29508-•2951829718--2972129938-■2993929960- •29961 30141-■3014230206--3021730243-■3025030361- •30368 testis GIBCO ATS001 51-52156221332395425-427468-469477-482524- 525635656913969-9711001-100510471123-1127 1273-127914511667-166818121912-191319442072 2202-22032212-22132540-25412599-26032874-2876 2988-298930043195-31973281-32823374-33763479- 34803531-35343555-35583679-36804091-40954124- 41264343-43464435-44364515-45164562-45684581- 458246514785-47894865-48674984-49855335-5336 5572-55735581-55865656-56575766-576757735802 5919-59215943-59455973-59755980-59846140-6143 6235-62376403-64046762-67646791-679468527209- 72127216-72197283-72847447-74497534-75357604- 760576087611-76127618-76197654-76557671-7673 7745-77467763-77707787-77887806-780879948069 8137-81418145-81598187-82008242-82488368-8369 8383-83898409-84158507-85088539-85508607-8609 8738-87408762-87669007-90089040-90429337-9338 94449449-94509475-94769482-94859510-95259617- 96269829-98329851-98529874-989510025-10026 10038-1004010167-10172102771030610449-10453 10488-1048910531-1053410607-1060810615-10623 10628-1063010638-1063910873-108751093811011- 1101211123-1112411339-1134411372-1139611431- 1143211476-1147811541-1154611568-1157111582- 1158311674-1167811697-1169811711-1171211731- 1173411739-117401180911835-1183612150-12151 12202-1220512226-1222812374-1237712398-12401 1263712676-126781272312785-1278712984-12985 12997-1299913107-1311713603-1360513609-13612 13627-1362813865-1386613873-1387513954-13956 14048-1405614098-1410014105-1410614268-14269 14640-1464214650-1465115190-151911520615250- 1525115258-1525915290-15291 15300-1530115460- 154611546515545-1554615548-1555015576-15577 15588-1558915988-1599016168-1617116365-16366 16473-1647816609-1661116636-166371664216804- 1680716851-1685316913-1691516938-1693916949- 1695317024-1702817156-171591723217238-17242 17286-17291 17330-1733217455-174561758017958- 1796218015-1801618029-1803018412-1841818527 18644-1865018668-1867018673-18681 18691-18692 18729-1873218738-1874918761 18771-1877218778- 1878018784-18786187891879618802-1880418839- 18841 18846-1885318894-1889618899-1890318947- 1895018955-1895918964-1896918978-1898219013- 1901719029-1903519045-1904819059-1906119106- Tissue RNA Library SEQ ID NOS: origin Source Name
1911819132 1913419253 1926419345- 1934919351- 19354193621937019380- 1938419407- 1941119422- 1943419526--1952919604--196071962519656-19657 19667-196681969319699--19705197091971019733- 197481981319921-199231993919943- 1994619972- 1998120029--2004320064--2006520087-2009420106- 2011220146--2015020161-•2016420172- 2017920253 20288-2028920401-2040720412-2041320415-20417 20437-2044020485-204902051020514- 2052020547 20575-2057820616-2061920625-2063020683-20685 20698-2070720718-2072520747-2075120758-20767 20824-2083520929-2093220938-2094220948-20949 20999-2100821021-2102321072-210752112221157- 2116921171--2117421202--2120721237--2124721256- 2126621275--2127621377--2139721408--2140921485- 2149121529--2153021647--2165521718--2172421973- 219742197822000-2200222016-2201822020-22025 22030-2203422040-2204122060-2206122070-22073 22090-2209122141-221432216022187--2219222212- 222172225022265-2227022277-2228322310-22311 22314-2231722323-2233522343-2234822358-22361 22366-223712237322399-■224042249522534-22539 2254422561--2256522571-2258122599-•2260222644- 2265122661--2266222730-■227312274922833-22834 22850-2285122857-2285822861-2286822974-22976 2299523047-■230552307123201-232022325123343- 2334423379-•2338523425-2342723489--2349023508- 2350923720-2372123731-2373623762--2376323771- 2378023793-■2379923816-2381923839--2384023882 23893-2390223990-2399224014-2401824029-24033 24772-2477425319-2532125368-253692537325416- 2541726024-■2602826196-2619926237-■2623926266 26285-2629026305-26306263272673726748-26754 2684326860-•2686226876-2687826980-■2698427052 2709727114-•2712627147-2714927173-2717427193- 2720027261-2726527269-272722729427304-27305 27570-2757127600-2760127636-2763927726-27739 27861-2786427890-2789227943-2794727989-27990 28095-2809628142-281452828628308-•2831328424 29278-2928329328-2933729339-2934029409-29416 29718-2972130141-3014230150-3015630218-30220
Genomic Research BAC001 3338-33394122-41234175-41764180-41904911-4914 DNA from Genetics 5021-50235030-50355045-50465154-51797065-7066 BAC (CITB 7370-73747426-74428802-88068836-88388855-8859 63118 BAC 8877-88919087-90999152-91929194-91959213-9215
Library) 9217-92259239-925792599269-92769291-92939301- 93049312-931310443-1044410684-1068910691-10703 10723-1072410726-1072710835-1083910842-10867 10879-1088012218-1222013130-131321314513176- 1317813324-1332513422-1342313677-1368013683- 1368513693-1369413708-137181382313854-13860 14663-1466515718-1572515815-158171582715910- 1591615934-1595015978-1598015991-1600016024- 160271623217637-17641 1767317688-1769617738- 1773917748-1775117763-17781 17783-1778617842- Tissue RNA Library SEQ ID NOS: origin Source Name
1784717862-17863 17876-17879 17881-1788517888- 178901792118024--1802818066--1808218090-18096 1813518139-18156182261824218247-1825318255- 1825818275-1827718286-1828818293-1829518354 18365-1837018382-1839221304-2132521501-21515 21549-215532156321752-2177221863-2187622436- 2243922449-2245422525-2253022596-2259822605- 2260623532-2353923820-2382624187-2419324255- 2425924269-2427324333-2434024370-2437424402- 2442524449-2446224610-2462324686-2469324727- 2475424757-247702479424820-■2483724855-24862 24919-2492324928--249482495124964-2497424993- 2499925041-2504825093-2509525112-2512625158- 2517725254-2525625366-2536725418-2544025447- 2546025471-2549425496-2551425537-2554125583- 2559325598-25603256212562325651-2565625660- 2567225676-2567925687-2570225741-2578225821- 2584225853-2587525903-2590525944-2594625956 25969-2601626054-2605826077-2608626090-26099 26105-2611126114-2611526136-2613826150-26152 26165-2617126629-2665326691 -2669426700-26703 27677-2768827714-2771727943 -2794728352-28357 28504-2850728514-2851728531 -2854728565-28570 28596-2860628673 -2867828709-2871928732-28742 2874828754-2876028775-2878128809-2882128831- 2883628841-2884728901-2890928914-2892628933- 2893528944-289582901229031-•2903829071-29075 29096-2910129104--2910729154■2915629179-29186 2920329225-2923129446-2946629468-2948729492- 2951829529-2954329547-2956329569-2957729583- 2960529608-2961029627-2962929631-2963729645- 2965429660-2967229686-2970329708-2971029763- 2987529880-2988429991-2999430011-3001330038- 300473006130118-3013430165-3017730179-30182 30201-3020430236--3023930254-■3025530260-30270 30275-3027830352
Genomic Research BAC002 40074037-40424177-41904193-41994905-49075030 DNA from Genetics 5036-50465326-53307036-70377043-70557089-7091 BAC (CITB 7094-70997175-717773747392-73957407-74147421- 39316 BAC 74427444-74468466-84768710-87158802-88068839-
Library) 88418855-88598866-88689150-91519182-91939201- 92089213-92299231-92349258-92599277-92869291- 929693009312-931310324-1032510330-1033110349- 1035010443-1044410704-1071810723-1072410726- 1075010755-1076910773-1077710835-1083910842- 1086710879-1088011890-118911317613184-13188 13324-133251370113706-1373613751 13815-13818 1382313825-1383113854-138601496214981-14983 15110-1511415718-1572515838-15841 15847-15854 15910-15911 15939-1594315951-1596015964-15982 15991-1600016016-1601816024-1602717275-17277 17409-1743017637-1764117688-1769617699-17700 17748-17751 17771-1777617783-1778617805-17806 17813-1781717819-1782417828-1782917837-17841 17843-1784717871-1787317876-1787917888-17890 Tissue RNA Library SEQ ID NOS: origin Source Name
18010-18012 18024-1802818075 1808018090-18092 18139-1814418155-1815618161 1816818175-18190 18204-1820618211-1822318226-1825318255-18268 18275-1828418300-183221835418361-1837018382- 183921840421501-2151521563--2156721576-21646 21656-2165921718-217242172821752-2177222498- 2253322540--2254322603-2260622610-2261722658- 2266022698--226992270223029--2304523364-23370 23702-2370323820-2382624183-241932422724246- 2426824282-2428924300-2430324308-2431224319- 2432424345 -2435724375-2438324402-2443524449- 2446224610 ■2462324666-2468024686-2469324726- 2475424757 ■2477024794-2480124820-2483724846- 2486224871 •248882492424939-249482495124975- 24977250292504825052-2506025078-2508425091- 2509225096■2510025112-2512625158-2517725193- 2521225228 ■2523825247-2524825359-2536325380- 2538125436•2544025450-2547025476-2548225496- 2550525515 •2552125537-2556125572-2559325598- 2560325633 •2565625660-2567225676-2568925703- 2584225847 •2589625901-2596525969-2601826077- 2607926105 •2611526121-2612626136-2613826172- 2617426185 ■2619426629-2665326691-2669726700- 2670327714-•2771728352-2835728489-2849028504- 2850728514-•2852128525-2853028533-2853928565- 2857128617-■2862728646-2865028679-2868028688- 2869428709-■2871928754-2876028768-2878128800- 2882128875-■2888128910-2891328917-2892628933- 2893528943-■2895828964-2897229013-2901929031- 2903829071-■2907529087-2908829096-2910129128- 2914129179-•2918629234-2924129290-2929529327 29446-294502946629468-2947229478-2948029488- 294912949929501-2951829544-2954629551-29574 29590-2960429606-2961029614-■2963729642-29678 29685-2971729722-2974529762-•2984129870-29875 29877-2992229926-2993729964-•2996829986-29990 29995-3000130006-3000730011-■3001930038-30058 3006130085-3009330098-3010030109-3013430145- 3014930165-3017730271-3027830283-3029530321 30343-3034730352
Genomic Research BAC003 1578-1585405242054899-49034908-49105031-5032 DNA from Genetics 504253346796-68007061-70647067-708770917094- BAC (CITB 709973747392-73957426-74427467-74768839-8841 39316 BAC 885988628899-89029145-91499152-918192599269-
Library) 930410443-1044410759-1076410819-1086811076- 1107713130-13132131831375113815-1383215112 15120-1512315718-1572515847-1585415944-15945 15966-1596815991-1601816021 1760917653-17667 17683-1768417734-1773617771-1777617783-17786 17791-1779917805-178061782717837-1784117843- 1784717862-1786818010-1801218075-1808018152- 1815418175-1818318211-1821418216-1821818230- 1823918255-1825818275-1837021304-2132521501- 215152172821788-2179922498-2252222525-22530 22582-2259822603-2260622610-226172270223029- Tissue RNA Library SEQ ID NOS: origin Source Name
2304523702-2370323993-23998 2422724231-24245 24255-2426824300-2430724333 -2434424370-24374 24397-2440124436-2443824481 -2449024495-24498 24615-2462324727-2473524749-2477024794-24801 24889-2489924928-2493824964-2497724989-24992 24995-2499925027-2502825041 -2504825052-25060 25093-2509525101-2511125147-2517025359-25363 25418-2543525450-254602547525542-2555025633- 2565025660-2566525676-2567925690-2570425737- 2579525821-2583525883-2589625912-2594325957- 2596525969-2601626050-2605826068-2608926100- 2612626133-2619426629-2665326700-2670428504- 2850728514-2851728531-2853928663-2866528673- 2867828688-2869428699-2870728728-2874228748 28754-2876028775-2878128815-■2882128944-28958 28964-289722901229096-2910129128-2914129420- 2942529446-2945029462-2946529468-2947229501- 2950729535-2954329564-2956829583-2958629590- 2959329606-2961029614-2962029642-2964329645- 2965429673-2968429686-2970329736-2973929746- 2976129763-298362987929892-2989529923-29935 29964-2997429980-3001930022-3006430084-30100 30109-3013430179-3018230236-■3023930256-30259 30334-3034030360 adult Invitroge BLD001 221329-330716-717796-7971123-11272570-2574 bladder n 2674-26863006-300833773461-346335603649-3653 37074213-42144266-42725134-51355480-54825919- 59215943-5945615464607534-75357924-79258010- 801282968430-843384659475-9476953195849726- 97279729-97319763-97679851-98529956-995710254- 10261 10881-1088310904-1090510980-1098511345- 1135011731-1173412371-1237312432-1243412829- 1283012978-1298113601-136021376015290-15291 15617-1561815623-1562616568-165691664217010- 1701417286-1729117330-1733217455-1745618405- 1840618730-18732187771878918802-1880418806- 1880718923-1892419225193621937019387-19389 19395-1940219407-1941119415-1941719422-19431 19706-1970819757-1976319926-1992919962-19963 19972-1998120066-2006820099-2011020122-20127 201302028920300-2030420333-203352036020366- 2036820495-204962050120505-2051020524-20535 20537-2054120625-2062820631-2063420676-20680 20747-2075120868-2087021021-2102321171-21174 2123621284-2129421351-2135221428-2143321447- 2145321495-2150021554-2155621955-2195721962- 2196622003-220062204722050-2205522080-22083 22161-2216422366-2237122553-2255822678-22679 22728-2272922857-2285822881-2288623007-23021 23252-2325623321-2332723682-2368623839-23840 23878-2388023999-2400124021-2402425085-25090 25323-2532525331-253332537326024-2602826341- 2634426460-2646326860-2686226880-2688127807- 2781127861-2786427890-2789228099-2810028286 28352-283572842428426-2842829718-2972130195- Tissue RNA Library SEQ ID NOS: origin Source Name
30199 bone Clontech BMD001 3035-3848-4955-7387-95105130-143156164-167 marrow 170-171173-181184197-200227-230240-250255-256 273-278301329-330332368-371419-420425-427483 504-506515-517546-549585-588607635656716-720 738-740792-795798-803832-836870-871891912919- 922934-935937-940942-944969-971981-9851001- 1003100510281044-104510471049-10501065-1066 1092-10941097-10981158-11611163-11641240-1241 13071314-131913211388-14011414-141714311467- 14691474148314991667-16681681-16871692-1693 1705-17141721-1722177718401853-18541862-1863 1879-1880190919141945-19461980-19821986-1987 19902015-2018203020362062-206720722086-2087 2102-21132222-22262272-22742404-240624312471- 24742550-25522554-255525762595-259626672773 28062828-28452871-28732886-28872923-29242926 2951-29552983-29862988-2989299330013006-3008 3037-30383144-31463151-31543205-320732283326- 33313355-335634333461-346334743508-35103515- 351735473555-355836353649-3653367636863693 3776-37773834-38353939-39413949-395540644151- 41554160-41624239-424042564422-442344504509- 45114515-45164555-45564562-45684726-47274751- 47554857-48614967-49685091-509451925279-5283 5335-5336539053935480-54865499-55015519-5521 5525-55265557-55605572-55735584-55865591-5592 5744-57475758-576158025805-58075907-59095919- 59215980-59846011-602160846095-60986403-6404 6442-64446452-64536773-67746791-679468526877- 68786884-68857187-71907209-72127216-72197245- 72487258-72637407-74147447-74497536-75417554- 75557561-75627580-75977604-76057615-76177622- 76237630-76367640-76477657-76597678-76877693- 76947701-770377297734-77367742-77437745-7751 7757-77597776-778377947806-78087815-78187924- 7925797779888059-80638077-80788098-81048132- 81518182-81838219-82208230-823282348247-8248 82508253-82578263-82658301-83068349-83508361- 83628368-83698383-83898416-84198449-84528466- 84768497-849985098539-85428576-85828607-8609 8698-87008736-87378741-87508777-87798917-8918 8986-89919002-90049040-90469100-91449196-9200 9306-93119321-93239329-93309336-93389340-9346 9355-93579362-93659375-93769429-94309456-9459 9475-94769487-948895319561-95629577-95789590- 95929603-96049617-96269743-97449759-97629780- 97839794-979598269828-98329835-98409854-9855 9857-98619902-99049923-99249929-993599539969- 99809984-99859989-99929997-1000910017-10019 10025-1003110033-1004010164-1016610175-10252 10273-1027410277-102781030610451-1045710480- 1048210513-1051510531-1053210537-1053810543- 1054910588-1059410607-1060810628-1063010638- 1064410890-1089310936-1093810954-1095610980- Tissue RNA Library SEQ ID NOS: origin Source Name
10985 11029-11031 11123-11124 111351114311157-
1119711295-11296 11300-11301 11308-1131011314-
113151131711339- 1134411399- 1140211431-11432
1143411467-11474 11476-114781149511505-11506
11531-1153211538 -1154011544-1154611561-11562
11586-1160211607 -11609116491165011669-11671
11711 •1171211731 1173411745 1174811766-11768
11780-1178211809 11835-11836118401184211974-
1197612006-12010 12046-1205612117-1211812131-
121321214012150- 1215112171- 1217212186-12189
12192-1219312202 -1220512229-122301223912241-
122441225812360- 1236112374- 1237712446-12450
12455-■1245912519 1254012543 ■1254612555-12559
12565-■1256812573 1257412590 •1259212615-12618
12642 -1264412655 -1265812666 ■1266712701-12705
1272312725-12727 12755-1276212785-1278712796-
1279812872-12881 12984-1298512997-1300413068-
1307013077-13079 13120-1312313556-1355913592-
1359513598-13600 13603-1360513609-1361213627-
1362813632-13633 13638-1364113873-1387513896-
1389813903-13906 13922-1392413931-1393313937-
1393813954-13956 14030-1403114048-1405314058-
1405914061-14064 14192-1419514228-1423514261-
1426314304-14305 14318-1434314347-1435114355-
1435614386-14387 14546-1454914552-1455514557
1460414607-14609 14650-14651146831478415003-
1500915013-15024 15151-1515715182-1518615195-
1519615218-15219 15222-1522415250-1525115257
15277 ■1527815290- •15291153261532815356-15357
15406 1540715412- -15414155301553115563-15564
155881558915600- •1560115623 1562615699-15700
158691587015906- ■15907159881599016044-16046
160591606116075- ■16079161101611216160-16162
161681617116219- •16220162691628316290-16295
162971630016304- 163051630916403-1640716434-
1643916451-16452 16467-1646916517-1651916530-
1653116535-16538 16542-1654416576-1657716583-
1658516609-16611 16619-1662116636-1663716642
16649- 1665716672- 1667416704- 1670516714-16715
16828- 16831 16851- 1685316860- 1687016934-16939
17070- 1707117094- •171001711617120-1712217145-
1714617292-17296 17330-1733217335-1733917385
17454- 1745617489- 1749017924-1792617940-17942
17958- 1796218015- 1801618029-1803018055-18060
18069- 1807218157- 1816018371-1837218376-18377
18405- 1840618411- 1841818492-1849418500-18501
18516- 1851818520- 1852718536-1853718562-18581
18583- 1861718625- 1862818644-186501866118668-
1867018673-18677 18691-1869318717-1872618734-
1873718750-18758 18762-1876618777-1878018787-
187891879618802- 1880418806- 1880718825-18828
18836- 8841 18854 18856-1888018882-1888818899-
1890318908-18910 18915-18921 18934-1893518941
18955 1895918964- 1896918975 1897718983-18985
1900019019-19024 19029-1903519062-1906519071- Tissue RNA Library SEQ ID NOS: origin Source Name
1908519106-19118 19134-1913719153-1915819211- 19212192531925719260-1926219266-1926719283- 1929419316-1931719345-1934919351-1935419362 1937019380-1938419387-1940219413-1941719422- 1943119442-19445194611950319514-1952119560- 195621956619604- 196071962719672-1968719693 19698-1969919733 1974219749-19751 19759-19763 19768-19771 198101981819915-1991819921-19929 19938-1993919943-1994619953-1995719965-19968 19972-1998020029-2006320072-2007320087-20094 20099-2010220104-201122012020122-2012720146- 2014820161-2016420182-2018820208-2021420222- 2022520235-2024020244-202492025320257-20262 20265-20270202892029920308-2031520328-20330 20345-2035420393-2040020411-2044020456-20468 20472-2047420476-2047920482-2049420497-20498 2051020524-2053620542-2054520568-2057820599- 2060620614-2061920621-2062420629-2063420644- 2064520649-2065220676-2068020683-2068520688 20698-2070720714-2072520727-207342075220767- 2078720806-2081220822-2082320836-2084920855- 2086220865-2086720881-2088420897-2090020922- 2092520938-2094220948-2094920952-2095420963- 2097620985-2098821005-2100821058-2105921062- 2106621072-2107521112-2111421141-2114221145- 2114821176-2118021202-2120721225-2122621233- 2123521237-2127421277-2128021284-2129621343- 2135221377-2139721408-2140921434-2144521447- 2145021463-2146921485-2149121495-2150021516- 2153021532-2153321554-2155621587-2165521729- 2173321744-2174721892-2189421915-2191621955- 2196121967-2196821973-2197421978-2198222007- 2201522040-2204122060-2206122092-2209422141- 2214322160-2216422187-2219222195-2219822208- 2222422251-2225522261-2227622284-2228922292- 2230222310-2232222336-2234822358-2235922365- 223712237322377-•2238022383-2238822399-22404 22409-2241022440--2244822455-2246522531-22539 22571-2258122618--2262122634-2265122663-22664 22669-2267022675--226792269022696-2269722700- 2270122703-2270722728-227292273622749-22762 22768-2277022794--2279622805-2280922835-22845 22850-2285122857--2285822881-2288622909-22910 22925-229262295122955-2295722969-2297022974- 2297622990-2299423007-2302123046-2305023053- 2305523057-2305823063-2306723070-2307123074- 2307623098-2310123138-2314023206-2320923218- 2322223229-232332325123261-2326223343-23344 23379-2339023400--234082343323439-2345023470- 234742348623489-2349023492-2349323495-23496 23506-2351123514--2351823526-2353123543-23544 2359923675-2368123705-2370823718-2372123726- 237302376123771-2378023798-2379923802-23805 23827-23835238432384423871-2387523890-23892 24014-240182411624126-2414424495-2449825085-
Figure imgf000146_0001
Tissue RNA Library SEQ ID NOS: origin Source Name
12050-120561214012202-1220512245-1225312258 12309-12311 1236212374-1237712560-1256112615- 126181263712676-1267812701-1270512782-12787 12796-1279812871-1288112894-1289712997-12999 1352813592-1359513627-1362813885-1388713925- 1392614137-1413814261-1426314546-1454914604 1462614714-147171478414809-1481314822-14838 15151-1515715182-1518315290-1529115329-15331 15336-1534015356-1535715412-1541415631-15642 15652-1565615855-158571587815906-1590716053- 1605816403-1640716422-1642916443-1644516473- 164781660016636-166371664216652-1665516894- 1689617046-1705717105-1710617120-1712217397 17399-1740017940-1794218136-1813818286-18288 1840318405-1840618500-185011852718579-18581 18757-1875818767-1876918771-1877218777-18780 18802-1880418899-1890318934-1893918964-18969 1901819059-1906519074-1908019106-1911819156- 1915819211-1921219221-1922219228-1922919254- 1925519260-1926219283-1929419351-1935419362 1937019373-1937419380-1938419390-1939419401- 1940219418-1943119442-1944419461 19512-19513 19515-1952119672-1967319679-196821969319698- 1969919733-1973519743-1975119800-1980319813 19819-1982219919-1992319938-1994219958-19963 19972-1998020015-2001720029-2004320074-20079 20087-200942012020146-2014820167-2017120195- 2019720208-2021420222-20225202412025320257- 2026220265-2027020290-2029620425-2042620432- 2044620476-2047920485-2049420497-2049820502- 2050320524-2053620563-2056720599-2060620620 20631-2063420653-2066520727-207322078820816- 2081820853-2085420858-2086220897-2090020938- 2094220948-2094920952-2095420977-2098421058- 2105921122-2112721141-2114221145-2114821157- 2116921197-2119821202-2120721216-2123221236- 2124721277-2128021284-2129421398-2140221408- 2140921463-2146521480-2148221485-2149421647- 2165521729-2173321744-2174721892-2189821917- 2192121948-2195421978-2198722026-2202922070- 220732208922092-2209422120-2212722141-22143 22171-2217522208-2222422292-2229922310-22311 22319-2232222336-2234622358-2235922440-22448 22644-2265122663-2266422678-226792269022838- 2284222846-2284922899-2290023047-2305023064- 2306523070-2307123074-2307623085-2308623252- 2325623382-2338523415-234182343323473-23474 23510-2351123543-2354423555-2356523720-23721 2376123793-2379723802-2380523827-2383323903- 2390425307-2531025323-2532525410-2541326024- 2602826196-26199262662628026285-2629026310- 2631426341-2634426373-263742659526744-26745 26832268432687926888-2689027100-2710227127- 2712827142-2714627193-272002720627209-27213 27216-272182729427476-2747727498-2750027544- Tissue RNA Library SEQ ID NOS: origin Source Name
2754527562-2756327636-2763927649-2765627814- 2781527848-2785227861-2786428050-2805928358- 283602842428426-2842829117-291232924629271- 2928329309-2931429339-2934029354-2935729367 29378-293792944530150-30156 bone Clontech BMD004 65610485335-53368768-87709956-995712872-12881 marrow 15290-15291 18673-1867718719-1872219156-19158 2197822669-2267023046253732684327193-27200 2721827790-2779627989-2799028199-2820328286 bone Clontech BMD007 373-37441744362-43658320-83229531 15908-15909 marrow 16044-160461665217160-1716718771-1877219749- 1975119814-1981520698-2070722310-2231123070 25370-253722626627702-27706
Adult Invitroge CLN001 170-171398-399700-701723-72510041235-12371273- colon n 12791608-16101667-16681961-196224392471-2474 2599-26032687-27063306-330836063735-37363789- 37943949-39534213-42144221-42384515-45164666 7216-72197258-72637536-75417742-774380468074- 80788297-83008323-832583428420-84218430-8433 8497-84998512-85138917-89189343-93469391-9392 9475-94769506-9509953196889735-9738982810148- 1016010263-102641031110506-1050710531-10532 10904-109051146711473-1147411544-1154511612- 1163912006-1201012150-12151124351272312725- 1272712872-128811294313540-1354213581-13583 14137-1413814277-1428014546-145491460414784 15093-150951525715406-1540715530-1553115855- 1585716345-1634916558-1656216570-1657316602- 166081699517105-1710617335-1733917394-17395 17958-1796218269-1827018412-1841818495-18499 18719-1872218778-187801879618925-1893319045- 190481913419283-1929419316-1931719368-19369 19390-194021956619683-1968719736-1974219759- 1976319926-1992919933-1993719972-1998020103 20106-2012020161-2016420189-2019420200-20207 20231-2023420300-2030420321-2032420328-20330 20336-2033820393-2040020480-2048120485-20490 2050120524-2053520676-2068020806-2081220819- 208232086520890-2089620963-2097221208-21210 21225-2122621256-2126221284-212942133421377- 2139721428-2143321447-2145021516-2152821877- 2188021955-2195722003-2201522026-2203422204- 222112225022343-2234622358-2235922405-22408 22551-2255222700-2270122810-2281822838-22842 22952-2295323053-230552307023074-2307623212- 2321523358-2336023415-2341823827-2383324005- 2401126307-26309263272633726341-2634426359- 2636526373-2637426421-2642226858-2685927100- 2710227156-2715727304-2730527640-2764127812- 278152818628255-282612826928468-2848430141- 30142
Mixture of Various CTL016 373-374847-8484581-45827465-74667745-77469267- 16 tissues Vendors* 926813638-13641 14344-1434515277-1527815356- 1535718825-1882819049-1905319306-1930719370 mR As* 19706-1970820263-2026420345-2034820425-20426 Tissue RNA Library SEQ ID NOS: origin Source Name
20712-20713 20792-20797 22343-22346 22881-22886 26409-26410 27989-27990
Mixture of Various CTL021 1667-16684206-42084562-4568 5335-533653705919- 16 tissues Vendors* 59217256-72578434-84489984-998510588-10591 10870-10872109651214912714--1271515126-15129 mRNAs* 15530-1553115543-1554415576-155771665216894- 1689618371-1837218516-1851819049-1905319211- 1921219283-1929419401-1940219422-1943119706- 1970819736-1974219749-19751 198132028921155- 2115621208-2121022101-221072230922358-22359 2237322632-226332269023074--2307627544-27545 27593-2759427989-2799029332-2933730135-30140 adult BioChain CVX001 3048-4951-52144-145156180-181201221255-256 cervix 301329-330332395419-420425-427515-517524-525 530-531543-547551-552694-695716-720738-740832- 836901-904934-9351001-100310471185-11871259- 12651273-12791302-13031424-142914311453-1454 1456-1460147414831499-15021563-156415961667- 16681705-171418001866-18721887-18891899-1900 2015-2018203620622220-22272275-227623072375 2404-240624312471-247425162540-25412599-2603 2626-26422644-26512653-26582661-26732692-2694 271028022921-29302951-29552988-298930013090 3205-32073245-32463268-32693290-33033421-3422 3455-345634663507-35103512-35203526-35293531- 35343555-35583661-36703705-37063763-37653817 3949-39534122-41234160-41624221-42384469-4470 4599-46014663-46654692-46944762-47634785-4789 4795-48094819-482349575084-50855092-50945109- 51155124-512651335335-533653645483-54845581- 558356545766-57675932-59365973-59755980-5984 5997-60006011-602162846371-63766403-64046484 6486-64896624-662667796877-68816932-69386980 7192-71937209-72127216-72197279-72827294-7295 7350-73527534-75507576-75777601-760376097640- 76427671-76737678-76897693-76947701-77037728 7742-77437747-775177717778-77837787-77887794- 780178907924-79257976798880208025-80268070 8084-80858115-81218132-81418145-81568184-8186 8258-82628270-82768313-831483428347-83608368- 83708383-838984158422-84258430-84338466-8476 8504-85068543-855086888751-87538758-87598869 8917-89338982-89838986-899190219029-90359037 9072-90749196-920093059321-93239337-93389391- 93929427-942894449456-94589487-94889506-9509 9522-95259531-95349617-96269684-968797099729- 97319763-97679794-97959815-9820982698289854- 98569899-99019916-99209923-99249958-99619969- 99809989-99929997-1000910017-1001910038-10040 10175-1019610273-10274102771030610451-10453 10484-1048910498-1050310508-1051510523-10532 10539-1054210588-1059410873-1087510877-10878 10886-1088910901-109021097911013-1101711028- 11031 11147-1115611217-1121911252-1127311300- 113031131711345-1135011423-1142611431-11432 Tissue RNA Library SEQ D3 NOS: origin Source Name
11468-1147211476-1147811505-1150611538-11540 11544-1154611553-1155411584-115851160911715- 1171611731-1173411749-1175011761-1176311777- 1177811803-1180411835-118361184211873-11875 11886-1188911921-1193012011-1201312042-12044 12121-1212612206-1220812215-1222512280-12282 12371-1237712420-1242912470-1247112519-12540 12555-125591256412593-125961259812601-12602 12606-126091264012668-1266912682-1268312707- 1271112720-1272312760-1276212785-1278712805- 1280612829-1283012832-12834129051294312978- 1298112984-1298513077-1307913560-1356213584 13592-1359513606-1360813645-1365113761-13795 13954-1395614023-1402614058-1405914192-14195 14261-1426314304-1430514546-1454914575-14604 14628-1463514645-1464814684-1468514702-14704 1478414789-14791 14866-1487015025-1506915182- 1518315190-1519115243-1524815292-1529315326- 1532815406-1540715486-1549015545-1554615588- 1558915699-1570015855-1585715863-1586815880- 1588315961-1596315988-1599016110-1611216141- 1614316145-1614616226-1622716299-1630016484- 1648616494-1649516508-1651116542-1654416579- 1658216609-1661116636-1663716643-1664816652 16686-1669316828-1683116851-1685316899-16900 16934-1693716993-1699417038-1704117046-17057 17268-1727417330-1733217335-1733917366-17368 17441-1745017454-1745617525-1752717924-17926 17958-179631800718015-1801618029-1803018069- 1807218136-1813818400-1840218412-1842018423- 1842518533-1853418587-1861618644-1865018680- 1868118691-1869218750-1875618759-1876018767- 1876918771-187761878918802-1880418806-18807 18811-1881318825-1883418836-1883818842-18845 1885418857-1888018894-1889618899-1890318910 18915-1891818923-1893318936-189391894118944- 1894518947-1895018960-189691897518989-18990 19029-1903519045-1904819059-1906519068-19080 19084-190851909019096-1910119106-1911819142- 1914719154-191551915919209-192101922519273- 1927419306-19307193621937019373-1937919387- 1939419401-1940219415-1941719422-1943419441- 1944419454-1945519458-19461 19484-1948619488- 1949319515-1952119526-1953119580-1958219604- 1960719656-1965719667-1966819683-1968719693- 1969819727-1973519749-19751 1975619759-19767 19804-1981519921-1992519933-1993919953-19957 19962-1996319965-1996719969-1997119981 19995- 2001720029-2004320048-2005220064-2006520069- 2007320080-2009820106-20112201202013020151- 2015420161-2016420167-2017920182-2018820200- 2021420231-2024120243-202492025320257-20264 20279-202802028820299-2030720328-2033020345- 2035420393-204072041120415-2042020427-20431 20441-2044620452-2045520469-2047920514-20520 Tissue RNA Library SEQ ID NOS: origin Source Name
20524-2054520559-2056220569-2057820621-20624 20649-2066520676-2068120683-206852068820698- 2070720718-207342075320758-2076620792-20812 20819-2082120836-2084320858-208622087120897- 2090020922-2092620929-2093220938-2094320948- 2094920963-2098420992-2099420999-2100421021- 210232102621062-2106621069-2108721101-21104 21122-2112721141-2114221145-2115121171-21174 21194-2119621213-2121521227-2123221263-21280 21284-2129421297-2129821351-2135221410-21414 21428-214332144621463-2146521467-2146921480- 2148221485-2150021647-2165521891-2189421917- 2192121925-219282193921955-2195721967-21968 21971-2197822000-2200222007-2201522020-22025 22030-2203422040-2204122050-2205522074-22076 22090-2209122135-2213822152-221562216022165- 2216822176-2218422187-2219222195-2219822208- 2221122218-2222422261-2228922292-2230222310- 223112231822333-2234222347-2235022357-22359 22365-223712237322405-224102249522534-22539 22553-225602256922607-2260922622-2262422634- 2264322653-226542269022700-2270122737-22739 22741-2274222750-2275922768-2277022794-22796 22810-2281622833-2283422843-2284522854-22856 22904-2290622952-2295322955-2295722969-22970 22978-2297922991-2299423007-230212304623053- 2305523059-2306023064-230672307123080-23083 23117-2311923125-231362314123206-2320923212- 2321523242-2324423251-2325623329-2333723356- 2335723364-2337023382-2338523402-2341123415- 2341823428-2343023486-2348723489-2349023492- 2349323515-2351823532-2353923543-2354423555- 2356523600-2362423675-236812370423726-23730 237612377023806-2380923820-2383323836-23837 23839-2384223867-2387523878-2388023990-23992 24021-2402424035-2404024042-2404324126-24144 24147-2415825085-2509025334-2533725347-25358 25376-2537725383-2540126024-2602826209-26213 26225-26231262522626626270-2627226285-26290 26294-2630226316-263212632726341-2634426358 264312645326460-2646426469-2647026596-26601 26676-26677266872674226748-2675426788-26789 26819-2682626842-2684326853-2685426873-26875 26980-269842698827012-2701427040-2704327052 27067-2707027100-2710227173-2717427177-27178 27193-2720027269-2727027348-2735327478-27479 27510-2751127520-2752127544-2754527557-27559 27600-2760127635-2764127662-2767227729-27739 27755-2775627781-2778827790-2780427861-27864 27890-2789227940-279472797027981-2798227989- 2799028008-2801128045-2804728095-2809628099- 2810028108-2812128157-281582816028199-28203 28255-282612828628311-2831328573-2857828768- 2877429117-2912329278-2928329321 -2932629354- 2935729409-2941629938-2994929960-2996330141- Tissue RNA Library SEQ ID NOS: origin Source Name
3014230150-3015630184-3019930218-3022030233- 3023530240-3024230361-30368 diaphragm BioChain DIA002 5581-55837978-79838187-82008349-83509072-9074 9366-936811123-1112411745-117481654619800- 1980319972-199802013020437-2044020554-20557 20614-2061520676-2068022212-2221723252-23256 25340-253412537326876-26878 endothelia Stratagen EDT001 19-2084124-125156158-159164170-171180-181192 I cells 197-200227-230251-252255-256260301307-308315- 316319-323329-330358-359373-374395414-418425- 427461-465505515-519524-525534-535553-554557- 559576-578619-620635656706-707712716-722832- 836845-848880-885891913919-922981-9851004- 10051044-104510471074-10871092-10981101-1103 1123-11271158-11611203-12041235-12371240-1241 1259-12661268-128913071310-131113341414-1417 14311453-14541467-1469148314991569-15731618 16201667-16681678-16801705-17141721-17221778- 17791792-17941910-19131925-192719671971-1974 19771986-19872015-2018202220722202-22032272- 227623062404-240624092432-24352471-24742570- 25742597-260326252724-27262886-28872903-2904 2923-292429262934-29352939-29432975-29772983- 298630013006-30083078-307930903107-31083115 3226-322832413255-325932663276-32773304-3309 3336-333733653374-33773421-342234333448-3450 3455-34563464-34653477-34783504-35073518-3520 35253555-355836063624368636933781-37843789- 37943834-38353949-39534067-40724080-40824084- 40854122-41234160-41624213-42144244-42474343- 434644504469-44824486-44884526-452745424555- 45564562-45734581-45824600-46014612-46144651 4785-47894857-48614966-49684971-49745089-5090 5092-509451925335-53365384-53865572-55735580- 55835594-5602565857005714-57165802-58125820- 582358805885-58975907-59095939-59415980-5984 5986-59886140-61436166-61686186-618963016342- 63436403-64056551-65676585-65926627-66286657- 66616669-66706791-67946832-685068526932-6938 7056-70577209-72127216-72197285-729573747415- 74207447-74497536-75417557-75597580-75977601- 760576097615-76177620-76217630-76367643-7647 7654-76557657-76657671-76737681-76877693-7696 7701-770377297742-77437745-77517784-77867794- 7797782178907906-79127921-79257946-79487972- 79757978-79837990-7992799480028010-80128017- 80198043-804480708090-80938117-81218152-8159 8163-81738182-81838187-82008211-82138216-8220 8227-822982348247-82488253-825783428347-8360 8363-83648368-837084158449-84528497-84998504- 85058507-85098512-85138539-85428554-85558584- 85878607-86098688-86908754-87578760-87618777- 87798936-89388986-89919002-90049013-90209038- 90459072-90749084-90859196-92009260-92639306- 931193169321-93239331-93339337-933893829391- Tissue RNA Library SEQ ID NOS: origin Source Name
93929445-94489454-94559475-94769478-94819487- 94909522-95259531-95349546-954995589561-9566 9586-95899617-962696599664-966596729682-9683 9729-97319735-97429757-97589763-97679794-9795 9815-982098269828-98349851-98529854-98559896- 98979902-99049923-99249927-992899539960-9964 9993-999610033-1004010167-1019610273-10274 1027710281-103041030610326-1032810451-10453 10480-1048210508-1051510537-1053810595-10599 10603-1060810612-106231062710638-1063910881- 1088310901-1090210913-1091810936-1093810963- 1096511035-1103711123-11124111351114311217- 1121911295-1130511314-1131511339-1135011359- 1136911372-1139611405-1140711431-1143211434 11465-1146611468-1147411476-1147811495-11498 11538-115401154611579-11606116091169611711- 1171211715-1171611731-1173411737-1175011780- 1178411786-1178811803-1180411835-1183611840 1184211873-1187511901-1190511941-1194411947- 1195012002-1200512050-1205612112-1211412121- 1213712150-121511217312190-1219112237-12238 12256-1225712259-1226212285-123021236112374- 1237712398-1240112446-1245012470-1247112477 12486-1254012545-1254612555-1255912590-12592 12615-12618126371264012642-1264412666-12667 1267012707-1271112714-127151272312755-12762 12796-1279812805-1280612829-1283012978-12981 12997-1299913077-1307913576-1357913581-13583 13592-1359713603-1360813629-1363113645-13651 13796-1379713873-1387513918-1391913999-14003 14044-1404514054-1405614058-1405914130-14131 14141-1414214170-1417214175-14181 14192-14195 1420614216-1422614236-1423814261-1426314299- 143011434614498-145011460414607-1460914681- 1468214684-1468514714-147171478414802-14808 14866-1487015013-1502415071-1507415093-15095 15165-1517015172-1517315182-1518315190-15191 1522015222-1522415233-1523615243-1524815250- 15251 1525715277-1527815283-1528615290-15291 15298-1530315323-1532415336-1534015356-15357 15406-1540715412-1541415457-1545915530-15535 15545-1554615548-1555015563-1556415576-15577 1558515588-1558915614-1561615699-1570015855- 1585715869-1587015880-1588315890-1589515988- 1599016044-160461605916075-160791608216096- 1609916103-1610416110-1611216141-1614316168- 1617116174-16176162371623916371-1637216388- 1639116422-1642916461-1646616479-1648116501- 1650416508-1651116536-165391654516563-16573 16576-1657716583-1658516600-16601 16609-16611 16619-16621 16628-1663716642-166481665216675- 1668216702-1670516771-1677216804-1680716833- 1684216851-1685316873-1687416894-1689616934- 1693717070-17071 1711417120-1712217131-17132 1724217284-17291 17297-1730617330-1733217335- Tissue RNA Library SEQ ID NOS: origin Source Name
173391739817454-1745617571-1757417937-17938 17958-1796217964-1796818015-1801618029-18030 18055-1806018136-1813818376-1837718407-18418 18421-1842518427-1842918492-1849418500-18501 18516-185181852718533-1853418536-1853918617 18624-1863718644-1865018655-1865818671-18677 18691-1869318717-1872618730-1873218745-18756 18759-1876118771-1878018784-1878918796-18807 18811-1881318822-1882818855-1888018882-18888 18894-1889618899-1890718919-1892118925-18939 189411894618964-1896918975-1897718989-18990 189961901219025-1903519045-190481905519057- 1906519071-19083190901913419138-1914019163- 1920119207-1921019213-192201922419251-19252 19254-1925719260-1926319266-192671927319295- 1929619306-1930719316-1931719345-1934919351- 1936019362-1936419368-1937019373-1937419380- 1938919401-1940219415-1941719422-1943419441- 1944419458-1946319484-194861950319514-19521 19530-19531 1953619560-1956219566-1957319586 19598-196011962519656-1965719659-1966119667- 1966819670-19671196881969319698-1969919727- 1974819759-197631977219800-1980319810-19815 19819-1982219855-1985619915-1991819921-19929 19933-1994619953-1996119965-1996719972-19981 20015-2001720026-2002720029-2004720066-20068 20072-2007920087-2009420099-2010220106-20110 20114-2012020122-201272013020133-2013620146- 2015420161-2016420167-2017920182-2019420200- 2021820222-2022520235-2024120250-2025220257- 2026220265-2027020274-2027820281-2028220289 20328-2033020345-203542036020393-2040020406- 204072041420418-2042620432-2045120456-20474 20476-2047920482-204942050120505-2051020521- 2053520542-2054520548-2055720563-2056720575- 2057820614-2061920625-2063420644-2064820653- 2067120688-2069220698-2070720710-2071120718- 2073220752-2075320758-207672078820806-20821 20824-2084320853-2086320879-2088020897-20900 20924-2092520927-2092820948-2094920955-20962 20973-2098821005-210082101521026-2103121060- 2106621069-2107121076-2108721101-2110421122- 2112721137-2114021145-2115121171-2117421176- 2118021197-2119821202-2120721225-2122621233- 2123521241-2124721256-2127421277-2128021284- 2129821301-2130321351-2135421359-2136021377- 2139721434-2145021467-2146921480-2148221495- 2150021516-2153021647-2165521718-2172421729- 2173321877-2188021892-2189421911-2191221915- 21921219242193921951 -2196621978-2198722000- 2200622019-2202522040-220442204722060-22076 22080-2208322092-2209422117-2211922128-22138 22141-2214322160-2216422169-2217022187-22192 22195-2219822200-2220322208-2221122225-22226 22251-2225222265-2228922292-222992230922314- Tissue RNA Library SEQ ID NOS: origin Source Name
2231722319-2233222336-2234222349-2235022358- 2235922365-223712237322377-2238022383-22388 22434-2243522455-224652249522531-2253922553- 2255822571-2258122607-2260922622-2262422634- 2263622644-2265122653-2265422663-2266722678- 226792269022700-2270122703-2270622726-22727 22730-2273122750-2276222768-2277022781-22782 22794-2279622810-2281622821-2282222852-22856 22870-2287422904-2290622909-2291122925-22938 22948-2295322955-2296122974-2297622978-22979 22991-2299423007-2302123046-2305223057-23060 23063-2306723070-230712308423098-2310123112- 2311623120-2313623138-2314123201-2320223218- 2322223237-2324123251-2325623258-2326323279- 232812333823343-2334423358-2336023379-23390 23415-2341823448-2345023470-2347223475-23479 23486-2348723489-2349023492-2349323499-23505 23508-2351123526-2353123555-2356523673-23674 23700-237012370423718-2371923726-2373623760- 2376123765-2376723771-2378023793-2379923806- 2380923827-2383323839-238482388223890-23892 23941-239562395923990-2399223999-2400124014- 240182402524029-2403324035-240402405624120- 2412924772-2477425085-2509025213-2521925293- 2529725307-2530825319-2532125334-2533525338- 2534125355-2535825370-2537525416-2541725534- 2553626024-2602826196-2619926205-2620626237- 2623926248-262522626626273-2627926285-26290 26316-2632126327263292633726339-2634026361- 2636526373-2637426409-2641126460-2646326469- 2647026559-2656026596-2660126678-2668026719- 26726267422674626748-2675626804-2680626843 26853-2685726860-2686226876-2687926980-26984 2703527040-270432705227067-2707027074-27077 27122-2712627133-2713427150-2715127173-27174 27193-2720227206-272132721827240-2725127254- 2726527269-2727027273-2727427281-2728727304- 2730527372-2737527439-2744027493-2749527501- 2750727510-2751127520-2752127544-2754527548- 2754927574-2758427586-2758727600-2760627636- 2764127649-2765427662-2767227698-2770627726- 2773927755-2775627814-2781827861-2786427890- 2789227896-2792727948-2796927989-2799028045- 2804728050-2805928095-2809628099-2810028105- 2810728138-2814028142-281562818628263-28268 2828628290-2829228311-2831328315-2831628361- 283622842428426-2842828468-2848428573-28578 28732-2874229117-2912329278-2928329304-29314 29317-2932029328-2933729339-293402935829406- 2941629418-2941929432-2943529614-2962029718- 2972129950-2995329960-2996130141-3014230150- 301563020530221-3022330361-30368
Genomic Genomic EPM001 880-882 1 142- 1 152 3281 -3282 4007-4052 4177 4180- clones DNA 4190 4899-4907 491 1 -4917 4921 -4922 4928 4945 4997- from the from 5023 5031-5035 5037 5042-5046 5202-5239 5241 5361 Tissue RNA Library SEQ ID NOS: origin Source Name short arm Genetic 6796-6800701970267028-70297031-70357041-7055 of Research 7058-70917094-709971737175-71777371-73727374 chromoso 7392-73957407-74147421-74257444-74457469-7472 me 8 74758466-84768710-87158800-88138820-88418850- 88538855-88598862-88688877-89029047-90489081- 90859088-90999145-91499152-918191939205-9208 9213-92299231 -92349239-92599269-92769291-9299 9314-93159962-996410320-1032510329-1033110334- 1034210443-1044410679-1068310723-1072410726- , 10741 10748-1075010755-1075710759-1076510775- 1077710822-1083010834-1086710879-1088011058- 1106211416-1141811890-1189112218-1222013000- 1300413127-1312813130-1313613142-1317413176- 1319013307-1331113316-1331813324-1333713425- 134951367713681-1368213695-1369813706-13707 13712-1372113723-1372713730-1375113815-13821 138231383113854-1386014876-1487714962-14977 15120-1512315593-1559915705-157081571015718- 15741 15773-1577415780-1578815815-1581715826 15828-1583115836-1583715842-1584515915-15916 15939-1594315946-1595615964-1597615978-15980 15991-1599616001-1601816047-1605216258-16268 1723217366-173681757917593-1759817602-17609 17637-1766717673-1768417688-1770017703-17707 17722-1772417734-1778917791-1779917805-17806 17818-1782417828-1782917836-1784917857-17861 17864-1786817871-1787317876-1787917881-17885 17888-1789417902-1790417909-1791618008-18012 18024-1802818066-1806818075-1808918139-18151 18155-1815618161-1816218166-1816818186-18187 18197-1820318207-1821418216-182181822618230- 1823918242-1824518247-1825318255-1826318275- 1827718286-1828818291-1829418308-1832918335- 1834918353-1835918365-1837018380-1839218398 21304-2132521501-2151521534-2155321557-21563 21568-2164621656-2171121718-217242172821738- 2174321748-2175121788-2179921863-2187622523- 2253322582-2258822596-2260222605-2260622610- 2261722658-2266022817-2281823029-2304523532- 2353923993-2399824183-2419624199-2422524227- 2426824274-2431224319-2432424333-2438724397- 2447024473-2449024607-2463124646-2466524681- 2470224722-2477024775-2481624820-2486224869- 2492324925-2495124964-2497724985-2506025093- 2511125114-2514625158-2517025178-2521225220- 2522425228-2524625249-2525325359-2536325366- 2536725450-2547025475-2549425506-2551425539- 2556125572-2559325598-2560325633-2565625660- 2567925690-2580025807-2584725853-2587525878- 2588025883-2589625906-2595125957-2596525969- 2601826050-2605326077-2608626090-2612026127- 2612926133-2614326157-2618426189-2619426219- 2622026237-2623926629-2665526691-2669427677- 2768827714-2771727809-2781128352-2835728489- 2851028514-2854828556-2857228579-2862728635- Tissue RNA Library SEQ ID NOS: origin Source Name
28719 28732-28799 28809-28824 28831-2883628841-
2889628901-2895828964-2897229012-2903829047-
2904929071-2908129087-2910129104-2910729116
29128-2914129147-■2915129154--2915629159-29168
29179- 2918629221-■2924129290--292952932729368-
2936929420-2942529446-29450294612946629468-
2947229486-2948729492-2949829501-2954629551-
2956829583-2960429606-2961029614-2962929631-
2963729645-2965429660-2967829686-2970329708-
2971429722-2973929741-2974229746-2983629842-
298712987629879-2991129919-2993529964-29968
29980- 2998129986-2999429998-3000130022-30031
30036-•3006430084-•3010030109--3013430167-30177
30179- 3018230236-•3023930253-■3027830281-30295
30297- 3030030310-•3031730321-■3032730330-30337
30342- 3034730352-30360
Genomic Genomic EPM003 1604-16074008-40364049-40514180-41904923-4928 clones DNA 52425326-53307067-708770937175-71777426-7442 from the from 74758807-88138822-88358839-88468850-88538855- short arm Genetic 885888628870-887388768892-88989088-90999152- of Research 91819205-92089226-92299239-925792599294-9296 chromoso 9301-93041034310347-1035010443-1044410726- me 8 1072710759-1076410775-1077710842-1086713000- 1300413307-133081334113422-1342313425-13495 13730-1373513737-137501382314981-1498315110- 1511115120-1512315718-1572515815-1581715827- 1583315939-1594315966-1596815991-1599616016- 1601817606-17608177011783617857-1787417876- 1788717891-1789418097-1813418145-1815118155- 181561818418255-1826318291-1829218335-18340 1835421549-2155321573-2158621660-2171121728 21788-2179921863-2187622525-2253022596-22598 22605-2260622658-2266024187-241932422724358- 2436924436-2443824473-2448024607-2460924722- 2472624749-2477124795-2481624869-2487024908- 249232496325017-2504825052-2506025078-25084 25091-2510025114-2513925147-2517025187-25192 25247-2524825359-2536325461-2547025489-25494 25515-2552125539-2555025572-255932562325633- 2565025676-2567925728-2573225741-2578225883- 2588925901-2590225906-2591125957-2595925969- 2601626059-2606726136-2613826150-2615226157- 2616426629-2665327943-2794728522-2852428533- 285392857128607-2861428646-2865028709-28719 2874828754-2876028768-2878128809-2881428822- 2882428882-2888728914-2891628933-2893529012 29031-2903829071-2908229087-2909529104-29107 29116-2914129154-291582932729446-2945029492- 2949429590-2960529614-296202963029645-29654 29663-2967829686-2970329741-2974229763-29836 29842-298692987929902-2991129964-2996830002- 3000330048-3005230084-3009330118-3013430145- 3014930167-3017730179-3018230201-3020430236- 3023930254-3025530275-3028130343-30347
Genomic Genomic EPM004 3281-32824924-492849454997-50205043-50445243- Tissue RNA Library SEQ ID NOS: origin Source Name clones DNA 52457061-70647067-708771677170-71777426-7442 from the from 7474-74758466-84768807-88138822-88388877-8902 short arm Genetic 9047-90489152-91819217-92259231-92339269-9286 of Research 9291-92939314-931510443-1044410446-1044810679- chromoso 1068310775-1077710842-1086713000-1300413145 me 8 13177-131781318913309-1331113328-1333013424- 1349513497-1349813719-1372113730-1373613815- 138181382313854-138601583415836-1583715842- 1584615946-1595015966-1596816006-1601817232 17585-1758617650-1765217674-1767617748-17751 17763-1777017850-1785117888-1792118075-18080 18097-1813418139-1814418161-1816218264-18268 18275-1827718286-1828818300-1830718323-18333 18353-1835418361-183701840419093-1909521549- 215532156321573-2158621661-217112172821780- 2187622523-2253022586-2258822605-2260623820- 2382623993-2399824187-2419324260-2426824333- 2434024358-2436924426-2443524449-2447024473- 2448024610-2461424681-2469324703-2471524757- 2477024775-2477924794-2480124827-2483724846- 248542495125005-2502625052-2506025096-25100 25112-2511325158-2523825247-2525625450-25460 25471-2547425515-2552125539-2554125551-25561 25673-2567925690-2570225716-2571825737-25782 25796-2580025807-2582025836-2584725853-25865 25901-2590225986-2601626077-2608626090-26099 26116-261202621626273-2627926629-2665327677- 2768828531-2853228540-2854728558-2856428617- 2862728695-2869828709-2872728768-2878128809- 2882128831-2883628841-2884728872-2887428901- 2890928933-289352894329031-2903829159-29170 29179-291972920329221-2923329290-2929529420- 2942529481-2948529488-2949129508-2951829529- 2953429551-2956329587-2958929631-2963729763- 2983629842-2986929872-2987529885-2989129936- 2993729980-2998129991-2999430059-3006030084- 3008730109-3011730179-3018230201-3020430271- 30274 esophagus BioChain ESO002 4581-45824810-48115335-53368466-84769475-9476 15588-155891665218781-1878319943-1994621277- 212802684327100-271023020530233-30235 fetal brain Clontech FBR001 202-203847-8481097-10981259-12621721-17222404- 24062951-29555276-527858027902-79038377-8379 9196-9200944398289969-998010273-1027410326- 103281087610976-1097811024-1102511476-11478 11731-1173411803-1180412127-1212812150-12151 13107-1311713581-135831460417366-1736817455- 1745618627-1862818964-189691901819211-19212 1936219387-1938919401-1940220328-2033020345- 2034820554-2055721256-2126621377-2139721434- 214392197822141-2214322200-2220322637-22643 22899-22900232222370923893-2390225416-25417 26307-2630926329268312711327245-2724627386- 2738927976-279822818630141-30142 fetal brain Clontech FBR004 1667-166819902114-2115238948185980-59847542- Tissue RNA Library SEQ ID NOS: origin Source Name
75507699-77007757-77597986- 79878274-82768307 8588-85979049-90629862-986310480-1048211018 11308-1131012581-1258215397-1539819074-19080 19306-1930719800-1980320087-2009420104-20105 2024120415-2041720491-2049420563-2056720607- 2061220698-207072086520897- 2090021105-21111 2193922087-2208822144-2215122625-2262823007- 2302123526-235312371025347-2535425534-25536 2632926347-2634826383-263852684328148-28156 2842429954-29955 fetal brain Clontech FBR006 3-4168192197-200240-250324-325329-330362373- 374464-465532-535553-554576-578613-614713-715 847-848912927934-935949-9501044-10451071 1097- 10981203-12041235-12371273-127913041411-1413 1488-14891569-15701591-15921678-16871705-1714 1923-192419772023-2029214522282231-22532259- 2264235623752396-24002404-240624312437-2439 24752525-25282599-26032656-26582663-26652707- 27162720-27322734-27452770-27722808-28112871- 28732889-28912931-29352951-2955300130393080- 30813105-31063205-320732133261-326333773477- 347835073512-35143555-355835963671-36733683 3687-3691369337083711-37123729-37303781-3784 38093939-39413949-39534055-406140654091-4100 4122-41264137-41394209-421045424562-45684574- 4576466746734683-46844720-47254765-47734795- 48094845-48514854-48564870-487149484964-4965 49705136-5137513952465251-52525291-52945392 5532-55335557-55605567-55685594-56025744-5747 6011-602161376155-61616209-62116217-62226378- 63886393-63956406-64106452-64536488-64906513- 65156542-65436669-66706674-66756775-67787194- 71977220-72277236-72377264-72667350-73527364- 73657426-74427452-74607482-75177557-75597580- 75977604-76057630-76367657-76597695-76967745- 77467778-77837787-77887898-7900794679577986- 798779938013-801680798137-81418152-81568162- 81738187-82008204-82058211-82138230-82338247- 82498263-82658301-83108313-83148320-83228335- 83368347-83488351-83608371-83748383-83898420- 84218426-84288457-84588461-84658497-84998506 8512-85138588-85978607-860986888733-87358758- 87598762-87668919-89338936-89458974-89778982- 89838998-90049029-90309043-904590689306-9311 9380-93819510-95189529-953195859603-96049729- 97319763-97679799-98009808-98129829-98329929- 99359958-99599969-99809989-99929997-10009 10015-1001610033-1003710449-1045310477-10478 1048310513-1051810523-1053010537-1053810603- 1060810638-1063910780-1078210901-1090210931- 10933109651102611081 11123-111241131711345- 1135011465-1147211476-11478115771167211711- 1171211731-1173411739-1174011803-1180411934 12102-1211012117-1211812131-1213212202-12208 12215-1221712226-1222812333-1233412374-12377 Tissue RNA Library SEQ ID NOS: origin Source Name
12403-1240512437-1243812441-1244512451-12453 12486-12518125981272312894-1289713503-13512 13529-1353513585-1358813796-1379713838-13844 1389813994-1399714023-1402614044-1404514061- 1406314127-1412814137-1413814173-1417414209- 1421514236-1423814261-1426314349-1435114391 14410-1441214605-146091462614681-1468214684- 1468914694-1470114709-1471314988-1499314999- 15001 15019-1502415253-1525515257-1525915300- 1530115358-1535915381-1538515397-1539815406- 1540715454-1546115491-155251556715855-15857 15861-1586615890-1589515902-1590316022-16023 16060-1606116110-1611216115-1611716123-16128 16130-1613216219-1622016365-1636616434-16437 16465-1646616470-1647216496-1649716553-16556 16570-16573165991662316628-1663116636-16637 16643-16651 16714-1672316836-1684216860-16870 16903-1690916938-1694116962-1696516996-16999 17002-1700917018-1702317026-1702817067-17069 17105-171061711517120-1712217264-1726717286- 1730617359-1736017372-1737417571-1757417685- 1769617857-1786117927-1792917949-1795217958- 1796218001-1800418038-1804218136-1813818411 18427-1842918500-1850118516-1851818536-18537 18625-1862618644-1865018655-1866018691-18692 18719-1872618730-1873218771-187721878918806- 1880718829-1883418857-1888018882-1888818925- 1893318944-1894518947-1895018955-1895918964- 1896919005-1900919029-1903519045-1904819055 19071-1907319081-1908319096-1910119138-19140 19159-1916119211-1921219266-192671927419295- 192961935019362-1936419375-1937919385-19389 19422-1943119442-1944419447-1944819526-19529 19548-1955319566-1957319598-1960219659-19661 19670-19671 1969319706-1970819733-1973519814- 1981819933-1993919948-1995019953-1996319972- 1998020029-2004320069-2007120095-2010220120- 2019420198-2022120235-2024020244-2025220265- 2027020274-2027820321-203242041420437-20440 20447-2045120456-2047120476-2047920505-20509 2054720559-2056820607-2061220621-2062420631- 2063420646-2064820676-2068120686-2068720698- 2070720718-2072520727-2073420747-2075120754- 2075720801-2080520827-208432086520929-20932 20952-2095420957-2096221062-2106621112-21114 21141-2114221157-2116921176-2118021197-21198 21213-2121521229-2123221253-2125521263-21266 21326-2133321353-2135421377-2139721410-21414 21454-2145721480-2148221495-2150021554-21556 21647-2165521712-2171721729-2173321744-21747 21881-218852189121899-2190421911-2191221915- 2192421951-2197421978-2198221995-2200222007- 2201822020-2202922035-2203622045-2204622056- 2205922062-2206922074-2207622080-2208322092- 2209422108-2211422128-2213422141-2214322152-
Figure imgf000161_0001
Tissue RNA Library SEQ ID NOS: origin Source Name
81368152-81598182-81838187-82008227-82328253- 82578335-83368347-834883708404-84068487-8488 8497-84998502-85038607-86098689-86908762-8766 8917-89189013-901790369454-94589475-94769506- 950995319535-95379586-95929594-96029617-9626 9664-96659682-96839729-97319735-97389743-9754 9757-97589784-9785982698289851-98529854-9855 9896-98979909-99129927-99289956-99579969-9980 10017-100191030610451-1045310477-1047810520- 1052210603-1060610638-1063910780-1078210881- 1088310899-1090210913-1091410919-1092211135 1114311212-1121611228-1122911306-1130711345- 1135011538-1154011603-1160611680-1169611711- 1171311731-1173411739-1174011761-1176311772- 1177411803-1180411843-1185511873-1187512040- 1204112150-1215112194-1219612206-1220812430- 1243112455-124591263712755-1275912803-12804 12898-1289912997-1299913552-1355513585-13588 13592-1359513796-1379713918-1391913999-14003 14046-1404714123-1412814137-141381460414709- 147131478414822-1483815019-1502415182-15183 15233-1523615257-1525915290-1529115298-15299 15326-1532815486-1548915496-1552515576-15577 15588-1558915623-1562615867-1586816115-16117 16174-1617616371-1637216387-1639116576-16577 16614-166181662316643-166481679916851-16853 16894-1689617026-1702817038-1704117120-17122 17131-17132172421745417958-1796218001-18003 18136-1813818412-1841818427-184291862418655- 1865818750-187561878918806-1880718842-18845 18894-1889618904-1890518919-1892118947-18950 18964-1896918976-1897718989-1899018993-18996 19062-1906519068-1907019207-1920819266-19267 19308-1930919316-1934219345-193501936219380- 1938919407-1941119422-1943419441 19458-19460 1956619659-1966119667-1966819670-1967119699- 1970519709-1971019759-1976319810-1981319933- 19937199391995019962-1996319965-1996719972- 1998020099-2010220122-201272013020137-20145 20161-2016420167-2017120198-2020720215-20218 20231-202342024120363-2036520415-2041720452- 2045520469-2047120497-2049820514-2052020524- 2053520537-2054120575-2057820607-2061220625- 2062820631-2063420646-206482068120816-20818 20836-208432086520927-2092821005-2100821076- 2108721101-2110421137-2114021153-2115421171- 2117421229-2123221253-2125521284-2129421351- 2135221410-2141421447-2145021454-2145721467- 2146921485-2149121495-2150021532-2153321712- 2172421729-2173321877-2188021903-2190421924 21936-219382197821983-2198722000-2200222007- 2201522026-2202922042-2204622101-2210722135- 2213822152-2215622165-2216822200-2220322218- 2222622243-2224522284-2228922292-2229922309 22349-2235022358-2235922381-2238222394-22398
Figure imgf000163_0001
Figure imgf000164_0001
Tissue RNA Library SEQ ID NOS: origin Source Name
21027- ■2103121275 -2127621516 •2152821967- •21968 21983 •2198722007-220152202 •2202522056- 22059 22333-•223352237322622-2262422637-22643 22669- 2267022700-2270122728-2272922887-22891 23059- 230602306323229-•2323323470-2347223903- 23904 25334-■2533525370-2537227269--2727027304- •27305 27386-■2738927570-2757128008--2801128234- ■28254 28311-2831329409-2941630085 3008730150- ■30156 fetal lung Invitroge FLG003 255-256412-413461-463534-535619-620785-786981- n 9851273-127913371678-16801792-17942207-2208 2341-23432471-24742774-27752779-27922874-2876 2923-29242936-29383326-333137084531-45334685- 46904795-48095337-53416235-62376530-65316932- 69387258-72637536-75417660-76657745-77467901 8017-80198074-80768113-81148152-81568318-8322 8349-83508422-84258434-84528457-84588502-8503 86888736-87379022-90259100-91449306-93119472- 947495849733-97349759-97629794-979598289833- 98439857-98619864-98669909-99129923-99249958- 995910015-100161027710523-1053011345-11350 11423-1142611431-114321157711610-1161111901- 1190512050-1205612455-1246412565-1256812581- 125821259812760-1276212978-1298113585-13588 13931-1393314236-1423814261-142631460414624- 1462515412-1541415558-1556015588-1558915902- 1590316145-1614616345-1634916636-1663716642 16851-1685317024-1702517046-170571745417934- 1793618015-1801618405-1840618655-1865818796 18899-189031891018915-1891819138-1914019159 19308-1930919375-1937919401-1940219623-19624 19800-1980319943-199461998120053-2006320103- 2010520222-2022520231-2025320263-2026420289 20340-2034120401-2040520412-2041320415-20417 20456-204682050120537-2054120607-2061220646- 2064820698-207072072620733-2073420827-20835 2086320957-2096220977-209842101521076-21087 21097-2110021157-2117421248-2125221277-21280 21284-2129421403-2140421428-2143321492-21494 21911-2191221939-2194622000-2200222007-22015 22056-2205922062-2206922120-2212722158-22160 22200-2220322265-2227622336-2234622358-22364 22561-2256822571-2258122708-2270922843-22845 2297723007-2302123098-2310123252-2325623506- 2350723532-2353923599-2362823827-2383324021- 2402424120-2412525373-2537525410-2541326237- 2623926294-263022632726350-263522635726359- 2636026383-2638526479-2654526607-2660926698- 266992674226860-2686226876-268792722527441- 2744327544-2754527790-2779627812-2781327845- 278522797028099-2810028133-2813728142-28145 28426-2842828768-2877429117-2912329950-29953 30195-30199 fetal lung Clontech FLG004 1569-15706005-60066403-640419074-1908019706- 1970820676-2068021157-2116923098-2310123543- 2354426383-26385 Tissue RNA Library SEQ ID NOS: origin Source Name fetal liver- Columbi FLS001 1-6810-1621-232530-3335-3848-5474-7981-120 spleen a 122-123126-129156158-159164167170-171177-178
Universit 180-181 185-192195-205215-224226-235237-239251- y 253255-256260273-284286-291307-309319-325328 331-332373-374395398-399404-405412-420461-465 505515-517524-525528546-547551-556576-578582- 584619-620626-630652685-688694-695697-699708- 711716-722729738-748754-757763771-772777-778 785-786796-797832-836847-848852-855864-867880- 882891-898900-911913919-922934-935969-971979- 9851001-10031005102810471067-10711080-1081 1092-10941097-10981109-11101114-11151123-1127 11291140-11411158-11611163-11641169-11791240- 1241 1259-12621273-12791295-12971302-13031307 1310-131313681372-13731388-14011424-14291432- 14371440-14471453-14541467-1469147814831544- 1546155116181658-16601667-166816751678-1687 1705-17141721-17221725-17261732-17371747-1749 177317771792-179418401879-18801912-19131915- 19161921-19241961-196219771986-19872015-2018 2030-20342058-20602063-20662114-21152201-2203 2222-22262272-22762284-229923062354-23552362- 23632383-23882393-23952404-2406243124392457 2471-24742517-25182540-25412561-25622570-2574 2599-26032649-265026662687-26882695-26982706 274528062808-28112814-28152871-28732889-2891 2918-29192923-292429262934-2935296729732975- 29772983-29863002-30033006-30083026-30283078- 307930903101-31043107-31083118-31193144-3146 3151-31543182-31873195-31973205-320732643276- 327732903304-3308334533523366-33693374-3377 3414-341734333435-34453464-34653468-34713477- 34783512-35203525-35293565-358836763686-3691 369337083789-37943820-38213834-38353840-3852 3879-38803939-39413949-39534084-40854091-4095 4101-41164141-41464151-41554160-416242114220- 42384244-424742554261-42654273-42754343-4346 4386-43904441-44434469-44824486-44874489-4493 4512-45164531-4533454245464555-45564562-4573 4581-45824623-46394660-466246664668-46724712- 47134720-47274738-47504780-47814785-47894795- 480948414857-48614897-48984967-49684980-4981 4984-49854989-49915057-5059506650825109-5115 51395151-51795190-51915269-52715279-52835335- 534153605362-53665368-537553815387-53895480- 54845487-548854905493-54965512-55185530-5531 5557-55605567-55685572-557355805591-55925658 5703-57045717-57185768-57725802-58045827-5828 58465907-59095919-59215939-59415943-59455967- 59695973-59755980-59845997-60006011-60216084 6135-61366166-61686226-62346291-62966335-6336 63776403-64046439-64406445-64496455-64586461- 64836488-64896503-65106523-65266533-65356540- 65436551-65676585-65926669-66726674-66756709- 67136745-67466773-67786791-679468526896-6899 Tissue RNA Library SEQ ID NOS: origin Source Name
6959-69607001-7007701172037209-72127216-7219 7245-72487256-72637267-72687275-72767296-7298 7368-73697447-74497534-75417551-75597572-7577 7604-760576087613-76177620-76217630-76367648- 76537660-76657671-76737681-76877693-76967698- 77037726-77277742-77437745-77517763-77757778- 77887794-77977802-78057815-78217891-78957898- 79017904-79157921-79257940-79457947-79487957 7972-79757978-798579887990-7992799479988002 8010-80128069-80738077-80798084-80858087-8093 8113-81148117-81218132-81418145-81568163-8173 8187-82008203-82058211-82138227-82298242-8248 8253-82578297-830683108313-83148337-83428347- 83508365-83668368-8369839084158422-84258434- 84488461-84648466-84768487-84898500-85018507- 85088512-85138537-85508554-85558588-86008607- 86098612-86168688-86908698-87008724-87328754- 87668768-87708777-87798917-89338939-89458978- 898389979002-90049007-90089013-90209028-9036 9038-90469049-90679072-90789196-92009264-9265 9267-92689306-931193169321-93239326-93289337- 93389340-93429353-93579362-93639366-93689375- 937693829391-93929395-93969420-94239427-9428 94449446-94489454-945894609472-947694789486 9510-95189522-95259531-953795589561-95629585- 95929603-96049609-96109617-96269664-96659667 96769682-96839726-97279729-97319735-97449757- 97679780-97839794-97959813-982098269828-9832 9841-98429851-98529854-98559857-98619896-9897 9899-99089913-99209923-99249929-993599539956- 99619969-998810011-1001210017-1001910025-10026 10033-1003710045-1012610164-1017210263-10274 10277-10278103061031210320-1032310449-10457 10470-1047610488-104901049410498-1050310508- 1051210519-1053210543-1055010588-1059410600- 1060110603-1060810612-1061410628-1063010638- 1064410873-1087510881-1088310886-1088910891- 1089310895-1089810901-1090210904-1090510907- 1091010913-1091410931-1093310963-1096410966 1097210976-1097810980-109851103811042-11044 11066-1106811071-1107511078-1112111123-11125 11128-1114211208-1120911212-1121911222-11229 11250-1127511281-1128211289-1129011295-11301 11306-1131011314-1131511339-1135311359-11369 11399-1140711423-1142611431-1143211460-11464 11468-1147211476-1147811493-1149511501-11506 11513-1151411538-1154511561-1156211565-11567 11574-1157611582-1158311586-116061160911612- 1163911644-1165011658-116591167211700-11707 11711-1171211731-1173411739-1174011745-11750 11761-1176311766-1176811775-1177611780-11782 11803-1180811811-1183211835-118361184211873- 1187511890-11891 11921-1193011939-1194011947- 1195012011-1201312027-1203912050-1205612121- 121261214012150-12151 12186-1218912192-12196 Tissue RNA Library SEQ ID NOS: origin Source Name
12202-12208 12226-1222812241 1225312256-12258 12280-1228412331-1233212371 123971242912436- 1243812446-1245012455-1245912470-1247112486- 1254012545-1254612562-1256312565-1256812590- 1259212601-1260212619-1262212637-1263812640 12642-1264412666-1266712673-1267812701-12705 1272312725- 1272712755-1275912792-1279312796- 1279912803-1280412824-1282712894-1289712963- 1296412968- 1296912978-12981 12984-1298512997- 1299913071-1307413077-1307913107-1311713512 13556-1355913567-1356813576-1357913581-13583 13592-1359513598-1360013609-1361613632-13633 13638-1364113652-1365913668-1367213761-13795 13838-138421386113867-1386813873-1387513898- 138991390713925-1392613954- 1395613999-14004 14023-1402614036-1403714044-.1404514048-14056 1410314127-1412814143-1416414170-1418114192- 1419514209-1421514236-142381425714261-14263 14268-1426914347-1434814389--1439014463-14497 14546-145491460414607-1460914628-1463514650- 14651 14661-1466514676-1467814681-1468214684- 1468914694-1470114709-14717147841479214818- 1482014866-1487015011-1506915093-1509615118- 1511915132-1515715174-1518315190-1519115197- 1520015222-1522415233-1523615243-1524815257- 1525915277-1527815283-1528615290-1529115298- 1530315321-1532215326-1532815336-1534015358- 1535915381-1538815406-1541415457-1545915530- 1553515545-1554615551-1555215558-1556015563- 1556415576-1557715588-1558915614-1562015629- 1563215661-1566315699-1570015855-1585715867- 158701587815880-1588515890-1589515969-15974 15988-1599016022-1602316041- 1604616053-16058 16060-160611606716115-1611716141-1614316163- 1616416168-1617116174-1617616200-1620116206- 1620816216-1621716233-16237162391625016255- 1626816279-1628016284-1629516304-1630516310- 1631416345-1634916365-1636616371-1637216399- 1640216418-1642916434-1643716440-1644816461- 1646416470-1647216479-1648116484-1648616489- 1649316508-1651116517-1651916536-1653816545 16550-1655216558-1656216579- 1658516601-16611 16619-16621 1662316636-1663716642-1664816652- 1665516686-1669316702-1670516714-1671516727- 1672916802- 1680316836-1684216851-1685316860- 1687016873-1687416894-168961691016913-16915 16918-1691916934-1693916949- 1695316967-16970 17026-1702817038-1704117067- 1706917078-17079 1711417120-1712217233-1723417237-1724217268- 1727417286-1729117330-1733217335-1735817366- 1736817441-1745017454-1745617463-1746417538- 1756517571-1757417581-1758217855-1786117934- 179361795517958-1796217995-1799718001-18003 18015-1801618018-1802318029-1803018038-18042 18046-1804818097-1813418136-1813818371-18372 Tissue RNA Library SEQ DD NOS: origin Source Name
18374 1837518400 -1840218405 - 18418 18421 -18425
18427 1842918495185191852718535- 18537 18577-
18578 18587-1861718621-1863718642-18660 18662
18668 1867718691 1869318717 18726 18730-18732
18738 18756187591876118771 18780 18784-18786
18789 -18793187961880718811 18813 18822-18834
18836 -1885518857 -1888018882 18888 18897-18905
18910 18915-1892118923-1893918941 18944-18945
18947 -18950189551897318975 1 8977 18986-18988
18993 1899619005 -190091901219018 19029- 19035
19045 190481905519057-1905819062- 19065 19071-
19085 19096-19101191321913419138- 19140 19142-
19148 19153-191551915919202- 19210 19213-19220
19224 -1922719251 -1925719260 -19262 19266-19267
19271 -1927219274 -1927919283 19296 19306-19309
19316 -193171934319350-1936419368-19370 19372
19375 -1938419387 -19402194071941 1 19413-19417
19422 -1943119438 -1944419454.19455 19461 19467-
19487 1950319512- 1952119526- 19529 19536 19548-
19553 19560-19562 19564-1957819583- 19586 19604-
19607 19609-19615 19617-1961919625 19627 19659-
19661 19663-19668 19670-1971019727-19748 19757-
19763 1977219800- 1981219814- -19822 19855-19856
19915 -1992919933 -199461995019953-19957 19962-
19963 19965-19967 19972-1998019995-20017 20026-
20047 20072-20080 20087-2009420099-20103 20106-
20112 20114-20120 20122-2013220146-20148 20151-
20154 20157-20181 20189-2021420226-20230 20235-
20241 20244-20253 20257-2027820281 -20282 20286-
20296 2029920321- 2032420328- -20330 20345-20354
20366- -2036820393 -204072041120415-20440 20447-
20451 20456-20474 20476-2047920482-20494 20497-
20498 20502-20503 20505-205092051 1 -20520 20524-
20535 20537-20541 20547-2055720559-20567 20569-
20578 20602-20612 20614-2063420639-20640 20644-
20648 20653-20671 20676-2068120683-20685 20688-
20692 20698-20707 20710-2072520727-20734 20747-
20752 20758-20766 20788-2079720806-20812 20816-
20849 20853-20857 208632086520871 20879-20880
20885 ■2088720897 -2090020924 -20932 20938-20943
20948 2094920951 -2095420957 -20976 20985-20988
20999 2100821027 -2105621058 -21 1 1 1 21 122-21 127
21137 ■2114021145 -2114821155 -21 169 21 171-21 174
21176 2118021199 ■2121021213 -21215 21225-21228
21233 2123521237 -2125221256 -21262 21267-21274
21277 •2129621334 21340-21342 21351-21352 21359-
21360 21377-21397 21405 21410- 21414 21434-21453
21463- -2146521467- -21476 21480 •21482 21485-21491
21495- 2150021516- -2153021532 ■21533 21554-21556
21587- ■2165521712- -21724 21744 •21747 21773-21775
21877- 2188021892- -21894 21899 •21902 21905-21912
21917- 2192121925- -21935 21939 21948-21968 21971-
21987 21995-22015 22019-22036 22040-22044 22047-
22079 22090-22094 22097 22101 - 22107 221 15-221 19
22128- 2213822141- -22143 22152 •22156 22160 22165- Tissue RNA Library SEQ ID NOS: origin Source Name
2216822171-2218622193-2219422204-2221122218- 2222622231-2223522246-2225022261-2228922292- 2232822336-2235122358-2235922362-2237322377- 2238022383-2239222399-2243222434-2243522440- 2244822455-224652249522531-2253922551-22560 22570-2258122599-2260222607-2260922618-22624 22632-2265122653-2265722661-2266722669-22670 22680-226842269022696-2269722700-2271522725- 2272722730-2273122760-2276422767-2277022781- 2278222794-2279622805-2281622821-2282722833- 2283422843-2284922852-2285622870-2287422887- 2289122904-2290622909-2291022916-2292422927- 2293822948-2295022969-2297722980-2298122991- 229942299823022-2302523046-2305523059-23060 23063-2306723070-2307123074-2307623088-23090 23094-2310123112-2311923138-2314123201-23202 23206-2320923212-2321523218-232212322623229- 2323323235-2323623242-23244232472325123261- 2328123356-233602337823382-2339023392-23398 23402-2340823415-2342123433-2343723470-23474 23486-2348723489-2349023492-2349323495-23496 23508-2351123515-2351823543-2354423547-23548 23555-2356523599-2362423673-2368123700-23701 23704-2370923725-2373023760-2376323771-23780 23793-2379923802-2380923816-2381923827-23835 23839-238482388223890-2390223906-2391023959 23999-2401124014-2401824021-2402524035-24040 24056-2409124105-2411324120-2412924194-24196 24749-2475424772-2477425085-2509025279-25288 25306-2530825319-2532125323-2532525331-25337 25340-2534125347-2535425359-2536325370-25375 25383-2540125403-2540725410-2541325534-25536 25598-2560325966-2596826024-2602826033-26049 26196-2619926205-2621326221-2624526248-26251 26253-26257262612626626273-2628026285-26290 26310-2631426316-26321263272633726347-26348 26350-2635226358-2636526373-2637426383-26385 2639826409-26411264232643126455-2646326469- 2647026559-2656026563-2656626596-2660226660- 2666126676-2668026691-2669426705-2672626731- 2673526737-2673926741-2675626782-2678526796- 2679826802-2680626813268272683526843-26845 26853-2685426860-2686226873-2687926888-26890 26980-2698727028-2702927044-2704727100-27102 2710527122-2712627133-2713427142-2714927173- 2717427177-2717827193-2720027209-2721327218 27240-2724627269-2727027275-2727627281-27282 27299-2730127304-2730527348-2735327386-27391 27414-2743127438-2744327458-274592749727501- 2750727510-2751127520-2752127544-2754527551- 2755427557-2755927586-2758727600-2760627636- 2763927649-2765427662-2767227698-2769927707- 2770827729-2774327750-2775627799-2780227812- 2782027825-2782627861-2786427896-2792727943- 2794727988-2799027992-2799728050-2809128095- Tissue RNA Library SEQ ID NOS: origin Source Name
28096 28099- -2810028105- ■2810728142- -2815628192- 281982823228234-282542828628308-2831328361- 28370284242844928533-2853928573- 2857829250- 2925229278-•2928329296-■2930029304--2931429327- 2933729339-■2934029343-•2934529354-■2935729367 29371-■2937929409-2941629508-2951829529-29534 29718-■2972129950-2995529960-2996130085-30087 30135 •3014230150-3015630195-301993020530218- 3023230236- 3024230361-30368 fetal liver- Columbi FLS002 1-25-621-2326-273035-3848-497477-7887-94108- spleen a 112129150-151164170-171180-181 188-192197-200
Universit 205218-220227-230235237-239251-252273-278304 y 309319-323328-330332362395414-420425-427505 515-519528532-533543-547582-584619-620674-675 685-688694-699708-711716-717742754-757763777- 778785-786825832-836847-848852-855892895-898 901-906910-912969-971979-98510051067-10701080- 10811092-10941109-11101163-11641185-11871202 1210-1211 1235-12371240-12411259-12621273-1279 1299-13001302-13031312-13131432-14351440-1447 1453-14541467-14691476-14781544-15461557-1558 161816251667-166816751678-16871705-17141725- 17261757-1758183818401866-18721879-18801912- 19131915-19161918-19191961-196219671986-1987 1991-19922015-201820362058206620722122-2124 2195-21962202-220522212275-2276227922992308- 232123482354-23552383-23882393-240024312457- 245825142517-25202529-25422544-25742663-2665 26912695-27002717-271927452808-281128192871- 28762908-29092913-291429172929-29302948-2949 2975-29772988-29893006-30103078-307930903112 31143126-31283130-31313165-31693182-31873226- 32273306-33093326-3331335233773392-33963418- 342034333461-34633468-34713508-35103526-3529 3555-35583562-35643661-36633676-36783687-3691 3693-36943699-37033869-387739343949-39533970- 40064077-40794101-410341314141-41464151-4155 4206-42084220-42384343-43464362-43654386-4387 4417-44194441-444944834489-44934515-45164526- 4527454645524562-45684623-46394658-46624666 4668-46724720-47254751-47554795-48094813-4815 4842-48514872-48984958-49594967-496849704980- 49814984-49854989-499150825087-50885109-5115 5129-51305152-515351925200-52015253-52665269- 52715279-52835288-52905311-53255368-53695381 5480-54825557-55605567-55685580-55835656-5658 5703-57045803-58045829-584658795912-59185932- 59365939-59415943-59455973-59755980-59845992 60846114-61256155-615961706172-61746186-6189 6223-62256346-6366638664006403-64046461-6484 6488-64896516-65196542-65436570-65716657-6661 6709-67116714-67476762-67646791-679468516877- 68996932-69846998-70147094-70997216-72197245- 72487256-72637267-72687275-72787296-72987364- 73657479-74807554-75557557-75597580-75977604- Tissue RNA Library SEQ ID NOS: origin Source Name
76057615-76177630-76367640-76427654-76557657- 76597671-76737678-76877693-76947701-77037726- 77277742-77437745-77517757-77597763-77707794- 779778057815-782079017924-79257947-79487972- 79757978-79837990-7992799880028008-80128017- 80198069-807080798084-80858090-80938098-8100 8107-81088117-81218129-81318137-81518157-8159 8163-81738178-818182038242-82468253-82628266- 82688297-83008308-83098313-83198339-83428347- 83508368-83698383-83898422-84258430-84488457- 84588461-84648487-84898497-84998504-85058509 8516-85188537-85388554-85558576-85828588-8597 8601-86098612-86178688-86908698-87008733-8735 8738-87408751-87668768-87998917-89338936-8945 89979002-90049013-90209031-90369038-90469049- 90629072-90749084-90859306-93119327-93289337- 93389343-93529362-93639366-936893829391-9392 9395-93969427-94289446-94489454-94599464-9469 9482-94869506-95189526-95289531-95379550-9551 95589561-95629577-95789594-96029609-96129617- 96269682-96839726-97289739-97429757-97699786- 97909794-97959813-98209828-98329835-98409848- 98509857-98619864-98669899-99089913-99209923- 99249927-99289942-99449953-99559958-99599969- 99809993-999610015-1001610033-1003710129-10145 10161-1016310173-1017410263-1026410273-10276 10278-1031110326-1032810449-1045310474-10476 10488-1048910491-1049310498-1050310508-10515 10531-1053210550-1055410588-1059910603-10611 10615-1062310628-1063010640-1064410783-10785 10873-1087510877-1087810881-1088310886-10889 10891-1089310904-1090510907-1091010913-10918 10923-1092610957-1095810965-1096610973-10975 11022-1102311029-110311103811040-1104111066- 110681108111123-1112411130-111321113511208- 1120911212-1121911226-1122711230-1124911281- 1128211308-1131011318-1132011339-1134411406- 1140711460-1146411473-1147411476-1147811505- 1150611574-1157611586-1160611649-1165011669- 1167111711-1171211715-1171611731-1173411739- 1174011761-1176311780-1178211811-1183211840 11901-1190511912-119141191611941-1194411947- 1195012011-1201312023-1203912050-1205612121- 1212612133-1213412150-1215112186-1219112194- 1219612206-1220812221-1222412245-1225312256- 1225712309-12311 12331-12332123621236812374- 12401 12403-12411 12446-1245012455-1245912468- 12471 12486-1251812547-1255412590-1259612637 1264012642-1264412666-1266712714-1271512720- 1272212725-1272712785-1278712794-1281212814- 1282212824-1282712829-1283012832-1283412872- 1288112894-1289712944-1294712963-1296412977- 12981 12984-1298512997-1300613062-1306313066- 13124135121351613545-1354813576-1357913585- 1358813596-1359713606-136121361713632-13633 Tissue RNA Library SEQ ID NOS: origin Source Name
13761 -1379713810 13873-13875 13930-1393313937- 139381400414023-1402614044-.1404514048-14056 1406114063141271412814137--1413814170-14172 14175- 14181142091421514261--1426314304-14305
14347-1434814355--14356143891439114550-14551
145601460414661--1466214681- 1468214692-14693
147141471714789-147911479314810-1481314841-
1487514999-1500115013-150701509015093-15096
151181511915182-1518315187-1518915197-15200
15209-1521015212-1521515221 -1522415243-15248
15250-1525115258-1525915283 -1528615290-15291
15321 -1532215358-1535915392•1539615406-15407
15460-1546115491 -1549515545 •1554615551-15552
15561 •1556215568-1557215606•1560715631-15632
15679-1568515688-1570015861 •1586615869-15870
15878-1588015988-15990160221602316041-16043
1606716075-1607916115-1611716121-1612216137
16141 •1614316168--1617116174-■161791620216206-
1621716233-1623716279-1628016290-1629516365-
1636616434-1643716443-1644516465-1646616470-
1647216496-1649716508-1651116517-1651916542-
1654516558-1656216579-1658216601 16609-16611
16632 •1663416643--166481665216656-1665716666-
1666716702-1670516771-1677216781-1678216836-
1684216860-1687016894-168961690016918-16919
16962 •1696617067--170691711417120-1712217131-
171321723717268-■1727417284-1728517297-17306
17335-1733917359--1736017365- 1737117399-17400
17436-174501745417497-1749817525-1752717534-
1757017575-1757817580-1758417891-1789417956-
1796218001-1800318008-1801218020-1802218038-
1804218069-1807218097-1813418136-1813818207-
1821018371-1837218378-1837918407-1841018412-
1841818421-1842518492-1849418500-1851818536-
1853718583-1858418624-1862618629-1863718644-
1865018655-1865818668-1867718680-1868118693
18717- 1871818723--1872618730-1873218738-18744
18750- 1875618759-•1876618771-1877218778-18780
18787- 1879318802--1880418822-1883418836-18853
18856- 1888018904-•189051891018915-1891818925-
1893518941 18944-1894518947-1895018955-18959
18964- 18973189751899619001- 1900419011-19012
1901819029-1903919045-1904819054-1905519062-
1906519068-1908319096-19118191321913419138-
19140191481915319202-1920619209-1921019213-
192201922519228-1922919251-192521925719266-
1926719271-1927219275-1927919298-1930019302-
1930319306-1930719355-1936419368-1937019380-
1938919401-1940219413-1941419432-1943419442-
1944419458-1946019484-19486195031953619560-
1956219564-195781958619598-1960119604-19608
19617-19622196251962719656-1965719659-19661
19667-1966819672196881969319733-1974819757-
1976319800-1980819813-1981519921-1992919933-
1993819940-1994619948-1994919958-1996319965- Tissue RNA Library SEQ ID NOS: origin Source Name
1996719972-1998119995-2001420018-2004720053- 2006320069-2007320087-2009420099-2010220106- 2011220114-2012020122-2012720133-2013620149- 2015020167-2017920189-2019420198-2021420231- 202412025320257-2027820281-202822028820317- 2032020340-2034120345-2035420366-2036820411- 2041320415-2043620447-2045120469-2047920482- 2049420497-2049820501-2050320505-2052020524- 2053520547-2055320563-2056720575-2057820602- 2061220614-2061920621-2062420631-2063420639- 2064020646-2065220666-2067120689-2069220698- 2070720712-2071720727-2073420747-2075120758- 2078820792-2079720801-2081220819-2082320827- 2083520844-2084920853-2086320865-2086720871 20888-2088920897-2090020922-2092620929-20932 2094320952-2095420957-2096220973-2098820999- 2100421016-2102521027-2103121046-2107121076- 2115121157-2116921171-2117421194-2119621202- 2120721229-2123221241-2125221256-2126221267- 2127421277-2129421297-2129821301-2130321334 21340-2134221351-2135221377-2140221405-21427 21434-2145721462-2146621470-2147621480-21500 21532-2153321554-2155621712-2172421744-21747 21877-2188521892-2189421905-2191021917-21921 21929-219352193921951-2195421967-2196821971- 2197721983-2198721989-2199321995-2200222007- 2201522019-2203622040-2204122047-2205522060- 2206122070-2207322077-2207922090-2209422101- 2210722115-2211622135-2213822141-2214322160- 2216822171-2218422187-2219222195-2219822208- 2221122218-2222622246-2225022256-2226022265- 2227622284-2228922300-2230222310-2234222351 22357-2235922377-2238022389-2239222399-22408 22411-2243222440-224482249522534-2253922551- 2255922561-2256522570-2258122599-2260222607- 2260922618-2262822632-2263322644-2265122653- 2265422661-2266222665-2266722669-2267422680- 226842269022696-2269722700-2270122703-22707 22760-2276422767-2277422801-2280422810-22816 22823-2282722846-2284922852-2285322870-22874 22904-2290622925-2293822948-2295022962-22970 22973-2297622982-2298623007-2302123046-23050 23053-2305523059-230602306323080-2308323112- 231192314123212-2321523229-2323323237-23239 232452324723251-232562326323279-2328123343- 2334423358-2336023375-2338523392-2339523402- 2341123428-2343023439-2344723475-2347923487 23492-2349323515-2352323545-2354823555-23565 23587-235932359923673-236862369023692-23693 237042372523760-2376123770-2378023793-23797 23802-2380923827-2383523839-2384023843-23848 238652388123883-2388923941-239562395923990- 2399224002-2400424014-2401824029-2404124044- 2404524056-2411624120-2412924145-2416224194- 2419624749-2475425085-2509025279-2528825300- Tissue RNA Library SEQ ID NOS: origin Source Name
2530425314 25319-2532125323- -2532525342-25343 25347-2535425359-2536325370-2537225374-25375 2538225403--2540725522-2553125534-2553625966- 2596826033--2604926196-2619926205-2620626209- 2621326221-•2622326240-2624326248-2625126261 2626626270--2627926285-2629026347-2634826354- 2635526359-•2636526373-2637426378-2638126409- 2641026414264232643126446-2644726469-26470 26559-2656026586-2660226617--2662826657-26661 26665-2666626684-2668626705--2670726719-26726 267412674626748-2675426796-•2679826806-26812 26876-2687826980-2698427012--2701427037-27038 27044-2704727067-2707327100--2710227114-27120 27129-2713227142-2714927154--2715527180-27187 27203-272052721827229-2723227240-2724427269- 2727027283-•2728727299-2730127304-2730527340- 2734227372-■2737527379-2738027401-2741027414- 274312743827441-2744327456-2745927487-27488 27493-2749527510-2751127520--2752127548-27549 27557-2755927586-2758727600-•2760127633-27634 27640-2764127674-2767627740--2775627758-27768 27812-2781327819-2782027890--278922792827995- 2799728040-2804128045-2804728050-2805928097- 2810028105-2810728133-2814028234-2825428278- 2827928290-2829528305-2831628347-2834928365- 2837028440-2846028463-2848829117-2912329265- 2926729278-2928329304-2931429321-2932629328- 2933129339-2934029354-2935729367-2940429409- 2941629427-2943129594-2960429718-2972129872- 2987529938-2995329962-2996330085-3008730195- 3019930221-3023230240-3025030325-3032730361- 30368 fetal liver- Columbi FLS003 674-6756773-677480708434-844811226-1122715588- spleen a 155891665216851-1685317958-1796218642-18643
Universit 18773-1877618802-1880419275-1927919375-19379 y 20189-2019420263-2026420602-2060621256-21262 22265-2227023890-2389224772-2477427193-27200 28424 fetal liver Invitroge FLV001 3-410-14112170-171255-256713-715847-848969- n 971981-9851001-10031097-10981123-11271142-1164 1273-1279147016251918-192720582272-22742414 2471-247424762663-26652871-28732939-29432948- 294930013021-302231643255-32593374-33773485- 34863512-35213555-3558362437083735-37363768- 37693818-38193949-39534562-45684600-46014623- 46395480-54825803-58045856-58585899-59026011- 60216166-61686551-65676932-69387216-72197421- 74237534-75357678-76807745-77467902-79037924- 7925795780708090-80938101-81048113-81168132- 81368157-81598230-82328253-82578274-82768416- 84198434-84488512-85138588-85978689-86908771- 87768936-893893059391-93929475-94769581-9584 9735-97389743-97449768-976998289833-98349909- 99129969-99809984-998510326-1032810474-10476 10480-1048210498-1050310592-1059410603-10606 Tissue RNA Library SEQ ID NOS: origin Source Name
10870-1087210881-1088310907-1091010913-10914 10936-1093711022-1102311123-1112411345-11350 11423-1142611658-1165911669-116721167911757- 1175811873-1187512050-120561214012190-12191 12455-1245912470-124711263712676-1267812723 1294312948-1296212978-12981 12997-1299913629- 13631 13638-13641 14044-140451411614460-14462 1460414640-1464214818-1482015011-1501215019- 1502415126-1512915277-1527815329-1533115576- 1557715661-1566315855-1585715886-1588716174- 1617616290-1629516403-1640716422-1642916496- 164971654516576-1657716599-1660816851-16853 17026-1702817150-1715217268-172741745417924- 1792617958-1796218001-1800318010-1801218395- 1839718516-1851818536-1853718585-1858618644- 1865018655-1865818771-187721878918829-18834 18846-1885318897-1889818919-189211897518996 1901219096-19101 19251-192521927419295-19303 19308-1930919345-193491937219390-1940219422- 1943119441-19444195141956319617-1961919625 19679-196821969819700-1970519727-1973219772 19814-1981519819-1982219926-1992919933-19937 19943-1994619948-1995019965-1996719972-19980 20133-2014520161-2016420167-2017120189-20194 20215-2021820257-2026420271-2027820281-20282 2028920321-2032420336-2033820340-2034120366- 2036820393-2040020432-2045120456-2046820514- 2052020563-2056720575-2057820607-2061220620 20625-2062820683-2068520836-2084320868-20870 20927-209322094320948-2094920952-2095421058- 2105921101-2111121157-2116921197-2119821208- 2121021248-2125221326-2133321351-2135221470- 2147621480-2148221495-2150021899-2190221917- 2192121925-2192821958-2196621983-2198722000- 22015220192204722074-2207622092-2209422101- 2210722161-2216422169-2217022204-2220722218- 2222622256-2226022277-222832231822343-22348 2249522551-2255222571-2258122625-2262822680- 2268422708-2270922760-2276222810-2281622823- 2282722852-2285323047-2305023080-2308323201- 232022341923433-2343723473-234742348723599 23673-2367423731-237362376123793-2379723845- 2384823876-238772388223890-2389223903-23904 23941-2395623999-2400124005-240112405624772- 2477425293-252972531525322-2532525331 -25333 25336-2533725374-2537525676-2567925966-25968 26024-2602826033-2604926196-2619926205-26206 262662632726347-2634826350-2635226359-26360 26383-2638526409-264112684326853-2685426860- 268622721827222-2722427304-2730527372-27375 27439-2744327510-2751127518-2752127544-27545 27636-2763927649-2765427814-2781528087-28091 28105-281072818628234-2825428365-2837028424 28426-2842828533-2853929409-2941729508-29518 29718-2972129960-299613006130085-3008730224- Tissue RNA Library SEQ ID NOS: origin Source Name
30232 fetal liver Clontech FLV002 3461-3463 6331-6334 7757-7759 10638- 10639 14098- 14099 14261-14263 20180-20181 23007-23021 23555- 23565 27317-27322 28277 fetal liver Clontech FLV004 218-219 1436-1437 5370 7216-7219 8247-8248 8512- 8513 8936-8938 9854-9855 10640-10644 1 1842 12121- 12126 12824-12827 16226-16227 16470-16472 16652 17924-17926 18500-18501 18516-18518 18527 18723- 18726 18778-18780 18964-18969 1921 1 -19212 19370 19401-19402 19422-19431 19461 19583-19585 19736- 19742 19919-19920 19924-19925 19943-1994620208- 20214 20437-20440 20792-20797 20948-20949 21277- 21280 21470-21476 22265-22270 22690 22730-22731 22846-22849 22881-22886 23047-23050 23356-23357 25319-25321 25323-25325 25966-25968 26843 27504- 27507 29343-29345 fetal Invitroge FMS001 158-159180-181251-252551-552697-699716-717832- muscle n 836847-8481123-11271371 1717-17192254-2258 2440-24442793-27992923-292430393107-31083377 35073555-35583686-36913773-37744127-41284343- 43464388-43904536-45384561-456846914795-4809 5372-53755965-59666403-64046669-66707209-7212 7620-76217681-76877695-76967745-77467815-7818 7891-78957921-792579467990-79928090-80938132- 813681628253-825783108313-83148345-83488598- 86008612-86168768-877088699306-93119321-9323 9343-93469366-93689479-94819491-94939517-9521 98289848-985098989927-99281030610320-10323 10451-1045310491-1049310638-1063910895-10898 10936-1093711217-1121911297-112991131711345- 1135011372-1139611731-1173411835-1183611840 12121-1212612129-1213012455-124591246512467 12569-125701272312743-1275012978-1298113581- 1358313592-1359513638-1364113885-1388713925- 1392614137-1413814270-142731460414640-14642 148751500915151-1515715243-1524815600-15601 15623-1562615863-158661613716147-1615116174- 1617616352-1635316576-1657716619-1662116836- 168421711417330-1733217335-1733917455-17456 17958-1796218008-1800918029-1803018400-18402 18412-1841818421-1842218500-1850118671-18672 18781-187831878918802-188041885618923-18924 1894219135-191371927319306-1930719355-19360 19362-193641937019380-1938419390-1940019422- 19431195031958619667-1966819679-1968219700- 1970519800-1980319813-198151993920029-20043 20151-2015420161-2016420200-2021420231-20234 20263-2027320281-202822028920328-2033020361 20401-2040520437-2044020491-2049420514-20520 2068120727-207322087120897-2090020952-20954 20957-2096220977-2098421005-210082101521123- 2112721157-2116921171-2117421208-2121021223- 2122421229-2123221284-2129421301-2130321351- 2135221377-2139721408-2140921958-2196121983- 2198722020-2202922080-2208322092-2209422160 Tissue RNA Library SEQ ID NOS: origin Source Name
22165 -22168 22212- -22217 22277 22283 22329-22335
22347 -22348 22405- •22408 22553 22558 22607-22609
22637 -22643 22669- -22670 22700 22701 22730-22731
22810 -22816 22881- •22891 22980 •22981 22991 -22994
23007 -23021 23084 23206-23209 23222 23229-23233
23329 -23337 23382- •23385 23510 -2351 1 23599 23827-
23833 23941 -23956 24014-24018 24446-24448 25085-
25090 25315 25319- 25321 25340- 25341 25403-25407
26024 ■26028 26258- -26260 26285 •26290 26307-26309
26421 •26422 26435 26559-26560 26737 26842 26879
27052 27091 27100- 27102 27304- 27305 27455 27520-
27521 27544-27545 27574-27577 27600-27601 27647
27698 -27699 27729- ■27739 27755- •27756 27825-27826
27853- •27857 27971 27989-27990 28008-2801 1 28142-
28145 2831 1-28313 28424 28426- 28428 29278-29283
29332- •29337 29956- 29957 30141 - ■30142 30189-30194
30236 30239 fetal Invitroge FMS002 534-53533774562-45684581-45826403-64047236- muscle 72379018-902012194-1219613638-1364116545 16851-1685318044-1804518655-1865818802-18804 1915319254-192551937019422-1943120263-20264 2028920437-2044020625-2062822020-2202522212- 2221722284-2228922362-2236422399-2240422531- 2253322669-226702269022887-2289125342-25343 26860-2686227602-27606 fetal skin Invitroge FSK001 35-38170-171192227-230315-316329-330373-374 n 425-427464-465551-552579-580605-606635685-688 713-715718-720785-786832-836969-971981-985 1001-100410311053-10571097-10981123-11271169- 11791203-120413071414-14171431 14511578-1585 1600-16011611-161616751705-17141729-17311778- 177918601885-18861912-19131915-19162136-2137 23072383-23882540-25412599-26032675-26782706 28002802-28122814-28152826-28272923-29242944 29462951-29553094-30963276-32793322-33233336- 3337335233773477-347835073526-35293531-3534 3555-355836863715-37163773-37743834-38353855- 38573860-38643873-38753926-39273939-39413949- 39534003-40064124-41264129-41314141-41434206- 42084213-42144343-43464358-43614441-44434489- 44934550-45514562-45684581-45824623-46394668- 46724686-46894705-47074795-480948184824-4831 4857-48614897-48984971-49744980-49815061-5062 50655480-54825581-558358025919-59215980-5984 6166-61686219-62226331-633463996403-64046623 66506657-66706692-66976877-68787209-72127216- 72197258-72637296-72987447-74497536-75417580- 759776087615-76177620-76217660-76657681-7689 7695-76967701-77037745-77517757-77597778-7788 7806-780878217824-78257904-79057924-79257972- 79757978-79837990-799279948077-80788080-8081 8090-80938113-81168129-81418152-815981628187- 82008216-82188242-82468253-82628274-82768310 8313-83148326-83278347-83608368-83698375-8376 84158434-84528487-84888502-850385368539-8542 Tissue RNA Library SEQ ID NOS: origin Source Name
8554-85558576-85828607-860986888724-87328768- 87708939-89458978-89819026-90289031-90369038- 90399043-90459079-90809264-92659305-93119321- 93239340-93429375-93769456-94599472-94769506- 95099517-951895319535-95379561-95629590-9592 9594-96029689-97079735-97389759-97629780-9781 98289846-98529854-98559857-98619867-98689874- 98959902-99049916-99209923-99249954-99559965- 998010020-1002410032-1003710161-1016310167- 1017210254-1026110480-1048210495-1050310513- 1051810523-1053010612-1062310628-1063010876 1090610936-109371096511035-110371115811250- 11251 11306-1131011345-1135011372-1139611431- 1143211465-1146611473-1147411496-1149811609 11612-1163911669-11671 11731-1173411745-11748 11757-1175811780-1178211803-1180411835-11836 1184011901-1190512011-1201312112-1211412140 1217312175-1217612280-1228212374-1237712422- 1242712460-124621246512468-1247112555-12559 12651-1265212666-126671267012676-1267812714- 1271512720-1272212728-1276612782-1278412803- 1280412823-128271294312978-1298112997-12999 13077-1307913100-1310613503-1351113596-13597 13603-1360513609-1361213629-1363113638-13641 13652-1365913834-1383713865-138661393013999- 1400414058-140591411714236-142381425714346 14355-143561460414626-1463514640-1464214725- 1472914784-1478814991-149921499615071-15074 15126-1512915182-1518315250-15251 1525715277- 1527815300-1530115341-1534215358-1535915404- 1540515408-1541415576-1557715588-1558915606- 1560715617-1561815623-1562615863-1586615925- 1593115988-1599016030-1604016044-1604616059 16121-1612216168-1617116174-1617616200-16201 16345-1634916365-163661637016388-1639116422- 1642916470-1647216593-16598166001662316636- 166371665216716-1672316851-1685316934-16937 17010-1701417026-1702917070-1707117104-17111 17156-171591723717244-1725017263-1727417278- 1732617335-1733917454-1745617571-1757417958- 1796218001-1800318015-1801618029-1803718136- 1813818393-1839418423-184251853518583-18584 18587-1861618624-1862618644-1865018655-18658 18671-1867218719-1872618730-1873218745-18749 18771-1877218787-1878918797-1880418811-18813 18829-1883418846-1885318894-1889618899-18903 18908-1890918915-1891818947-1895018955-18959 18976-1897719001-190041904019045-1905319062- 1906519074-1908019135-1913719142-1914719224 19228-1922919251-1925219258-1926219266-19267 19306-1930719316-1931719341-1934219350-19354 1936219368-1937019375-1938419395-1940019407- 19411 19422-1943419441 19458-1946019464-19466 19484-194861950219515-19521 19548-1955319564- 1956519574-1957819604-1960719617-1961919659- Tissue RNA Library SEQ ID NOS: origin Source Name
19661 19663-1966619670-19671 19683-1968819693 19700-1970519727-1974219764-1976719813-19815 19921-1992919933-1993719943-1994619972-19981 20029-2004320087-200942012020130-2014520161- 2016420180-2018120189-2019420198-2021420229- 202412025320257-2027020279-2028220286-20288 20290-202962029920328-2033020401-2040720414 20427-2043120447-2045120456-2046820476-20479 20485-204902050120505-2050920524-2053520537- 2054520548-2055320559-2056720629-2063020639- 2064020644-2064820666-2067120676-2068120683- 2068520689-2069220698-2070720710-2071120714- 2072520727-2073420747-20751207532076720789- 2079120806-2081220844-2084920868-2087020882- 2088420890-209002092620929-2095420957-20984 20989-2099021005-2100821026-2103121076-21087 21097-211002112221149-2115121157-2116921197- 2119821208-2121021213-2121521225-2122621241- 2125221263-2126621277-2128021284-2129421334 21351-2135221410-2141421428-2143321440-21445 21447-2145321463-2146521480-2148221495-21500 21529-2153021554-2155621718-2172421729-21733 21881-2188521911-2191221917-2192121929-21938 21958-2196621973-219742197821983-2198722020- 2202922062-2207322077-2208322092-2209422101- 2210722141-2214322160-2216822176-2217722195- 2219822208-2221122218-2222622246-2224922261- 2227022277-2228922292-2229922303-2231122347- 2235022358-2235922362-223712237322375-22380 2239322433-224352249522534-225392256022566- 2256822571-2258122599-2260222607-2260922622- 2262822637-2264322661-2266422669-2267022690 22700-2270122737-2273922801-2280422828-22829 22852-2285322870-2287422881-2289122904-22908 22916-2292222941-2294722952-2295322969-22970 2297323007-2302123047-2305023059-2306023071 2308423088-2309023201-2320223212-2321523218- 2321923229-2323323252-232562326323356-23357 23364-2337023373-2337423379-2338123386-23390 23392-2339523409-2341123415-234192343323470- 234722351423526-2353123543-2354423673-23689 2370423726-2373023771-2378023806-2380923845- 2384823865-238662388223893-2390223911-23940 23990-2399223999-240012402524035-2404024056 24092-2409724481-2449025085-2509025306-25308 2531325319-2532125331-2533325340-2534125461- 2547025598-2560326024-2602826196-2619926207- 2620826261262662628026316-263212632726337 26350-263522641426558-2656126676-2667726691- 2669426829-268302684326893-2689526980-26984 2705227067-270702709127100-2710227269-27270 27275-2727627283-2728727411-2741227480-27481 2749727520-2753927544-2754527600-2760127636- 2763927649-2765427677-2768827726-2772827755- 2775627812-2781527848-2785227858-2786827885-
Figure imgf000181_0001
Tissue RNA Library SEQ ID NOS: origin Source Name
90359040-90429267-92689329-93309343-93469366- 93689391-939294439472-94749486-94889506-9509 9529-95309532-95349561-95629590-95929594-9602 9609-96109726-97279780-97819799-980098269828 9844-98459848-98529854-98559916-99209923-9924 9929-99359939-99409953-99551001210015-10019 10025-1002610038-1004110254-1026110277-10278 1030610498-1050310513-1051810531-1053210543- 1054910592-1059410609-1062310638-1063910873- 1087510931-109331093810963-109641096611006- 1100811116-1112111123-111241113511302-11303 11345-1135011359-1136911431-1143211467-11472 11561-115621160911658-1165911715-1171611731- 1173411803-1180411835-118361184211873-11875 11878-1187911901-1190511935-1193611947-11950 12127-1212812133-121341214012202-1220512221- 1222412259-1226212398-1240112430-1243112637 12666-126671272312774-1277812780-1278712791 12796-1279812824-1282712978-1298113068-13070 13077-1307913083-1308613505-1351113560-13562 13571-1357313592-1359513838-1384213865-13866 1400414058-1405914118-1412414130-1413114141- 1414214166-1416814170-1417214192-1419514236- 1423814261-1426314270-1427314304-1430514498- 14501 1460414607-1460914640-1464214684-14685 1470814726-147291478414789-147911487515096 15182-1518315218-1521915243-152481525715290- 1529115412-1541415496-1552515530-1553115548- 1555015576-1557715612-1561815699-1570015855- 158571588015988-1599016044-1604616121-16122 16174-1617616451-1645216484-1648616494-16497 16583-165851662316636-16637166421665216683- 1668516724-1672516752-1675316758-1676016851- 1685316938-169391697417026-1702817038-17041 17074-1707717120-1712217153-1715417233-17234 17286-1729117327-1733917454-1745617538-17565 17958-1796218015-1801618044-1804518069-18072 18097-181341841118421-1842518427-1842918492- 1849918516-1851818535-1853718644-1865018655- 1865818668-1867018691-1869218719-1872218750- 1875618759-1876018767-1876918771-1877218787- 187891879618802-1880418806-1880718842-18845 1885618899-1890318923-1892418941 18955-18959 18964-189691897519029-190351905519062-19065 19074-1908319096-1910119138-1914019209-19210 19226-1922719251-1925319316-1931719341-19342 1935019355-19360193621937019373-1937419387- 1938919407-1941119415-1941719422-1943119438- 1944019442-1944419461 19487194971950319515- 19521 19526-1953119560-195621956619574-19578 19580-1958619598-1960119604-196071962719667- 1966819670-196711969319698-1970519733-19735 19743-1974819764-19771 19804-198081981319921- 1992519933-1993719943-1994619953-1995719965- 1996719969-19971 19981 19995-2001420029-20043 Tissue RNA Library SEQ ID NOS: origin Source Name
20048-2005220074-2007920087-2009420106-20110 20114-2012020122-201272013020149-2015020161- 2016620172-2017920182-2019420198-2019920208- 202142024120263-202702028920298-2030420316 20328-2033020345-2035420401-2040520427-20451 20485-2049420497-2049820524-2053520548-20558 20563-2057820599-2060120616-2061920625-20628 20631-2063420649-206522068120698-2070720753 2076720788-2079720813-2081520836-2084320855- 2086320866-208702088120885-2088720927-20928 20963-2097220990-2104521076-2108721097-2llll 21145-2115121171-2117421208-2121521277-21283 21295-2129821377-2139721408-2141421434-21439 21446-2145321516-2152821554-2155621647-21655 21712-2171721905-2191021929-2193521951-21954 21962-2196621978-2198221989-2199322003-22006 22020-2202522050-2205522080-2208322101-22107 22120-2212722135-2213822141-2214322152-22156 2216022171-2217522185-2219222208-2221122261- 2227622343-2235022358-2235922362-2236422366- 223712237322399-2240422434-2243522455-22465 2249522534-2253922570-2258122599-2260222622- 2262822653-2265422663-2266422669-2267022690 22696-2269722700-2270122794-2279622843-22845 22852-2285322899-2290022909-2291022955-22957 22977-2298123046-2305023053-230552307123141 23206-2320923212-2321523220-2322223229-23233 232362324923251-2325623261-2326223353-23354 23364-2337023382-2339023399-2340123428-23430 2343323489-2349023508-235112354623682-23686 2370423723-2372423726-237302376023764-23767 23798-2379923806-2380923836-2383723839-23840 23867-2388023890-2390223941-239562395923990- 2399224021-2402424035-2404024092-2409724120- 2412925085-2509025289-252902530625319-25321 25340-2534125359-2536325368-2536925374-25375 25403-2540725416-2541725598-2560326196-26199 26207-26208262662628026285-262902632726337 26373-263742639426563-265662660326615-26616 26676-266772668726691-26694268062684326860- 2686226876-2687926980-2698427028-2702927050- 2705227067-2707027114-2712027203-2720527254- 2726027271-2727227501-2750327557-2755927562- 2756327586-2758727589-2759227633-2763427729- 2773927809-2781127859-2786427869-2787327890- 2789227989-2799028099-2810028105-2810728142- 2814528173-2817528272-282792828628311-28313 28361-283622842428573-2857829225-2923129278- 2928329321-2932629328-2933129339-2934029354- 2935729378-2937929960-2996130141-3014230150- 3015630195-30199 fetal brain GIBCO HFB001 16121 144-145148152156-163177-178192197-200 227-230255-256282-284307-308329-330332373-374 395398-399419-420425-427524-525532-535540-542 549557-561568-581635716-720750-752758-762785- Tissue RNA Library SEQ ID NOS: origin Source Name
786792-795825832-836845-851860-861901-904912- 913934-9351001-10051044-104510471067-1071 1097-10981101-11031112-11131123-11271165-1168 1205-12061228-12311235-123712461273-12821312- 13131411-141714231431 1453-14541467-14701499 1667-1668167517551792-17941961-19621980-1982 2015-20182083-20842136-21372202-22032275-2276 23482354-23552383-23882404-24062420-24212471- 24742550-255225762599-2603267927062724-2726 274528062871-28732903-290429262934-29352951- 29552975-29772983-29862988-29893205-32073276- 327732903322-33233336-333733523355-33563361- 336233643374-33773421-342234333441-34453468- 34713477-347835073512-35143555-35583612-3613 3729-37303781-378438603949-39534080-40824091- 40954124-41264151-41574206-421042484343-4346 4486-44874489-44934515-45164526-45274531 -4533 4562-45734581-45824612-461446514726-47274785- 47894825-48264975-49784980-49814984-49855065 5084-50855092-50945109-51155190-51915335-5336 5376-53785382-53835470-54775480-54825517-5521 5525-55265557-55605572-55735587-55925659-5698 5710-57115768-577258025808-58125899-59025907- 59095919-59215943-59455973-59755980-59845986- 59886029-60356088-60896166-616864006403-6404 6406-64106450-64536488-64896523-65266542-6543 6657-66616669-66706775-67787001-70077056-7057 7209-72127216-72197236-72377256-72577325-7340 7536-75417557-75597611-76127615-76217630-7636 7657-76597666-76737681-768976987701-77037726- 77277734-77367747-77517778-778377947798-7801 7806-78087824-782578907902-790379167921-7923 79577972-79757978-798379887990-799279948002 8017-801980708077-80798117-81218137-81568162 8187-82008219-82208242-824882508253-82578313- 83148349-83508368-83708409-84148466-84768497- 84998512-85138536-85508607-86098612-86168688- 86908736-87378754-87668919-89338982-89839007- 90089013-90209029-90309038-90429072-90749196- 92009306-931193169321-93239331-933393359337- 93399391-939294439459-94609482-94859487-9488 9506-95189522-952595319535-95379561-95629570- 95769617-96269664-96659682-968396889729-9731 9735-97429828-98329851-98529854-98559873-9895 98989902-99049923-99249927-99289956-99599969- 99809984-99859989-999210032-1004010167-10172 10263-1026410273-10274102771030610326-10328 10451-1045310470-1047310484-1048710498-10503 10508-1051210531-1053210588-1059110600-10601 10603-1060810615-1062310628-1063010638-10644 10881-1088310913-109141093810963-1096410980- 1098511066-1106811123-11124111351114311161- 1119711200-11201 11204-1120711297-1129911308- 113101131711326-1132711399-1140211431-11432 11460-1146411468-1147211476-1147811607-11608 Tissue RNA Library SEQ ID NOS: origin Source Name
11649-1165011674- 1167911696 11715-1171611731- 1173411739-1174011761-1176311766-1176811786- 1178811805-1180811835-118361184011843-11855 11937-1193812112- 1211412131 -1213212150-12151 12186-1218912192- 1219612225-1222812241-12244 1225812283-1228412374-1237712455-1245912468- 1246912486-1251812573-1257412590-1259212598 1263712642-1264412655-1265812666-1266712714- 127151272312725-1272712755- 1276412785-12787 12796-1279812829- 1283012832-1283412898-12899 12997-1299913077-1307913107-1311713576-13579 13585-1358813592-1359513606-1360813627-13628 13632-1363313638-1364113645 -1365113796-13797 13865-1386613873-1387513885 -138951389813918- 1391913954-1395613994-1399714023-1402614058- 140591409414127-1412814137- 1413814170-14172 14236-1423814261-1426314270-1427314283-14288 14314-1431714353-143541460414650-1465114684- 1468514714-1471714789-1479115013-1501815070 15093-1509515162-1516415182■1518315225-15230 15243-1524815250-1525115257-1525915290-15291 15323-1532415341-1534215358-1535915392-15396 15406-1540715412-1541415443 1545215530-15531 15545-1554615563- 15564155761557715588-15589 15600-1560115699-1570015780 ■1578815855-15857 15867-1587015884-158851596 1596315988-15990 16059-1606116075-1607916096 ■1609916121-16128 16141-1614316152-1615316168 1617116224-16225 16233-1623516281-162831629616371-1637216418- 16421 16449-1645316473-16481 16498-1650016521- 165261654516602-1661116632- 1663416636-16637 166421665216686-1669316702- 1670516716-16723 16804-1680716828-1683116836■1684216851-16853 16894-168961692416934-1693716954-1696117104 1711417120-1712217131-1713217145-1714617286- 17291 17330-1733217335-1733917454-1745617489- 1749017958-1796318001-1800318015-1801618029- 1803018055-1806018136-1813818393-1839418421- 1842518427-1842918492-1849418500-1850118516- 185181852718533-185371854018587-1861618625- 1862618644-1865418668-1867218691-1869218717- 1872618730-1873218738-1874418750-1875618761 18771-1877218778-1878018784- 1878918796-18804 18825-1883418842-1884518856- 1888018882-18888 18899-1890318919-1893518947- 1895018964-18969 18983-1898518989-189901899619001-1900919011 19018-1902419029-190351905519062-1906519074- 1908319096-19101 19138-1914019142-1914719153 19159-1916119163-19201 1925719260-1926219266- 1926719274193081930919316- 1931719345-19349 19351-19354193621937019380- 1938419395-19400 19407-19411 19415-1941719422- 1943419441-19444 19447-1944819464-1948319515- 1952419526-19531 19560-1956219564- 1956619583- 1958519604-19607 1962719659-19661 19663-1966819670-1967119674- Tissue RNA Library SEQ ID NOS: origin Source Name
19678 19683-19687 1969319700- 1971019727-19748
19759-197631977219800-1980819810-1981519819-
1982219921-1992919933-199371993919943-19946
1995019962-1996319972-1998120029-2005220069-
2007920087-2009420099-2010220106-2011020114-
2012020122-2012720133-2015420165-2016620172-
2019420200-2021820226-2022820235-2024120244-
202492025320257- 2026220265-2027020274-20278
2028920328-2033020345-2034820355-2035620393-
2040520415-2041720425-2043120437-2044020447-
2045120456-2047120485-204942050120505-20509
20524•2054520547 -2055820563-2056720569-20578
20607 •206122061 -2061920621-2062420629-20634
20639-2064020644 -2064520649-2067420676-20681
206982070720710--2071120718-2072520733-20734
20747-2075120753207672078820792-2079720806-
2081220819-2082120824-2084320853-2086220865
20868-2087120897-2090020926-2093220938-20942
20948-2094920955-2095620963-209842099120999-
2100421021-2102321060-2106121069-2107521097-
2110021137-2114021145-2114821153-2115421171-
2117421176-2118021197-2119821202-2120721213-
2121521229-2123221237-2124021248-2125221256-
2127421277-2128021284-2129421297-2129821301-
2130321326-2133321343-2135421377-2139721434-
2143921447-2145021454-2145721463-2146521477-
2147921485-2149121495-2150021516-2153021532-
2153321554-2155621587-2165521712-2171721729-
2173321877-2188521892-2189421911-2191221915-
2191621929-219352193921955--2195721967-21968
21971 -2197421978--219872199422000-2200222007-
2201522019-2202922037-2203922045-2204622056-
2206922074-2207622087-2208822090-2209422115-
2211622128-2213422141-2214322160-2216422169-
2217022187-2219222195-2219822208-2222622243-
2224522261-2226422271-2228922300-2230822312-
2231822323-2233222336-2235022358-2235922365-
2237122381-2238822399-2240422409-2241022434-
224352249522531-■2253922553-•2255822561-22565
2257022599-2260222607-2260922618-2262122625-
2262822634-2265122653-2265422663-2266722671-
2267722690226962269722700-•2270122726-22727
22730-2273122741 -2274822763 -2276422768-22770
22794■2279622810-2281622852-2285322859-22874
22899-2290022912-2292222927-2293822952-22961
22969-2297022973 -229762299 -2299423007-23021
23046 -2305523059-2306023070-2307123085-23086
23098 -2310123105 -2310823117-2311923125-23136
23203 •2320923220-2322123223 -2322523236-23239
23242 ■232452325123261-2326223279-2328123343-
2334423356-2335723382-2339023400-2340123415-
2341823420-2342123423-2343023434-2343723489-
2349023492-2349323508-2350923515-2351823555-
235652359923682-2368623700-237012370423718-
2372123731-2373623760-2376323768-2376923798-
Figure imgf000187_0001
Tissue RNA Library SEQ ID NOS: origin Source Name
50545190-51915335-533653615567-55685572-5573 57005702-57045744-57475762-57645829-58455907- 59095919-59215929-59375939-59415956-59575960- 59645973-59755986-59886038-60396137-61396170 6235-62376403-64046406-64106511-65196775-6778 68556861-68627056-705771917209-72127216-7219 7236-72407258-72637299-73017304-731673627452- 74607542-75507554-75557557-755976097611-7612 7620-76217630-76367648-76537668-76737678-7689 7695-769777297734-77437752-77597778-77837787- 778877947798-78017898-79007906-791279167972- 79757986-79877995-799780028017-80198041-8042 8064-80678077-80788087-80938095-80968122-8124 8152-815681628187-82008204-82058234-82418247- 824883108339-83418343-83448347-83508368-8370 8403-84068420-84218461-84648492-84968504-8505 8512-851385368539-85508607-86098612-86168688- 86908701-87058736-87378758-87598768-87768822- 88358917-89188939-89458982-89919013-90209031- 90359040-90469063-90679069-90719264-92659305 9337-93399427-94289431-94339456-94589464-9469 9472-94749479-94819506-95099522-952595319535- 95379561-95629594-96029611-96129617-96269664- 966596769719-97229726-97279729-97319745-9756 9801-98029828-98329854-98559864-98669874-9895 9899-99019927-99289946-99529969-99809997-10009 1001110017-1001910033-1003710161-1016310167- 1017210254-1026110268-1027210326-1032810449- 1045010470-1047310477-1047810480-1048210484- 1048710491-1049410498-1050310520-1052210531- 1053210588-1059110607-1060810615-1062310627- 1063010634-1064410780-1078210873-1087510899- 1090210913-1091410919-1092210936-1093710972 10976-1097810980-1098511022-110251113511198- 1119911222-1122411292-1129411300-1130111306- 1131011345-1135011468-1147211524-1152511531- 1153411541-115431160911656-1165711721-11729 11772-1177611805-11808118401184211901-11905 11937-1193811941-1194412040-1204412112-12114 12141-121461214912194-1219612206-1220812215- 1221712226-1222812280-1228212305-1231112437- 124401245312470-1247112480-1248212541-12542 12589-1259812623-126241263712673-1267512723 12755-1275912785-1278712832-1283413077-13079 13107-131171351313543-1355113556-1355913576- 135791358413592-1359513638-13641 13645-13651 13668-1367213752-1375313838-1384213918-13919 13954-1395613999-1400314023-1402614038-14041 14044-1404514054-1405614065-1408914098-14099 14123-1412414141-1414214170-1417214209-14215 14236-1423814283-1428814355-1435614360-14369 14456-1445814561-145681460414640-1464214650- 1465114709-1471714789-1479114841-1484315002 15013-1501815093-1509515158-15161 15182-15183 15233-1523615243-1524815250-15251 15253-15255 Tissue RNA Library SEQ ID NOS: origin Source Name
15257- -1525915277 1527815283 1528615326-15328
15358 -1535915397 1539815415 -1543715473-15475
15530 1553115543 1554615551 -155561556715588-
15589 15623-15626 15780-15788 15867-1586815871-
15877 15896-15900 15988-15990 160821610516133-
16134 16168-16171 1620216345- 163491635916365-
16366 16418-16431 16451-16452 16508-16511 16532-
16535 1654516570- 1657316623 16643-1664816653-
16655 1666216702- 1670316714- 1672316802-16803
16833 1684216851 1685316860 ■1687017074-17081
17153 -1715417238 1724217286 -1729117297-17306
17489 1749017924 -1792617958 -1796318010-18012
18029 -1803018395 -1839718407 -1841018412-18418
18423 1842518500 -1850118516 -185181852718535
18583 1858418644 -1865018655 -1866018668-18677
18691 -1869218719 1872218730 -1873218750-18756
18759 -1876118777 -1878018787 -1879618815-18821
18825 -1883418836 -1883818842 -1884518857-18880
18899 -1890518910 18919-18921 18923-1893518947-
18950 18964-18969 18976-18977 18989-1899018993-
18996 1899919049- -1905319055 19068-1907019074-
19080 19093-19101 19106-19118 191321914819153
19159 -1916119207 -1920819213 -1922019228-19229
19251 -1925219257 19266-19267 19308-1930919316-
19317 19345-19350 19363-19364 1937019387-19411
19422 -1943119447- -1944819454 -1945519461 19511
19526 1952919560 -1956219564 -1956519567-19578
19604 1960719620 -1962219626 -1962719659-19661
19663 1966819672 -1967819683 -1968719706-19710
19733 1974219759 -1976319800 -1980819810-19815
19855 1985619921 -1992319926 -1992919931-19937
19939 19943-19946 19965-19967 19972-1998120026-
20027 20029-20043 20069-20079 20099-2010220111-
20112 2012020122 2012720137 •2015420157-20164
20167 2017120180 2018120189 -2019420198-20207
20235 2024120274 2027820281 -2028220290-20296
20317 2032420328 2033020345 -2034820393-20407
20415 2041720421 2043120437 -2045520469-20481
20485 2049020511 2051320524 -2053520547-20553
20559 2056220568 2057420607 -2061220614-20619
20635 2063620646 2064820676 -2068020683-20685
20689 2069220698 2070720712 -2072520727-20732
20747 2075320767 20806-20812 20819-2082120824-
20843 20853-20854 20865-20870 20879-2088020882-
20887 20897-20900 2092620929- 2093220948-20949
20952- 2095420957 ■20972 20999 2100821069-21071
21097- 2110021105 ■21 1 1 1 21 123 2112721141-21142
21145- 2114821153 ■21 169 21 171 2117421197-21198
21208- 2121021213 -21215 21225- 2123221241-21274
21284- 2129421297 -21298 21301 2130321343-21354
21377- 2139721408 .21414 21434 2145021454-21457
21480- 2148221485 ■21491 21529 2153021532-21533
21554- 2155621587 21655 21718 2172421729-21733
21881- 2188521892 ■2189421899- 2190221911-21914
21922- 2192421929 2193521940- 2194621958-21961 Tissue RNA Library SEQ ID NOS: origin Source Name
21973-2197421978 21989-21999 22003-2202922035- 2203622045-2204922070-2207322084-2208822090- 2209422098-2210022108-2211622120-2213822144- 2215622161-2216822171-2217522187-2219222195- 2219822208-2221722225-2222622243-2224522250 22261-2226422277-2228922300-2230922312-22313 2231822336-2234222358-2235922381-2238822405- 2240822434-2243522531-2253922553-2255822561- 2256522570-2258122607-2260922625-2262822637- 2265122653-2265422671-226742269022700-22701 22760-2276422810-228162286922881-2288622892- 2289322916-2292422941-2294722955-2295722969- 2297022991-2299423022-2302523047-2305223064- 2306523085-2308623088-2309023112-2311623125- 2313623212-2321523223-2322523229-2323323242- 23245232512326323353-2335423373-2337423399 2341923425-2342723434-2343823495-2349623508- 2350923512-2351323543-2354423555-2356523704 23720-2372123771--2379723802-2380523816-23819 23839-2384023845--238482388123890-2390223941- 2395623999-2400124005-2401124014-2401824021- 2402424201-2422024481-244902479425085-25090 25279-25290253132532225340--2534125383-25401 25403-2540725410--2541325416--2541725666-25672 25966-2596826205--2620626209-2621326217-26218 26235-2623926258--262592628026285-2629026304- 2630626310-26321263272633726347-2634826361- 2636526373-2637426412-26413264232643626469- 2647026604-2660626705-2670726788-2678926804- 268052684326860-268622687927028-2702927036- 2703827067-2707027100-2710527201-2720227288- 2729127304-273052744427520-■2753927544-27545 27588-2759627602--2760627649--2765427709-27710 27814-2782027865-•2786827890--2789227987-27991 27995-2799728042--2804328045--2804728108-28121 28133-2813728142--281452818628199-2820328234- 282542828628311-2831328426-2842828458-28460 28549-2855529117--291232924429278-2928329288- 2928929296-2930829328-2933129339-2934029343- 2934529418-2941929427-2943129508-2951829718- 2972129954-2995530085-3008730135-3014030195- 301993020530218-3022030233-3023530240-30242 infant Columbi IB2003 144-145165-166360-361619-620763796-797852-855 brain a 900934-935955-956981-9851001-10031097-1098
Universit 1312-1313154716751692-16931792-17941931 1961- y 1962207224392675-267827102724-27262886-2887 2934-29352951-29552975-29772983-29863182-3187 3322-33233353-335433773681-36823781-37843936- 39384117-41184160-41644362-43654471-44824562- 45684720-472548184967-49685572-55735594-5602 5829-5845593760846403-64046775-677868556861- 68626877-68816959-69607542-75507615-76177630- 76367648-76537660-76657668-76707695-76967726- 77277784-77887806-78087906-79127924-79257972- 79757978-79837990-79928017-80198087-80898129- Tissue RNA Library SEQ ID NOS: origin Source Name
81368152-81568162-81738187-82008343-83448349- 83508368-83698383-83898404-840685098612-8616 86888733-87538939-89458992-89949013-90179069- 907193059594-96029617-96269664-96659726-9727 9801-980298289896-98979947-99529965-99809989- 999210033-1003710161-1016310268-1027210312 10326-1032810470-1047310477-1047810491-10493 10498-1050310592-1059410612-1062310628-10630 10638-106391097210980-1098511022-1102311135 11406-1140711419-1142211541-1154311835-11836 1184011941-1194412006-120101214912280-12282 1272312755-1275912785-1278712973-1297613552- 1355913609-1361213638-1364113865-1386613885- 1388714127-141281425714264-142651460414650- 1465114709-1471314817-1482115013-1502415209- 1521015243-152481525715326-1532815397-15398 15406-1540715863-1586815884-1588515890-15895 15988-1599016133-1613416434-164371654516570- 1657316804-1680717074-1707717297-1730617489- 1749017958-1796218015-1801618492-1849418516- 185181853518655-1865818673-1867718691-18692 18730-1873218750-1875618759-1876018778-18780 18790-1879318815-1882118839-1884518919-18921 18925-1893318947-189501899619045-1904819074- 1908019148191591925719316-1931719363-19364 19387-1938919395-1940019405-194061948719526- 195291953619564-1956519659-1966119706-19710 19800-1980319816-1981819921-1992519933-19937 1993919943-199461995019965-1996719972-19980 20087-2009420099-2010220111-2011220151-20154 20157-2016020189-2019420198-2020720235-20240 20274-2027820281-2028220290-2029620317-20324 20328-2033020349-2035420415-2041720425-20431 20437-2045120456-2047120514-2052020554-20557 20559-2056220607-2061220614-2061920625-20628 20635-2063620639-2064020644-2064520698-20707 20710-2071120718-2072520727-2073220768-20787 20806-2081220824-2082620836-208492087120926 20929-2093220999-2100821016-2102021069-21071 21123-2112721153-2115421171-2117421208-21210 21213-2121521225-2122621248-2125221256-21262 21272-2127421277-2128021284-2129421340-21354 21377-2139721447-2145021495-2150021529-21530 21647-2165521729-2173321877-2188521911-21912 21917-2192121929-2193521958-2196622007-22029 22070-2207322084-2208622101-2211422135-22138 22152-2215622161-2216422208-2221122218-22226 2223022246-2224922261 -2226422271-2228922381- 2238222394-2239822405-2240822434-2243522495 2257022644-2265122669-226742269022700-22701 22743-2274822760-2276222810-2281622861-22868 22881-228862297123047-2305023112-2311623201- 2320223212-2321523229-2323323237-2323923321- 2332723379-2338123412-2341823425-2342723434- 2343723526-2353123555-235652370423793-23797
Figure imgf000192_0001
Tissue RNA Library SEQ ID NOS: origin Source Name
86038698-87008760-87618982-898389979013-9017 9072-90749306-9311944394609475-94769487-9488 9506-95099531 -95349617-962696599757-97589826 9851-985298739981-998310273-102771030610498- 1050310520-105221090310980-1098511476-11478 11538-11540115461160911674-116781169611731- 1173411739-1174011766-1176811803-1180411835- 1183611912-1191412283-1228412486-1251812545- 1254612555-1255912615-126181263712642-12644 12760-1276212977-1298113592-1359513645-13659 14058-1405914270-1427314607-1460914663-14665 14684-1468514714-147171478415093-1509515290- 1529115302-1530315548-1555015568-1557515855- 1585716006-1601516168-16171 162371630116360- 163611637016576-1657716636-1663716851-16853 16873-1687416934-1693717330-1733217335-17339 17455-1745617491-1749218015-1801618029-18030 1806118097-1813418423-1842518492-1849418516- 1851818536-1853718682-1868818719-1872218750- 1875618759-1876018796-1880718811-1881318825- 1882818857-1888018882-1888818936-1893918941 18955-189591897518989-189901899619001-19009 19011-1901219057-1905819062-1906519081-19083 19096-19101 19213-1922019251-1925219254-19255 19266-192671927319316-1931719345-1934919362- 193641937019373-1937419385-1938919395-19400 19442-194451950319530-1953319598-1960119625 19709-19710197561981319819-1982219921-19925 1993919969-1997120044-2006320106-2011020130 20146-2014820180-20188202432025320257-20262 2028820300-2030420345-203482036020421-20424 20469-2047120485-2049020542-2054520563-20568 20575-2057820629-2063020698-2070920718-20725 20733-2073420747-2075120758-2076720788-20791 20836-208432086320879-2088020897-2090020929- 2093220938-2094220973-2097620985-2098821027- 2103121060-2106121101-2110421149-2115121197- 211982123621272-2128021351-2135221361-21374 21451-2145321744-2174721917-2192121951-21954 2197822003-2200622141-221432216022187-22192 22261-2226422271-2227622284-2228922314-22317 22343-2234622349-2235022358-2235922365-22371 223732249522534-2253922632-2263622644-22651 22663-2266422797-2280022843-2284522854-22856 22899-2290622955-2295722974-2297923046-23050 2306323066-23067230702314123218-2322223249 23358-2336023386-2339023409-234112348623515- 2351823700-237012370423726-2373023760-23763 23806-2380924126-2412925085-2509025096-25100 25334-2533525368-2537226024-2602826209-26213 26217-2621826248-2625126285-2629026596-26601 2673726748-267562684326876-2687926980-26984 2705227067-2707027203-2720527247-2725127283- 2728727379-2738027522-2753927636-2763927662- 2767227861-2786427896-2792728099-2810028108- Tissue RNA Library SEQ ID NOS: origin Source Name
2812128142-2814528315-2831628361-2836228424 28430-2843428573-2857829225-2923129250-29252 29278-2928329309-2931429328-2933729339-29340 2935829940-2995329960-2996129964-2996830141- 3014230150-3015630233-3023530361-30368 lung tumor Invitroge LGT002 35-3851-5287-94105136-138158-159167192197-200 n 216227-230255-256260329-330333-335360-361373- 374398-399406-407419-420425-427524-525528534- 535543-545549553-554560-561619-620631-633635 667-668694-695697-699706-707716-717777-778783- 790792-795847-848852-855901-904912-913919-922 964981-9851001-10051035-10371044-10451047 1097-10981123-11271158-11611203-12041235-1237 1263-12651273-1279128913041312-13131322-1344 1346-13471349-13541414-14171432-14351440-1447 1467-1469147414991569-15701667-16681678-1680 1692-16931705-17141721-17221792-17941879-1880 1912-19131925-19271941-19461949-19501956-1960 2015-20182058-20612259-2266230624312440-2444 2457-24592471-247425132570-25742599-26032663- 26652675-26782702-2705274527802871-28762934- 29352939-29442951-29552975-29773006-30083037- 3038309031123255-32593306-33083316-33213334- 33403374-3377343334663472-34733477-34783555- 35583561367636863729-37303789-379438173827- 382938323840-38523873-38753998-40024053-4054 4134-41364151-41554158-41624215-421642204273- 42744340-43414386-43874402-44034469-44704489- 44934531-453345424562-45684581-458246204623- 46434757-47584785-47894795-48114813-48154824 4845-48514857-48614891-48964967-49684980-4981 4984-49855109-51155127-51285152-515351925253- 52665335-534153645532-55335557-55605572-5573 5581-558356585695-56985703-57085717-57185731- 57425766-576758025805-58075827-58285919-5921 59285943-59455973-59755980-59845986-59886163- 61686223-62256330-63346403-64046406-64106533- 65356625-66266630-66376775-67786879-68816883 6932-69387209-72127216-72197256-72577285-7293 7296-72987302-730373747604-76057611-76127620- 76217624-76297640-76427671-76737678-76897701- 77037726-77287745-77517778-77867794-77977802- 78047806-78087815-78187902-79057924-79257972- 79757977-798579887993-799480028008-80098025- 80268041-804280468059-80638074-807680798084- 80858090-80938098-81058107-81088113-81148117- 81218129-81368142-814481628178-81818184-8185 8187-82008211-82138216-82188242-82468253-8257 8271-82738313-83148323-83258339-83428349-8360 8363-83648368-83698371-83748430-84338449-8454 8497-84998502-85038510-85188534-85358588-8597 8607-86098612-86168688-86908698-87008768-8779 9013-9017902890379063-90689072-90749196-9200 92669306-93119337-93389375-937693829391-9392 94449454-94609475-94769510-95189522-95259529- Tissue RNA Library SEQ ID NOS: origin Source Name
95319535-953795589563-956695849590-95929603- 96049609-96109617-9626967697289735-97429757- 97589763-976997919829-98329841-98429848-9850 9857-98619874-98959899-99019923-99249954-9955 9965-1001010161-1017210175-1019610263-10264 10273-102741027710279-102801030610326-10328 10451-1045710480-104821049010498-1050310506- 1050710516-1051810539-1054210592-1059410603- 1060810612-1061410638-1064410873-1087510877- 1087810895-1089810929-1093010936-1093810953 10976-1097810980-109851101011066-1106811123- 11124111351114311212-1121611252-1127311295- 1129611300-1130111306-1130711339-1134411359- 1136911405-1140711419-1142211431-1143211460- 1146611476-1147811493-1149411499-1150411538- 1154011584-1158511644-1164811651-1165511660- 1166111711-1171211731-117341176511769-11771 11835-118361184011873-1187511886-1188911937- 1193812040-1204112045-120561214012150-12151 12186-12191 12329-123321233512374-1237712398- 1240112468-124711247712486-1251812545-12546 12565-1256812571-1257212590-125921263712666- 1266712673-1268312698-1269912720-1272212755- 1276212785-1278712829-1283012832-1283412872- 128811294312965-1296712978-1298112997-12999 13077-1307913540-1354213563-1356413576-13579 13585-1358813592-1359513598-1360013603-13605 13627-1362813632-1363313638-1364113873-13875 13918-1391913925-1392613931-1393313998-14004 14044-1404514054-1405614058-1405914137-14138 14166-1417214209-1422614236-1423814299-14305 14355-143561439014498-14501 14546-1454914645- 1464814718-147201478414789-147911480914999- 1500115019-1506915093-1509615142-1515015182- 1518315187-1518915195-1519615233-1523615243- 1524815257-1525915290-15291 15321-1532815332- 1533515392-1539615406-1540715412-1541415454- 1545615496-1552515536-1553715545-1554615561- 1556215576-155771558615588-1558915600-15601 15623-1562615631-1563215855-1585715869-15870 1588015884-1588515961-1596315983-1598415988- 1599016041-1604616141-1614316145-1614616174- 1617616365-1636616403-1640716451-1645216470- 1647216479-1648116508-165111654516558-16562 16576-1657716602-166111662316636-1663716652- 1665516661 16672-1667416716-1672316730-16733 16802-1680316828-16831 16851-1685816894-16896 1691016934-169371694217058-170641710417114 1711917127-1713217145-1714617153-1715417235- 1723717330-1733217335-173391745417857-17861 17930-17931 17958-1796218015-1801618029-18030 18271-1827218376-1837718412-1841818423-18425 18495-18501 18516-185181852718579-1858118633- 1863718644-1865018655-1865818668-1867018673- 1867718680-1868118691-1869318717-1872618730- Tissue RNA Library SEQ ID NOS: origin Source Name
1873218745-1875818761 18773-187891879618802- 1880418806-1880718825-1882818839-1885318855 18857-1888018882-1888818894-1890518923-18935 18944-1894518964-1896918975-1897718993-18996 19005-190091901219045-1905319057-1906519074- 1908019084-190851913419138-1914019142-19148 19153192241927319283-1929419298-1930019302- 1930319306-1930719316-1931719350-1935419362- 193641937019380-1938419387-1940219415-19417 19422-1943419441-1944419454-1945519458-19460 19488-19493195031951419526-1952919532-19533 19560-1956219564-1957319598-19601 1960819617- 196191962719667-1966819670-196781969319727- 1975119759-197631977219814-1981519819-19822 19855-1985619921-1992519933-1993819943-19946 1995019953-1995719965-1996719972-1998020015- 2001720026-2002720029-2004320048-2005220069- 2007120087-2009420099-2010220111-2011220122- 2012720131-2013220137-2015020157-2016020165- 2016620172-2017920200-2021820235-2024020244- 202492025320271-2028020286-202872028920317- 2032020328-2033020336-2033820345-2034820361 20366-2037420393-204052041420418-2042020432- 2044620456-2047920485-2049020497-2049820501- 2050320510-2052320548-2055720559-2056720575- 2057820607-2061220614-2061920621-2062820646- 2065220676-2068120683-2068520698-2070720718- 2072520727-2073420747-207512075320758-20766 20789-2079120806-2081220816-2081820822-20854 20858-2087020885-2088720890-2090020922-20928 20938-2094320952-2095420957-2098820999-21008 21032-2104521060-2106621072-2107521112-21114 21122-2112721133-2113521149-2115121157-21169 21194-2119821225-2122621233-2123521241-21252 21256-2127121277-212942133421351-2135521359- 2136021377-2139721410-2141421428-2143321447- 2145021467-2146921480-2148221485-2149121516- 2152821647-2165521712-2172421744-2174721881- 2188521892-2189421905-2191221925-2193521955- 2195721962-219662197821995-220292204722056- 2205922062-2206922074-2207622084-2208622090- 2209122141-2214322160-2216822187-2219422208- 222112225022253-2225522261-2228922300-22308 22310-2231722323-2232822333-223502235722373 22405-2241022440-2244822455-224652249522531- 2253922607-2260922622-2262822634-2263622644- 2265122653-2265422661-2266222671-2267422690 22696-2269722700-2270122741-2274222760-22764 22794-2279622810-2281622838-2284922852-22853 22887-2289122923-2293822948-2295022952-22954 22977-2298622991-2299422999-2300623046-23050 23059-2306023070-2307123074-2307623098-23101 23125-2313623138-2314023206-2320923212-23215 23223-232282324723252-2325623261-2326323328 23343-2334423382-2339023402-234112341923433 Tissue RNA Library SEQ ID NOS: origin Source Name
23439-2345023473 -2347423486 -2348723489 23490 23492-2349323512 -2351323526-2353123543-23544 23555-2356523675 -23681236902370423726-23730 23760-2376323770 23806-2380923827-2383323838- 2384223871-23875 23878-2388023890-2389223941- 2395623990-23992 23999-2400124021-2402424029- 240332405624116 2466524772-2477425017-25026 25293-2529725306 25309-253102531525319--25321 25334-2533525340 -2534125355--2535825368--25369 25373-2537725383- -2540125410 -2541325966--25968 26024-2602826196 -2619926205-2620626235--26239 26248-2625226260 26270-2627926292-2629326305- 263062632726337 26341-2634426361-2636526421- 264222643126469- •2647026479-•265462655626559- 2656026596-26601 26607-2660926657-2665926681- 266832673726748- ■2675626776-■267812680626813 26843-2684526860- -2686226873 -268792703927067- 2707027074-27079 27100-2710227150-2715127173- 271742720627209- ■2721327216-•2721827269-■27270 27275-2727627280 27283-2728727304-2730527354- 2735727439-27444 27501-2750727544-2754527548- 2754927562-27563 27600-2760127636-2764127649- 2765427662-27672 27700-2770127714-2771727726- 2772827750-27756 27814-2781827845-2785227861- 2786427888-27889 27896-2792727989-2799027992- 2799428045-28047 28050-2805928095-2809628099- 2810028105-28121 28133-2813728142-2814728161 28169-2820328255- -2826128290-•2829228311--28313 28315-2831628424 28426-2842828573-2857829250- 2925229278-29283 29328-2933729339-2934029354- 2935829406-29417 29508-2951829529-2953429594- 2960429655-29659 29718-2972129938-2995330085- 3008730141-30142 30195-3019930224-3023230240- 3025030361-30368 lymphocyt ATCC LPC001 310-314332360-361373-374464-465506576-578792- es 795912934-9351123-11271273-127913341432-1435 14701667-16681705-17141721-17221971-19742015- 20182086-20872234-22352275-227623892396-2400 2570-25742599-26032923-29242988-298929933001 3006-30083078-30793110-311132443326-33313441- 34423508-35103781-3784386038783913-39153939- 39414137-41394151-415542034221-423842554343- 43464531-45334751-47554891-48964971-49745057- 5059508251805269-52715525-55265530-55335717- 57185943-59455980-59846081-60826769-67787368- 73697447-74497536-75417561-75627580-75977609 7630-76367671-76737678-76877693-76947726-7727 7734-77367742-77437778-778377947806-78087906- 79127924-792580028084-808582038211-82138247- 824983428349-83508368-83698383-83898453-8454 8512-85138537-85388607-86098688-86908736-8737 8758-87598762-87668768-87708914-891690379264- 92659408-94149427-942894609506-95099517-9518 9526-95319535-953796769735-97389763-97679815- 982098289851-98529909-99129916-99209962-9964 Tissue RNA Library SEQ ID NOS: origin Source Name
9986-998810033-1003710146-1016310167-10172 10197-102521030610311 10516-1051910533-10534 10551-1055410595-1059910870-1087210895-10898 10904-1090510913-1091410936-1093710980-10985 11042-1104411123-11124111351115911406-11407 11431-1143211468-1147211493-114941155411603- 1160611766-1176811878-1187912006-1201312050- 1205612150-1215112439-1244012470-1247112475- 1247612555-1255912565-1256812615-1261812637 12642-1264412673-126751272312805-1280612823- 1283012835-1286712872-1288112977-1298113576- 1357913598-1360013632-1363313638-1364113922- 1392413927-1392913954-1395614304-1430514346 1460414785-147861480014802-148081507015141 15151-1515715233-1523615250-152511530515326- 1532815408-15411 15460-1546115545-1554615623- 1563215869-1587015890-1589516174-1617616226- 1622716284-1628916614-1661516628-1663116649- 16651 16730-1673316851-1685317105-1710617153- 1715417380-173951739817677-1768117924-17926 18015-1801618412-1842018492-1849418516-18518 1852718659-1866018730-1873218745-1875618761 18773-187771879618802-1880418822-1882818842- 1884518897-1889818925-1893318964-1896918976- 1897719029-190351905519059-1906119081-19083 19154-1915519213-1922019251-1925219306-19307 19316-1931719351-193541937019415-1941719441 19484-1948619512-1951319548-1955319604-19607 19659-1966119663-1966619670-19671 1968819698 19706-1970819768-1977119813-1981519933-19939 19943-1994619953-1996319965-1996719969-19980 19995-2001420029-2004320048-2005220106-20110 20137-2014820161-2016420172-2017920182-20188 20208-2021420229-2023020265-202702028920299 20349-2035420366-2036820427-2043120441-20446 20485-2049420497-2050020524-2053520542-20545 20602-2060620614-2061520631-206342064120646- 2065220683-2068520698-2070720718-2072520727- 2073420792-2079720806-2081220824-2083520885- 2088720890-2090020927-2092820938-2094220952- 2095420963-2097220977-2098421027-2103121062- 2106821076-2108721097-2110021105-2111121145- 2115621170-2117521194-2120721211-2121221233- 2123521241-2124721277-2128021284-2129421301- 2130321458-2146121463-2146521480-2148221495- 2150021917-219212192421962-2196621973-21974 21983-2198722000-2200222007-220152201922077- 2207922092-2209422117-2211922161-2216822187- 2219422208-2221122218-2222422253-2225522284- 2228922314-2231722319-2232222336-2234622399- 2240822440-224482249522607-2260922634-22636 22661-2266222700-227012270722823-2282722835- 2283722861-2286822941-2294723022-2302523046 23064-230652307023125-2313623206-2320923212- 2321523242-2324423261-2326223264-2327823356- Tissue RNA Library SEQ ID NOS: origin Source Name
23357 23379-23381 23392-23395 23400-2340123494
23508-2350923555-2356523731--2373623762-23763
23841 -2384223845-2384823883-2389223906-23910
24014-240182405624772-2477425213-2521925293-
2529725309-2531025323-2532525331-2533325336-
2533725340-2534125347-2535425436-2544025598-
256032628026307-■2630926341--2634426373-26374
266032668426691-266942684326860-2686226873-
268752687926893-2689527074-•270772721827304-
2730527510-2751127544-27545275502756427640-
2764127767-2776827814-2781527819-2782027970
27989-2799028097--2809828133--281372828628315-
2834928365-283702842428426--2842828468-28484
28511- 2851329328-•2933129339--2934029343-29345
2937029427-2943129594-2960429718-2972129940-
2995330085-3008730130-3013430195-3019930205
30361 •30368 leukocyte GIBCO LUC001 10-1442-4451-52112156164167170-171177-178 180-181 197-200202-203206-211221227-230255-256 259282-284289-292319-327329-330332-335339343- 353357-363366-367372-374395398-399414-420425- 427464-465505-506509-519522-525527-528534-535 540-542546-547549576-578591-593626-630635656 667-668706-707758-759785-786792-795825847-848 852-855880-882901-904912-913919-922934-935945- 946981-9851001-1003100510281044-10451047 1065-10661101-11031109-11101123-11271153-1155 1169-11791203-12041235-123712461269-12821307 1312-131313341346-13471431-14351453-14541467- 147014741476-147814991569-15731667-16681705- 17141721-1722177017771792-179418601879-1880 1887-18891912-191319671980-19821986-19872015- 20182023-20342052-2054206220722086-20872202- 22032222-22262267-22692272-22762383-23882437 2460-24612471-24742540-25412599-26032656-2658 2780280628462874-287629052918-29192923-2924 2934-29352951-29552983-29862988-29892993-3010 3014-30193026-30333105-31083144-31463205-3207 324132443255-325932653274-327732903306-3308 3310-33113326-333133523355-335633653374-3378 3421-342234333477-34783507-35103512-35203550- 35583649-36533681-368236933729-37303741-3753 3760-37623768-37693781-37843834-38353840-3852 3873-38753949-39534091-40954119-41204137-4139 4141-41434160-41624200-42034209-421042554266- 42744367-43684386-43904486-44874489-44934531- 45334555-45564562-45764581-45824612-46144640- 464346664692-46944712-47134720-47274785-4789 4833-48364857-48614971-49744980-49815057-5059 50825092-50945119-512051925253-52665269-5271 5285-52875335-53365480-54845499-550155105522 5525-55365557-55605572-55735579-558057005766- 576758025805-5807582358465910-59215932-5936 59465973-59755980-59846103-61046106-61096135- 61366154-615961756331-63366388639564006403- Tissue RNA Library SEQ ID NOS: origin Source Name
64046450-64516516-65196523-65266533-65356542- 65436551-65676585-65926638-66396772-67746791- 67946879-68816898-68996932-69387209-72127216- 72197245-72487252-72557258-72637267-72687296- 72987355-73577447-744974817536-75417556-7559 7580-76057608-76097611-76127615-76177620-7621 7630-76367660-76657671-76897693-76947701-7703 77297737-77437745-77517757-775977717778-7786 7789-77977802-78047815-7818782278907898-7900 7904-79127921-792579577972-79757984-79857988 7990-79928010-80128049-80568059-80638098-8104 8113-81148132-81448157-81598178-81818184-8185 8187-82008211-82138247-82488253-82578271-8273 8297-830083428349-83508365-83668368-83708409- 84148416-84258430-84528461-84648487-84888502- 8503850685098512-85138537-85428607-86098612- 86168688-86908698-87008741-87508754-87578760- 87668917-89188986-89918998-90089018-90209031- 90369043-90459063-90679072-90749196-92009306- 93119321-93239337-93389380-93819406-94079445 9456-94589475-94769487-94889510-95189522-9534 9561-956295849617-962696599664-96659726-9727 9729-97319739-97449763-97679782-97839792-9793 9808-981298269828-98329851-98529854-98559857- 98619874-98959898-99089916-99209923-99249927- 99359956-99579965-99929997-1000910017-10019 10025-1002610161-1016310167-1017210268-10274 10277103061031110320-1032310326-1032810449- 1045710484-1048710498-1050310508-1051210520- 1052210531-1053210543-1054910551-1055410588- 1059410603-1060610615-1062310627-1063010638- 1064410873-1087510891-1089310895-1090010904- 1090610913-1091410938109531097211071-11075 1108111086-1111511123-111241113511208-11209 11226-1122711295-1129611302-1130311306-11311 11314-113151131711332-1133311339-1135011359- 1136911372-1139611406-1140711423-1142611431- 1143211467-1147411476-1147811544-1154611561- 1156211582-115831160911640-1165011660-11661 11669-1167111674-116781169611711-1171211731- 1173411739-1174011780-1178211803-1180411811- 1183211835-11836118401184211873-1187512023- 1202612040-1204112050-1205612121-1212612133- 121371214012150-1216912186-1219612202-12205 12258-1226212280-122821233612361-1236212374- 1237712398-1240112421 12446-1245012455-12459 12470-12471 12475-1247612519-1254012545-12559 12590-1259212601-1261412619-126221263712642- 126441264612653-1265812666-1266712701-12705 127231272912755-1275912785-1278712796-12798 12803-1280612824-1282712829-1283012835-12843 12872-12881 12978-1298112997-1299913077-13079 1351213556-1355913565-1356813571-1357313592- 1359513598-1360013603-1360513609-1361213638- 13641 13645-13651 13754-1375513796-1379713898 Tissue RNA Library SEQ ID NOS: origin Source Name
13918 1391913925 -1393013954- •1395613994-13997
1400414044-1404514054-1405614105-1410614137-
1413814173-1417414209-1421514228-1423514264-
1426514283-1428814304-143051434614355-14369
1454614549145571460414607-1460914650-14651
14666 -1466814681 -1468214684- 1468514714-14717
14784 14822-1483814871-1487415019-1502415182-
15183 15193-1520015243-1524815250-1525115257
15290 -152911530515321-1532415326-1532815336-
15340 15356-1535915406-1540715412-1541415454-
15456 15530-1553115545-1554715558-1556015568-
15572 15576-1557715588-1558915600-1560115623-
15626 15686-1568715699-1570015855-1585715861-
15862 15867-158681588015888-1588915988-15990
16044 -1604616053 1605816060- 1606116075-16079
16082 16096-1609716110-1611216141-1614316163-
16164 16168-1617116206-1620816219-1622016226-
16227 16233-1623516284-1629516339-1634116350-
16353 1638016434- 1643916443-1644516451-16452
16456 -1646916473-1647816498-1650016508-16511
16536 -165381654516576-165771660016609-16611
16623 16636-1663716642-1664816652-1665516675-
16682 16686-1669316704-1670516752-1675316828-
16831 16833-1684216851-1685316860-1687016878-
16881 16894-1689616911-1691516934-1693716974
17024 -1702817046-•1705717082-1708417105-17106
17114 17117-1711817160-1716717264-1726717286-
17291 17330-1733217335-1733917454-1745617791-
17799 17857-1786117924-1792617930-1793117958-
17962 17969-1799418015-1801718029-1803018055-
18060 18097-1813418136-1813818371-1837218400-
18402 18407-1841018412-1841818421-1842218490-
18501 18516-185181852718533-1853418536-18537
18583 ■185861861718624-1862818633-1863718644-
18650 18661-1866218668-1867718680-1868118689-
18693 18717-1872618730-1873218745-1876918771-
18776 18778-1878018787-1878918796-1881318822-
18828 18839-1885318856-1888818893-1889618899-
18905 18908-1891018925-189391894618964-18969
18975- ■1897719001- 190091901819029-1903519045-
19053 1905519062-1906519068- 1907319081-19083
19090 19106-1911819138-1914019142-1914819153
19207- 1921019213- 1922019228- 1922919251-19252
19257 19266-192671927419283- 1929419306-19312
19316- ■1931719345- 1936019362- 193641937019372-
19374 19380-1940219407-1941119413-1941719422-
19434 19438-1944419454-1945519461-1946319502
19512- 1952119530- 1953119560- 1957319598-19601
19604- 1960719617- 196191962719656-1965719659-
19661 19667-1966819670-1967319679-1968719693
19698- 1969919706-1970819727- 1974819759-19763
19800- 1980819813-1982219915- 1991819921-19929
19933- 1994619948- 994919953- 1996319965-19980
20015- 2001720026-2002720029-2004320048-20063
20072- 20079200872009420099-2010220106-20112 Tissue RNA Library SEQ ID NOS: origin Source Name
20120 20122-20127 2013020137 2014820151-20154
20157-2016620172■2017920182-2019420200-20214
20231 -202342024120244-2024920257-2026220265-
2027020274-2027820281-2028220286-2028920299
20328-2033020345-203542036020366-2036820393-
20405204112041420418-2045120456-2046820472-
2047420476-2050320510-2052020524-2053520537-
2054120548-2055320559-2056820575-2057820599-
2060620614-2061920625-2062820631-2063420644-
2065220666-2067420676-2068020683-2068720698-
2070720718-2073420752-2075320758-2076720789-
2079720816-2082120824-2084920853-2086320866-
2087020879-2088420888-2088920897-2090020922-
2093220938-2094320948-2094920952-2095420957-
2097620999-2100821027-2103121060-2106621069-
2108721097-2110021122-2112721145-2115121153-
2116921171-2117421176-2118021194-2119821202-
2120721225-2122621233-2123521241-2125221256-
2126621272-2127421277-2128021284-2129421297-
2129821301-213032133421351--2135521359-21360
21377--2140221434--2143921447--2145021458-21461
21463--2146521467--2146921480--2148221485-21500
21529--2153021554--2155621587--2165521729-21733
21744--2174721881--2188521891 -2189421915-21921
2192421955-2196821973-219742197821983-21987
21995--2201522019--2202522042--2204422056-22076
2208922092-2209422115-2212722135-2213822141-
2214322158-2216422187-2219222195-2219822204-
2221122218-222262223022246-•2224922253-22264
22277--2230222310--2232822336--2234622358-22359
22365--2237322377--2238022383--2238822399-22410
22440--2244822455--224652249522531-2253322571-
2258122607-2260922634-2265122653-2265422661-
226642269022696-■2269722700-•2270122703-22707
22726--2272722732--2273522760--2276422794-22796
22805--2281622821--2282722833--2283722843-22849
22852--2285322875--2287622881--2289122904-22906
22925-■2293822952--2295722974--2297722982-22986
22991-2299423007--2302123046--2305023059-23060
23064-2306523070--2307123074--2307623098-23101
23125-•231362314123201-2320223206-2320923212-
2321523218-2322123236-2323923242-2324423249
2325123261-2326223321-2332723343-2334423379-
233902339923415-2341823425-:234272343323438-
2345023473-2347423486-2348723489-2349023492-
2349323495-2349623506-2351323515-2351823543-
2354423555-235652359923673-2367423682-23686
236902370423723 2373023760- 2376123806-23809
23827-•2383323856■2386023878 ■2388023890-23892
23906-2391023941 2395623999 ■2400124014-24018
24021-•2402524035 ■24040240562411624126-24129
25085-■2509025289■2529025316 •2532125323-25325
25336-■2533725340 ■2534125355-2535825370-25377
25416-2541725436■2544025598 •2560326019-26028
26196-2619926205 ■2621326237- 262432625226260
Figure imgf000203_0001
Tissue RNA Library SEQ ID NOS: origin Source Name
20559-2056220616-2061920631 -2063420676-20680 20801-2080520855-2086220938-2094220948-20949 20957-2096220977-2098421067■2107121123-21127 21145-2114821225-2122621241 -2124721277-21280 21351-2135221377-2139721467•2146921647-21655 21892-2189421915-2191622003 -2200622077-22079 22101-221072216022169-2217522187-2219222250 22253-2225522303-2230822343-2234622366-22371 22405-2240822534-2253922661-226622269022794- 2279622843-2284522881-2288622991-2299423070 23343-2334423402-234082341923439-2345023488 23495-2349623508-2351123856--238602388223906- 2391024014-2401826237-26239262662628426327 26361-2636526469-2647026796-■267982689627052 27074-2707727193-272002721827261-2726527492 27544-2754527551-2755427562-■2756327648-27654 27989-2799028142-2814528263-■282682842428573- 2857829278-2928329343-2934530150-30156 melanoma Clontech MEL004 10-143048-4951-52164366395576-578635697-699 from cell 716-717785-786798-803832-8369671044-10451071 line ATCC 1440-14471488-14891705-17141778-17791912-1913 #CRL 2275-2298230625492692-26942699-27002950-2956
1424 2967-29713078-30793151-31543226-32283255-3259 3336-33373738-37393856-38573949-39534515-4516 4600-46014623-46394819-48235109-51155335-5336 5569-557158025805-58075932-59365980-59846166- 616863776491-65106814-68316857-68587536-7541 7630-76367681-768777717778-778677947972-7975 7995-79978145-81518301-83068332-83338380-8390 8409-84148461-84648543-85508601-86038762-8766 8998-90019040-904290689337-93389340-93429360- 936194459529-953096599735-9738982698289862- 98689927-99289989-99921030610449-1045010488- 1048910508-1051210520-1052210615-1062310876 1115811419-1142211473-114741160911731-11734 11766-1176811780-1178211805-1180811910-11911 11921-1193012127-121281214012190-1219112206- 1220812547-1255412560-1256112581-1258212585- 1258812615-1261812824-1282712978-1298113077- 1307913589-1359113629-1363113638-13641 13954- 1395614137-1413814170-141721460414607-14609 14628-146351464914681-1468214694-1470115222- 1522415290-1529115329-15331 15358-1535915412- 1541415576-1557715699-1570015863-1586615881- 1588316075-1607916121-16122161351615616623 166421665216683-1668516704-1670516860-16870 17070-170731724217297-1730617330-1733217454 18015-1801618097-1813418405-1840618411 18533- 1853418583-1858418761 18777-187801878918825- 1882818857-1888018923-1892418947-1895018964- 189691897519045-190481905519062-1906519081- 1908319096-19101 19106-19118191481925319297 19316-1931719345-19350193621937019373-19374 19385-1938919422-1943119442-1944419515-19521 1969319709-1971019727-1973519768-19771 19804- Tissue RNA Library SEQ ID NOS: origin Source Name
1980819813 -1981519855-19856 19921-1992519969- 1998020029 -2004320087-2009420099-2010220106- 20110201202013020146-2014820172-2017920235- 2024120265-202702028920418--2043120511-20513 20537-2054120607-2061220625-2062820646-20648 2068820698--2070720733-2073420752-2075320788- 2079120816--2081820824-2082620836-2084320858- 2086220888-■2088920929-209322094320948-20949 20963-2097220977-2098421005--2100821024-21025 21062-2106621069-2107121076--2108721149-21151 21202-2120721275-212802144621554-2155621587- 2165521744--2174721881-2188521892-2189421903- 2190421922-•2192321951-219542197822016-22018 22042-2204422062-2207322087--2208822117-22119 22152-221562216022169-2217022178-2218422187- 2219222195-2219822225-2222622231-2223522261- 2226422271-■2227622300-223022231822343-22346 2237322434-224352249522607-•2260922632-22633 22637-2265122661-2266222700--227012270722760- 2276222948-•2295022974-2297623046-2305023053- 230552306323138-2314123206-2320923220-23222 23252-2325623261-2326223358-•2336023379-23385 23392-233952343323448-234502348623682-23686 23711-237192376123798-2379923827-2383323893- 2390225331-■2533325374-2537525583-2559326209- 2621326316-263212632726361-2636526469-26470 26596-2660126860-2686226980--269842703527193- 2720027245-•2724627254-2726027299-2730127439- 2744027586-2758727698-2769927814-2781527825- 2782627861-2786427896-2792727983-2798528040- 2804128099-2810028142-2814528204-2820928732- 2874229328-2933729339-293402935829427-29431 29746-2976130141-30142 mammary Invitroge MMG001 167170-171180-181255-256260319- -325358-359395 gland n 398-399406-407419-420425-427461 -465534-535582- 584604619-620706-707754-757776785-786826-827 832-837847-848852-855900-904912969-971981-985 1001-10041035-1037107110781082- 10851092-1094 1097-10981123-11271131-11331196--119812421273- 12791346-13471432-14351470147814831486-1492 1498-14991578-15851703-17191721--17261728-1731 1866-187219021925-19271939-19401961-19671980- 19822015-201820302055-20582284-22992306-2307 2382-23882410-24132471-24742540-■25412577-2584 2599-26032663-26652675-26782776-■27782820-2821 2874-28762886-28872889-28912903-•29042923-2924 2934-29352951-29553078-30793205-■32073261-3263 3276-327733123322-33233326-33313374-33773452- 34533459-34603477-34783483-34843507-35103512- 35143526-35293531-35343550-355836113649-3653 3686-36913729-37303768-37693781-37843789-3794 3834-38393873-38753949-39534091-40954129-4130 4141_41434160-41624206-42084404-44054515-4516 4526-45274531-45334562-45734600-46014623-4639 4644-46504686-46894785-47894795-48154857-4861 Tissue RNA Library SEQ ID NOS: origin Source Name
4980-4981505050735109-511551175147-51485291 - 52945335-534153715480-54825530-55315572-5573 5581-55835594-56025695-569857095717-57185803- 58045829-58455899-59025919-59215960-59645973- 59755980-59846085-60896140-61436166-61686184- 61856235-62376389-63926403-64046450-64516488- 64896638-663966456650-66556879-68817209-7212 7216-72197238-72407296-730173747421-74237534- 75357604-76057615-76177620-76217660-76657678- 76967745-77517787-77887795-78047815-78187821 7896-78977902-79157921-79257930-79327972-7975 7978-798579887990-79927994-799780028006-8007 8010-801280208071-80788084-80858090-80938106- 81108113-81148129-81368152-81598187-82008204- 82138227-82328247-82488253-826283108313-8314 8323-83258351-836483708383-83898404-84068434- 84488461-84648502-85058509-85158554-85558576- 85828607-86098689-86908760-87618771-87768869 8917-89338936-89388998-90019007-90089013-9017 90289040-90459072-90749196-92009264-92659305- 931193169337-93389340-93429366-936893829427- 94289456-94589472-94769506-95169522-95259529- 95319563-956695859594-96029617-962696599664- 966596769726-97279733-97389759-97679792-9795 9815-9820982698289833-98349841-98429857-9861 9864-98669874-98959899-99049909-99129916-9920 9927-99359965-99839989-999210011-1001610032 10161-1017210254-1026110263-1026410268-10272 10278103061031110326-1032810451-1045310480- 1048210491-1049310498-1050310519-1053010543- 1054910603-1060610612-1061410624-1062610638- 1063910820-1082110870-1087210881-1088310886- 1088910899-1090010913-1091810931-1093310936- 1093810957-1095810980-1098511006-1100811010 11022-1102311032-110341108111217-1121911345- 1135011372-1139611406-1140711423-1142611431- 1143211465-1146611505-1150611538-1154011544- 1154511582-1160811644-1164811660-1166111711- 1171211731-1173411761-1176311780-1178211803- 1180811835-118361184011843-1185911873-11875 11890-1189111901-1190511937-1195012050-12057 12131-121321214012150-1215112175-1217612190- 1219612202-1220812241-1224412309-1231112362 12378-1239712430-1243112470-12471 12480-12482 12637126401264612673-1267512701-1270612714- 1272312755-1275912829-1283012872-1288112943 12997-1299913505-1351113552-1355513565-13566 13592-1359513629-1363113638-136411366513752- 1375313896-1389713908-1391013918-1391913925- 1392613935-1393613999-1400414044-1405314058- 140591410414118-1412214175-1418114192-14195 14199-14201 1420514228-1423514274-1428014344- 1434614360-1436914452-1445314546-1455514604 1462614628-1463514640-1464214650-14651 14666- 146681468314714-1471714784-1478614789-14791 Tissue RNA Library SEQ ID NOS: origin Source Name
14981-1498315019-15024150701509615126-15129 15182-1518315187-1518915218-1521915222-15224 15232-1523615243-1524815257-1525915268-15270 15290-15291 15294-1529715326-1532815341-15342 15412-1541415460-1546115476-1547815491-15495 15568-1557215587-1558915661-1566315679-15685 15688-1569415855-1585715863-158681588015896- 159001590815991-1599616129-161321616716174- 1617616233-1623516359-1636116365-1636616388- 1639116395-1639616438-1643916470-1647216479- 1648116496-164971654516558-1656216602-16608 16619-16621 1662316642-1664816702-1670316707- 1670816710-1671216716-1672316761-1676716828- 1683116836-1684216851-1685816894-1689616934- 1693717026-1702817046-170571710417238-17243 17249-1725017284-1728517292-1730617330-17332 17335-1733917394-1739517454-1745617791-17799 17956-1796218029-1803718136-1813818269-18270 18371-1837218376-1837718412-1841818516-18518 18536-185371854018579-1858118585-1858618624 18644-1865818668-1867018673-1867718717-18718 18730-1873218750-1875818771-1877218778-18780 18787-187931879618806-1880718839-1885318856 18899-189031891018925-1893318947-1895018955- 18959189751899619005-1900919012-1901719019- 190241905519057-1906119081-1908319096-19101 1915119209-1922019251-1925219266-1926719274 19283-1929619302-1930319306-1931219316-19317 19345-1936019362-1937019375-1938419390-19402 19407-1941119415-1941719422-1943419441-19444 19458-1946019464-1946619484-1948719494-19496 19515-1952119526-195291953619567-1957319586 19598-1960119617-196191962719659-1966119663- 1966819670-1967319683-1968719700-1970519727- 1975119759-197671977219800-198031980919813- 1981819855-1985619926-1992919933-1994619950 19962-1996319965-1996719972-1998120044-20047 20066-2006820099-2010220120-2012120128-20130 20133-2015420157-2016420167-2017120189-20194 20200-2020720231-2024120244-2025220257-20262 20274-2028020286-2028720289-202962029920305- 2030720321-2032420328-203302033920349-20354 2036120363-2036820393-2040020406-2040720411- 2041320415-2041720432-2046820472-2047420485- 204902050120505-2050920514-2053520537-20541 2054620548-2055720559-2057420607-2061220620 20625-2062820631-2063420646-2067120676-20681 20686-2068720698-2070720727-207322075220758- 2076620768-2079120798-2080020806-2081220822- 208432086320868-2087120879-2088120885-20900 20927-209282094320948-2094920952-2095420957- 2096220977-2098420999-210082101521021-21023 21026-2104521060-2106621072-2107521122-21127 21133-2113521137-2114021155-2116921171-21174 21176-2118021197-2119821208-2121021213-21215 Tissue RNA Library SEQ ID NOS: origin Source Name
21223-212262123621248-2125221263-2126621272- 2127421284-2129421297-2129821301-2130321334 21353-2135421377-2139721408-2140921428-21433 21440-2144521447-2145321467-2146921480-21482 21485-2149121495-2150021554-2155621712-21717 21788-2179921877-2188021899-2190221911-21912 21917-2192121925-2192821936-2193821958-21966 21983-2198721995-2199922003-2203422040-22041 2204722062-2206922074-2207622080-2208622090- 2209422101-2210722115-2211622141-2214322152- 2215622160-2216822193-2219422204-2221122218- 2222622256-2228922303-223082231822323-22328 22333-2233522343-2235022357-2235922362-22365 22377-2238022405-2240822434-2243522440-22448 22455-224652249522531-2253922553-2255922571- 2258122607-2260922622-2262822644-2265122661- 2266422669-2267422700-2270122730-2273122741- 2274222760-2276222805-2281622821-2282722838- 2284222846-2284922852-2285322870-2287422881- 2288622907-2290822925-2292622940-2294722952- 2295322969-229712297322999-2302123046-23052 2306323066-230672307023072-2307623080-23086 23098-2310123201-2320223212-232152322223229- 2323323237-232392324723252-2325623258-23263 23279-23281233282334123356-2337023375-23377 23379-2338123386-2339023392-2339523409-23411 2341923433-2343823473-2347923508-2350923514 23543-2354423555-2356523673-236862369023709 23720-2372123726-2373623760-2376123771-23797 23802-2380523827-2383323839-2384223845-23848 23867-2387523878-238802388223890-2390223990- 2399223999-2400124005-2401124035-2404024056 24120-2412524402-2442524772-2477425017-25026 25085-2509025279-2528825293-2529725300-25304 2530625319-2532125323-2532525331-2533325336- 2533725340-2534125366-2536725373-2537725383- 2540125403-2540725416-2541725542-2555026024- 2602826140-2614326175-2618426195-2619926205- 262062626626285-2629026305-2630626324-26325 263272633726339-2634426350-2635226359-26360 26370-2637426395-2639626421-2642226455-26459 26469-2647026479-2654526684267372674226807- 268122682826860-2686226876-2688126985-26987 27040-2704327050-2705127074-270772709127100- 2710227129-2713227150-2715127156-2715727180- 2718327193-272002721827229-2723227269-27270 27354-2737527439-274432745527458-2746227510- 2751127520-2752127544-2754727586-2758727600- 2760627633-2763427636-2764127649-2765427662- 2767227719-2772027726-2773927755-2775627799- 2780227816-2781827848-2785227861-2786427890- 2789227896-279272797027981-2798227995-27997 28099-2810028105-2810728142-281452818628192- 2820328212-2822128233-282612828628365-28370 28416-284172842428426-2842828549-2855529278- Tissue RNA Library SEQ ID NOS: origin Source Name
2928329288-2928929328-2933729354-2935829364- 2936629409-2941729427-2943129508-2951829594- 2960429611-2961329718-2972129746-2976129960- 299613006130085-3008730141-3014230150-30156 30189-3019930221-3023230243-3025030325-30327 induced Stratagen NTD001 188-191289-291576-578617-618716-71710301034 neuron e 1097-109816462540-25412599-26032675-26782724- cells 27262871-287329733326-33313374-33763649-3653 3908-391039303949-39534165-41704486-44874515- 45164581-45824984-49855053-50545272-52755335- 53365777-577858025919-59215980-59846403-6404 6787-67906795-68007170-71727258-72637325-7343 73637368-73697557-75597688-76897693-76947701- 77037745-77467778-77837906-79127990-79928187- 820082508497-84998689-86908758-87598998-9001 9029-90309040-90429047-904890879321-93239559- 95609829-98329909-99129947-99529993-999610477- 104781049410531-1053210592-1059410615-10623 10842-1086710980-109851104511228-1122911314- 113151140511431-1143211541-115431154611609 11700-1170711739-1174011803-1180811886-11891 11941-1194412131-1213212241-1224412258-12262 12898-1289912902-1290512997-1299913592-13595 13609-1361213652-1365914304-1430515182-15183 15190-15191 15290-1529115588-1558915969-15974 16028-1602916180-161811654516619-1662116642 17292-1729617401-174321743517455-1745618029- 1803018097-1813418300-1830718400-1840218412- 1841818691-1869218771-1877218796-1880118839- 1884118846-1885318899-1890318989-1899019001- 190041901219074-1908019106-1911819207-19208 19256-1925719266-1926719306-1930719316-19317 1934319355-1936019387-1938919447-1944819458- 1946019488-194931956619598-1960119617-19619 19659-1966119736-1974219804-198081981319939 19972-1998020029-2004320099-2010220106-20110 20114-2011920122-2012720182-2018820208-20218 20485-2049020521-2052320607-206122068120698- 2070720827-2083520853-208542087121105-21111 21248-2127121275-2128021284-2129421463-21465 21495-2150021587-2164621929-2193522020-22025 22045-2204622070-2207322141-221432216022187- 2219222195-2219822243-2224522358-2235922365- 2237122381-223882243322653-2265422671-22674 2269022916-229222297723201-232022325123358- 2336023420-2342123700-2370123798-2379923806- 2380923890-2389224749-2475424928-2493824985- 2498825029-2504025279-2528825376-2537926024- 2602826205-2620626209-26213262662628026310- 263142632726361-2636526678-266802709127100- 2710227269-2727027446-2744927522-2753927544- 2754527729-2773927861-2786427896-2792727989- 2799028315-2831628361-2836429117-2912329328- 2933729343-2934529418-2941929426-2943129885- 2989130167-301773035230361-30368 Tissue RNA Library SEQ ID NOS: origin Source Name retinoid Stratagen NTR001 2599-260344504581-45825038-50395253-52665335- acid e 53367695-76967787-77888537-85388612-86169049- induced 90629563-956612755-1275912978-1298113638-13641 neuronal 15530-15531 15588-1558916110-161121654516652 cells 16836-1684217238-1724117335-173391779018673- 186771879618825-1882818936-189391897519152 19306-1930719363-193641937019387-1938919759- 1976319962-1996320328-2033020437-2044020554- 2055721208-2121021277-2128021467-2146921587- 2164621881-2188522020-2202522187-2219222265- 2227023261-2326223771-2378023843-2384425306 2537326205-2620626237-262392632726860-26862 27156-2715728142-2814528426-2842829328-29331 neuronal Stratagen NTU001 240-250373-374425-427576-578847-8481388-1401 cells e 1432-1435147014991778-177923062599-26032944 3040-30763107-31083326-33313421-34223477-3478 3483-34843789-37943912-39154055-40584171-4172 4206-42084515-45164562-45684581-45824785-4789 5186-51895253-52665272-52755279-52835525-5526 5943-59456135-61366403-64047264-72667346-7352 7784-77867815-781882038227-822984658497-8499 8716-87188998-90049007-90089038-90399049-9068 9472-94749479-94819535-95379594-96029735-9738 9929-99359969-998010161-1016310167-1019610516- 1051810615-1062310873-1087510915-1091811308- 1131011334-1133511775-117761184012150-12151 1225812590-1259212653-1265412716-1271912997- 1299913552-1355513638-13641 13847-1384914038- 1404114044-1404514137-1413814277-1428014640- 1464214814-1481515025-1506915100-1510915277- 1527815408-1541115530-1553115563-1556415576- 1557715588-1558915863-1587016141-1614316174- 1617616182-1618916545166421665216836-16842 16851-1685317284-1728517435-1744017451-17454 17958-1796218029-180301804318097-1813418500- 18501 18562-1857618671-1867218796-1880118825- 1882818857-1888018925-189331897518993-18995 19049-190531915319298-1930019306-1930719316- 1931719351-193541937019375-1937919395-19400 19415-1941719432-194341951119515-1952119526- 1952919564-195661962519659-1966119683-19687 19813-1981519855-198561993819940-1994219965- 1996719972- 1998020026-2002720099-2010220106- 2011020161-2016420200-20207202412028920336- 2033820406-2040720437-2044020485-2049020521- 2052320537-2054120602-2061220631-2063420639- 2064020646-2064820666-2067120792-2079720827- 208432087120897-2090020924-2092520927-20928 20938-2094220957-2096221032-2104521208-21210 21241 -2124721256-2126221272-2128321295-21296 21554-2155621587-2165521899-2190421936-21938 21951 -2195421958-2196622007-2201522040-22041 22047-2204922070-2207322152-2215622165-22168 22171 -2217522218-2222422243-2224522253-22255 22292-2229922312-2231322343-2234622372-22373
Figure imgf000211_0001
Tissue RNA Library SEQ ID NOS: origin Source Name
952195319561-95629617-96269680-96819763-9767 982810309-1031110470-1047310588-1059410628- 1063010820-1082110901-1090210941-1094310965 11496-114981154611780-117821180911835-11836 1184011951-1196912027-1203912171-1217312194- 121961225812615-126181264012997-1299913576- 1357913592-1359713609-1361213843-1384414058- 1405914092-1409314130-14131 14166-1416814389 1460414684-1468514821 15182-1518315190-15191 1543815454-154561547915699-1570015855-15857 15863-1586615890-1589515961-1596315988-15990 16053-1605816168-1617116392-163941664216773- 1678016851-1685316949-169531711417330-17332 1745417958-1796218015-1801618500-1850118516- 1851818533-1853418625-1862618655-1865818671- 1867218719-1872618762-187661879618802-18804 18857-1888018899-1890319059-1906519081-19083 1909019135-1913719154-19155192741937219415- 1941719442-1944419683-196871969319700-19705 1977219814-1981519921-1992519943-1994619972- 1998020048-200522013020161-2016420172-20179 20182-2018820265-202702028920305-2030720339 20357-2035920393-2040020537-205412055820575- 2057820625-2063020644-2064520698-2070720753 20758-2076720792-2079720806-2081220819-20821 2088120948-2094920957-2096221016-2102321069- 2107121101-2110421225-2122621237-2124021284- 212942142721495-2150021529-2153021647-21655 21892-2189421905-2191021973-219742197822026- 2202922040-2204122090-2209422212-2221722230 22284-222892231822336-2234222347-2234822358- 223642257022622-2262422634-2263622653-22654 22700-2270122768-2277022857-2285822899-22900 22955-229612297322991-229942306323070-23071 23098-2310123218-2321923252-2325623364-23370 23415-2341823473-2347423519-2352023700-23701 23704237612377023802-2380523827-2383324005- 2401124772-247742531325340-2534125374-25375 25416-2541726339-2634026657-2665926755-26756 26804-2680526850-2685426873-2687527100-27102 2721827463-2746627468-2747527655-2765627700- 2770127816-2781827825-2782627861-2786427989- 2799028108-2812128204-2820928426-2842829278- 2928329432-2943529960-2996130085-3008730150- 3015630184-3019430236-30242 rectum Invitroge REC001 180-181534-535540-5421681-16871705-17141721- n 17221745- 17462272-22742675-26782695-26982813- 28463326-33313555-35583687-36913717-37203834- 38353949-39534692-46954857-486152465337-5341 5572-55735802-58045919-59216209-62116400-6404 6406-64107209-72127426-744276097747-77517757- 77597778-77838074-80788323-83258329-83348453- 84548741-87508986-89919029-90309043-90459305 9375-93769391-93929456-94589531958598289921- 99229986-998810263-10264102771030610480-10482
21 I Tissue RNA Library SEQ ID NOS: origin Source Name
10491-1049310498-1050310523-1053010907-10910 10919-1092211006-1100811372-1139611399-11402 11660-1166111835-1183611886-118891214012150- 12151123621243512455-1245912472-1247312571- 1257212642-126441272312755-1275912788-12790 12824-1282712872-1288113576-1357913601-13602 13629-1363113638-1364114546-145491460414636- 146421509615233-1523615336-1534015623-15626 15855-1585716716-1672317010-1701417031-17035 17038-1704117956-1796218374-1837518395-18397 1862418651-1865418717-1872618757-1875818767- 18769187891879618944-18945191321922419257 19283-1929419306-1930719380-1938419390-19394 19401-1940219407-1941119422-1943419454-19455 19488-194961962519733-1973519743-1975119814- 1981519972-1998020122-201272013020137-20145 20189-2019720235-2024020274-2027820297-20298 20300-2030720321-2032420328-2033020441-20451 20456-2047120476-2048120524-2053520569-20574 20625-2062820758-2076620806-2081220827-20835 20858-208622087120882-2088420952-2095421005- 2100821021-2102321145-2114821171-2117421229- 2123221256-2126221297-2129821351-2135221428- 2143321440-2144521447-2145021467-2146921480- 2148221492-2149421877-2188021903-2190422003- 2201522020-2202922050-2205522062-2206922092- 2209422117-2211922200-2220722261-2226422277- 2228322292-2229922349-2235022455-2246522495 22571-2258122622-2262422663-2266422760-22762 22781-2278222838-2284222974-2297723074-23076 2308423229-2323323252-2325623392-2339523433 23543-2354423599-23624236902376123806-23809 23878-2388024021-240242405624092-2409724120- 24125246652530625347-2535825966-2596826347- 2634826359-2636026421-2642226448-2645126455- 2645926855-2685726879-268812709127129-27132 27193-2720027245-2724627269-2727027304-27305 27510-2751127544-2754527799-2780227814-27815 27869-2788627888-278922798328199-2820328232 28255-2826128511-2851329278-2928329309-29314 salivary Clontech SAL001 260307-308331551-552832-836969-971981-985 gland 1312-13131721-17221810-1811207223032306-2307 2599-260328472850-28603151-315436573723-3728 37373840-38523949-39534515-45164531-45334555- 45564581-45824857-48614971-49745269-52715525- 55265652-5653565857006337-63386411-64126442- 64496762-67647452-74607678-76877701-77037745- 77467778-7783780579888145-81518187-82008337- 83428383-83898554-85558986-89919018-90209038- 90399427-942895319535-95379782-978398289899- 99019923-99249997-100091030610531-1053210607- 10611 108761100911123-111241160911644-11648 11669-11671 11731-1173411835-1183612040-12041 12175-1217612202-1220512229-122301236212434 12468-124691247412565-1256812573-1257412642- Tissue RNA Library SEQ ID NOS: origin Source Name
1264412701-12705 1272312794- -1279512803-12804 12872-12881 12984 1298512997-1299913077-13079 13545-1354813592 -1359513638-13641 13937-13938 14132-1413614546 1454914552-145551460414650- 14651 14714-14717 15003-1500815294-1529715542 15545-15546155481555015614-1561615884-15885 16168-161711662316636-1663716653-1665516716- 1672316851-1685317058-1706617094-1710017255- 1725617330-1733217489-1749018010-1801218412- 184181852718624- 1862618673--1867718682-18688 18691-1869218717-1871818723-187261878918796 18829-1883418934-1893518964-189691897518993- 1899519096-191011913419209--1921019283-19294 19308-1930919350-1935419362-1936419380-19384 19401-194021969919706-1970819749-1975119759- 1976319855-1985619921-1992320111-2011220120 20146-2014820244-2024920288-2028920316-20320 20325-20335204142050120710-•2071120752-20753 20758-207662078820792-2079720813-2081520858- 2086320902-209212094320952--209542101521248- 2125221281-2129421403-2140421451-2145321529- 2153021532-2153321647-2165521955-2195721978 22026-2202922077-22079220972216022208-22211 2223022314-2231722323-2232822343-2234622358- 223592237322440-224482249522571-2258122690 22737-2273922838 -2284222923--2292422980-22981 22991-2299423059-2306023066--230672307023074- 2307623212-2321523235-2323623251-2325623524- 2352523726-237302376123878-•2388023911-23940 2405625085-250902530625323-■2532525340-25341 25370-2537325383--2540126024--2602826341-26344 26373-263742642326596-2660126844-2684527067- 2707027281-2728227304-2730527740-2774327814- 2781527861-2786427893-2789528105-2810728192- 2819828212-282162823328255-2826128311-28313 2842429278-2928329332-2933729940-2994930150- 3015630233-30235
Salivary Clontech SALs03 981-9858698-870011538-1154014546-1454920316 gland 27814-2781527989-2799028424 skin ATCC SFB001 1307 3374-3376 6285-6288 6791-6794 10306 12258 fibroblast 17026-17028 18029-18030 1901 1 19939 19972-19980 20182-20188 22141-22143 22160 22495 23415-23418 28424 30150-30156 skin ATCC SFB002 29265805-58076166-6168103061225817026-17028 fibroblast 17038-17041 17455-1745618029-1803019011 19548- 195531981321060-2106122141-221432216022373 2249522531-225332687927636-276392842430150- 30156 skin ATCC SFB003 5803-5804 6166-6168 12258 17335-17339 18029- 18030 fibroblast 18778-18780 19062-19065 19548-19553 20182-20188 22007-22015 23051-23052 23419 25340-25341 27269- 27270 27814-27815 28424 30150-30156 small Clontech SIN001 83 87-94 195-200 307-308 332 373-374 557-559 674-675 intestine 783-784 852-855 901 -904 1071 1240-1241 1470 1678- 1680 1755-1762 1764-1766 1769-1772 2030 2048 2089-
2 i : Tissue RNA Library SEQ ID NOS: origin Source Name
21002337-233824682599-26032871-28733006-3008 3151-31543461-34633487-35033661-366336754137- 41394531-4533454945524640-464346515279-5283 5744-57475980-59846095-60996103-61046155-6159 6186-61896235-623764846982-69847267-72687407- 74147576-75777681-76877778-77867798-78017906- 79127972-79758074-807681628211-82138247-8248 83708539-85428762-87668917-89189196-92009456- 94589475-947694789529-953098289905-99089925- 99269997-100091049010508-1051210592-10594 10624-106261093010936-10937109651102111505- 1150611538-1154011612-1163911772-1177411886- 1188911977-1197812027-1204112642-1264412814- 1282212872-12881 12978-1298112997-1299913087- 1308913592-1359513999-1400314090-1409114283- 1428814546-145511460414663-1466515182-15183 15258-1525915294-1529715326-1533115336-15340 15532-1553515869-1587015961-1596316096-16097 16163-1616416601-166081662316702-1670316724- 1672516836-168421689916918-1691917268-17274 17366-1736817455-1745617489-1749017525-17527 17958-1796218020-1802218421-184221852718655- 1865818719-1872218757-187581876118842-18845 1915119283-1929619306-19307193501937019387- 1938919401-1940219422-1943119488-1949619617- 196191962719663-1966619733-1973519749-19751 19814-1981519924-1992519972-1998020026-20027 20111-2011220151-2015420165-2016620231-20234 20265-2027020343-2034420349-2035420425-20426 20432-2043620447-2045120469-2047120491-20494 20514-2053520548-2055320575-2057820646-20648 20666-2067120698-2070720714-2071720747-20751 20758-2076620768-207872092620938-2094220952- 2095421027-2103121069-2107121097-2110021176- 2118021377-2139721647-2165521744-2174721925- 2192821948-2195021967-2196821971-2197422077- 2207922117-2211922152-2215622200-2220322218- 2222422323-2232822343-223462236522399-22404 22700-2270122763-2276422838-2284222870-22874 22881-2288622969-22970229982304623074-23076 2323623252-2325623373-2337823386-2339023434- 2343723543-2354423806-2380923890-2389224056 25085-250902530625338-2533925383-2540125705- 2571526196-2619926237-262392625226361-26365 2687926893-2689527304-2730527476-2747727497 27600-2760127729-2773927853-2785627861-27864 27940-2794227989-2799028199-2820328468-28484 29309-293142942630141-30142 skeletal Clontech SKM001 170-171255-256912981-9851259-12621280-1282 muscle 1436-14371483268329722975-29773661-36633740 4091-40954338-43394388-43904515-45164562-4568 4708-47115525-55265581-55835591-55926671-6672 6773-67747681-76877784-77867806-78087815-7818 7902-79037972-79757990-79928216-82188351-8360 8502-85038512-85138554-85559100-91449327-9328
Figure imgf000216_0001
Tissue RNA Library SEQ ID NOS: origin Source Name
1220812237-1223912259-1226212337-1236112430- 12431 12477-1247912555-125591263713077-13079 13592-1359513796-1379714032-1403314048-14053 14228-1423814264-1426514289-1429714445-14447 1460414794-1479515133-1514015182-1518315290- 15291 15360-1538015543-1554415561-1556215867- 1586815884-1588516062-1606516115-1611716418- 1642116496-1649716579-1658216724-1672516783- 1678616851-1685317074-1707717330-1733217335- 1733917441-1745017454-1745617940-1794217963 1800418029-1803018044-1804518055-1806018219- 1822318371-1837218423-1842518490-1849118516- 185181852718673-1867718719-1872218761 18796- 1880118811-1881318894-1889618906-1890718964- 1896918976-1897718989-1899019057-1905819066 19068-1907019102-191051915119209-1921219256- 19257193621937019401-1940219438-1944019442- 1944419548-195531956619580-1958219590-19592 19604-1960719617-1961919667-1966819672-19673 1969319709-1971019727-1973219736-1974219764- 1976719810-1981219943-1994720029-2004320048- 2005220074-2007920087-2009420111-2011220122- 2012720137-2014520151-2015420161-2016420250- 2025220274-2027820281-2028220366-2036820401- 2040520415-2042020437-2044020452-2045520491- 204942050420514-2052020524-2053520563-20567 20569-2057420653-2066520676-2068020718-20725 20727-207342075320806-2081220827-2083520882- 2088420938-2094220999-2100421032-2104521122 21157-2116921253-2125521272-2127421343-21350 214272144621463-2146521485-2149121529-21530 21587-2164621729-2173321973-219742197822047 22074-2207622090-22091221572216022187-22192 22208-2221122218-2222622246-2224922261-22264 22284-2228922347-2234822381-2238222411-22432 22434-2243522571-2258122637-2265122661-22662 22700-227012274022768-2277022828-2282922833- 2283422854-2285622870-2287422952-2295322977 2304623064-2306523070-2307123080-2308323212- 232152323523251-2325623379-2338523412-23414 2348723489-2349023492-2349323515-2351823802- 2380523867-238702388223893-2390223941-23956 24005-2401125085-2509025289-2529025334-25335 25347-2535425368-2536925403-2540725966-25968 26024-2602826258-2625926294-2630226347-26348 2635726409-2641026448-2645126604-2660626843 26860-2686227048-270492710527147-2714927225 27269-2727027281-2728727570-2757127595-27596 27607-2761527729-2773927758-2775927861-27864 27890-2789228008-2801128142-2814528311-28313 2842428882-2888729343-2934529409-2941629940- 2995329958-2995929962-2996330233-30235 adult Clontech SPLcOl 188-191546-547667-668697-699792-795934-935981 spleen 98510471259-12621411-14131705-171417772052- 205420722599-26032649-26503326-333133773661-
Figure imgf000218_0001
Tissue RNA Library SEQ ID NOS: origin Source Name
23046 2307023112- 2311623201- 2320223358-23360 23492- ■2349323543 -235442359923798-2379923841- 2384223845-2384823871-2387523893-2390224126- 2412925340-2534125966-259682660227067-27070 27080-2709027269-2727027302-2730327480-27481 2765728204-2820929328-293312935829418-29419 29950■2995330141 -3014230195-30199 thalamus Clontech THA002 124-125398-399464-465901-9041044-10451295-1297 1334-1336151415471806-18071809-18121817-1818 1820-18231826-18311833-18351837-18391925-1927 220123392354-23572410-24132471-24742561-2562 2675-267828062983-29863182-31873322-33233618- 36213789-37943853-38543926-39274515-45224531- 45334562-45684845-48514950-49515557-55605899- 590259926040-60426166-61686176-61836282-6283 6406-6410700073627580-75977615-76177681-7689 7978-798380208132-81368152-815681628206-8210 8230-82328253-82578701-870587168724-87328771- 87768822-88359049-90629305-931193829535-9537 9590-959297089719-97229792-979398289981-9983 9986-998810167-1017210326-1032810488-10489 10780-1078211405-1140711460-1146411766-11768 11835-11836118401190011937-1193812202-12205 1263712671-1267212701-127051272312829-12830 12978-1298113500-1350213552-1355513796-13809 1422714264-1426514314-1431714505-1454514604 14793-1479515025-1506915093-1509515661-15663 16096-1609716118-1612016141-1614316787-16788 1692016943-1694517038-1704117156-1715917242 17454-1745617571-1757418015-1801618097-18134 18651-1865418691-186921879618842-1884518919- 1892118925-189331901819096-191011913419256 19260-1926219295-1929619306-1930719355-19360 1944619512-1951319594-1960319620-1962219659- 1966119674-1967819727-1973219800-1980319814- 1982219919-1992019943-1994619950-1995720106- 2011020133-2013620146-2014820161-2016420167- 2017120180-2018120215-2021820235-2024020317- 2032020406-2040720432-2043620456-2046820514- 2052020524-2053520676-2068020827-2084320853- 2085421101-211042112221153-2115421211-21212 21229-2123221284-2129421353-2135421447-21450 21463-2146521477-2147921492-2149421554-21556 21718-2172421881-218852192422003-2200622040- 220412204722080-2208322090-2209122120-22127 22161-2216422225-2222622243-2224522256-22260 22290-222992236522381-223822239322531-22533 2257022625-2262822661-226622269022870-22874 22952-2295322969-2297023007-230212307123223- 2322523229-232332334223412-2341423420-23421 23489-2349023726-2373023781-2379223843-23844 2388223999-2400124005-2401124021-2402424120- 2412924231 -2424024471 -2447225085-2509025342- 2534325374-2537526205-2620626359-2636026375 2639826460-2646326678-2668027012-2701427397 Tissue RNA Library SEQ ID NOS: origin Source Name
27482-2748527662-2767227750-277562778127819- 2782027890-2789228095-280962828628361-28370 28468-284842936329508-2951830189-30194 thymus Clontech THM001 112180-181216227-230302329-330332524-525546- 547576-578697-699796-797900934-9351065-1066 1210-12111259-12621346-13471431 149915481551 1667-166816751681-16871705-17141721-17221777 2072-20752083-20842122-21242202-22032383-2388 2570-2574263628062913-2920299331283151-3154 33773508-35103649-365536863789-37943949-3953 4067-40724137-41394160-41624469-44704489-4493 4640-46435057-50595092-509451925335-53365654 5808-581262896403-64046450-64516488-64897216- 72197275-72767640-76427681-76877745-77467778- 77837902-79038113-81148132-81368163-81738253- 82578297-83068313-8314836784658607-86098758- 87598768-87708917-89188982-89839029-90369038- 90399049-90629196-92009391-93929456-94589472- 947494869517-95219529-95309594-96029617-9626 9739-97429794-979598269828-98329851-98529857- 98619909-991210273-102741027710498-10503 10508-1051210520-1053010533-1053410537-10538 1090611011-1101211069-1107011081 1116011252- 1127311314-1131511345-1135011406-1140711476- 1147811496-1149811579-1158111607-1160811715- 1171611731-1173411739-117401184011901-11905 11947-1195012121-1212612202-1220512283-12284 1247712555-12561126371272312984-1298513083- 1308613107-1311713556-1356213581-1358313592- 1359513603-1360813796-1379713896-1389713999- 1400314175-141811460414640-1464214694-14701 14818-1482015182-1518315277-1527815321-15322 1534415392-1539615406-154071543815568-15572 15576-1557715588-1558915988-1599016053-16058 16121-161221623716434-1643716461-1646416496- 16497166231664216653-1665516851-1687016899 17074-1707717105-171061711417238-1724118008- 1801718029-1803018097-1813418421-1842218579- 1858118583-1858418629-1863218644-1865018655- 1865818691-1869218750-1875618759-1876118806- 1880718836-1883818842-1884518857-1888018899- 1890318947-1895019001-1900419068-1907019209- 1921019283-1929419310-1931219316-1931719345- 1935019362193701937219380-1938419415-19417 19441195661962719659-19661 19670-19671 19693 19706-1970819727-1973219736-1974219749-19751 19813-1981519921-1992319943-1994619962-19963 20029-2004320087-2009420111-2011220128-20129 20146-2015420161-2016420182-2018820208-20214 20244-2025320265-202702028920328-2033020360 2036220406-2040720412-2041320415-2041720425- 2042620437-2044020485-2049020495-2049820505- 2050920514-2052020524-2053520542-2054520563- 2056720621-2062420631-2063420646-2064820681 20698-20707207532076720853-208542088120897-
Figure imgf000221_0001
Tissue RNA Library SEQ ID NOS: origin Source Name
8059-80638113-81168132-81368206-821082508258- 82628301-83068351-83608422-84488453-84608465 8490-84918502-85038519-85258539-85428588-8597 86888758-87618771-87798790-87998917-89188998- 90019013-90209029-90309049-90679075-90789306- 9311931693539420-94239456-94589482-94859506- 950995319603-96049617-96269726-97279729-9731 9743-97449763-97699782-978398269848-98509857- 98619874-98959909-99249929-99359965-99809984- 99929997-1000910015-1001610161-1016310167- 1019610273-1027410281-103041030610326-10328 10454-1045710470-1047310498-1050310506-10507 10531-1053210588-1059110628-1063010638-10639 10931-1093310941-1094211024-110251108111161- 1119711212-1121611220-11221 11308-1131011314- 1131511339-1134411493-114941154611658-11659 11680-11691 11745-117481184011901-1190512050- 1205612117-1211812175-1218112192-1219312202- 1220512245-1225312312-1232812331-1233212362 12374-12377124341-2562-1256312590-1259212606- 1261212623-1263612638-12640127231272912785- 1278712796-1279812805-1280612824-1282712829- 1283012869-1287012872-1288112973-1297412997- 1299913505-13511 13517-1352313668-1367213796- 1379713854-1386013865-1386613954-1395614004 14044-140451410014127-1412814137-1413814187- 14191 1431314380-1438114552-145551455714607- 1460914661-1466914673-1468314787-1478814822- 1483814999-1500115019-150241509615182-15183 1525715277-1527815300-1530115329-1534015358- 1535915392-1539615439-1544215454-1545615480- 1548415496-1552515545-1554615551-1556215576- 1557715600-1560115653-1565615661-1566315726- 15741 15861-158681587815881-1588315890-15895 15988-1599016053-1605816075-1607916158-16159 16163-161641623716352-1635316403-1640716418- 1642116470-1647216614-166151662316636-16637 1664216653-1665516771-1677216812-1681416836- 1684216873-1687416894-1689616911-1691216946- 1695317043-1705717074-1707717094-1710317105- 1710617112-1711617145-1714617297-1730617356- 1735817455-174641757917924-179261793917951- 179521800418015-1801718029-1803018062-18064 18097-1813418157-1816018376-1837918400-18402 18412-1841818421-1842218562-1857618629-18632 18644-1865018668-1867018673-1867718691-18692 18717-1871818750-1875818761 18767-1876918789- 1879318825-1882818842-1884518882-1888818899- 189031891018919-1892118955-1895918964-18969 19001-1900419059-1906519096-191011913419266- 1926719283-1929619306-1930719331-1934019350 19363-193641937019380-1938419387-1940219415- 1941719438-1944019464-194661948719515-19521 19526-19531 1956319567-1957319598-1960119604- 196071962519663-1966819670-196711969319699 Tissue RNA Library SEQ ID NOS: origin Source Name
19706-1970819733-1974819752-1975519759-19763 19804-1980819814-1981519823-1985419915-19918 19921-1992319926-1992919938-1993919948-19949 19953- 19961 19965-1998020029-2004320074-20079 20106-2011220122-2013020137-2014520167-20171 20208-2021420231-2023420244-2024920274-20278 20289-2029620345-2035420357-2035920401-20405 20441-2044620456-2046820472-2047420476-20479 20485-2049620502-2050320505-2051020514-20553 20558-2057420599-2060620616-2063620639-20641 20646-2065220676-2068020698-2070720718-20725 20727-207342075320789-2079120819-2082120824- 2084320853-2085420868-2087020879-2088020890- 2090020924-2092520938-2094220952-2095420957- 2096220977-2098821005-2100821016-2102021026- 2104521060-2106621072-2108721097-2110021105- 2111121123-2112721133-2113521176-2118021194- 2119821208-2121521233-2123521241-2124721277- 2128021295-2129621301-2130321359-2136021377- 2139721410-214142141721428-2143321440-21445 21447-2145021492-2150021516-2153021554-21556 21647-2165521729-2173321881-218852189121915- 2191621924-2192821948-2195021962-2196821975- 2197721989-2199322007-2201522020-2202922074- 2207922090-2209422101-2210722128-2213822152- 2215622161-2216422171 -2217722204-2220722212- 2222422253-2227622284-2229922310-2231122319- 2232222336-2234622349-2235022358-2236122365- 2237122377-2238022383-2238822440-2244822495 22561-2256522571-2258122607-2260922644-22651 22653-2265422663-2266422678-226792269022708- 2270922763-2276422801-2281622823-2282722835- 2283722854-228582291122925-2292622941-22947 2297123007-230212304623051-230522306323066- 2306723074-2307623085-2308623098-2310123109- 2311123125-2313623138-2314023206-2320923212- 2321523218-2321923252-2325623261 -2326223343- 233442334623356-2335723379-2338523392-23398 23415-2341823439-2345023470-234722348623492- 2349623521-2352323555-2356523675-2368123704 23718-2371923731-2374223760-2376723802-23805 23812-2381323834-2383523839-2384023845-23848 2386523867-2387023903-2390423990-2399224014- 2401824021-2402424035-2404024481-2449025289- 2529025323-2532525347-2535425383-2540125461- 2547026029-2603126116-2612026270-2627926294- 263022633726341-2634426373-2637426383-26385 26389-263932641526455-2645926467-2647026610- 2661226657-26659267322674226757-2677126796- 2679826855-268572687926882-2688427100-27102 27127-2713227173-271742721827247-2725127269- 2727027497-2750027520-2752127562-2756327586- 2758727600-276012762127649-2765427674-27676 27784-2778827807-2780827812-2781327865-27868 27938-2793927995-2799828001-2801128018-28039 Tissue RNA Library SEQ ID NOS: origin Source Name
28042-2804328045-2808628099-2810028142-28145 28192-2819828263-282682828628315-2831628360 28363-283642842428468-2848429250-2925229278- 2928329304-2930829328-2933729343-2934529354- 2935729432-2943529529-2953429655-2965930150- 3015630189-3019930361-30368 thyroid Clontech THR001 197-200221255-256259-260309324-325329-330333- gland 335395464-465524-525546-547635656684690-691 847-848852-859872-875891900-904913919-922929- 931957-958969-971981-9851034-103710471092- 10941109-11101123-11271235-123712421273-1279 130713371409-141014311453-14541467-14691488- 14891553-15551557-15591667-166816751705-1714 182818401850-18521856-18601862-18631865-1873 1879-18801885-18891912-19131925-19271945-1946 1964-19661969-19701990205820622130-21312222- 22262275-227623062354-23552360-23672383-2388 2393-23952471-24742480-25112513-25142540-2541 2597-26032663-26652679277928062826-28272871- 28732951-29552983-29862988-298930043006-3008 3107-310831313144-31463151-31543306-33083310- 33113316-33213366337734333441-34423448-3450 34663512-35143526-35293555-35583565-35883679- 368236933763-37653789-37943834-38353944-3955 4067-40724124-41264160-41624489-44934512-4516 4555-45564562-457346074612-4614465546664701- 47044712-47134720-47254785-47894795-48094868- 48714971-4974506550735109-5115511851925276- 52785285-52875335-53365519-552158025805-5807 582358465899-59065919-59215994-60006106-6109 6155-61596284-62886290-629663006341-63436386 6403-640464906533-65356542-65436551-65676859- 68766982-698472037209-72127216-72197350-7352 7450-74517482-75177536-75417554-755576087611- 76127615-76177630-76367671-76737678-76877701- 77037734-77367757-77597763-77707805-78087815- 781878227972-79757978-798379887990-79927994 80028049-80568059-8063807080798087-80898113- 81148137-815981618174-81778206-82138221-8229 823482498251-82658297-83008313-83148349-8362 8368-83698383-83898409-84148434-84528465-8476 8497-84998506-85088512-85138534-85358537-8550 8607-86098751-87538762-87678917-893389979037 9040-90459072-90749264-92659331-93339360-9361 9391-93929395-93969408-94149429-94309444-9448 94609472-94779486-94889522-95349561-95669617- 96269682-96839709-97139726-97279729-97319735- 97389757-97629770-97719782-97839794-97959828 9835-98409848-98529899-99049923-99249929-9935 9954-99559965-99689981-99839989-99929997-10009 10015-1001610025-1002610033-1003710164-10166 10275-1027610278103061031110326-1032810451- 1045310470-1047310490-1049310498-1050310520- 1052210533-1053410537-1053810543-1054910592- 1059410609-1061110615-106231062710881-10883 Tissue RNA Library SEQ ID NOS: origin Source Name
10895 -1089810903 10931-10933 1094510957-10958
1097110980-1098511035-1103711042-1104411071-
11075110811113511147-111561116011217-11219
11274-1127511295-1129611345 -1135011419-11422
11468-1147211476•114781154611607-1160811660-
1166111731-1173411766-1176811780-1178211811-
1183211835-118361184211901- 1190511921-11930
12011 •1201512017•1202212175 ■1217612186-12193
12206 ■122081222512241-1224412259-1226212366-
1239712470-1247112486-1251812545-1254612555-
1255912615-1261812637127231272912760-12762
12785 •1278712796■1279812803 •1280412829-12830
12968 •1296912975 ■1297612978 •1298112984-12992
12997■1299913087■1308913107 ■1311713425-13495
13531 ■1353513560■1356213592■1359513603-13605
13931 •1393313954■1395614023 •1402614058-14059
14130 •1413114170141721420614228-1423814261-
1426314298-1430514382-1438514505-1454514604
1462614644-1464814650-1465114666-1466814681-
146851478414789- 1479115182- 1518315257-15259
15290 •1529315306•1532015332 ■1533515345-15346
1535815359153861538815400-•1540315454-15456
15538 ■15541155451554615576-■1557715588-15589
1566115663156991570015789- 1581415869-15870
1588015902-1590315988-1599016044-1604616075-
1607916096-1609716121-1612216130-1613216168-
1617116200-1620116219-1622016269-1627016279-
1628016451-1645216461-1646416470-1647216484-
1648616496-16497165351655716570-1657316579-
165821662316632- 1663416636-166371665216686-
1669316727-1672916851-1685316875-1687716894-
1689616911-1691216921-1692316943-1694516954-
1696117038-1704117046-1705717094-1710017105-
1711117131-1713217237-172421727917284-17285
1733017332173351733917372-■1737417441-17450
1745417456174931749617610-■1763617924-17926
1795817962179981799918001-■1800318015-18016
1802918030183761837718421-■1842518492-18494
1850018501185331853418536-■1853718625-18626
186551866018671 1867218691-■1869218717-18718
1872318726187301873218738-■1874418767-18769
1877818780187871878918796-1880418806-18807
1882518828188351883818842-1884518857-18880
18894189031891018919-1892118925-1893518941
18946- 1895018960- 1897318976189771899619001-
1900919029-1903519045-190481905519057-19058
19062- 1906619074-19083190961910119106-19118
19138- 1914019142- 1914819154191551915919254-
192551925719266-192671927419283-1929419306-
1931219316-1931719345-19360193621937019380-
1940219405-1940619415-1941719422-1943419442-
1944419458-1946119484-1948619494-1949619515-
1952119548-1955319560-1956219564-1956619574-
195781959319604-1960819617- 1961919627-19655
19667-1966819683-19687196891969319727-19735 Tissue RNA Library SEQ ID NOS: origin Source Name
19743-19751 1975619759-1976319768-1977219804- 1980819814-1982219855-1985619921-1992919933- 1993719939-1994619948-1994919953-1995719962- 1996319965-1996719972-1998120015-2001720029- 2004320048-2006520074-2007920087-2009420099- 2010220106-201132012020122-2012720130-20132 20137-2014820167-2017920189-2019420208-20214 20235-2024020250-2025320265-2027020274-20278 20289-202962029920305-2030720316-2032020328- 2033020345-2035420391-2040020415-2041720425- 2043120437-2045120485-2049020497-2049820501 20510-2052020524-2053520537-2054520554-20567 20575-2057820614-2061920642-2064320676-20681 20686-2068820698-2071120726-2073220752-20753 20758-2076620789-2079720806-2081220819-20821 20824-2084920853-2085420858-208632086520881 20890-2090020922-2092320929-209352094320952- 2095620963-2097220977-209882099120999-21004 21027-2104521060-2106621069-2108721097-21114 2112221145-2115121202-2120721223-2122621233- 2123521248-2125221256-2127121277-2129421301- 2130321326-2133421340-2134221351-2135221377- 2139721408-2140921434-2143921447-2145021458- 2146121477-2148221495-2150021516-2153021587- 2165521913-2192121929-2193521967-2196821973- 219742197821989-2199321995-2201822020-22029 22042-2204422050-2205922074-2207622092-22094 2209722115-2211622141-2214322152-2215622160 22187-2219222195-2219822204-2222622246-22255 22271-2228922292-2229922314-2231822323-22328 22333-2233522343-2234822358-2235922365-22371 22383-2238822399-2240822434-2243522440-22448 2249522534-2253922553-225582256022571-22581 22599-2260222607-2260922622-2262822634-22643 22653-2265422661-2266222700-227012270722741- 2274222760-2276222768-2277022805-2281622823- 2282922833-2283422846-2284922854-2285822870- 2287422927-2293822948-2295022955-2295722969- 2297122991-2299423047-2305023059-2306023066- 2306723070-2307123074-2307623085-2308623098- 2310123117-2311923123-2312423218-2321923242- 23244232492325123261-2326223358-2336023382- 2338523392-2339523439-2344723470-2347223489- 2349323510-2351123514-2351823524-2352523532- 2353923543-2354423682-236862370423718-23719 2376123771-2378023793-2379723806-2380923827- 2383323843-2384423856-2386023871-2387723890- 2389223906-2391023941-2395623999-2400124005- 2401124014-2401824026-240282405624126-24129 24772-247742530625316-2531825323-2532525334- 2533525340-2534125370-2537525534-2553625966- 2596826024-260282619526209-2621326235-26236 26248-2625226294-2630226307-2630926316-26321 263272633726341-263442634926359-2636526373- 2637426383-2638526401-26402264052641126424 Tissue RNA Library SEQ ID NOS: origin Source Name
26440-26445 2645326596 -2660226615- •2661626732 2673726796--2679826843 •2684526850-2685226858- 2686226876--2687926891 -2689226980-•2698426988 27048-270492705227074 -27077270912710527122- 2712627147--2714927209 -2721327229-•2723227269- 2727027275--2727627414 ■2743127439-■2744327468 27486-274922749627498 2750027510-2751127589- 2759227602--276062762227636-2763927649-27654 2765827662--2767227698 ■2769927726-■2773927744- 2774827767--2776827816 ■2781827840-•2784427848- 2785227861--2786427890•2789227896-2792727989- 2799028019-■2803828097■2810028142-2814728161 28192-2819828204-2820928290-2829228311-28313 28315-2831628361-2836228365-283702842028424 28435-284392846428468-2848429117-2912329278- 2928329309-•2931429332-2933729364-2936629378- 2937929940-2995330135-3014230150-3015630195- 3019930224-■3023230240-3024230361-30368 trachea Clontech TRC001 177-178180-181255-256395518-519546-547847-848 919-922934-9351312-1313143115611667-16681875- 18771879-18801884-18922015-20182370-23732516 29262951-295530013165-31693322-33233441-3442 35253622-362336763840-38523949-39534067-4072 4151-41554158-41594386-43874515-451650565253- 52665337-53415530-553158466002-600461136226- 62347618-76217745-774677947806-780879888041- 804283428449-84528510-85118543-85508576-8582 8919-89339049-90629264-926595319729-97319739- 97429786-979098289848-98501030610449-10450 10551-105541086910980-1098511123-1112411252- 1127311731-1173411906-119071236012374-12377 12432-1243312760-1276212872-1288113609-13612 13632-136331400414048-1405314058-1405914105- 1410614170-1417214207-1420814546-1454914604 15290-1529115491-1549515588-1558916434-16437 16636-1663716666-1666716727-167331707317455- 1745617958-179621852718633-1863718673-18677 1879618857-1888018882-1888818894-1889618975 19057-1905819074-1908019084-1908519138-19140 193621937019401-1940219422-1943119494-19496 19749-1975119764-1976719953-1995719962-19963 19972-1998120257-2026220265-2027020289-20296 20441-2045120472-2047420548-2055320631-20634 20698-2070720727-2073220792-2079720813-20815 20929-2093220952-2095420973-2097621060-21066 21097-2110021137-2114021171-2117421297-21298 21403-2140421410-2141621447-2145021973-21974 2197822135-2213822171-2217522358-2235922383- 2238822455-2246522551-2255222634-2263622760- 2276222828-2282922870-2287422955-2295722969- 2297022972230712322223261-2326223382-23385 23726-2373023762-2376323827-2383323871-23875 23890-2389224126-2412924772-247742530625347- 2535426270-2627226341-2634426469-2647026665- 2666626876-268792705227129-2713227557-27559 Tissue RNA Library SEQ ID NOS: origin Source Name
27600-2760127814-2781527819-2782028142-28145 282332936730141-30142 uterus Clontech UTR001 124-125188-191227-230251-252301329-331738-740 919-922102810471453-14541562-15641705-1714 1893-1901 1912-191323662374-237729262988-2989 30013205-32074557-45604712-47134971-49745903- 59065919-59216114-61366235-62376403-64046533- 65356625-66266932-69387678-7680772877717798- 78017921-792379468010-80128084-80858090-8093 8137-81418313-83148368-837084158420-84218689- 86909072-90749264-92659517-95219535-95379577- 95789828-98329848-98509929-9935995310033- 1003710268-1027210508-1051210537-1053810980- 1098511071-110751113511505-115061154611609 11731-1173411803-1180412023-1202612046-12049 12190-12191 12378-1239712432-1243312894-12897 13107-1311713592-1359713888-1389513954-13956 14058-1405914261-1426314445-144471460414650- 1465114988-1499215182-1518315187-1518915290- 152911539015576-1557715699-1570015855-15857 16145-1614616174-161761660016643-1664816716- 1672316851-1685317330-173321745417958-17962 18015-180161852718655-1865818673-1867718761 1878918825-1883418894-1889618899-1890318936- 1893919036-1903919074-19083193621937019375- 1937919387-1938919442-1944419560-1956219609- 196151969319727-1973219764-1976719816-19818 19926-1992919933-19937199501998120029-20043 2012020122-2012720146-2014820151-2015420289- 202962029820328-2033020366-2036820401-20405 20427-2043120469-2047120491-2049420554-20557 20602-2060620629-2063020649-2065220689-20692 2075320758-2076720801-2080520858-2086220864 20938-2094221005-2100821072-2107521213-21215 21281-2129421377-2139721911-2191221955-21957 21978-2198222019-2202522050-2205522090-22091 22187-2219222218-2222422251-2225222261-22264 22358-2235922362-223642237322405-2240822571- 2258122622-2262422644-2265122663-2266422887- 2289122955-2295722969-2297023047-2305023094- 230972314123425-2342723439-2344723543-23544 24029-2403324130-2414425085-2509025340-25341 25374-2537525416-2541726221-2622326270-26272 26285-262902632726607-2660926676-2667726755- 2675626853-2685426860-2686227173-2717427294 27348-2735327493-2749627602-2760627636-27639 27649-2765427729-2773927861-2786427896-27927 28105-2812128133-2813728311-283132842428426- 2842829339-2934029378-2937929962-2996330150- 30156
*The 16 tissue-mRNAs and their vendor source, are as follows: 1) Normal adult brain mRNA (Invitrogen),2) normal adult kidney mRNA (Invitrogen), 3) normal adult liver mRNA (Invitrogen), 4) normal fetal brain mRNA (Invitrogen), 5) normal fetal kidney mRNA (Invitrogen), 6) normal fetal liver mRNA (Invitrogen), 7) normal fetal skin mRNA (Invitrogen), 8) human adrenal gland mRNA (Clontech), 9) human bone marrow mRNA (Clontech), 10) human leukemia lymphablastic mRNA (Clontech), 1 1) human thymus mRNA (Clontech), 12) human lymph node mRNA (Clontech), 13) human spinal cord mRNA (Clontech), 14) human thyroid mRNA (Clontech), 15) human esophagus mRNA (BioChain), 16) human conceptional umbilical cord mRNA (BioChain).
Figure imgf000230_0001
Figure imgf000231_0001
Figure imgf000232_0001
Figure imgf000233_0001
Figure imgf000234_0001
Figure imgf000235_0001
Figure imgf000236_0001
Figure imgf000237_0001
Figure imgf000238_0001
Figure imgf000239_0001
Figure imgf000240_0001
Figure imgf000241_0001
Figure imgf000242_0001
Figure imgf000243_0001
Figure imgf000244_0001
Figure imgf000245_0001
Figure imgf000246_0001
Figure imgf000247_0001
Figure imgf000248_0001
Figure imgf000249_0001
Figure imgf000250_0001
Figure imgf000251_0001
Figure imgf000252_0001
Figure imgf000253_0001
Figure imgf000254_0001
Figure imgf000255_0001
Figure imgf000256_0001
Figure imgf000257_0001
Figure imgf000258_0001
Figure imgf000259_0001
Figure imgf000260_0001
Figure imgf000261_0001
Figure imgf000262_0001
Figure imgf000263_0001
Figure imgf000264_0001
Figure imgf000265_0001
Figure imgf000266_0001
Figure imgf000267_0001
Figure imgf000268_0001
Figure imgf000269_0001
Figure imgf000270_0001
Figure imgf000271_0001
Figure imgf000272_0001
Figure imgf000273_0001
Figure imgf000274_0001
Figure imgf000275_0001
Figure imgf000276_0001
Figure imgf000277_0001
Figure imgf000278_0001
Figure imgf000279_0001
Figure imgf000280_0001
Figure imgf000281_0001
Figure imgf000282_0001
Figure imgf000283_0001
Figure imgf000284_0001
Figure imgf000285_0001
Figure imgf000286_0001
Figure imgf000287_0001
Figure imgf000288_0001
Figure imgf000289_0001
Figure imgf000290_0001
Figure imgf000291_0001
Figure imgf000292_0001
Figure imgf000293_0001
Figure imgf000294_0001
Figure imgf000295_0001
Figure imgf000296_0001
Figure imgf000297_0001
Figure imgf000298_0001
Figure imgf000299_0001
Figure imgf000300_0001
Figure imgf000301_0001
Figure imgf000302_0001
Figure imgf000303_0001
Figure imgf000304_0001
Figure imgf000305_0001
Figure imgf000306_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Λmino acid sequence ( X=Unknown, NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
3400 33768 A 3437 2052 MVLVVVAVVVVVLVVAVIVV
VVVVVAAVVVGAVVVVVVVV
MVVVVVVVVVEEDNQHKTGA
INNNNTAKNPQQSPFHSPATST
GAEATQMRRNQKTNPHNMTK
QVSLTPPKITLAHQQWIQTKKK
YLIYLKKHSGVNKIPRNPTYEG
CEGPFQGELQTTAQQNKGGHK
QTEDHSMLMDRKNQYCENGH
TAQAVPNPYTLLSQIPEDAEWF
TVLDPKHAVFCIPVHPDSQFLF
AFEDPSNPMSQLIWTVLPQGFR
NSPHLFGQALAQDLSQFSYLDT
LVLRYMDDLLLATHSETLCHQ
ATQALLNFLATCGYKVSKPKA
QLCSQQVKYLGLKLSKGTRTLS
EERIQPILGYPHPKTLKQLTAFL
GITGFCQIWIPRYSKIARPLNTRI
KETQKANTHLVRWTPEAEVAF
QALKKALTHAPVLSLPVGQNFS
LYVTEKYTGIALGVLT/PGTSAQ
LAELIALTRAPELGEGKRVNIY
ANSIGREREFLTSKGTLVKHQE
AIKRLLLAVQKPKEVAVLHCW
GHQKGKEREIEENRQADIEARR
A ARQDPPLEM LTEGPL AFEL A
MATARAELSLAIHHCCLPPPPQ
TRC WLPSLRIRQG VCC I PDPAR
AITLTAWPKIPFLGIRKAKNPRS
EKTRLATILEAACCHFGSGPPPS
WELWEQGPPVTVQTHILRSHL
Figure imgf000308_0001
Figure imgf000309_0001
Figure imgf000310_0001
Figure imgf000311_0001
Figure imgf000312_0001
Figure imgf000313_0001
Figure imgf000314_0001
Figure imgf000315_0001
Figure imgf000316_0001
Figure imgf000317_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Amino acid sequence ( X=Unkπown, NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
3450 5818 3487 2302 MTECLDRFIDS1AAVPRSTKSTV
QKCLPCLSDGEDKIPDLIAITWT
PRQGELLEKNVISETGTLLPTPC
LDTSTKETADKSTSGKTIHQSIK
TVLKDLSGSIDDLPTGTEATLSS
AVSASGSTSSQGDQSNPAQSPF
SPHASPHLSSIPGGPSPSPVGSPV
GSNQSRSGPISPASIPGQDPGYG
NS/DKSMGHEYSQR/SFLEDRFP
IAVWWPRPLRLKNCLSVLSYSS
PSEVTPHPKSESSGTS/SAAQDL
QGCSQDVGQPASSSGGSTREQS
TSSFIRIVAASSPSSCWKLQVLL
SG/AGGDYSPVLLIGGYSRVCLP
Q*SDASAATREP/GQNPVPIPP*
ASHQCHRKEGPPCRQQAGASQ
MLSRD*AKQLKPSSSHTLSKHK
TT/GTRKSLLFGIKKAYNFTNKY
YSELMTQTRPQSTPSIPSPLPLD
DAGLERSQGNVSASSFMVLGN
RERGEDTTGAGFGRSRNKEEVP
CTIYVGAESP/EMC*WMDHT*R
KEGKGGLVGVPCV/SREHLEEW
QYQLQR*ISLKTQQV*RRKSEV
LLGRS/SNTAQACSCWQLTCFM
AGTQRNPQMAQYGPQQTGPSM
SPHPSPGGQMHAGISSFQQSNSS
GTYGPQMSQYGPQDGGGDVSD
VVMAIDDDGSCHLLGSAVPGA
VLVTFNLLLIIVVTLQMTEPQFR
EYITGDPLESTCRHASLALAVV
LHQETAMTMITDSLAVVPHSG
Figure imgf000319_0001
Figure imgf000320_0001
Figure imgf000321_0001
Figure imgf000322_0001
Figure imgf000323_0001
Figure imgf000324_0001
Figure imgf000325_0001
Figure imgf000326_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Amino acid sequence ( X=Unkno n,
NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possiblc nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
3499 53867 A 3536 2502 MTELTGIQQPQIVLFEHKGHKL
VQGSSSDAGKVNRIYQHYEAS
DKFNYTTGLAWKTAPEQTGKT
VRKQQIKLNVKKMESRSKMQE
HSSSPPMEQSWRENDFDELREE
AFRRSNYSELQEEIQTKGQEVK
NFEKTLDEYITRITNTEKCLKEL
MELKAKARELREECRSLRSRCD
QLEERVSVMEDEMNEMKREG
KFREKRIKRNEQSLQEKWDYV
KTPNLRLIGVPESDGENGTKLE
NTLQDIIQENLPNLVRQANIQIQ
EIQRTPQRYSSRRATPRHIIVRFT
K VEM KEKMLRA AREKEIQTTIR
E Y YKH L Y ANKLENLEEMDKFL
DTYTLPRLNQEEVESLNRPITGS
EIVAIINSLPTKKSP/GPVGFTAE
FCQRKVEGILSISFCEASIILIPKL
GRDTTKKENFRPISLMTIDTKIF
NKILANQIQQHIKKLIHHDQVG
FIPGMQGWFNICKSINVIQHINR
TKDKNHMIISIDAEKAFDKIQQL
FMLKTLNKLGIDGTYFKIIRAIY
DKPTANIILNGQKLEAFPLKTGT
RQGCPLSPLLFNIVLEVLAGAIR
QEKEIKGVQLGKEEVKLSLFAD
DMIVYLENHIVSAQNLLKLISNF
SKVSGYKINVQKSLAFLYTNNR
QTESQIMSELPFTIASKRIKYLGI
QLTRDVKDLFKENYKPLLNEIK
EDTNKWKNIPCSWVGRINIVK
MAILSKVIYRFNAIPINLPITVFT
SEQ ID SEQ ID NO: Met SEQ ID NO- Nucleotide Nucleotide location of last Λmino acid sequence ( X=Unkno n, NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possιble nucleotide sequence 09/540,217 codon lor peptide ol peptide sequence deletion, \=possιble nucleotide insertion) sequence
3500 33868 3537 2197 MNNAKENFLGRFQDGRIGTAP
VYSPQHQRRRRRVISALPTEPPL
VIPRQTGFGVDLQQTPTDLQLR
VLTVRRKTTKQEGHSTKTPSVR
YHHQRPKEDKTTKMGRNQSRK
AENSKNESASSPPKECSSSPATE
QSWMENDFDKYTEVGFRQLVI
TNFSELKEDVQTHHKEAKNLE
KRLDEWLTRINSIENTLIDLMEL
KTMARELRDSCTSFSRQFDQVE
ERVSVIEDQMNEMKREEKFRE
KKMLEVLPRAIRQEKEIKGIQL
GKEEVKLSLFADNMTVYLENPI
ISAQNLPKLISNFSKVSGYKINV
QKSQ AFLYTNN RQTESQIMSEL
SFTIASKRIKYLGIQLKRDVKEL
FKNYKPLLKEIKEDTN K WKN IP
CSWVGRTNIVKMAILPKIIYRFN
A I PIKPPMTFFTELEKTTLKFI RN
QKRAHIAKTILSKKNKAGGIML
PDFKLYYKATVTKTAWYWYQ
NRDIDQWYRAEASEIMPHIYNY
L1FDKPEKNKQWGKDSLFNKW
CWENCLAICGKLKLDPFLTPYT
KINSRWIKDLNVRPKAIKILEEN
LGNTIQDTGMGKDFMSKTPKA
MATKAKIDKWDLIKLKSFCTA
KETTIRVNRQPTKWEKIFATYS
SDKGLISRIYNELKQ1YKKKTN
NSINKRAKDMNRHFSKEDIYAA
KRHMKKCSSSLAIREMQIKTTM
RYHLTPPEVEVVLETL/NH/RSW
3501 33869 3538 242 NLEEMDKYLDTYTLPRLNQEEF
ESLNRPITGSEIEAIINSLPTKKSS
GPDGFTAKFYQSIVLEVLARAI
RQEKEIKGIQLGKEEVKLSLFA
DDMIVYLENPIISAQNLLKLLSN
FSKVSGYKINVQKSQAVLYTN
NKQTESQIMSEPSFTIASKRIKY
LGIQRTRDVKDLFKENYKPLLN
KIKEDTNKWKNTPCSWIGRINI
MKMAIVPKVIYRFNAIPIKLPM
TFFTELEKTTLKFIWNQKRARIA
KSILSQKN
Figure imgf000329_0001
Figure imgf000330_0001
Figure imgf000331_0001
Figure imgf000332_0001
Figure imgf000333_0001
Figure imgf000334_0001
Figure imgf000335_0001
Figure imgf000336_0001
Figure imgf000337_0001
Figure imgf000338_0001
Figure imgf000339_0001
Figure imgf000340_0001
Figure imgf000341_0001
Figure imgf000342_0001
Figure imgf000343_0001
Figure imgf000344_0001
Figure imgf000345_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Λmino acid sequence ( X=Unkno\vn, NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /^possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
3587 53955 3624 2056 REALQGIQVRLKHLRTFG1IVPC
QSPCNTLLLPFPKPRTKDYSQV
QDLRLLHQATLTFHPTVPNPTT
LLGLLPAKDSGFTCLDPKDAFF
PIRLAPERQKLFAFQWEDPESG
VTTQYTWTGLPQGFKNSPTIFG
EAWARDLQKFPSRDLGCVLLQ
*VDDLLLGHPTAVGCAKGTDA
LHRHLEDCGCKVSKKKAQICR
QQALAATALRVQEANKLTLEQ
NLNIKASRAVVTLMNTKGHHW
LTNARLTQYQTWLCENPRITIE
VCNSLHPATLLPVSESPVEPRC
VEVLDTIDSSRPDLRGQPWASV
DWELYVDGSSFFNPQGERGAG
CAVITLDTVVEARSLSQATSAQ
KAELIAFIRALELS/EGRKGLSPG
RGKDK*WRKDGFGYRMGEYC
ATAARSCSCTGCARNHPSTSGV
TGKVVRPVFLHLAFVS\FAKTV
RQRCVTCRQHDARQGPAVLPGI
G A YG A APFEG LQ VDFTEMPKC
GGNKYVLVLVCTYSGWVEAYP
TLTEKAREVTRVLLRDLIPRFRP
PLRIGSDKGPAFLAALLQKTAK
MGTRSDTQLAHIGTVLRDIHVS
VCSDGPNLRTGLNVILGGVEW
QSTPGNLVRRQGETGLHLHIYH
WWQAVAIFPVYLGSSLHMKVG
GRSFEQEEDTEHIPVSYDREGQ
ECDTELKGQEGDELEAGSVVP
3588 33956 3625 491 964 R1QLCCRTRGTGAQKKRMKVS
SRCTPAPATRGTGAWQPQAQQ
APGVRATEAPRL*AHDEVSQPA
PAPPSTRHSPRR* PVAGKEHLE
AAVDKERHEVAQAVVTHVLEG
QLEDVAPAHAAQI/GSPPWAGK
RLRTNPAPRPCHPIQTLSRRLGP
QNHTLLH
3589 33957 3626 131 351 NVGLKGTAGER\GGSGPPS*PPA GRNSGPAGRRPPAARAPTPGSA AR*PAPPGPPRPPAGRGAAAAA GPAGGGA
3590 53958 3627 428 G EWEA PPLLRHTRPG PA/PAPPA
PSGASCAPCGGQTCRPRPLRQA
PPSPITTGHARIWLGQPRPRSSS
ATPPKELP*GPTE\PHTGELWVA
SGSCPSGTKLPEEGSGSNTYFSA
VSAGDTQSNIIWNGPPANSNRP
AAEGPDC
Figure imgf000347_0001
Figure imgf000348_0001
Figure imgf000349_0001
Figure imgf000350_0001
Figure imgf000351_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Λmino acid sequence ( X=Unknovvn, NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
3614 53982 3654 854 5009 VNSHSQLLQRE*NT*ESNLQGM
*RTSSRRTTNHCSMK*KRIQTN
GRTFHAHG*E/RVN1VKMAILP
KIQSDLTSHEISLEEMKKHNQG
KEAAQRVLSQIDVAQKKLQDV
SMKFRLFQKPANFEQRLQESK
MILDEVKMHLPALETKSVEQE
WQSQLNHCVNLYKSLSEVKS
EVEMVIKTGRQIVQKKQTENPK
ELDERVTALKLHYNELGAKVT
ERKQQLEKCLKLSRKMRKEMN
VLTEWLAATDMELTKRSAVEG
MPSN LDSEV A WGKATQKEI EK
QKVHLKSITEVGEALKTVLGKK
ETLVEDKLSLLNSNWIAVTSRA
EEWLNLLLEYQKHMETFDQNV
DHITKWIIQADTLLDESEKKKP
QQKEDVLKRLKAELNDIRPKV
DSTRDQAANLMANRGDHCRK
LVEPQISELNHRFAAISHRIKTG
KKPSWRRGVSNLGEMLVEVYL
KALMSEDLRKGINQDEFSPTIY
YFPITVFGSEGDLLLGKIRWIQG
AYCLMIGQDVFMDTRLRVSAC
FLKTKMKTVLWFDQNEDNEG
TVKELLQRGDNLQQRITDERKR
EEIKIKQQLLQTKHNALKDLRS
QRRKKALEISHQWYQYKRQAD
DLLKCLDDIEKKLASLPEPRDE
RKIKEIDRELQKKKEELNAVRR
QAEGLSEDGAAMAVEPTQIQLS
KRWREIESKFAQFRRLNFAQIV
3615 33983 5655 44 953 GVHNGVEELILVRRMQKSPGP
GEMESGSLEKEPLGTQTGPVPS
E/EYGIGLSQSI STKH PETSPKDS
RIRENDVTADGRTTEDHITADP
GTTEDSVTADPGTTEDNVTVDP
GTTEGSVTADPATTKDYVSADP
GTTKDSVTADPGTTENFVTADP
GTTKDSITADPRTTENFVTADP
GTTKHSITVDPGTTEDSVTADP
GTTKHSITADPGTTEDSVTADP
GTTEDETTKHGDTHLL*TTSVT
AVKPTRLLTPMGIILISLAATTV
TVVLFVGLGFIVKECFLPPLNPS
TRVIYHPHVMDYSTP
3616 33984 A !656 200 542 CSPPSTRPGPGP/SGTAWPGPRG TKRSSPSSSSSSPSTTSTTTSSSSS SSSSSSSSAPPRGFSSTRPSPLRR LLPPSSSSPSSSSSSPSTTSTTTSS SSSSSSASAGGRRAGTRG
Figure imgf000353_0001
Figure imgf000354_0001
Figure imgf000355_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Amino acid sequence ( X=Unknovvn, NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
3633 34001 A 3673 1270 MGDPSRRRTCRAMQAKYPLVF
KGCGVCWGSLDPRCRVASQV
WPIPKRLSRGWPFHNAVGRQV
SDWKSGQDFADFGTTHQTGFS
PAGANQRGPLAATLSGPGGEG
QSAVARLTGEKKNHPGAQYAN
RLSPRVGRFINAAGTTGFPTGK
RAVSATQLMILCLLPGYLCNGK
RKLSAIQGLLDNGSELSLFPENP
KRHCGLPVKVGAYGGQKTDRS
WRKTVDYCKLNQVVAPIVAAV
PDV/WSLLEQINTSPGTWYAAI
DLTNAFFSIPVHKAHQKQFAFS
WQGQQYTFTVLPQGRWEINMT
KIQGPSTSVKFLGVQWCGACQ
DIPSKVKDKLLHLVPPTTKKEA
QHLTGLFGFRRKYIPYLGVLLC
PIYQVTRKAASFQWRPEQEKAL
QQVQAAMQAALPLGPYDPAGP
MVLEIAVADTEAVWGH
3634 34002 3674 1978 LTIYAVNLSLILPQGDLWPFTRV
TVH*GKGNDQTFQELLDTGSEL
TLIPGYPKRHCCPPVKVRVYGG
QVINGVLAQV*LTVGPVGPRTH
PWISPVPECIIMLSSWQNPHIGF
LTGRARAIMVGKAKWKPLELT
LPRKIVNKKQYHILGGTVEISAT
IKDLKDTEAVTPTTSPFΝSPIWP
VQKTDGSWRMTVDYCKLΝQV
VTPIAAAVPDWSLLEQIΝTSPG
TWFEWSPK\KALQQVQAAVQA
ALPFGPYDPADPMVLEVSVAD
RDAIWSLWΝAAIGESQRRPLGF
WSKALLSSADΝYSPFERQLLAS
YWALVETERLTVGHQVTLRPE
LPIMΝWVLSDPSSHKVSGAQQ
RSIIKLKWYIHDWVRAGPEGTS
KLHEEVAQMPMVSTPATLPSLS
QPALMASGGVPYYQLTEEEKT
RAWFTDGSARYAGTTQKWTA
AALQPFSRTPLKDSCEGKSPHH
PVIAQWAHEQSGHGGRDGGYL
WAQQHGFPLTKADLAMATAE
CPICQQQRPTLSPRYGTIPQGDQ
PATWWQVDYMGPLPSWKGQR
FVLTGIDTYCGYGSAYSARΝAS
AKTTIHGLTECLFHCLG1PHSIA
SDRGTHFMDKEAPSASVLGLA
LALLAPQLADSLLEDPVIVKGT
DEAEYFQSVREEPDSGVKRKK
MLKSGKΝY
Figure imgf000357_0001
Figure imgf000358_0001
Figure imgf000359_0001
Figure imgf000360_0001
Figure imgf000361_0001
Figure imgf000362_0001
Figure imgf000363_0001
Figure imgf000364_0001
Figure imgf000365_0001
Figure imgf000366_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Λmino acid sequence ( X=Unkno n, NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
3692 34060 3732 5695 MGKRSFERVLVDKCDSGRSCL
RKQHENECAFIQILDTQSLMIPG
QRGSFRLADSQHTDRVLCTLM
AEKWDRKALSYTRNSFRAQIRL
RKTRFQGKGCMICKKSRVLPY
QAAYVSQHGSACQPSSHLPSVG
SLSSTGDDEEEEEIVHMGNAIM
SFYSALIDLLGRCAPEMHLIQTG
KGEA1RIRSILRSLVPTEDLVGII
SIPLKLPSLNKDGSVSEPDMAA
NFCPDHKAPMVLFLDRVYGIK
DQTFLLHLLEVGF
3693 34061 3733 2523 MKQFLLYLDESNALGKKFIIQDI
DDTHVFVIAELVNVLQERCHTR
LGYTEFLVAWRVTFGLCVEAV
TLHLKYQILIRGLLEMMSFSDA
DILKQLPVTVPGLFPASLSPSSL
LGNSPPSWLRHNSESKVSAVSS
PSATKTLSTGIGKLDPGHKEMA
EESELLKNKMQAPPLSRCPESQ
KCQHQLRLHHWKPSVRHQVKR
RSPAVLRSAMPPADCPAVLEAT
TATHPEKGTALSKHLPSSDSMS
LKVDVE A LENSPGATYI WKGG
KVTRDSQPKEQGKGDLKKKKK
GKLPKNYDPKLTPDPERWLPM
QECSFYQGRKKGKKKDQMGK
GTQGATAGASSELDARKTVSSP
PTSPRPGSAATLSASTSNIIPPRH
QRPAGAPATKKKQQQKKKKG
GKGFPVLREITWKVDTLWFQ
ILEERLSVFHIQYDTSYPFSTVDI
EDHECAVWLLLRKSKSDDKTT
RLEAVREMSETHHWHDAEKAF
DKIQQPFMLKTLNKFGVDGTY
LKI I RAI YDKPTAN I ILNGQKLE
AFPLKTGTRQGCPLSPLLFNTV
LEVLARAIRQEKEIKGIQLGKEE
VKLSLFAGDIIVYIENSIVSAPKL
LKLISNFSKVSEYKINVQKSQAF
LYTNNRJ-ΠΈSQIMSKLPFTIATK
RIKYLGIQLTRDVKDLFKENYK
PLLNEIKEDTNEWKNIPCSWVG
RINIMKMAILPKVIYRFNAISIKL
Figure imgf000368_0001
Figure imgf000369_0001
Figure imgf000370_0001
Figure imgf000371_0001
Figure imgf000372_0001
Figure imgf000373_0001
Figure imgf000374_0001
Figure imgf000375_0001
Figure imgf000376_0001
Figure imgf000377_0001
Figure imgf000378_0001
Figure imgf000379_0001
Figure imgf000380_0001
Figure imgf000381_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Amino acid sequence ( X=Unkno n, NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
3774 34142 3814 75 807 GIAGFVNIHLDSLSFLTGVPGVK
AERF\E*RMTAKHCALSLVGEPI
MYPEINRFLKLLHQCKISSFLVT
NAQFPAEIRNLEPVTQLYVRVD
ASTKDSLKKIDRPLFKDFWQRF
LDSVKALAVKYLQRIGSRTPM
DTKIYSYCPAVHPAEPTDMKS
WPSLFEVPTSLEYCPFYLQLVES
ADAEGTQKYRRLTAYYIPVYTE
PPLITKEPSCLWKQAEFGDLGK
HVWLVEQFSSTRVQEHGVGW
3775 34143 3815 35 2088 KVMNKRSQTQNGTRYMTPPPR
SSHTKQHLL\PTPPPRSSHTKQH
PLHDPITTKLTH RT/CTR YTTPSP
RSSDTEQHPL\PAPSPRSSDTEQ
HPL\PAPSSRSSDTEQHPLHDPT
TTKLTYRTAPATRPHHHEAHTQ
NSTRYTTPSPRSSDTEQHPLHGP
ITTKLTHRTAPATRPHHHEAHT
QNSTRYTAPPPRSSDTEQHPLH
GPTTTKLRHTTAPATRPHHHEA
HTQNSTRYTAPSPRSSDTEQHP
LHGPITTKLTHRTAPAT/PAPSP
RSSHTEQHPLPAPPPRSSDTEQ
HPLHGPTTTKLTHRTAPATRPH
HHEAHTQNSTRYTAPPPRSSDT
EQHPL\PAPPPRSSHTEQHPL\PA
PSPRSSHTEQHPL\PAPSPRSSHT
EQHPLHGPITTKL/STQNSTRYT
APSPRSSDTEQHPL\PTPSPRSSH
TEQHPL\PTPSPRSSHTEQHPLH
GPITMKLTHRTAPATRPHHHEA
HTQNSTRYTAPSPRSSHTEQHP
L\PAPSPRSSHTEQH PL\PAPSPRS
SHTEQHPLHGPITTKLTHRTAP
AP/PTPSPRSSDTEQHPLHGPITT
KLRHRTAPATRPHHHEVQEQA
KPIK*PPRPSPETTRAQPREPAV
TLLPSGALGQACPCDATAGPHG
TTLWPAVPPRWQQHLTRELLH
PVPRACP*QGQGQPFTAGPGRG
SHPYDPTGASPKGQSSIL
3776 34144 3816 83 84 RLTLPDRLGSPPDTH*AQHITRA VLPQGFTDSPH
Figure imgf000383_0001
Figure imgf000384_0001
Figure imgf000385_0001
Figure imgf000386_0001
Figure imgf000387_0001
Figure imgf000388_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Λmino acid sequence ( X=Unkno\vn,
NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possiblc nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
3819 34187 5859 852 DEEEWAREEEEEEEEEEMVPE
ESMASAGPEDFEQDGEEAALA
RGAPAVDSLGMEEEVDIETEPV
AHEKRPSMLDEPPLPVGVEEPA
DSREPPEEPGLSQEGAMLLSPEP
PAKGLAHPNGSQKV1FRVPLRV
IHGPKAVELQVFPGLHKQPTNQ
PK/TEPCDPHSWFKSCYHLLFIP
VGISRPP/HNPTITATIFASTASV
LW/PVLDTCMSSNSGYFKAVLE
SYSSKVLSVTQYGNPRATGSAG
LRGRPGS\PGSSGSRGPAWP*PQ
AAPRCPPSSGRPGPTSQSPS
3820 34188 A 5860 1997 AQGSWPGLFWAFLQLEVNCL
LESPIIQGKFHFRLER1SWEPQE
RKRLSFRKSEI*P*K*SLVKKL*E
RLKTRKQMQLAΝRLRRYGYSV
VES*FPΝLKVSSSVSTTPTTTYIP
MTHKAIFSSYFLWDGRSAFLTI
YKMMSSHPQEEEEEEEEEGGE
GEERKRRKKEEERGKRRKRRR
RMK*RRRRTRKRRKRRKMK*R
RRRRRRNM RKKRKEGKNMKK
KM/REEIKRQNALYEIEMRKKL
EKKREEMHESRRRFLAPLFSSP
TANCSTSLVPRLRLASLPAALPS
NRWRVTTPPAGVRGAWRHSH
FSRSRSIMDTSSEMLVRFGRRC
GRAKESTGRDWNSLKSSEEDR
KMWESLELPRDLLNAFDQNAD
SDMDNKMQAEMVSDGDEELS
GNWSKGDSCYVLAKRLASFYL
CPRDLWNFEKDDLGYLAEEISK
QQSIQEAQRSRRKKWFYGPGPG
SLCCVQPIDLVPCVPAAPAMAE
RGQCRAHAVASEGGSPKPWQL
PHGVEPVGAQKSRIEVWEPPPR
FQKMYGNAWMSRQKFAAEAG
PHGEPLLGQCRRELWGRSSHVE
SLMGHYLVELLSIGAMGIKVQR
PRCFFDIAINNQPGEKGTGKSTQ
KPLHYKSCLFHRVVKDFMVQG
GDFSEGNGRGGESIYGGFFEGP
AMGPNATNNFTKLAG
Figure imgf000390_0001
Figure imgf000391_0001
Figure imgf000392_0001
Figure imgf000393_0001
Figure imgf000394_0001
Figure imgf000395_0001
Figure imgf000396_0001
Figure imgf000397_0001
Figure imgf000398_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Amino acid sequence ( X=Unknown,
NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
3859 34227 5899 2289 GELHGVAEQAEGDPREGSPGP
AEQASGTELREVPGPHWPLHPP
EAPVCQHYHRPMVQKGN*GSV
WGRGESLVQG/AHGQTSQRSV
QMTGGGAWTGQTSQRSVQMT
GGGGTPRGSRSSSPRTTYTPGTA
EDTEGEPAGAGEQAAGRPVRP
LHGHPGAGQEAAAGVRELPPA
EPAAHLHPQPALLIQQHPISHAR
SQHPRRPCCLPGPGLRAGGTAE
GLPCAFCSQRDERAEGRERDLE
GGGEAASGPGRQAQAPGQGGH
LGPPLTPAAPLPWWLEGHHRE
ATGRPRGG*GRPPGRGPTGRRK
ASRAQDISSGQNLPRGHPA*VA
SPRHEPPAHLQPAARDHCRGA\
PGSQACPADRGPANGTPPPLPA
RSSPPSP\GMSVASPWTASCGPP
GPPP* PMGPEALPEGGPALPPKP
PPVPA PSEPPQQPPG PCCSPQRP
PAPGPEGQRSRGLGGAHRTAG
AAQCPGGHAGPSPGGGTAPAP
GPAAAAG*GQGRQCQAKGPAH
TRGDAALPTSRLRL*GP*E*GD
QGSSGVAAGLSGGRHTQPAGPG
RAQRTEAAATQDCALDKPLDL
SEWGRARGQDTPKPAGQHGSL
SPAAAHTASPEPPTQSGPLTRSP
QALSNGTKGTRVPEQEEASTPM
PPDLDGHP\GPARLKC*DQSPTN
WM RQTPQA A\SG PELPGGG\PT
STTGEGPEC1CTQEHGQGPPRK
3860 34228 3900 3169 ASQLVLTLAYQANCVSVSYTD
LLGKPGGSYFTFLYVLNIRSRSR
LKKDYDDFRRQPDHDTFNREL
WTTDEGEGDLGKDSPKGEISKS
IDSTEPLDILEKDHFDSDDMKLS
EIDFPMARSKLLKKELPSKDLP
KTLLKTLKRQSKQTDYVDDST
KELSPRKKAKLSTNETTVENLE
SDVQIDCFSESKHTEPSFPESFA
SLDSVPVSTLQKGTKPIQALLA
KNIGNKVTLTNQLPPSTGRNAL
AVEKPVLSPPEAS
Figure imgf000400_0001
Figure imgf000401_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Amino acid sequence ( X=Unkno\vn, NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possiblc nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
3866 34234 A 5906 4527 MGFCCRECRPRLLKGRPCIQHA
GPVAAFKVATPYSLYVCPEGQ
NVTLTCRLLGPVDKGHDVTFY
KTWYRSSRGEVQTCSERRPIRN
LTFQDLHLHHGGHQAANTSHD
LAQRHGLESASDHHGNFSITMR
NLTLLDSGLYCCLVVEIRHHHS
EHRVHGAMELQVQTGKDAPSN
CWYPSSSQDSESNHGNNFRIH
VSNGLLMRGPRPLDRERNSSHV
LIVEAYNHDLGPMRSSVRMRK
LRQSTALAQHWTGTALDR
5867 34235 3907 2180 MALTFPCRKFEWYGRRQPEVR
YSVPASHQLKATDADEGEFGR
VWYRILHGNHGNNFRIHVSNG
LLMRGPRPLDRERNSSHVLIVE
AYNHDLGPMRSSVRMRKLRQS
TAL\DSTGQAQHWTESRSGSPG
SPVAPTCSART*QTSAS\VHLCL
SGKSHHAWPP*TPFKLYYVH\E
YSAHIHKENLVLVIVYVEDIND
EAPVFTQQQYSRLGLRETAGIG
TSVIWQATDRDSGDGGLVNY
RILSGAEGKFEIDESTGLIITVNY
LDYETKTSYMMNVSATDQAPP
FNQGFCSVYITLLNELDEAVQF
SNASYEAAILENLALGTEIVRV
QAYSIDNLNQITYRFDAYTSTQ
AKALFKIDAITRILGTQMDTKM
NKTLLSPQRVLRLEVEMELIQD
ANQS ATRRCAEN YN RG V VEPL
RAQQSYLAGEAGRLHGRGGFP
VECEREEGIQQTECPGEVMPDR
GSDMEGVITVQGLVDREKGDF
YTLTWADDGGPKVDSTWSG
TRVYITVLDENDNSPRFDFTSDS
AVSIPEDCPVGQRVATVKAWD
PDAGSNGQVVFSLASGNIAGAF
EIVTTNDSIGEVFVARPLDREEL
DHYILQWASDRGTPPRKKDHI
LQVTILDINDNPPVIESPFGYNV
SVNENVGGGTAWQVRATDRD
IGINSVLSYY1TEGNKDMTFRM
DRISGEIATRPAPPDRERQSFYH
Figure imgf000403_0001
Figure imgf000404_0001
Figure imgf000405_0001
Figure imgf000406_0001
Figure imgf000407_0001
Figure imgf000408_0001
Figure imgf000409_0001
Figure imgf000410_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Amino acid sequence ( X=Unkno>vn, NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
3900 54268 5940 1566 IQTTIGEYYKHLYTNKLENLEE
MDKFLDTYTLPRLNQEEGESLK
RPMAGSEIEAIINSLPTKNSPGP
DRFTAEFYQRYKEELLISNFSK
VS/VIQNQWEKITSIPIHQ*QTNR
EPNHE*TPIHNCFKENKI\LGIQL
TRDVKDLFKENYKPLLSEIKED
TNKWKNIPCSWIGRTNIVKMAI
LPKDKTSKYIDVDENEGSHCGK
RKYKYGMEKALEILARAIRQEK
EIKGIQLGKEEVKLSLFADDMI
VHLENPIISAQNLLKLISNFSKV
SGHKINVQKSQTFLYTNNRQTE
SQIMSGLPFKIATKRIKYLGIQL
TRDVRDLFKENYKPLLNETKED
TNKWKKNILSSWIGRINIVKMA
ILPKVIYRFNAILINLPMTFFTEL
EKTTLKFIWNQKRACIAKTILSQ
RNKAGGITLRDFKPYYKATETK
TASEMKYYLENKIPFKVLHMV
YNVPTHPPFIGDLHPNTKWSL
PPNITSLIEPMNQGVISAFKDCY
LRKTFVQAVATPEGETEMTVM
QFWKDYNT
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Λmino acid sequence ( X=Unknown, NO. of peptide hod in USSN location of first codon tor last ammo acid *=Stop codon, /=possιblc nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possιble nucleotide msei tion) sequence
3901 34269 A 3941 2580 MVKGSIQQEELTILNRYAPNTG
APRSIKQVLSDLQRDLDSHTIIM
GDFNTPLSTLDRSTRQKVSKDI
QELNSVLHQADLIDIYRTLHPK
STECTFFSAPHRTYSKIDHIVGS
KALLSKYKRTEIITNCCSDHSAT
KLELRINKLTQN RSTT WKLNN L
LLN DY W VHNEM KAQIKMFFET
NENKDTAYQNLWDTFKAMCR
GKFIALNAHKRKQERSKIDTLT
SRLKELEKQEQTLHSKDSRRQE
INAEKAFDKIQQPFMLKTLNTL
DIDETYLKIIRAIYDKPTVNIILN
GQKLEVFPLKTGTRQGCRLSPL
LFNIMLEVLARAIRQEKEIKGIQ
LGKEEVKLSLFADDMIVYLENP
IISAQNLLKLISNFSKVSGYKIN
VQESQAFLYTNNRQTESQIMSE
LP\FTIASKRIKYLGIQLTRDVKD
LFKENYKPLLKEIKEDTNKWK
NIP\CSWVGRINIVKMAILPKVI
YRFNAISNKLPMTFFTELEKTTL
K\FI*KQKRACIAKSILSQKNKA
GGITLPDFKLY\YYKAIVTKTA
WYWYQNRDIDQWNRTEPSEIIP
HIYNHLIFDKPDKNKKWGNDS
LFNKRCWENWLAICRKLKLDP
FLTPYTKINS\RWIKDLHVRPKT
IKTLEENLGNTIQDIGMG KDFM
TKTPKAMATKS\KIDKWDLIKL
KSFCTAKETΠIRVNRQPTEWK
KI FTI YPSDKGLI SRI YKEPKQI Y
Figure imgf000413_0001
Figure imgf000414_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Amino acid sequence ( X=Unknovvn, NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
3908 34276 A 5949 2377 SLFHGKVCHFLHEPLPLVYLSL
CTGYQLFKPSL1SWLEEEEELST
LPRVLQEWKMCLKTKGPALW
QDNFCLKTLNGIQLARNQNGEE
LYDCKQCEDVFCKHPCLKTNM
STQNRGNTSECIQYAKDLLSLY
NKTSTIRKVSVFSKHGKSFRL\F
*MFRSRESVHKINPLK/CTDYGK
AFIYQSYLEAHRKTQSGEKLNE
WKQCGEAFTHSTSHAVNVETH
IIKNPYECKECGKDFRYPTHLN
NHMQTHIGIKPYKCKHCGKTFT
VPSGFLEHVRTHTGEKPYGCKE
CGKAFGTSAGLIEHIRCHAREK
TFKCDHCGKAFISYPSLFGHLR
VHNGEKPYEHKEYGKAFGTSS
GVIEDRRSNTGQKRFDCDQCG
KVFVSFSSLFAHLRTHTGEKPF
KCYKCGKPFTSSACLRIHMRTH
TEERLYQCKKCGKAFTKCSYLT
KHLRTH AGEKPYECM KCG KAF
TERSYLTKHLRRHSGEKPYECK
KCGKAFTERSDLTKHLRRHTG
DKPYEYKDCGKAFWSSSLVD
HLRTHTGYKPYKCNACEKAYS
RSCVLTQHLKTHAAEKTSECN
ACGNSFRNSMCFHDRLKTLTKI
KPYKCKDCGKAFTCHSDLTNH
VRIHTGEKPYKCKECGKAFRTS
SGRIQHLRTHMGEKPFECDQCG
KAFA\FSQLVLHI*KHTREKPCG
CEECGKTFAVSSSLTEHVKIHR
3909 34277 5950 455 GLLHERQAEARCSICLDYLRHP
MTTDCRHYI*SARIHQCW*ELQ
DISPCPVCLQHCPDKNLKRNFQ
LCHMTDIAKQLLTTARRKRKL
QGEEPVCRKSDVALFCEKDPEL
LCHQYRVSLDH*DH/SPMPIEQ
AAAKHRKQFESY1EPLEKQV
Figure imgf000416_0001
Figure imgf000417_0001
Figure imgf000418_0001
Figure imgf000419_0001
Figure imgf000420_0001
Figure imgf000421_0001
Figure imgf000422_0001
Figure imgf000423_0001
Figure imgf000424_0001
Figure imgf000425_0001
Figure imgf000426_0001
Figure imgf000427_0001
Figure imgf000428_0001
Figure imgf000429_0001
Figure imgf000430_0001
Figure imgf000431_0001
Figure imgf000432_0001
Figure imgf000433_0001
Figure imgf000434_0001
Figure imgf000435_0001
Figure imgf000436_0001
Figure imgf000437_0001
Figure imgf000438_0001
Figure imgf000439_0001
Figure imgf000440_0001
Figure imgf000441_0001
Figure imgf000442_0001
Figure imgf000443_0001
Figure imgf000444_0001
Figure imgf000445_0001
Figure imgf000446_0001
Figure imgf000447_0001
Figure imgf000448_0001
Figure imgf000449_0001
Figure imgf000450_0001
Figure imgf000451_0001
Figure imgf000452_0001
Figure imgf000453_0001
Figure imgf000454_0001
Figure imgf000455_0001
Figure imgf000456_0001
Figure imgf000457_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Λmino acid sequence ( X=Unknovvn, NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possiblc nucleotide insertion) sequence
4175 34543 A 4216 896 1626 NATTIRYEHQRVLKAAQYLHQ
QG1TRCNSSSTLTSAAPGRVDSP
PTSMIVAPALIIRFACFTASISEL
CVPPSEKESGVTLRIPITHLMRE
LTCRLIRDKSATVTGSHTLWA
RHCCYAAPGGCCLLANWLKPG
LFGPIGVLSRRGTSVILPIGGFY
QWNPMICPNGWPMQHG*RRG
LAQERPLEEWLPVCRDMLNAF
FLPDAETEAAMTLIEQQWQAII
AEG LG AQ YGDA VPLSLLRDEL
AQRLDQERISQRFLAGPVNICTL
MPMRSIPFKVVCLLGMNDGVY
PRQLAPLGFDLMSQKPKRGDRS
RRDDDRYLFLEALISAQQKLYI
SYIGRSIQDNSERFPSVLVQELI
DY1GQSHYLPGDEALNCDESEA
RVKAHLTCLHTRMPFDPQNYQ
PGERQSYAREWLPAASQAGKA
HSEFVQPLPFTLPETVPLKRYN
DSVRAPTCAESRAIFTSTRNNTL
QLFFNANFRRPWARGLATNVN
DRRASVDHQIRMFHRIYQRVM
RATI RKG I RRD VEDPH YSSD AG
TYLSPNSRQISNGNWQSHIGRSP
SLLLCRTWGLLFTGKLVETRFI
WPNRGVIPTGNERYIAHRRFLP
MEPDDMPQWRCPHATWLAEA
KMFDSLAKAGKYLGQAAKLMI
GMPDYDNYVEHMRVNHPDQT
PMTYEEFFRERQDARYGGKGG
4176 34544 4217 838 1575 CFFLSPSPPSSPPSPNRSQTTEEE
TKRQE/ERERKREEEEEKKGRR
KETKKRRNRQEGKQHRKEEKE
GEKQTKQRTETERETKRRRENE
QAKAHKGTRKRKEEQKKAKA
ARRRTHKRQNPSRGREGTHPK
QRQGKEE/VNRQNKEEAKQKR
EEAGRTRR/EDRGRKDDKKERR
QQQTEKKAKPKAEHGQERTDT
TTKKARQREGRPSERRRREREE/
MSKHDPQNRAEKTNEEKEEGR
QHER*TQKSSTGI
Figure imgf000459_0001
Figure imgf000460_0001
Figure imgf000461_0001
Figure imgf000462_0001
Figure imgf000463_0001
Figure imgf000464_0001
Figure imgf000465_0001
Figure imgf000466_0001
Figure imgf000467_0001
Figure imgf000468_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Amino acid sequence ( X=Unknovvn,
NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
4239 54607 4280 2661 MTMNFVADSHTGRNPLASAAG
AKTGLRPLPRPCGARVWNPPD
AGGGGVGSLKTSTPLGPLSAAN
SPVHQGSVPQTRARGGGTLFQE
WTSRTLAFRNSLSAFTEVTSG
TVSGRKGGRSTHLAGRRVSGG
EGSRKAAAAALAAVAAAPGPV
RRCSSQSCFSSSGSSHYSARTSP
VRVRPRRSLSSRSAAGNRAEAT
ESAMEKTLETVPLERKKREKEQ
FRKLFIGGLSFETTEESLRNYYE
QWGKLTDCVVMRDPASKRSRG
FGFVTFSSMAEVDAAMAARPH
SIDGRWEPKRAVAREESGKPG
AHVTVKKLFVGGIKEDTEEHHL
RDYFEEYGKIDTIEIITDRQSGK
KRGFGFVTFDDHDPVDKIVLQK
YHTΓNGHNAEVRKALSRQEMQ
EVQSSRSGRGDGYGSGRGFGD
GYNGYGGGPGGGNFGGSPGYG
GGRGGYGGGGPGYGNQGGGY
GGGYDNYGGGNYGSGNYNDF
GNYNQQPSNYGPMKSGNFGGS
RNMGGPYGGGIWKNTSITERK
KSRKLDLIQSKKGSRTKEAPQP
PVASLCMHLGHWSRLMVSPGA
QLTGKNSHGLSVSSVRKSNVGP
RRLCAAMKATGPDNAQSQVSP
PGHAPSAEDPTGSRTVSSPCTD
RPHPFLSRPKPPTQISLVLPLKT
DGALERMPQQL/HIASS/GAKVP
NPSTQTPPVLLAFFYPFNLPP*N
4240 54608 4281 908 MRKVKGKNRQSFKCLPPPSGA
LQAHGAAAPHGSLLTLHLHLV
PVSSAAMKATGPDNAQSQVSP
PGHAPSAEDPTGSRTVSSPCED
RPHPFLSWPTWISLALLLKTDG
ALERMPQQLPSLHPSQGTQSIH
PDPSSTSSFLLPFQPPTLKRAAFP
CPPSIVNPAVWDTSTPSVAEHH
TPIRITLKEPTQFLSQKQYPIPQA
ALVGLQPIISHLLASHLLRPTDS
PFNTPILPVKKPNGTYRLVQDL
RLINQAVLPLVQE/DYSVLLYLP
LNVTPGLPPATAFSYPPSPGPVA
RARLASRLHSHAA
Figure imgf000470_0001
Figure imgf000471_0001
Figure imgf000472_0001
Figure imgf000473_0001
Figure imgf000474_0001
Figure imgf000475_0001
Figure imgf000476_0001
Figure imgf000477_0001
Figure imgf000478_0001
Figure imgf000479_0001
Figure imgf000480_0001
Figure imgf000481_0001
Figure imgf000482_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Λmino acid sequence ( X=Unknown, NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
4343 3471 1 A 4384 495 EDTGTFRIY\ESAGAVKKARGF
LEFVEDFIQVSKNLIGKVIGKNG
KVIQEIVDKSDMVPVRIEGDSE
NKLPREDKDDRDSRHQRDSRR
CPGGRCRSVSGRRGRGGPRGG
KSSISSVPKDPDSNPYSVLDN/T
ESDQTADTDASKSHHSTNRHTR
SRRRRTDEDAVL
4344 54712 A 4385 550 TESERKDELSDWSLAGEDDRDS
RHQRDSRRRPGGRGRSVSGGR
GRGGPRGGKSSISSVLKDPDSN
PYSLLDNTESDQTADTDASESH
HSTNRRRRSRRRRTDEDAVLM
DGMT\ESDTASVNENGL\AKDV
IEEHGPSEKAINGPT\SASG\DDIS
KLQRTPGERKRLIP*KKENTQE
AAVLNGVS
4345 3471 4386 2063 MAELTVEVRGSNGAFYKGFIK
DVHEDSLTVVFENNWQPERQV
PFNEVRLPPPPDIKKEISEGDEV
E VYSRAN DQEPCG WWL AKVR
MMKGEFYVIEYAACDATYNEI
VTFERLRPVNQNKTVKKNTFFK
CTVDVPEDLREACANENAHKD
FKKAVGACRIFYHPETTQLMIL
SASEATVKRVNILSDMHLRSIR
TKLMLMSRNEEATKHLECTKQ
LAAAFHEEFVVREDLMGLAIGT
HGSNIQQARKVPGVTAIELDED
TGTFRIYGESADAVKKARGFLE
FVEDFIQVPRNLVGKVIGKNGK
VIQEIVDKSGWRVRIEGDNEN
KLPREDGMVPFVFVGTKESIGN
VQVLLEYHIAYLKEVEQLRME
RLQIDEQLRQIGMGFRPSSTRGP
EKEKGYATDESTVSSVQGSRSY
SGRGRGRRGPNYTSGYGTNSEL
SNPSETESERKDELSDWSLAGE
DDRDSRHQRDSRRRPGGRGRS
VSGGRGRGGPRGGKSSISSVQY
RSNIHNCSTLKRIFLASDMNIVL
KDPDSNPYSLLDNTESDQTADT
DASESHHSTNRRRRSRRRRTDE
DAVLMDGMTESDTASVNENGL
DDSEKKPQRRNRSRRRRFRGQ
AE\DRQPAIDFIYKEVEKVVSL
WQAKDVIEEHGPSEKAINGPTS
ASGDDISKLQRTPGEEKINTLKE
Figure imgf000484_0001
Figure imgf000485_0001
Figure imgf000486_0001
Figure imgf000487_0001
Figure imgf000488_0001
Figure imgf000489_0001
Figure imgf000490_0001
Figure imgf000491_0001
Figure imgf000492_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Λmino acid sequence ( X=Unknown,
NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
4394 34762 A 4435 2571 MKAEIKMFFETNENKDTTYQN
LWNTFKAMCRGKFIALNAHKR
KQERSNTDTLTSQLKELKKQEQ
THSKPSRRQEITKIRAEMKEIET
QKTLQKIKESRTWFFEKINKIDR
LLARLTKKKREKNQIDAIKNDK
GDITTDPTEIQTTIREYYKHLYA
NKLENLEEMDKFLDTYTLPRLN
QEEVESLNRPITGSEIEAIINSLP
T/KKCPGPDGFTAEFYRRKRGM
LPNSFYEASIILIPKPGTDTTKKE
NFRPISLMNIDVKILNKILANRI
QQHIKKLIHHDQVGFIPGMQG
WFNIRKSINIIQHINRAKDKNH
MIISIDAEKAFDKIQQCFMLKTL
NKLGIDGTYLKIIRAIYDKPTAN
IILNGQKLEVFPLKTGTRQGCPL
SPLLFNIVLEVLARAIRQEKEIK
GIQLGNEEVKLSLFADDMIVYL
ENPIISAPNLLKLINNFSKGSAY
KIKVQKSQAFLYTNNRQTESQI
MSELPFTIASKRIKYLGIQLTRD
VKDLFKENYKPLLKEIKEDTNK
WKNIPCSWVGRINIMKMAILPK
VIYRFNAILIKLPMTFFTELEKST
LKFIWNQKRARIAKSILSQKNK
AGGITLPDFKLYYKATVTKTA
WYWYQNRDIDQWNGTEPSEIM
PHIYNYL1FDKPEKNKQWGKDS
LFNKWCWENWLAICRKLKLDP
FLTPYTKINSRWIKDLHVRPKTI
KTLENLGNTIQDIGMGKDFMSK
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Amino acid sequence ( X=Unkno\vn, NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possiblc nucleotide insertion) sequence
4395 34763 4436 1965 MTLESEQTFVYAVTATQTGAK
EGTRMSKSNVAGQQGDSGEKA
LQKTYQKILREKESALEAKYQA
MERAATFEHDRDKVKRQFKIF
RETKENEIQDLLRAKRELESKL
QRLQAQGIQVFDPGESDSDDNC
TDVTAAGTQCEYWTGGALGSE
PSIGSMIQLQQSFRGPEFAHSSID
VEGPFANVNRDDWDIAVASLL
QVTPLFSHSLWSNTVRCYLIYT
DETQPEMDLFLKDYSPKLKRM
CETMGYFFHAVYFPIDVENQYL
TVRKWEIEKSSLVILFIHLTLPRI
KYLGIQLTRDVKDLFKENYKPL
LNEIKEDTNKWKNILCSWTGR
NNVMKMATLPKVIYRFNAIPIK
LPMTFFTELEKTTLKFIWNQKR
AHIAKTILSEKNKAGGIMLPDF
KLYFKATVTKAAWYWCQNRD
IDQWNRTEASEITPHIYNHLIFD
KPDKNKKWGKDSLFNKWCWE
NWLAICRKLKLDPFLTPYTKIN
SRW1KDLN VRPKTI KTLEENLG
NAIQDIGMGKDFMTKTPKAMA
TKAKIDKWDLIKLKSFCMAKET
PIGVNRQLTEWEKIFAIYPSDKG
LISRIYKELKQTYKKKTNNPIEK
LAKEMNRHLSKEDIYAANRHK
KKCSSSLVIREMQIKTT/MRYHL
TPVRMAIIKKSGNNRCWRGCG
4396 34764 4437 300 476 PDLSL WLPLTFFPSFQL W* I*QL
CVLELLFSRSIFVAFSVFPEFES
WPALLGWGSSPG
Figure imgf000495_0001
Figure imgf000496_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Ammo acid sequence ( λ=Unkno\vn,
NO: of peptide hod in USSN location of first codon for last ammo acid *=Stop codon, /=possιble nucleotide sequence 09/540,217 codon tor peptide of peptide sequence deletion, \=possιble nucleotide insertion) sequence
4407 34775 4448 1802 MSYPADDYESEAAYDPYAYPS
DYDMHTGDPKQDLAYERQYE
QQTYQVIPEVIKNFIQYFHKTVS
DLIDQKVYELQASRVSSDVIDQ
KVYEIQDIYENSWTKLTERFFK
NTPWPEAEAIAPQVGNDAVFLI
LYKELYYRHIYAKVSGGPSLEQ
RFESYYN YCNLFN Y I LNADGPA
PLELPNQWLWDIIDEFIYQFQSF
SQYRCKTAKKSEEEIDFLRSNP
K1WNVHSVLNVLHSLVDKSNIN
RQLEVYTSGGDPESVAGEYGR
HSLYKMLGYFSLVGLLRLHSLL
GDYYQAIKVLENIELNKKSMYS
RVPECQVTTYYYVGFAYLMMR
RYQDAIRVFANILLYIQRTKSM
FQRTTYKYEMINKQNEQMHAL
LAIALTMYPMRIDESIHLQLREK
YG\DKMLRMQKGDPQVYEELF
SYSCPKFL\SPWPNYDNVHPN
YHKE\PFLQ\QLKGVF**SSSQQ
AQLS/TPIRSFLKLYTYTMPWAK
LAGFPGPSQSQEF\RI PG FF VFKQ
QDERTSVWTQRVFSAPGW*NF
SQASEVDF\YI\DKDMI\HIADTK
VA\RRYG\DFFIRQI\HKF\EELNR
TLKEGWGQRPWMIFHTHFREP
GFECIIGQGSVFC
Figure imgf000498_0001
Figure imgf000499_0001
Figure imgf000500_0001
Figure imgf000501_0001
Figure imgf000502_0001
Figure imgf000503_0001
Figure imgf000504_0001
Figure imgf000505_0001
Figure imgf000506_0001
Figure imgf000507_0001
Figure imgf000508_0001
Figure imgf000509_0001
Figure imgf000510_0001
Figure imgf000511_0001
Figure imgf000512_0001
SEQ ID SEQ II) NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Amino acid sequence ( X=Unknovvn, NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
4508 34876 A 4549 1602 NLSPILPQDLWPFTRVTVHWGK
GNDQTFQGLQDTGSELMLIPGD
PKRHCSPPVKVGSYGGQVINGV
LAQVRLTVGTVGPRTHPWISP
VPEC1IDIDILNSWQNPHIDSLTG
RVKAIMVGKAKWKPFEPLLPIK
IVNQKQYRIPGGIAEISATIKDL
KDAGWIPITLPFNSPFWPVKKT
DGSWRMKWYCKLNQWTPIT
AAVPDV/WSLLEQINTSPGTW
YAAIDLANAIFSIPVHKAHQKQ
FAFSWQGHQNTFTVFTILLHIH
KVGHAQQHSIIKWKWYIHDGA
RAGSEGTSKLNEEVPQMPMVT
TSAALPSLPRPAPMASWGVLY
DQLTEEEKTRAWFTDGSARYA
GTTQKWTAAALQPLSRTSLKG
SGEGKSSQWAELQAVHLWHF
SWKDKWPDVRLYIDSWAVAN
GLAGWSGTWKKHDWKIGDKEI
WGRGMWMDLSEWPKPVKIFG
SHVSAHQWVISAEEDFNNQVD
KMTCSVDITQPLSPATPVITQW
AHKQSGHGGRDGGYTWAQQH
GLPLTKTGLAMATAECPI
Figure imgf000515_0001
Figure imgf000516_0001
Figure imgf000517_0001
Figure imgf000518_0001
Figure imgf000519_0001
Figure imgf000520_0001
Figure imgf000521_0001
Figure imgf000522_0001
Figure imgf000523_0001
Figure imgf000524_0001
Figure imgf000525_0001
Figure imgf000526_0001
Figure imgf000527_0001
Figure imgf000528_0001
Figure imgf000529_0001
Figure imgf000530_0001
Figure imgf000531_0001
Figure imgf000532_0001
Figure imgf000533_0001
Figure imgf000534_0001
Figure imgf000535_0001
Figure imgf000536_0001
Figure imgf000537_0001
Figure imgf000538_0001
Figure imgf000539_0001
Figure imgf000540_0001
Figure imgf000541_0001
Figure imgf000542_0001
Figure imgf000543_0001
Figure imgf000544_0001
Figure imgf000545_0001
Figure imgf000546_0001
Figure imgf000547_0001
Figure imgf000548_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Amino acid sequence ( X=Unknown,
NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
4745 351 13 A 4786 832 ERHHPPPQAGPHPPRRRKERNT
HRQPTPQPHGA/WKISVPQTTN
KGISSSITSTQRNQPQATTVEKA
HNESTRTPEEPKGAVRGTR*KK
KKTSRDTPPQTP*EKSSSATRNK
AGGKDSDKRERKATEDQTTPS
KRRKLH PM AKKLKTLKKN *MN
G*LE*PMQRSP*RT*WS*KPRHE
NYVMNAQASVTDAINWKKGPP
PRPTTNDAKIVTRPTHPTCSHH
RPPRKPPRTHPTPTPTQNKISQ*
NGYTPPRGKRVREDRCKPPQAP
TSAPRAAKQRQS
4746 351 14 4787 1580 4673
4747 351 15 4788 462 MKLEHQEAQRRSGQGREKRR
WPEGKAGPGCEGAWILCPESQ
DDSKQEGDNNMIWSRNAVRS
VKAEFQGDNLENWESCSEAIIV
QGRDNGVPHKAHGLGTEEEGT
VLKISERQNWLDLVDFILLGGD
LFHENKPSRKTLHTCLELLRKY
CMGDRPVQFEILSDQSVNFGFR
KLTNRKDIHTKNPSVRHHHQRP
KVDETIKMGKTQSRKTRNSKN
QSTSPPPKERSSSPAIEQSWMEN
DFDELKEEGFRRSNYSELKEEV
RTNGKEVKNLEKKLDKWITRIT
NAEKSLKDLMELKTIAQELRDE
CTSLSNQCDQLEERVSVMEDQ
MNEMKREEKFREKRIKRNEQS
LQEI WDY VKRPN LCLIG VPESD
GENGTKLENTLQDIIQVNFPNL
ARQANIQIQEIQRMPQRYSLRR
ETPRHIIVRFTKVEMKEKMLRT
AREKGLECSGAGLAHCKLWLL
GPSDPPDCSSVSPVLRVHLVLPS
SLPHSVGTPFLGSVSIPPSVPRFP
DRVFHPYPYTHYCDNLKTCHT
SHGSVMAETAVINHKKRKNSP
RIVQSNDLTEAAYSLSRDQKRM
LYLFVDQIRKSDGTLQEHDGIC
EIHVAKYAEIFGLTSAEASKDIR
QALKSFAGKEWFYRPEEDAG
DEKGYESFPWFIKRAHSPSRGL
YSVHINPYLIPFFIGLQNRFTQFR
LSETKEITNPYAMRLYESLCQY SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Λniino acid sequence ( X=Unknown, NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
4748 55116 A 4789 1314 2221 KNRNYNKLSLRPQCNPTRTQD*
ETHSKPLNYMETEQPSPE*LLG
T*RNEGRHKAVL*NQ*EKRHNI
PESLGHI/AKQCVEGNL*H*MPT
RESRKDLKLTP*/PSQLKELEKQ
EQTHSKAGRRQE/TN*DQSRIEG
DRDTKNPSKNQ*IQELVF*KDQ
QN**TASKT/IQRRKERRIK*TQ*
KMLKEDITTDPTEIKTIIREYYK
HLYAHNLENLEEMDKFLDTYT
LPRLNQEEAESLNRPITNSEIET
VINSLKEKAQDQKDLQLNSTRA
LFTI AKA WNQPKCPSMTDE I K/I
NVEHIHHGILCSHQKE
4749 351 17 4790 2260 TKDKNHMIISIDGGKAFDKIQQ
PFMLKTLNKLGIDGTYLKRIRAI
FDKPTANIILNGQKLEAFPLKTG
TRQGCPLSPLLFNIVLEVLARVI
RQEKEI GIQFGKEEVKLSLFA
DDMTVYLENPIFSAQNLLKLIS
NFSNVSGYKINVQKSQAFLYTN
NSQIMSELPFTIATKRITYLGIQL
ARDVKDLFKENYKPLLNEIKED
TNKWKNIPCSQIGRILWPYCPR
QEDENFNSLLQNGDILNSSTEE
KFKAHDKKDFNLPEYDLNVEE
RLVLIEKSVDSTATADDTHKLD
HINMNLNKLITNDTFQPEIMERS
KTQDIVLGTSFLSINSKEETEHL
ENGNKYPNLESVNKVNGHSEE
TSQSPNRTEPHDSDCSVDLGISK
STEDLSPQKSGPVGSWKSHSIT
NMEIGGLKIYDILSDNGPQQPST
TVKITSAVDGKNIVRSKSATLL
YDQPLQVFTGSSSSSDLISGTKA
IFKFDSNHNPEEPN 11 RGPTSGPQ
SAPQAYGPPQYNIQYSSSAAVK
DTLWHSKQNPQIDHASFPPQLL
PRSESTENQSYAKHSANMNFSN
HNNVRANTAYHLHQRLGPARH
GEMWAISPNDRLIPAVTRSTIQR
QSSVSSTASVNLGDPGSTRRAQ
IPEGDYLSYREFHSAGRTPPMM
PGSQRPLSARTYSIDGPNASRPQ
SARPS1NEIPERTMSVSDFNYSR
TSPSKRPNARVGSEHSLLDPPG
4750 35118 4791 1516 1729 ILAPHSLLACRVSAERSAVSPM GFPLWVTQPFSLAALN/DFFLHF NFG/R QLCVLELLFSRSrWAF SEFP
Figure imgf000551_0001
Figure imgf000552_0001
Figure imgf000553_0001
Figure imgf000554_0001
Figure imgf000555_0001
Figure imgf000556_0001
Figure imgf000557_0001
Figure imgf000558_0001
Figure imgf000559_0001
Figure imgf000560_0001
Figure imgf000561_0001
Figure imgf000562_0001
Figure imgf000563_0001
Figure imgf000564_0001
Figure imgf000565_0001
Figure imgf000566_0001
Figure imgf000567_0001
Figure imgf000568_0001
Figure imgf000569_0001
Figure imgf000570_0001
Figure imgf000571_0001
Figure imgf000572_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Ammo acid sequence ( X=Unknown,
NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possιble nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possιble nucleotide insertion) sequence
4888 S5256 4929 2502 MFFETNENKDTTYQNLWDTFK
AVCRGKFIALNTHKRKQERSKI
DTLTSQLKEPEKQEQTHSKASR
RQEITKIRAELKEIETQKTLQKIL
QKINESRSWFFEEINKIDRLLAR
LIKKKREKNQIDAIKNDKGDITT
DRTEIQTTIREYYKHLY ANKLE
NLEEMDKFLDTYNLPRLNQEE
VESL\HRLITGSEIEAIINSLPTKK
SPGPDGFTAKFYQRYKEELVPF
LLKLFQSIEKEGILPNSFYEANII
LIPKPGTDTTKKENFRPISLMNI
DVKILNKILANRIQQQIKKLIHH
DQVGFIPGMQGWFNIHKSINVI
QHINRTKDKNHMIISIDAEKAF
DKIQQPFMLKSLNKLVLEVLAR
AIRQEKEMKGIQLGKEEVKLSL
FADDMIVYLENPIISAQDLLKLI
SNFSKVSGYEINVQKSQASLYT
NNRQTESQIMSELPFTIASKRIK
YLG1QLTRDVKDLFKENYKPLL
NEIKEDTNKWKNIPCSWVGRIN
IVKMAILPKVIYRFNAILIKLPM
TFFTELEK\TTLKFIRNQKRACIA
KS1LSKKE\KAGGIMLPEFKL*Y
KATVTKTVWYWCQNRYIDQW
NRTEPSEIIPHIYNHLIFDKPDKN
KKWGKDSLFNKWCWEHWLAI
CRKLKLDPFLTPYTKINSRWIK
DLNVRPKTIKTLAGHLGNTIRDI
GLGKDFMTKTPKAMATKAKID
KWDLIKLKSFCTAKETAIRVNR
4889 35257 A 4930 4187 MGDFNTPLSTLDRSMRQKVNK
DSQELNSALHQADLTDICRTLH
PKSTEYTFFAAPHHTYSKIDHIV
GSKALFSKCKRTEIITNCLSDHS
A I KLELRIKKLTQNRSTTWKLN
NLLLNDYWVHNEMKAEIKMFF
ETN ENKDTTYQNL WDTFKAVY
RGKFIAFKAVCRGKFIALNAQN
RKQERSKIDTLTSQLKELEKQE
QTHSKASRRQEITKIRAELKEIE
TQKTLQKMSESRSWFFEKTNKI
DRPLARLIKKKR
Figure imgf000574_0001
Figure imgf000575_0001
Figure imgf000576_0001
Figure imgf000577_0001
Figure imgf000579_0001
Figure imgf000580_0001
Figure imgf000581_0001
Figure imgf000582_0001
Figure imgf000583_0001
Figure imgf000584_0001
Figure imgf000585_0001
Figure imgf000586_0001
Figure imgf000587_0001
Figure imgf000588_0001
Figure imgf000589_0001
Figure imgf000590_0001
Figure imgf000591_0001
Figure imgf000592_0001
Figure imgf000593_0001
Figure imgf000594_0001
Figure imgf000595_0001
Figure imgf000596_0001
Figure imgf000597_0001
Figure imgf000598_0001
Figure imgf000599_0001
Figure imgf000600_0001
Figure imgf000601_0001
Figure imgf000602_0001
Figure imgf000603_0001
Figure imgf000604_0001
Figure imgf000605_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Amino acid sequence ( X=Unknown,
NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
5088 35456 5129 2462 MRALWVLGLCCVLLTFGSVRA
DDEVDVDGTVEEDLGKSREGS
RTDDEWQREEEAIQLDGLNAS
QIRELREKSEKFAFQAEVNRM
MKLIIHSLL*NKGIFLERLISNSS
C\ALDRLSLISLTDENALSVNE*
LTVNMKCDKEKNLLHVQHTGV
GMT\REELVKN\LGTIAQIWDQ
ASFLNKMP*SHRKMGPVNLPEL
IGPVWVSGFLFPPSLVADKVIVT
SKHKQPIPQH I WG V WTPNGIFL
*LLDPRGKHS*DRGNRQFTLCP
* KEE ASDLP WNLDTI KNLRQKN
I HSFIKLFLIYVW\SSKTETV*G A
PWGEEEAAKEEKEESDDEAAW
EEEEEEKKPKTKKVEKTVWDW
ELMNDIKPIWQRPSKEVEEDEY
KAFYKSFSKESDDPMAYIHFTA
EGEVTFKSILFVPTSAPRGLFDE
YGSKKSDYIKLYVRRVFITDDF
HDMMPKYLNFVKGWDSDDL
PLNVSRETLQQHKLLKVIRKKL
VRKTLDMIKKIADDKYNDTFW
KEFGTNIKLGVIEDHSNRTRLA
KLLRFQSSHHPTDITSLDQYVE
RMKEKQDKIYFMAGSSRKEAE
SSPFVERLLKKGYEVIYLTEPVD
EYCIQALPEFDGKRFQNVAKEG
VKFDESEKTKESREAVEKEFEP
LLNWMKDKALKDKIEKAWSQ
RLTESPCALVASQYGWSGNME
RI M KAQA YQTGKDISTN Y YAS
5089 35457 5130 45 416 RWGLAMFPSLDNKSETLSQKK
KKKDLCPHISFEMDLTTADAR
WVQDAATRLLATTRCPPNSRG
SR**ISWRAGCCPGGWAGPHW
GHDGCL*PSCVSSS
5090 35458 5131 477 GRLPRHRGGEPGAAQPAGRQP
RGGAAATAAGAPRAGGGAAA
GNARRCREAAARDPHHGEGSG
PEPSQCEGAAHQRAGRAQGD*
GGSGATGEGGRRGHDSHNPLG
RVRGSTLPPS**N*VGL*VPSWP
QCGPAHPPGQHPALQEIHQRLP
LESGGHRW
Figure imgf000607_0001
Figure imgf000608_0001
Figure imgf000609_0001
Figure imgf000610_0001
Figure imgf000611_0001
Figure imgf000612_0001
Figure imgf000613_0001
Figure imgf000614_0001
Figure imgf000615_0001
Figure imgf000616_0001
Figure imgf000617_0001
Figure imgf000618_0001
Figure imgf000619_0001
Figure imgf000620_0001
Figure imgf000621_0001
Figure imgf000622_0001
Figure imgf000623_0001
Figure imgf000624_0001
Figure imgf000625_0001
Figure imgf000626_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Amino acid sequence ( X=Unknovvn, NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
5270 35638 531 1 2333 TGRGYCGDHESSFGAMEEPGA
TPQPYLGLLLEELRRWAALPE
GMRPDSNLYGFPWELVICAAV
VGFFAVLFFLWRSFRSVRSRLY
VGREKKLALMLSGLIEEKSKLL
EKFSLVQKEYEGYEVESSLKDA
SFEKEATEAQSLEATCEKLNRS
NSELEDEILCLEKELKEEKSKHS
EQDELMADISKRIQSLEDESKSL
KSQVAEAKMTFQIFQMNEERL
KIAIKDALNENSQLQESQKQLL
QEAEVWKEQVSELNKQKVTFE
DSKVHAEQVLNDKESHIKTLTE
RLLKMKDWAAMLGEDITDDD
NLELEMNSESENGAYLDNPPK
GALKKLIHAAKLNASLKTLEGE
RNQI YIQLSEVDKTKEELTEH I K
NLQTQQASLQSENTHFENENQ
KLQQKLKVMTELYQENEMKL
HRKLTVEENYRLEKEEKLSKV
DEKISHATEELETYRKRAKDLE
EELERTIHSYQGQIISHEKKAHD
NWLAARNAERNLNDLRKENA
HNRQKLTETELKFELLEKDPYA
LDVPNTAFGRGSRGPGNPLDH
QITNERGESSCDRLTDPHRAPS
DTGSLSPPWDQDRRMMFPPPG
QSYPDSALPPQRQDRFCSNSGR
LSGPAELRSFNMPSLDKMDGS
MPSEMESSRNDTKDDLGNLNV
PDSSFPVAKKEATGPGFVPPPLA
PVRGPLFPVDARGPFLRRGPPFP
Figure imgf000628_0001
Figure imgf000629_0001
Figure imgf000630_0001
Figure imgf000631_0001
Figure imgf000632_0001
Figure imgf000633_0001
Figure imgf000634_0001
Figure imgf000635_0001
Figure imgf000636_0001
Figure imgf000637_0001
Figure imgf000638_0001
Figure imgf000639_0001
Figure imgf000640_0001
Figure imgf000641_0001
Figure imgf000642_0001
Figure imgf000643_0001
Figure imgf000644_0001
Figure imgf000645_0001
Figure imgf000646_0001
Figure imgf000647_0001
Figure imgf000648_0001
SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Amino acid sequence ( X=Unkno\vn, of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
35838 5516 114 2695 GSACRDPDPDPLRLVGGGAAA
PPCSSVSPSAWRQSRPREYYEG
GRGLHAPDRETALGLG1ATSER
MLGICRGRRKFLAASLSLLCIPA
ITWIYLFSGSFEDGKPVSLSPLE
SQAHSPRYTASSQRERESLEVR
MREVEEENRALRRQLSLAQGR
APSHRRGNHSKTYSMEEGTGD
SENLRAGIVAGNSSECGQQPW
EKORRNPLHFHLIADSIAEQILA
TLFQTWMVPAVRVDFYNADEL
KSEVSWIPNKHYSGIYGLMKLV
LTKTLPANLERVIVLDTDITFAT
DIAELWAVFHKFKGQQVLGLV
ENQSDWYLGNLWKNHRPWPA
LGRGYNTGVILLLLDKLRKMK
WEQMWRLTAERELMGMLSTS
LADQDIFNAVIKQNPFLVYQLP
CFWNVQLSDHTRSEQCYRDVS
DLKVIHWNSPKKLRVKNKHVE
FFRNLYLTFLEYDGNLLRRELF
GCPSEADVNSENLQKQLSELDE
DDLCYEFRRERFTVHRTHLYFL
HYEYEPAADSTDVTLVAQLSM
DRLQMLEAICKHWEGPISLALY
LSDAEAQQFLRYAQGSEVLMS
RHNVGYHIVYKEGQFYPVNLL
RNVAMKHISTPYMFLSDIDFLP
MYGLYEYLRKSVIQLDLANTK
KAMIVPAFETLRYRLSFPKSKA
ELLSM LDMGTLFTFRYHV WTK
GHAPTNFAKWRTATTPYRVEW
Figure imgf000650_0001
Figure imgf000651_0001
Figure imgf000652_0001
Figure imgf000653_0001
Figure imgf000654_0001
Figure imgf000655_0001
Figure imgf000656_0001
Figure imgf000657_0001
Figure imgf000658_0001
Figure imgf000659_0001
Figure imgf000660_0001
Figure imgf000661_0001
Figure imgf000662_0001
Figure imgf000663_0001
Figure imgf000664_0001
Figure imgf000665_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Amino acid sequence ( X=Uπkπo\vn, NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
5616 35984 A 5662 2142 MIILIDAEKAFDKIQQPFMLKTL
NKLG1DGTYLK1TRAIYDKPTA
NIILNGQKLEAFPLKTGTRQGCP
LSPLLFNIVLEVLAQAIRQEKEI
KGIQLGKEEVKLSLFADDMILY
LENPIVSAQKLLKLISNVSKVSG
YKINVQKSQAFLYTNNRQTESQ
IMSEFPFTIATKRIKYLGIQLTRD
VKDLFKKYKPLLNK1KEDTNK
WKNIACSWIGRINIMKMAFPR
WELNNENTWTQEGEHHTLGPV
VGWGKRGGIALVDIPNVNDKL
MVLEVLARAIRQKKEIKGIQLG
KEEVKLSLFADDMIVYLENSIV
SAQNLKLISNFSKVSGYKINVQ
KSQAFLYTNNRQTESQIMSEFP
FTIATKRIKYLGIQLTRDVKDLF
KENYKPLLKEIREDTNKWKNIP
CSRIGRINIMKMAILPKVIYRFN
DIPIKLPMTFFTELEKTTLKFIW
NQKRACIAKTILSKKNIAGGITL
PDFKLYYKATVTKTAWYWYQ
NRDIDQWNRTEASEVTSHIYNH
LIFYKPDKNKKWGNDSLFNKW
CWENWLAICRKLKLDPFLTPYT
KIHSRWIKDLNVRPKTIKTLEEN
LGNTIQDIGMGKDFMTKTPKA
MATKAKVDKWDVIKLKSFCTA
KETTIRVSRQPTEWEKIFAIYPS
DKGLISRIYKELKQIYRKK\TNN
PIKKWAKNMNRHFSKEDIYAA
NRQMKKCSSSLVIREMQIKTTM
Figure imgf000667_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last A ino acid sequence ( X=Unknovvn,
NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
5619 35987 5665 2460 MPHAGHARSPGHTLIKLAEWW
MSRWMEWNPGFPLSIDAKCHK
DLPRDIQFDSEKGVDFVLNYSK
ALNQEEVESLNRPITGSEIEAIIN
SLPTQKSSGPDGFTAEFYQRYK
EELVPFLLKLFQSIEKEGILPNSF
YEASIILIPKPGRYTHKKNNFRPI
SLMΝIDAKILΝKILAΝRIQQHIK
KLIHQDQVGIIPGMQSWFΝ1HK
SIΝVIQHIΝRTKDKΝHMI1SIDAE
KAFDKIQQPFMLKTLΝKL\GIK
YLRIQLTRDVKDLFKENYKSLL
NEIKEDTNKWKNIPCSWIGRMN
IIKMAI/LPKVIYRFNVIPIKLPMT
FFSELEKSTLKFIWNQKRARIAK
TILSQKN KAGGIMLPDFKL Y YK
ATVTKTAWYWYQNRDIDQWN
RTEPSEMTPHIYNHLIFDKPDKN
KQWGKDSLFNKWCWENWLAI
GRQLKLDPFLTPYTKINSRWIK
DLNVRPKTIKTLEENLGNTIQDI
SMGKDFMSKTPKAMATKAKM
DKWDLIKLKSFCTAKETTIRVN
RQPTEWEKNFAIYSSDKGLISRI
YKQLKQIYKKKTNNPIKKWAK
DMNRHFSKEDVYAANRHMKK
CSSSLAIREMQIKTIMIYHLTPV
TMAIIKKSGNNRCWRGCGEMG
TLLYCWWDCKLVQPLWKTLW
QFLRDLELGIPFDPAI PLLG I YPK
DYKSCCYKDTCTPKLARDDQI
HILKQHRRKELETRQKQYRAW
5620 35988 5666 689 1909 LIAYQPKKSRTRWIHNQILPERI
KYLGIQLTRDVKDFLKEKYKPL
LNEIKEDINKWKNIPCSWIGRIN
LVKMAIL/PQAICRKLKLDPFLT
TYTKINSRWIKDLHVRPKTTKT
LEENLGNTIQDIGMGKDFMSKT
PKAMATEAKIDKWDLIKRKSFC
AAKETTTRVNRQPTDWEKMFA
IYSSDKGLISRIYEELKQ1YKKK
TNNPINKWAKDMNRHFSKEDI
YAANRHMKKCSSSLAIREMQIK
TTVSPFAAQWVSQRYGKWQEA
HPSNNDGKGYRGKPRFQSLSD
VHGPQDQKKIIKESALPPTPTFI
NILMNAKTIENGQFPYLLNALK
QQQPHDAENLFTWGKENAAIV
SPCIEVSAALSQWKVPAWPQRS
GIPNR1LRSPIPGLGSWVAFLDL
VWVRGDPTALK
Figure imgf000669_0001
Figure imgf000670_0001
Figure imgf000671_0001
Figure imgf000672_0001
Figure imgf000673_0001
Figure imgf000674_0001
Figure imgf000675_0001
Figure imgf000676_0001
Figure imgf000677_0001
Figure imgf000678_0001
Figure imgf000679_0001
Figure imgf000680_0001
Figure imgf000681_0001
Figure imgf000682_0001
Figure imgf000683_0001
Figure imgf000684_0001
Figure imgf000685_0001
Figure imgf000686_0001
Figure imgf000687_0001
Figure imgf000688_0001
Figure imgf000689_0001
Figure imgf000690_0001
Figure imgf000691_0001
Figure imgf000692_0001
Figure imgf000693_0001
Figure imgf000694_0001
Figure imgf000695_0001
Figure imgf000696_0001
Figure imgf000697_0001
Figure imgf000698_0001
Figure imgf000699_0001
Figure imgf000700_0001
Figure imgf000701_0001
Figure imgf000702_0001
Figure imgf000703_0001
Figure imgf000704_0001
Figure imgf000705_0001
Figure imgf000706_0001
Figure imgf000707_0001
Figure imgf000708_0001
Figure imgf000709_0001
Figure imgf000710_0001
Figure imgf000711_0001
Figure imgf000712_0001
Figure imgf000713_0001
Figure imgf000714_0001
Figure imgf000715_0001
Figure imgf000716_0001
Figure imgf000717_0001
Figure imgf000718_0001
Figure imgf000719_0001
Figure imgf000720_0001
Figure imgf000721_0001
Figure imgf000722_0001
Figure imgf000723_0001
Figure imgf000724_0001
Figure imgf000725_0001
Figure imgf000726_0001
Figure imgf000727_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Amino acid sequence ( X=Unknown, NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, V=possible nucleotide insertion) sequence
6261 56629 2052 MGDFNTPLSTLDRSMRQKVNK
DTQELNSALHQADLIDIYRTLH
PKSTEYTFFSAPHHTYSKIDHIV
GSKALLSKCKRTEI ITN YLSDHS
AIKLELRIKNLTQNRSTTWKLN
NLLLNDYWVHNEMKAEIKMFF
ETNENKDKTYQNLWDAFNAV
CRGKFVALNAHKRKQEGSKID
TLTSQLKELEKQEQTHSKASRR
QEITKVRAELKEIETQKTLQKIN
ESRS\ARLIKKKR£KNQI/DAIKN
DKRDITIDPTEIQTTIREYYKHL
Y ANKLENLEEM DKFLDTYTLP
RLNQEEVESLNRP1TGSEIVAIIN
SLPTKKSPGPEGFTAEFYQRYK
EELVPFLLKLFQSTEKEGILPNS
FYEASIILIPKPGRDTTKKENFRP
I SLMNI DAKILNKIL AKRIQQHI
QKLIHHDQVGFIPGMQGWFNIC
KSINVIQHINRTKDKNHMIISID
AEKVFDKIQQRFM LKTLNKL V
LEVLARAVRQEKKIKGIQLGKE
ELKLSLFADDMIVYLENPIVSA
QNLLKLISNFSKVSEYKINVQKS
QAFLYTNNRQTESQIMSELPFTI
ASKRIKYLGIQLTRDVKDLFKE
NYKPLLKEIKEDVNKWKNIPCS
WVGRINIVKMAILPKVIYRFNAI
PIKLPMTSFTELEKTTFKFIWNH
KGARIAKSILSQKNKAGGITLPD
FKLYYKATVTKTAWY
Figure imgf000729_0001
Figure imgf000730_0001
Figure imgf000731_0001
Figure imgf000732_0001
Figure imgf000733_0001
Figure imgf000734_0001
Figure imgf000735_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Amino acid sequence ( X=Unknown, NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
6307 56675 6357 2569 MRTKTQHTRISWDAFKAVCRG
KFIALNAHKRKQERSKIDTLTS
QLKELEKQEQTHSKASRRQEIT
KIRAELKEIETQKSLQKINESRS
WFFERTNNIDRPLARLIKKKREK
NQIDTIKNDKGDITTDPTEIQTTI
REYYKHLYANKLKNLEEMDKF
LDTYTLPRLNQEEVESLNRPITG
SEIVTIINSLPTKKSPGPDGFTAE
FYQRYKEELVPFLLKLFQSIEKE
GILPNSFYEASIILIPKPGRDTTK
KENFRPISLMNIDAKILNKILAN
RIQQHIKKLIHHDQVGFIPGMQ
GWFNIRKSINVIQHVNRAKDKN
HMIISIDAEKDFDKIQQPFMLKT
LNKLG I DGTYLKI I RAI YDKPTA
NIILNGQKLEAFPLKTGTRQGCP
LSPLLFNIWEVLASAIRQEKEI
KAQNLLKLISNFSKVSGYKINV
QKSQAFLYTNNRQTESQIMSEL
PFIIASKRIKYLGIQLTRDVKDLF
KENYKPLIKEIKEDTNKRKNIPC
SRVGRINIVKMAILPKVIYRFNA
IPIKLPMTFFTELEKTTLKFIWN
QKRAHIAKSILSQKNKAGGITLP
DFKLYYKATVTKTAWYSYQNR
DIDQWNRTEPSEITPHIYNYLIF
DKPEKNKQWGKDSLFNKWGW
ENWLAIWRKLKLDPFLTPYTKI
NSRWIKDLNVRPKTIKTLEENL
GITIQDIGMGKDFMSKTP/TSNG
NKRQN*QMGSNETKELLHSKR
6308 36676 6358 1702 2252 FLSLFFF1SLASSLSILLIHIMNSW
IPHSPIFTDLΝVESSRLCPLGDIM
M*IPLTWYLIVTCTHLSREITTV
PRGLARLWSFTRICPRTHCKPIP
AWLLRAGCPPSCWWDQDTHS
QLPSKRSSYTLFFCTVTEYPFLD
*WP*ΝHGTPC*TAPLWSMF/PCS
CWYPVTGPGVSCSPSCVQVFSL
FISHL*VRTCGVWFFVLAIVC*E
*WFPASSMSQAVLVTVAL*YSL
KSGSVMPPA\FSFGLGLTWRCR
LFFGSIRTLK*IFPIL*RKSLVA*
WGWH*IYKLRWAVWPFSRY*F
FLPMSM/VMFFHLFVSSFISLSS
GL*FSLKRSFMSLVSWIPKYFIL
FEAIVNGSSLM1WLSVCLLLLH
KNACDFCTLILYPETFLKLLISL
RRFWAETMGFSRYT1MSSANR
DNLTSSFPN
Figure imgf000737_0001
Figure imgf000738_0001
Figure imgf000739_0001
Figure imgf000740_0001
Figure imgf000741_0001
Figure imgf000742_0001
Figure imgf000743_0001
Figure imgf000744_0001
Figure imgf000745_0001
Figure imgf000746_0001
Figure imgf000747_0001
Figure imgf000748_0001
Figure imgf000749_0001
Figure imgf000750_0001
Figure imgf000751_0001
Figure imgf000752_0001
Figure imgf000753_0001
Figure imgf000754_0001
Figure imgf000755_0001
Figure imgf000756_0001
Figure imgf000757_0001
Figure imgf000758_0001
Figure imgf000759_0001
Figure imgf000760_0001
Figure imgf000761_0001
Figure imgf000762_0001
Figure imgf000763_0001
Figure imgf000764_0001
Figure imgf000765_0001
Figure imgf000766_0001
Figure imgf000767_0001
Figure imgf000768_0001
Figure imgf000769_0001
Figure imgf000770_0001
Figure imgf000771_0001
Figure imgf000772_0001
Figure imgf000773_0001
Figure imgf000774_0001
Figure imgf000775_0001
Figure imgf000776_0001
Figure imgf000777_0001
Figure imgf000778_0001
Figure imgf000779_0001
Figure imgf000780_0001
Figure imgf000781_0001
Figure imgf000782_0001
Figure imgf000783_0001
Figure imgf000784_0001
Figure imgf000785_0001
Figure imgf000786_0001
Figure imgf000787_0001
Figure imgf000788_0001
Figure imgf000789_0001
Figure imgf000790_0001
Figure imgf000791_0001
Figure imgf000792_0001
Figure imgf000793_0001
Figure imgf000794_0001
Figure imgf000795_0001
Figure imgf000796_0001
Figure imgf000797_0001
Figure imgf000798_0001
Figure imgf000799_0001
Figure imgf000800_0001
Figure imgf000801_0001
Figure imgf000802_0001
Figure imgf000803_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Amino acid sequence ( X=Unknown, NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
6923 37291 6974 261 2667 TWCSGRYMVGCR/DWHLLLV7
KGLRKLTIVAEGEGGAGMSLG
ESGNKIKRDKEGHYIMVNGSIQ
QEELTILNIYAPNTGAPRFIKQV
LRDLQRDLDSHTIIMRDFNTPLS
ALERSTREKVNKDIQELNSALH
QADLVDIYRTLHPKSTEYTFFS
APHRTYSNIDHIVGSKALLSNC
KRTEIITKCLSDHSTIKLELRIKK
LTQNRSTTWKLNNLLLNDYWV
HNEMKAEIKMFFETIENKDTTY
QNLWDTFKAECRGKFIALNAH
KRKQERSKIDTLTSQLKELEKQ
EQTHSKASRRQEITKIRAELKET
ETQKTLQKINESRSWFFERXNKI
DRPLARLIKKKREKNLIDAIKN
DKGDITTDPTEIQTTIREYYKHL
YANKLENLEEMDKFLDTYTLP
RLNQEEVESLNRPITGSEIVAIIN
SLPTKKSPGPDGFTAEFYQRHT
VSILISYCQGGSLMSQVYWNQQ
EKPPSSSNAPPGLSVNKAQHRH
VCCRGEMLKRVQVHYVRAGES
DSASFHCVAQGPRAFGKPLILS
KENHLLTVTAIAKLNHKLCEIK
SMG VFINTEPL Y DSCLEQKHAL
FASPPATTHSTTMISKAKVIPML
SLTPGSFQTVESSLAPSAGKFQG
TETSSLKTLINLCCPKLPSPSPA
HLGGLPGGSQERPSRQYLYESA
RTTVLLGLGCPLKQIQLRSQHS
SPLEYLESLPKKDRKKREKIQIN
Figure imgf000805_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Λmino acid sequence ( X=Unknown, NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
6926 37294 6977 1921 MGESVQSLLTLAQKRSMMIQV
AVAPRTPTLLLKLYKEKASKTQ
LKKVNLWQDGQSLESIEVHD
HEPADTYKLLSLVKEPAEEIPR
VIQNVFGHRSKKENLKDSYGLR
QSSTMVATTWNNVSLMQHHSL
RRPNRHLTASYPHFHFQEIPVNF
FHSGRDSESFGWNQHIKKDKE
GHHIMVKGSIQQEELTMLNIYA
PNTGAPRFINQVLSDIERDLDSH
KI I MGDFNTPLSTLDRSTRQKV
NKDTQELNSALHQADLTDIYRT
LHPKSTEYTLFSAPHHTYSKIDH
IVGSKALLSKWKRTEIITNCLSD
PRAIKLELRIKKLTQNRSSIWKL
NNVLLNDYWVHNEMKAEIKIF
FETNENKDTSCQNLWDTFKAV
CRGKFIALNAHRRKQERSKIDT
LTSQLKELEKQERTHSKASRRQ
EITKIRAEMKEIETQKYTLQKINE
SRSWFFEKINKIDRPLARLIKKK
REKNQIDAIKNNKGDITTDPTEI
ETTIREYYKHLYTNKLENLEEM
DKFLDTYTLPRLNQEEVESLNR
PITGSEVVAIINSLPTKKSPGPDG
FTAEFHQRYKEKLISELLPMPN
HTALKKQTQDLSDCKAMFLYY
QSVSVQTAIINYHRLRGLNNRN
LFLTVLEYSSLRSGCEHSRVL
Figure imgf000807_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Λmino acid sequence ( X=Uπkπo\vn, NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
6930 S7298 A 6981 2156 KSLKDLMELKIMAQELCDEWT
SLSSRCNQLEERVSVMEDEMN
EMKDKHRPKIKEWRKIYQANG
QQKKTRVANLVSDKTEFKPTKI
KRDKEGHYIMVKGSIQQEELTI
VNIYAPNTGAPRFIKKVLSDLQ
RALDSHTIIMGDFNTPLSTLDRS
TRQKVNKDIQELNSALHQADLI
DIYRTLHPKSTEYTFFSAPHYTY
PKIDHIVGSKALLSKCKRTEIIT
NCLS\DHSAIKLELKIKKLTQNL
STTWKLNNLLLKDYWVHKEM
KAEIKMFFETNKNKDTTYQNF
WDTFKAVCRGKFIALNAHKRK
QEGFKIDTLTLQLKELEKQEQT
HSKASRRINKIDRPLARLIKKKR
EKNQIDTIKNDKGDITTDPTEIQ
TTIREYYKHLYANKLENLEEM
DKFLDTYTLPRLNQEEVESLNR
PITGAEIVAIINSLPTKKSPGPDG
FTAKFYQRYKEELVPFLLKLFQ
SIEKEGILPNSFYEASIILIPKPGR
DITKKENFRPISLMNIDAEILNKI
LASQMQQHIKKLIYHNQVGFIH
GMQGWFNIHKSMNVIQHINRTK
DKNHTIISLDAEKAFDKIQQPF
MLKLKTLNKLGIDGMYLKIIRA
IYDKPTANIILHWQKLEAFPLKT
GTRQGCPLSPLLFNIVLEVLAR
AVRKEKEIKRTQIGREEIKLSLF
ADDMIVYLENPIVSAQNLLKLIS
NFTKVSGYKINVKKSQAFLYN
6931 37299 A 6982 3721 8717 MLPIKRHRL AN WI K\SQDP\SVC
CIEETλHLTCRDAHRHKIKGWR
KIYQANGKQNKTKQQKKQGLQ
ILVSDKRDFKPTKIKRDKEGHYI
M VKG PI QQVEELTILNI YAP\NTG
APRFIKQVLSDLQRDLDSHTLI\
MGD\FNTPLSTLDRSMRQKVNK
DTQELNSALHQADLIDIYRTLH
PKSTEYTFFSAP\HHTYSKIDHIL
GSKALLSKCKRTEIIT\NYLSD\H
SA\IKLKLRIKNLTQNHSTT\WK
LNNL
Figure imgf000809_0001
Figure imgf000810_0001
Figure imgf000811_0001
Figure imgf000812_0001
Figure imgf000813_0001
Figure imgf000814_0001
Figure imgf000815_0001
Figure imgf000816_0001
Figure imgf000817_0001
Figure imgf000818_0001
Figure imgf000819_0001
Figure imgf000820_0001
Figure imgf000821_0001
Figure imgf000822_0001
Figure imgf000823_0001
Figure imgf000824_0001
Figure imgf000825_0001
Figure imgf000826_0001
Figure imgf000827_0001
Figure imgf000828_0001
Figure imgf000829_0001
Figure imgf000830_0001
Figure imgf000831_0001
Figure imgf000832_0001
Figure imgf000833_0001
Figure imgf000834_0001
Figure imgf000835_0001
Figure imgf000836_0001
Figure imgf000837_0001
Figure imgf000838_0001
Figure imgf000839_0001
Figure imgf000840_0001
Figure imgf000841_0001
Figure imgf000842_0001
Figure imgf000843_0001
Figure imgf000844_0001
Figure imgf000845_0001
Figure imgf000846_0001
Figure imgf000847_0001
Figure imgf000848_0001
Figure imgf000849_0001
Figure imgf000850_0001
Figure imgf000851_0001
Figure imgf000852_0001
Figure imgf000853_0001
Figure imgf000854_0001
Figure imgf000855_0001
Figure imgf000856_0001
Figure imgf000857_0001
Figure imgf000858_0001
Figure imgf000859_0001
Figure imgf000860_0001
Figure imgf000861_0001
Figure imgf000862_0001
Figure imgf000863_0001
Figure imgf000864_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Amino acid sequence ( X=Unknown, NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
7485 37853 7539 169 PPLLIPRQTGSGVDLQQTPTDLQ
LRVLTVRRKTNKQKGHPHQNSI
CTSPSSKTKDFKPTKIKRDKEG
HYIMVKGSIQQEELTILNIYTPN
TGAPRFIKQVLRDLQRDLDSHT
IIVGDFNTPLSTLDSSKRQKVDK
DIQELNSALYQADLTDIYRTLH
PKSTEYTFFSAPHHTYSKIDHIV
GSKALLSKCNRTEIITNCLSDHS
AIRLELRIKKLTENRSTTWKLN
NLLLKDYWVHNEMKAEIKMFF
ETNENKDTTYQNLWDTGKAAC
RGIFIAINAHKRNQKRSKMDTL
TSQLKELEKQEQRHSKASRRLN
QEEVESLNKPRTGSEIEAIINSLP
TKKSPGADGFTAEFYQRYKEE/
PGTIPSETIPINRKRGNTP*LIL*G
QHHPDTKA
7486 37854 7540 404 MDEFLNTYTLPRLNQEEVESLN
RPITGSEIVAIINSLPRKKSPGPD
GFTAEFYQRYKEELVPFLLKLF
QSIEKEGILPNSFYEASIILIPKPG
RDTTIKENFRPISLMNIDAKILN
KIILANRIQQH I KKLIHHDQ VGFI
PGMQGWFNIRKSINVIQHINRT
KDKNHMIISTDAEKAFDKIQQP
FMLKTLKKFGIDGTYLKIRIKYL
GIQLTRDVKDLFKENYKPLLNE
IKEDTKKWKNIPCSWVGRINIM
KMAILPKVIYRFNAIPIKLPMTF
LTELEKTTLKFIWNQKRARIAK
SILSQKNKAGGITLPDFKLYYK
ATVTKTAWYWYQKRDIHQWN
TTEPSEITPHI YN HLIFDKPEKNK
QWGKDSLFNKWCWENWLAIW
RKLKLDPFLTPYTKINSRWIKD
LNVRPKTIKTLEENLGITIQDIG
MGKDFMSKTPKAMATKAKIDK
WDLIKLKSFCTAKETTIRVNRQ
PTKWEKIFTTYSSDKGLISRIYN
ELKQIYKKKSDNPIKKWVKDM
NRHFSKEDIYAAKKHMKKCSP
SLAIREMQIKTTMRYHLTPPSLL
IPRQTGSGVDLQQTPTDLQLKV
LTVRRKTNKQKVHPHQNPICTS
PSSKTKELEKQEQTHSKASRRQ
EITKIRAELKEIETQKNLQKINES
RSWFLEKVNKIDRLLARLIKKK
REKDQIDAIKNDKGDITTDPTEI
QATIREHYKHLYANKLENLEE
Figure imgf000866_0001
Figure imgf000867_0001
Figure imgf000868_0001
Figure imgf000869_0001
Figure imgf000870_0001
Figure imgf000871_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Amino acid sequence ( X=Unkno\vn, NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
7527 57895 7581 2677 TKETCFIRGPKTPVPVTDWEGS
LPLVFNHCRDTSVIIHPRFKGVR
PRRDACLGPSPLAASPAFLGKG
QAELGPNSSSASAPPPYNLFIAS
PPHT WSG LQFRS VTSPPPPAQQF
TLKKVAGAKGIVKRLKTDTAR
SPWKTPRPSRTPSFRKAERTKG
LLKIHLTKLSHQLKKDWTILLP
LSLLRIQACPRNATRLATGQLG
YPFISQSYVLVNGFQTVEDLCE
AADLRVSVADLRVSVTALKVA
RLELFVPPGGLWSLASAVKLQ
TFAVLQLIKAKRWDWGTLEQG
AALIGEARDAQEPTEGVGGSG
MAGCRSRDLPRGKAAKARREI
ERSAGLTIKKERCI RNG YSKEK
MKLIWSHGLHVNDLQHKLTL
FTKETYTYLARDSEKQKQGYL
AGLEGAHANRVNQQISDNLW
KTGALPPPYRTKGEKLYFYMSR
QNCLFSVSGQLLQGFFPNGGQA
APNPYALLSQIPEEAEWFTVLD
LKDDFFCIP/VHPDSQFFFAFEDP
SNPTSQLTWTVLPQGFRVHLFG
QALAQDLSQFSYLDTLVLCGVL
KSPIIIVWELNQIDTIKNDKGDIT
TDPTEIQTTIREYYKHLYTNKLE
NLEEMDKFLNTYTLPRLNQEEV
ESLNRPITGAEIEAIINSLPTKKS
PGPDGFTAEFYQRVADVKLREE
KDENMLL VPLSDLL YQ V H APIP
QLPEVLAANSPTCHLDPPTITES
7528 37896 7582 459 MAVRYTDENVLRKGTREAGT
MMRLRGTTECCADFAAWDVT
HDALRATGGELPGAQVMLTTT
ECGRHVDFCDRAVWLPRMWG
YPLALPGEMRKLYTVRMAGRD
ILAKAGAIIHLNIGEGTPVRCPL
L/EGINPEVWKTEGQYRQAKNA
RPVQVKL
Figure imgf000873_0001
Figure imgf000874_0001
Figure imgf000875_0001
Figure imgf000876_0001
Figure imgf000877_0001
Figure imgf000878_0001
Figure imgf000879_0001
Figure imgf000880_0001
Figure imgf000881_0001
Figure imgf000882_0001
Figure imgf000883_0001
Figure imgf000884_0001
Figure imgf000885_0001
Figure imgf000886_0001
Figure imgf000887_0001
Figure imgf000888_0001
Figure imgf000889_0001
Figure imgf000890_0001
Figure imgf000891_0001
Figure imgf000892_0001
Figure imgf000893_0001
Figure imgf000894_0001
Figure imgf000895_0001
Figure imgf000896_0001
Figure imgf000897_0001
Figure imgf000898_0001
Figure imgf000899_0001
Figure imgf000900_0001
Figure imgf000901_0001
Figure imgf000902_0001
Figure imgf000903_0001
Figure imgf000904_0001
Figure imgf000905_0001
Figure imgf000906_0001
Figure imgf000907_0001
Figure imgf000908_0001
Figure imgf000909_0001
Figure imgf000910_0001
Figure imgf000911_0001
Figure imgf000912_0001
Figure imgf000913_0001
Figure imgf000914_0001
Figure imgf000915_0001
Figure imgf000916_0001
Figure imgf000917_0001
Figure imgf000918_0001
Figure imgf000920_0001
Figure imgf000921_0001
Figure imgf000922_0001
Figure imgf000923_0001
Figure imgf000924_0001
Figure imgf000925_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Λmino acid sequence ( X=Unknown, NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possi hie nucleotide insertion) sequence
7829 38197 7884 809 KPRLENYVKNAEASGADA1NW
KKGY/LVMEDEMNEMKREGKF
REKRIKRNEQSLQEIWDYVKRP
TLHLIDVPETLNAHKRKQERSK
TDTLTSQLKELEKQEQTHSKAS
RRQEITKIRAELKEIETEKTLQKI
N ESRS WFFERINK IDRPLARLIK
KKREKNQIDAIKNDKGDITSDP
TEIQTTIREYYKHLYANKLENL
EEMDKFLDTYTLPRLNQEEVES
LNRPITGSEIVAIINSLPTKKSPG
PDGSTAEFYQRYKEEL
7830 38198 7885 2142 MIILIDAEKAFDKIQQPFMLKTL
NKLGIDGTYLKITRAIYDKPTA
NIILNGQKLEAFPLKTGTRQGCP
LSPLLFNIVLEVLAQAIRQEKEI
KGIQLGKEEVKLSLFADDMILY
LENPIVSAQKLLKLISNVSKVSG
YKINVQKSQAFLYTNNRQTESQ
IMSEFPFTIATKRIKYLGIQLTRD
VKDLFKKYKPLLNKIKEDTNK
WKNIACSWIGRINIMKMAFPR
WELNNENTWTQEGEHHTLGPV
VGWGKRGGIALVDIPNVNDKL
MVLEVLARAIRQKKEIKGIQLG
KEEVKLSLFADDMIVYLENSIV
SAQNLKLISNFSKVSGYKINVQ
KSQAFLYTNNRQTESQIMSEFP
FTIATKRIKYLGIQLTRDVKDLF
KENYKPLLKEIREDTNKWKNIP
CSRIGRINIMKMAILPKVIYRFN
DIPIKLPMTFFTELEKTTLKFIW
NQKRACIAKTILSKKNIAGGITL
PDFKLYYKATVTKTAWYWYQ
NRDIDQWNRTEASEVTSHIYNH
LIFYKPDKNKKWGNDSLFNKW
CWENWLAICRKLKLDPFLTPYT
KIHSRWIKDLNVRPKTIKTLEEN
LGNTIQDIGMGKDFMTKTPKA
MATKAKVDKWDVIKLKSFCTA
KETTIRVSRQPTEWEKIFAIYPS
DKGLISRIYKELKQIYRKK\TNN
PIKKWAKNMNRHFSKEDIYAA
NRQM KKCSSSL V I REMQIKTTM
Figure imgf000927_0001
Figure imgf000928_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Amino acid sequence ( X=Unknown, NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
7844 38212 7899 1751 MIISIDAEKAFDKIQQPFMLKTL
NKLGIDGTYFKIIRANYDKPTA
NIILNGQKLEALPLKNGTRQGC
PLSPLLFNTVLEVLARAIRQEKE
IKGIQLGNVEVKLSLFADDMIV
YLENPIVSAQNLLKLISKFSKVS
GYKINVQKSQAFLYTNNRQTES
QIMSELPFTIASKRIKYLGIQLTR
DVKDLFKENYKPLLKEIKEDTN
KWKNIPCSWVGRLNIMKMAILP
KVIYRFNAIPIKLPMPFFTELEK
TTLKFIWNQKRARIAKSILSQK
NKAGGITPPDFKLYYKATVTKT
AWCWYQNRDIDQWNRTEPSEI
TPHIYNYLIFDKPEKDKQWGKD
SLFNKRCWENWLAICRKLKLD
PFLTPYTKINSRWIKDLKVRPKT
IKTLQENLGFTIQDIGMGKDFM
SKTPTAMGTKDKIDKWDLIKL
KSFCTAKETTIRVNRQPTKWEK
IFTTYSSDNGLISRIYNELKQIYK
KKTNNPI KQWAKDMNRHFSIE
DIYAAKKHMKKCSSSLAIREM
QIKTTMRYHLTPVRMAIIKKSG
NNRNHLDFKH I RI LGIC YL/D* KI
YQP*LHLVSRNRKPEKLSPKQA
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Amino acid sequence ( X=Unkno n, NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
7845 18213 7900 2540 MGDFNTPLSTLDRSTRQKVNK
DTQELNSALHQPDLIDIYGTLHP
KSTEYTFFSAPHHTYSKIDHILG
SKALLSKCKRTEIITSYLSDHSAI
KLELRIQNLTQNHSTTWKLNNL
LLNDYWLHNEMKAEIKMFFET
NENKDTTYQNLWDAFKAVCR
GKFIALNAHKRKQERSKTNTLT
SINKIDRPLARLIKKKREKTQID
AIKNDKGDITTDPTEMQTTIRE
YYKHLYANKLENLEEMDKFLN
TYTLPRLNQEEVESLNRPITGAE
IVAIINSLPTKKSPGPDGFTAEFY
/HEL\LKENKIPRNPTY/RGCEGP
LQGELQTTAQ*NKRGYKQMEE
HSMLMDRKNQYRENGHTAQG
NLQIQCHPH K ATNDFLHRSGKN
YFKVHMEPKKSPHRQVNPKPK
EQSWRHHAI*LQTILQGYSNQN
SMVLVPKQRYRSMEKNRALRN
NAAYLQLSNL*QT*EKQAMGK
GFLI**MVLGKLASHM*KAETG
SLPYTLYKNQFKMD*RLKH*T*
NHKNPRRKPRHYHSGHRHGQG
LHV*NTKSNGNKSQN*QMGSN
TKELLHSKRNYHQSEQATYKL
GENFCNLPI*PRANNQNLQ*TQ
TNLQEKNKQPHQKVGKGHEQT
LLKRRHLCSQKTHEKMLSITGH
QRNANQNHNEIPSHTS*NGNH*
KVRK*QVLNGLDDVQLFR*NK
QPCCSHKACLWSSYRRA*SLV
Figure imgf000931_0001
Figure imgf000932_0001
Figure imgf000933_0001
Figure imgf000934_0001
Figure imgf000935_0001
Figure imgf000936_0001
Figure imgf000937_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Λmino acid sequence ( X=Unkno n,
NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
7876 38244 7931 3146 MGDFNTPLSTLDRSSRQKVNK
DTQELNSTLHHADLIDIYRTLHP
KSTEYTFFSAPHHTYSKIDHW
GSKALLSKCKRTEIITNCLSDHS
AIKPELRIKKLTQNRSTTWKLN
NLLLNDYWVHNKMKAEIKMFF
ETNENKDTTYQNLWDTFKAVS
RGKFIALNAHKRKQKRCKIDTL
ASQLKEVEKQEQTHSKASRRQ
EITKIRAELKEIETQKTLQKINES
RSWFLERINKIDRPLARLIKKKR
EKNQIDVIKNDK
7877 38245 7932 2669 MHNTDGNRFLSHWGYRQALSI
SKPASASLHPSSKTKPLGTQSKT
WAKRNREHGKKERSSSPAME
QS WMEN DFDELREEGFRRSNY
SELREDIQTKGKEVENFEKNLE
ECITRITNTEKCLKELMELKTK
ARELREECRSLRSRCDQLEERR
KQERSKIDTLTSQLKELEKQEQ
THSKAGRRQEITKIRAELKEIET
QKTLQKINESRSWFFERINKIDR
PLARLIKKKREKNQIDTIK\NDK
GNITTDPTEIQTTIREHYKHLYA
NKLENLEGMDKFLDTYTLPRL
NQKEVESLNRPITGSEIVAIINSL
PTKKSPGPDGFTAEFYQRYKEE
LHINRAKDKNHMIISIDAEKAF
DKIQQPFMLKTLNKLVLEVLAR
AIRQEKEIKGIQLGKEEVKLSLF
ADDMIVYLENPIVSAQNLLKLL
SNFSKVSGYKINVQKSQAFLYT
NNRQTESQIMSELPFTIASKRIK
YLGIQLTRDVKHLFKENYKPLL
KEIKEDTNKWKNIPCSWVERIN
IVKMAILPKVIYRFNAIPIKLPM
TFFTELEKTTLKFIWNQKRARIA
KSILSQKNKAGGITLPDFKLYY
KATVTKTAWYSYQNRDIDQW
NRTEPSEILPRIYNYLIFDKPEKN
KQWGKDSLFNKWCWENWLAI
CRKLKLDPFLKPYTKIKSGWIK
DLNVRPKTIKTLEENLGITIQDI
GMGKDFMSKTPKAMATKAKID
Figure imgf000939_0001
Figure imgf000940_0001
Figure imgf000941_0001
Figure imgf000942_0001
Figure imgf000943_0001
Figure imgf000944_0001
Figure imgf000945_0001
Figure imgf000946_0001
Figure imgf000947_0001
Figure imgf000948_0001
Figure imgf000949_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Amino acid sequence ( X=Unkno n, NO: of peptide hod n USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
7939 38307 7994 2367 HRVTVISGETGCGKTTQVTQFI
LDNYIERGKGSACRIVCTQPRRI
SAHSSCGKSSCRKGQNLVASGN
STGY\QIRLPESVAQGNRVLSYT
VPTRNHPSSGSQSDPYLSSVSHI
VLDEIHERNLQSDVLMTWKD
LLNFRSDLKVILMSATLNAEKF
SEYFGNCPMIHIPGFTFPWEYL
LEDVIEKIRYVPEQKEHRCQFK
RGFMQGHVNSQEKEEKEAIYK
ERWPDYVRELRRRYSASTVDVI
EMMEDDKVDLNLIVALIRYIVL
EEEDGAILVFLPGWDNISTLHD
LLMSQVMFKSDKFLIIPLHSLM
PTVNQTQVFKRTPPGVRKIVIA
TNIAETSITIDDWYVIDGGKIK
ETHFDTQNNISTMSAEWVSKA
NAKQRKGRAG\RVQPGSLLFICI
NGS*EASLLGWT1QLPEIFEE\PF
WEGTLFTK*RFLRLGEIAYFLSR
*MDPPSNEPVLLSIRHL\RSLNA
LDKQEELTPLGS\HLARLPVEPH
IGKMILF\GALFCCLDPVLTIAAS
LKFQ\SPFVIPLGKEKIADARRK
ELAKDTRSDHLTWNAFEGWE
EARRRGFRYEKD\FCWEYFLSS
NTLQMLHNMKGQFAEHLLGA
GFVSSRNPKDPESNINSDNEKII
KAVICAGLYPKVAKIRLNLGKK
RKMVKVYTKTDGLVAVHPKS
VNVEQTDFHYNWLIYHLKMR\
TSSIYLY\DCTEVSP\YCLLFFG\G
7940 38308 7995 25 312 WLIYVITDARMYRQRGRYYFL
VSSSLDKVLMISFLLGWRRSSR
RLRAFLVLHLRASPWAPRSFVR
PVSAFSVGMCHMKTRVFENTF
PFTLGHTVRYVGREI F/CRRVFT
RYRESIKISLPTYLTVCPRVKGN
VFSNTRVFMWHIPTLKADTGRT
KLLGAQGEARRCKTKKARKRR
EERLQPNKKEIIKTLSKEEETKK
7941 38309 7996 417 QQIRKLIKVGLIIHKPVIVHSQV
GCQKSTFS/RTRKGRHMGTG*R
KSTANFQMPEKVTWMRRMRIL
HWLFGRYHESKKTDHHMYHSL
YLKVQGNMFTNKQILMEHNHK
LKADKAHKKFLADQAEARSSK
TNKASKLREER
7942 58310 7997 124 KKRPELRDLLNQARKETL/ARK EDRSASSGAEGDVSSEREP
Figure imgf000951_0001
Figure imgf000952_0001
Figure imgf000953_0001
Figure imgf000954_0001
Figure imgf000955_0001
Figure imgf000956_0001
Figure imgf000957_0001
Figure imgf000958_0001
Figure imgf000959_0001
Figure imgf000960_0001
Figure imgf000961_0001
Figure imgf000962_0001
Figure imgf000963_0001
Figure imgf000964_0001
Figure imgf000965_0001
Figure imgf000966_0001
Figure imgf000967_0001
Figure imgf000969_0001
Figure imgf000970_0001
Figure imgf000971_0001
Figure imgf000972_0001
Figure imgf000973_0001
Figure imgf000974_0001
Figure imgf000975_0001
Figure imgf000976_0001
Figure imgf000977_0001
Figure imgf000978_0001
Figure imgf000979_0001
Figure imgf000980_0001
Figure imgf000981_0001
Figure imgf000982_0001
Figure imgf000983_0001
Figure imgf000984_0001
Figure imgf000985_0001
Figure imgf000986_0001
Figure imgf000987_0001
Figure imgf000988_0001
Figure imgf000989_0001
Figure imgf000990_0001
Figure imgf000991_0001
Figure imgf000992_0001
Figure imgf000993_0001
Figure imgf000994_0001
Figure imgf000995_0001
Figure imgf000996_0001
Figure imgf000997_0001
Figure imgf000998_0001
Figure imgf000999_0001
Figure imgf001000_0001
Figure imgf001001_0001
Figure imgf001002_0001
Figure imgf001003_0001
Figure imgf001004_0001
Figure imgf001005_0001
Figure imgf001006_0001
Figure imgf001007_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Amino acid sequence ( X=Unknown, NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
8246 38614 8301 2558 FEVLWSDSSITSVTKSSSEVTEFI
SKLCQLYPEENLEKLIPCLAGPD
AFYVERNHVDLDSGLRYLASLP
SHVLKNDHVRRFLSTSSPPQQL
QSPSPGNPSLSKVGTVMGVSGR
PVCGVAGIPSSQSGAQHHGQHP
AGSAAPLPHCSHAGSAGSALA
YRTQMDTSPAILMPSSLQTPQT
QEQNGILDWLRKLRLHKYYPV
FKQLSMEKFLSLTEEDLNKFES
LTMGAKKKLKTQLELEKEKSE
RRCLNPSAPPLVTSSGVARVPPT
SHVGPVQSGRGSHAAELRVEV
EQPHHQLPREGSSSEYSSSSSSP
MGVQAREESSDSAEENDRRVEI
HLESSDKEKPVMLLNHFTSSSA
RPTAQVLPVQNEASSNPSGHHP
LPPQMLSAASHITPIRMLNSVH
KPERGSADMKLLSSSVHSLLSL
EERNKGSGPRSSMKVDKSFGSA
MMDVLPASAPHQPVQVLSGLS
ESSSMSPTVSFGPRTKWHAST
LDRVLKTAQQPALWETSTAA
TGTPSTVLHAARPPIKLLLSSSV
PADSAISGQTSCPNNVQISVPPA
IINPRTALYTANTKVAFSAMSS
MPMGPLQGGFCANSNTASPSR
HPSTSFANMATΛMPSCPAPSSSP
ALSSVPESSFYSSSGGGGSTGNI
PASNPNHHHHHHHQQPPAPPQP
APPPPGCIVCTSCGCSGSCGSSG
LTVSYANYFQHPFSGPSVFTFPF
8247 38615 8302 266 YCGPL*SVWVARNPPGFAFVEF EDPRDAAGVV*ELDGRTLCGP\ PSWGHRP*DDYCRRGLPPRRRP PRRRNLSCSRSRSLYSPWTLSL
8248 38616 A 8303 37 535 ESWLVLGRRKAGRLIGACGFEP
PHFLTLDLEMHRDSCPL\DCKV
YVGNLGNNGNKTELERAFGYY
GPLRSVWVARNPPGFAFVEFED
PRDAADAVRELDGRTLIGC\RV
REE\LSNGEKKK/SRNRGPPPS\W
GRPPSENDNRKEESFPPR\RRSP
R\RRSFS\RSRSRSLF
Figure imgf001009_0001
Figure imgf001010_0001
SEQ ID SE ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Amino acid sequence ( X=Unknown,
NO: of pept ide hod in USSN location of first codon for last amino acid *=Stop codon, /=possιble nucleotide seq nee 09/540,217 codon for peptide of peptide sequence deletion, \=possιble nucleotide insertion) sequence
8262 58630 8318 186 2123 CPAHCWQEAMLSAWRLVMSP
GG PCTQHPAPACHPS VLTTPRK
AGEPRP*WPPCPAWGPLADSSK
PHRWRPKPPASCRPLVRRPWGP
QRTLTPTLTSHWRASIR*SWNW
TPPSSCFPQGLGAPRLSWPRAPC
Q* ERRRNLKPWT* STSR* PPPDQ
GATMAPSTAPAPLSPRPSAPFA
WASSFPETSPERHEAAVRASSS
LGTRAGGTSALCPPQRVSP/HRP
PNSPSISIPCMGSKASSPHGLGSP
LVASPRLEKRLGGLAPQRGSRI
SVLSASPVSDVSYMFGRTPHSP
PLAKEHA\AAAPHPSPTPWWTY
PLC* STAAQN/RGSSPPQRTPGH
QNSVQPGAASPSNPCPATRSNS
QTLSDAPFTTCPEGPARDMQPT
MKFVMDTSKYWFKPNITREQG
CPGGAVSISDSRIVPAIELLRKE
EPGAFVIRDSSSYRGSFGLALK
VQEVPASAQSRPGEDSNDLIRH
FLIESSAKGVHLKGADEEPYFG
SLSAFVCQHSIMALALPCKLTIP
QRGCHTLYLSSVSVETLTGALA
VQKAISTTFERDILPTPTWHFK
VTEQGITLTDVQRKVFFRRHYP
LTTLRFCGMDPEQRKWQKYCK
PSWIFGFVAKSQTEPQENVCHL
FAEYDMVQPASQVIGLVTALL
QDAERM
8263 38631 8319 15 909 LHVSHPGHADARGGLLQPCAA
PPLWLCRIQCPPRWFQTPSQVF
YHAATEHGGKDVYPGQCLWE
GCEPFQRQRFSFITHLQAANCR/
EAQ/RSTPRAQKAIVNHPSAAL
MALRRGSRNLVFRDFTDEKEG
PITKHIRLTAALILKN IGKYSEC
GRSWPRASRYQQGHQDLFILRS
DLPSQVFIRDKLMERRNRRTGR
TEKARI WEVTDRTVRTWIG E A
VAAAAADGVTFSVPVTPHTFR
HSYAMHMLYAGIPLKVLQSLM
GHKSISSTEVYTKVFALDVAAR
HRVQFAMPESDAVAMLKQLS
Figure imgf001012_0001
Figure imgf001013_0001
Figure imgf001014_0001
Figure imgf001015_0001
Figure imgf001016_0001
Figure imgf001017_0001
Figure imgf001018_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Λmino acid sequence ( X=Unkno\vn,
NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
83 13 38681 8369 130 LDFQHHNYKAMRKLMKQVQE
QCPNITRIYSIGKSYQGLKLYV
MEMSDKPGEHE/LGYWHGEW
GEVGTGQGPRHEPAASPHVSPG
EPEVRYVAGMHGNEALGRELL
LLLMQFLCHEFLRGNPRVTRLL
SEMRIHLLPSMNPDGYEIAYHR
GSELVGWAEGRWNNQSIDLNH
NFADLNTPLWEAQDDGKVPHI
VPNHHLPLPTYYTLPNATVSILR
AAVEVLWGADPCLCLLPLLTCP
IQVAPETRAVIKWMKRIPFVLS
ANLHGGELVVSYPFDMTRTPW
AARELTPTPDDAVFRWLSTVY
AGSNLAMQDTSRRPCHSQDFS
VHGNIINGADWHTVPGSMNDF
SYLHTNCFE VTWELFCDM FPH
ENELPHEWENNTDA
8314 38682 8370 2229 MWGLLLALAAFAPAVGPALGA
PRNSVLGLAQPGTTKVPGSTPA
LHSSPAQPPAETANGTSEQHVR
IRVIKKKKVIMKKRKKLTLTRP
TPLVTAG PL VTPTPAGTLDPAE
KQETGCPPLGLESLRVSDSRLE
ASSSQSFGLGPHRGRLNIQSGLE
DGDLYDGAWCAEEQDADPWF
QVDAGHPTRFSGVITQGRNSV
WRYDWVTSYKVQFSNDSRTW
WGSRNHSSGMDAVFPANSDPE
TPVLNLLPEPQVARFIRLLPQT
WLQGGAPCLRAEILACPVSDPN
DLFLEAPASGSSDPLDFQHHNY
KAMRKLMKQVQEQCPNITRIY\
RIGKSYQGLKVYWEMSDKPG
EHELGEPEVRYVAGMHGNEAL
GRELLLLLMQFLCHEFLR\GNP
RVTRLLSEMRIFLLPSMNPDGY
EIALH\RSS*LVGLSEGRWNNQS
IDLNHNFADLNTPLWEAQDDG
KVPHIVPNHHLPLPTYYTLPNA
TVAPETRAVIKWMKRIPFVLSA
NLHGGELWSYPFDMTRTPWA
ARELTPTPDDAVFRWLSTVYA
GSNLAMQDTSRRPCHSQDFSVL
NGNIINGA\DWHTVPG\SMNDFS
YLHTNCFEVTVELSCDKFPPEE*
ICPKEWEVNNKDALLTYLEQVR
MGIAGVVRDKDTELGIADAVIA
VDGINHDVTTALGPGIIWRLA*
PPGD\YMVDLPSSEGYHSVTRN
Figure imgf001021_0001
Figure imgf001022_0001
Figure imgf001023_0001
Figure imgf001024_0001
Figure imgf001025_0001
Figure imgf001026_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Λmino acid sequence ( X=Unkno n, NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
8342 38710 8399 89 1685 KGPLLLSLPCRAGAMAETSVAT
GACGEAMAAAEGSSGPAGLTL
GRSFSNYRPFEPQAFGASSPS\W\
RLDGAFSGNERAEGCKVPQEG
RLLKLLAG\LTRP\DVRPP\LGRG
\LVGGQEEASQEAGLPAGAGPS
PTFPALGIGMDSCVIPLRHGGLS
LVQTTDFFYPLVEDPYMMGRIA
CANVLSDLYAMGITECDNMLM
LLSVSQSMSEEEREKVTPLMVK
GFRDAAEEGGTAVTVGQTVVN
PWDLLIGGVATCSIANQFEFAM
PDSAWGDVLWLTKPFRNPGL
LFNAHQWLG*FLERWD*S*RW
WFSREE\VEL\AYQEA\MFNMA
TLNQNCCRV*MHTFNAHAATD
IT\GFG\ILGHSQNL\AKQQRNEV
SFVIHNLPIIAKMAAVSKRASGT
VWGFFQGTSA\ETSGGITGFCLP
R\EQGGLAFCS\EIQILPKYGRGS
PKAWMVGIVEKGNRTAPDPLTS
PRVIEV\LPRG\ATAAVLAP\DSS
NALLLSLAREIERTEWWTLEP
LSTITDG SQELI VKKFPKKA ACI
WPAALSR
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Amino acid sequence ( X=Unkno n, NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
8343 3871 A 8400 30 2391 ARGRPLLAARSSRPCGPRLPFK
RAGPAPAAPHRGDQEARWFSG
AAPSRLPPAPRFSGPAAIFLSAQ
GPPSGAMQPTLLLSLLGAVGLA
AVNSMPVDNRNHNEGMVTRCI
I E VLSN ALSKSS APPITPECRQV
LKTSRKDVKDKETTENENTKFE
VRLLRDPADASEAHESSSRGEA
GAPGEEDIQGPTKADTEKWAE
GGGHSRERADEPQWSLYPSDS
QVSEEVKTRHSEKSQREDEEEE
EGENYQKGERGEDSSEEKHLEE
PGETQNAFLNERKQASAIKKEE
LVARSETHAAGHSQEKTHSRE
KSS/QFSAA*EAGSQENRPQESK
GQPRSQEKSEEGDEDATSEVDK
RRTRPRHHHGRSRPDRSS\QGG
SFPS\EEKGHPQ\EESEESNVSM
ASLGEKRDHHSTHYRASEEEPD
YGEEIKGYPGVQAPEDLEWER
YRGRGSEEYRAPRPQSEESWDE
EDKRNYPSLELDKMAHGYGEE
SEEERGLEPGKGRHHRGRGGEP
RAYFMSDTREEKRFLGEGHHR
VQENQMDKARRHPQGAWKEL
DRNYLNYGEEGAPG\KWQQQG
VLQDTK\ENRVEARFQDKQYSS
HHTAEKRKRLGELFNPYYDPL
QWKSSHLERRDNMIDNFLEGE
EENELTLNEKNFFPEYNYDWW
EKKPFSEDVNWGYEKRNLARV
PKLDLKRQYDRVAQLDQLLHY
Figure imgf001029_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Amino acid sequence ( X=Unkno n, NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possiblc nucleotide insertion) sequence
8349 38717 8406 10 580 KQCALLTFICSLPMSFGFAPA1T
VTWN/PRHPLIRDMLRRIKEDE
DDKTVLDLAWLFETATLRSG
YLLPDTKAYGDRIERMLRLSLN
IDPDAKVEEEPEEEPEETAEDTT
EDTEQDEDEEMDVGTDEEEET
AKMS*ALLTFICSLPMSFGFAPA
ITVTWNHFVLCSKTGSQVFRS
ENWKVWAESSRGDHDDCLDL
CSVLCWGRTATGAFSCPAQLIF
AVSSSSSVPTSISSSSSCSVSSW
SSAVSSGSSSGSSSTFASGSMFK
LRRSILSILSPYALVSGKRYPDR
SVAVSNKTTARSKTVLSSSSSLI
RRSMSLISGCRD
8350 38718 8407 1967 MTMAAAAWARGAGARAATA
AALRGGCGTAARGRPCAGPAR
PLCTAPGTAPDMKRYLWERYR
EAKRSTEGTKKYTTSLNARHY
YTHFTEENEDEINSSSSYASQKK
TFEINPRHPLIRDMLRRIKEDED
DKTVLDLAWLFETATLRSGYL
LPDTKAYGDRIERMLRLSLNID
PDAKVEEEPEEEPEETAEDTTE
DTEPDEDEEMDVGTDEEEETG
KESDDPMAYIHFTAEGEVTFKS
ILFVPTSAPRGLFDEYGSKKSDY
IKL YVRRVF ITDDFHDM MPKYL
NFVKGWIRKKLVRKTLDMIK
KIADDKYNDTFWKEFGPTTKL\
GVIEDHSNRTΛRLAKLLRIQSSH
HPTDITSLDQYVEK\MKEKQDK
IYFMAGSS\RKEAESSPFVERLL\
KKGYEVIYLTEPVD\EYCIQALP
\EFD\GKRF\QNWQGKECKFHE
S* ENLRK* *CKSVEQRIS ALCLN
W\MKDKALKDKI*K/ALWVSSA
ALTESPVLLLVAQPVRDWSWP
TLGE/RSMKAQAYQTGQGHLL
QIYYAESERKHFEINPQTPRLIR
D\MLRRIKEDEDDKTV\LDLA\V
VLV*NRQRLGSGYLFTQTLKAY
GDW/RLERMLRLSLNI*PLMPK
VEEEPR/EEEPEETAEDTTΛEDTE
\QDED\EEMDVG\TDEKG RTSKG
8351 38719 8408 181 462 GMASASDSSKTGFRGKQWAG
AVGDSAKESQGQPPGAPPRRA
AGL/WRRHHCSF*AD*TTNAPG
GKNPRPGKGAGGEQGTAGDFT
CQMPRTQNTL
Figure imgf001031_0001
Figure imgf001032_0001
Figure imgf001033_0001
Figure imgf001034_0001
Figure imgf001035_0001
Figure imgf001037_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Amino acid sequence ( X=Unkno n, NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
8394 38762 8451 2460 MEEKSKRRGVLWSLLILALKFG
DEPPLGSPNSCKYKLTTLKELG
WREVFSSLQFCLCTSVASQMLS
QYFEYTQPHPGDICHPETCALQ
KVFLTSSGQSSSTSSSSQQFVLQ
INAIQTILEKTPTHISTPTPTPTLA
NPTAVGLEEDREQDVGTSSRG
GAWDTRIKRELEKMSEKCGGQ
INGGDIAVFASRAGHGVCWHPP
CFVCTVCNELLVDLIYFYQDGK
IYCGRHHAECLKPRCAACDEIIF
ADECTEAEGRHWHMKHFCCFE
CETVLGGQRYIMKEGRPYCCH
CFESLYAEYCDTCAQHIGEAIF
GSQ/HWSFC/RCGHLCFGEWG*
PQLLWMLC*LELTKGPRDAIMS
SLHSWHLRKGVRRRRRKKEDK
SPEVLP*KS\SSGTLSH*R*RPH*
SLFLGSRVSL\PSGECRGPGPPYS
HSLRLYSQQALPTGRGPSSSQFP
FCQFYCCYKKSLLCDCLSKSPA
LEKAMRGIDQGQMTYDGQHW
HATETCFCCAHCKKSLLGRPFL
PKQGQIFCSRACSAGEDPNGSD
SSDSAFQNARAKESRRSAKIGK
NKGKTEEPMLNQHSQLQVSSN
RLSADVDPLSLQMDMLSLSSQT
PSLNRDPIWRSREEPYHYGNKM
EQNQTQSPLQLLSQCNIRTSYSP
GGQGAGAQPEMWGKHFSNPK
RSSSLAMTGHAGSFIKECREDY
YPGRLRSQESYSDMSSQSFSET
8395 38763 8452 942 NKNPKNHYPESYVSLKNDFTK
LTSRRHRGVEQALVQLCHVFV
CEHGHVIVTLSSGFIVYRTRAN
YSSSAQGVGIAAAALDAVLIRA
HQRTQGCHHELSALLAFEKGSE
KKEEEEGGQVSRSSSLKVQAQE
LSATEDKGHISPFFWEAVYHYA
SGECRGPGPPYSHSLRLYSQQA
LPTGRGPSSSQFPFCQFYCCYK
KSLLCDCLSKSPALEKAMRGID
QGQMTYDGQHWHATETCFCC
AHCKKSLLGRPFLPKQGQIFCS
RACSAGEDPNGSDSSDSAFQNA
RAKESRRSAKIGKNKGKTEEP\
C*TSTASCK
Figure imgf001039_0001
Figure imgf001040_0001
Figure imgf001041_0001
Figure imgf001042_0001
Figure imgf001043_0001
Figure imgf001044_0001
Figure imgf001045_0001
Figure imgf001046_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Λmiπo acid sequence ( X=Unkno n, NO: of peptide hod n USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
8435 38803 8492 2563 MGDFNTPLSTVDRSMRQKVDK
DTQELNSALHQADLIDIYRTLH
PKSTEYTFFSAPHHTYSKIDNIIG
SKALLSKCKGTEIITNSLSDHSA
IKLELRIKKLTQNHSTTEKLNNL
LLNDYWVHNEMKAEIKMFSET
NENKDTTYQNLWDTFKAVCRG
KFIALNAHKRKQERSKIDTLTS
QLKELEKLNQEEVESLNRPITG
AEI VAIINSLPTKKSPGPDG FTA
KFYWRYKEELVPFLLKLFQSIE
KEGILPNSCYEASIILIPKPGRDT
TKKENFRPISLMNIDAKILNKIL
ANRIQQHIKKLIHHDQVGFIPG
MQGWFNICKSINVIQHINRTKD
KNHMIISIDAEKAFDKIQQPFML
KTLNKLAQNLLKLISN/SQQSLR
IQNQCTKITSILIHQ*QTNREPNH
E*TPIHNCFKENKIPRNPTYKGC
EGPLQGELQTTAQGNKRGYKQ
MEEHSMLMGRKNQYLENGHT
AQGNL*I/QMPSPSSY**LSSQN
WKKLL* SS YGTKK\GPHSQVN P
KPKEQSWRHHTT*LQTILQGYS
NQNSM VLAPKQR* RPMEQNRA
LRNNAAYLQLSDL*QT*QKQA
MGNGFPI**MVLGKLASHMEK
AETGSLPYTLYKN*FKMD*RLT
H*S*NHKNPRRKPRQYHSVHRH
GQGLHV*NTKSNGNKSQN*RM
GSN*TKELLHSERNYHQSEQAT
HKMGENFRNLLI*QRANIQNLQ
Figure imgf001048_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Amino acid sequence ( X=Unknown, NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
8440 38808 A 8497 1614 3085 AGEGNKADKQLQQSLRIQNQC
TKITSILIHQQQTNREPNHESTPI
HNCFKENKIPRNPTYKGCEGPL
QGELQTTAQGNKRGYKQMEE
HSMLMGRKNQYHENGHTAQG
NLQI/QMPSPSSYQ*LSSQNWKK
LLSSSYGTKK\GLHHQVNPKPK
EQSWRHHAT*LQTILQGYSNQN
SMVLVPKQRYRSMEQNRALRN
NAAYLQLSDL*QT*EKQAMGK
GFPI * * M VLGKL ASHM* KAETG
SLPYTLYKNQFKMD*RFKR*T*
NHKNPRRKPRHYHSGHRHGQG
L/LCPKHQKQW/HTKAKIDKWD
LIKL/IELLHSKRNYHQSEQATY
KMGENFRNLLPQRANIQNLQ*
TQTNLQEKNKQPHQKVGEGHE
QTLLKRRHLCSQETHEKMLIIT
GHQRNANQNHNEIPSHTS*NGN
/QLKSQETTG AG EDVEK*EH F Y
TVGGTVN*FNHCGSQCGDSSGI
*N*KYHLTQPSHYWVYTQRTIN
HAAIKTHAHVCLLRHYSQ
Figure imgf001050_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Amino acid sequence ( X=Unknown, NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
8443 3881 8500 3 2202 EGRNKDVI*NQREQRHNIPESL
GHIQSSV*REI*NTKCPQEKAGK
IQN*HPNITIKRTRKARANTFKS
*QKARNN*NQSRIEGNRDTKNP
SKN**IQTTIREYYKHLHANNLE
NLEEMDKFLDTYTLPRLNQEEV
ESLNRPITGSEIVAIINSLPTKKS
PGPDGFTAKFYQRYKEELRIKY
LGIQLTRDVKDLFKENYKPLLK
EIKEDTNKWKNIPCSWVGRINI
VKMAILPKVIYRFNGIPIKLPITF
FTELEKTTLKFIWNQKRARVAK
SILSQKNKAGGIMLPDFKLYYK
ATVTKTAWYWYQNRDIDQWN
RTEPSEIMPHIYNYLIFDKPEKN
KQWGKDSLFNKWCWENWLAI
CRKLKMDPFLTPYTKINSRWIK
DLNVRPKTIKTLEENLGNTIQDI
GMGKDFMSKTPKAMATKAKID
KWDLIKLKSFCTAKETTIGVNR
QPTKWEKIFATYSSDKGLIFRIY
NELKQIYKKKTNNPIKKWAKD
MNRQFSKEDIYAAKRHMKKCS
TSLAIREMQIKTTMRYHLTPVR
TAIIKKSGNNRCWRGCGEIGTL
LHCCWDCKLVQPLWKSVWRF
LRDLELEIPFDPAIPLLGIYPKDY
KSCCYKDTCTRMFTAALLTIAK
TWNQPKCPTMIDWIKKMWHIY
TMES YAA I NDEFMSF VGT WM
KLEIIILSKLSQGQKTKQRIFSLI
DGN
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Λmino acid sequence ( X=Unknown, NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
8444 38812 8501 437 2163 KQKCNISHLWNAVKVERIKYL
GIQLTRDVKDLFKENYKPLLKE
IKEDTNKWKNIPCSWVEES1P*K
WPYCPR*FTDSMPSPSSYQ*LSS
QNWKKLL*SSYGTKKEPASPSQ
S*AKRTKLEASHYLTSNYSTRL
Q*PPGKRGPPEVQGHNADENN
HTEDRQGNRRHEHSIDQLAVT
DVHSTTAEYTFLSTRGGFSKID
HMLGCKTNLNKYKEIEIIQSIFS
NHNEIKQEINSRRKTRKSTNLW
ELSSALLMNGSEKKTQVNLDNI
WKLMKAKMQHITLMECSESRK
NKIPRNPTYKGREGPLQGELQT
TAQGNKRGYKQMEEHPMLMG
G RINTVKM A ILPK VI YRFNAIPI
KLPMTFFTELEKTTLKFIWNQK
RARIAKSILSQKNKAGGLTLPD
FKLFYKATVTKTAWYWYQNR
DIDQWNRTEPSEIMPHIYNYVIF
DKPEKNKQWGKDSLFNKWCW
ENWLAICRKLKLDPFLTPYTKI
NSRWIKDLTVRPKTIKTLEENL
GITIQDIGMGKDFMSKTPKAMA
TKAKIDKWDLIKLKSFCTGKET
TIRVNRQPTKWKKIFATYSSDK
GLISRIYNELKQIYKKKTNNPIK
KWVKDMΝRHFSKEDIYAAKK
HMKKCSPSLAVREMQIKTTMR
YHLTPVRMAIIKKSGNNRCWR
GCGEIGTLLHCWWDCKLVQPL
WKSVWQFLRDLELEIPFDPAIPL
8445 38813 8502 2001 MPHIQVMLMQEMDSHSLGKLC
PYGFAEYSPTSSCFHGFVLSVC
GSSRDKATQRVDALVRQVIVN
GGIVLDQFAVHDRVALADLIDL
LVDLSAMMLCGVPTTGDPFVA
FALGHPDDFDHLILPKHLVDRY
LLLEPLLGPVQLLSHSASVHLD
LHQFCSDFSYGIQLTRDVKDLF
KENYKPLLKEIKEDTNKWKNIP
CS WVGRINIMKMAMλRPR* FI D
SMPSPSSYQ*LSSQNWKKLL*SS
YGTKKEPVSPSQALA
8446 38814 B 8503 152 3166
Figure imgf001053_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last A ino acid sequence ( X=Unkno n,
NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
8454 38822 851 1 1990 EPHRIASPRFRQAFDSDGTDRTL
*FLSDIEEEELTRGDPEQRHVPL
RRKSEWKWAADRAAIDSRWN
WLQAHVSDLEYRIRQQTDIYK
QIRANKGLIGLGEVPPP\EHTTD
LFLPLSSEVKTDHGTDKLIESVS
QPLENHGAPIIG\HISESLSTKSC
G ALRP VNG VINTLQPVL ADH I P\
GDCSDAEEQLHKKQRLNLVSSS
SDGTCVAARTRPVLSCKKRRLV
RPNSIVPLSKKVHRNSTIRPGCD
VNPSCALCGSGSINTMPPEIHYE
APLLERLSQLDSCVHPVLAFPD
DVPTSLHFQSMLKSQWQNKPF
DKIKPPKKLSLKHRAPMPGSLP
DSARKDRHKLVSSFLTTAKLSH
HQTRPDRTHRQHLDDVGAVPM
VERVTAPKAERLLNPPPPVHDP
NHSKMRLRDHSSERSEVLKHH
TDMSSSSYLAATHHPPHSPLVR
QLSTSSDSPAPASSSSQVTASTS
QQPVRRRRGESSFDINNIVIPMS
VAATTRVEKLQYKEILTPSWRE
VDLQSLKGSPDEENEEPASPDV
SSSHSLSEYSHGQSPRSPISPELH
SAPLTPVARDTPRHLASEDTRC
STPELGLDEQSVQPWERRTFPL
AHSPQAECEDQLDAQERAARC
TRRTSGSKTGRETEAAPTSPPIV
PLKSRHLVAAATAQRPTHR
8455 38823 8512 19 739 ADDRDHLHIQTWCAKAPVFLQ
PGSSWLSTKGKKM/PETPASDE
PPLRQPAVRFEFLQHAPPDSLQP
LGSPGHS*GPSSDTGQPAAGRW
PRRQP*AAVQDLGPQTRGSASP
ESQTVLPAAPSAPGSARPHGSM
SASAPSPTASPQMSKPPCAAAC
LRGSGAGGGTVGPCLEPGGSAP
PRARLPRCAAGSARPGAAAGPP
PGAGTTRAVP
8456 38824 8513 343 MRMCLRLPTLMTKCTNWHVT
GHLCRPT/MREPTPEP/WTGDTT
TA*METRWPAEQPALAERRLFR
AAPRGAFAGIAAPSPAAPPQPG
SLGILPEPTPAQAWAARNQLRG
RIKERGS
Figure imgf001055_0001
Figure imgf001056_0001
Figure imgf001057_0001
Figure imgf001058_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Amino acid sequence ( X=Unkno\vn,
NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
8477 38845 8534 569 1278 GHTACEWYSQGRHPGPELSPFL
*GPG*NAGHVISMPAARPMEPC
LGTGGLDVLLHHLRPDLPPAAL
PG S A WPGEG ALPLPGQVSGRG
RSAACLQPGAWPGRGAAPVRR
RPRPAAAGSVREAELLESPHLA
AHRDC*LARKLNSGGYIPQAEY
GKHAGLPLPGARALSCGCRAR
APVPAALPHPLLDHQDPGPAGR
IYGTGAGTQTLDGSAPRTSRAL
GSTQMPEARPGLAGWGVAGV
WSRG* SR*TRGSAS/SPRPRRCA
LLALPGAAVSGDLRARNPRGSR
PPKAPSGRGGPLLPPHRSTPRR
8478 38846 8535 2494 YQRQSSSPEDPAETASPTYALIA
ACQSEIQLQTRQQPCDMAPGPL
DCDPVKGSSRKRMFLMNAPPV
VALQPKWEASVPPGSFRFPECF
SEADKGVESMSPTMHKELPAL
AACGLVADFDPVGEEETADFG
PLVLDSDSDDSVDRDIEEAIQEY
L/KGGGSRCKRELAHSS/APTAL
CSPKLVPGSGG/GGPGSQVGSS
KDQGSASPVSMSRADSFEQSIR
AEIEQFLNEKRQHETQKCDGSV
EKKPDTHENSAKSLSKSHQEPA
TKWHRQGLMGVQKEFAFCRP
P\RLAKTNVQPRSLRSKVTTTTT
QEKEGSTKPATP/TRPSEAVQN
KSGIKRSASTARRGKRVTSAVQ
APEASDSSSDDGIEEAIQLYQVQ
KTRKEADGDPPQRVQLQEERA
PAPPAHSTSSATKSALPETHRKT
PSKKKPVPTKTTDPGPGDLDAD
HSPKIPKETKAPPPTSPASRSKF
VEWSSCQADTSAELRAVLDIFK
TILP/APMEGSDGSLSASPLFYSP
NVPSRSDGDSSSVDSDDSIEQEI
WTFLALKVQSRSLLARGAAGQ
APERVEKQAPAGAEELPKSKRD
SCEGSRKKPPSVFGSRAERTKP
RPSCSSVDSDDSIELEIRKFLVE
KAKESGVPGQAQPCLWKPTLA
GEEGLRSPAKQEGLTSAIVAGIS
AALGPEHTWVPGATTPRTPSVA
PVLVPAVPTHWKKLAPACGGP
Figure imgf001060_0001
Figure imgf001061_0001
Figure imgf001062_0001
Figure imgf001063_0001
Figure imgf001064_0001
Figure imgf001065_0001
Figure imgf001066_0001
Figure imgf001067_0001
Figure imgf001068_0001
Figure imgf001069_0001
Figure imgf001070_0001
Figure imgf001071_0001
Figure imgf001072_0001
Figure imgf001073_0001
Figure imgf001074_0001
Figure imgf001075_0001
Figure imgf001076_0001
Figure imgf001077_0001
Figure imgf001078_0001
Figure imgf001079_0001
Figure imgf001080_0001
Figure imgf001081_0001
Figure imgf001082_0001
Figure imgf001083_0001
Figure imgf001084_0001
Figure imgf001085_0001
Figure imgf001086_0001
Figure imgf001087_0001
Figure imgf001088_0001
Figure imgf001089_0001
Figure imgf001090_0001
Figure imgf001091_0001
Figure imgf001092_0001
Figure imgf001093_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Amino acid sequence ( X=Unknown, NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
8652 39020 8709 1779 MGDFNTPLSTLDRSTRQKVNK
DTQELNSALHQADLIDIYRTLH
PKSTEYTFFSAPHHTYSNIDHIV
GSKALFSKCKRTEIITNCLSDHS
AIKLELRIKKLTQNHSTTWQLN
NLLLNDYWVNNEMKAEIKMFF
ETNENKDTTDQNLWDTFKAVC
RGKFMALNAHKRKQERSKMG
TLTSQLKELEKQEQTHSKASRR
QEITKIRAELKEIET/QKTLQKIN
ESRYTEIQTTIREYHKHLYANKL
ENLEEMDKFLDTYILPRLNQEE
VESLNRPVTGSEIEAIINSLPTKK
SPGPYGFTAEFYQRYKEELHIN
RTKDKNHMIISIDAEKAFDKIQ
QPFMLKTLNKLGIDGTYLKILR
AIYDKPTANIILNGQKLEAFHLK
TGTRQGCPLSPLLFNIVLEVLAR
AIRQEKEIKGIQLGKEEVKWSL
FADDMIVYLENPIVSTQNLFKLI
SNFSKVSGYKINVQKSQAFLYT
NNRQTESQIMSELPFTIASKRIK
YLGIQLTRDVKDLFKENYKPLL
NEIKEDTNKWKNIPCSRAGRINI
VKMAILPKVIYRFSAIPIKLPMT
FFTELEKTKVFIWNQKRAHIAKS
ILSQKHKAGGITLP
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Λmino acid sequence ( X=Unknown, NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /^possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
8653 39021 8710 2093 MEITTNCLSEHSAITLQLRIKKL
TQNCTTTWKLNYLLLNDYWV
NNEMKEEIKMFFETNENKDTT
YQNLWDVFKAVCRGKFIALSA
HKRKQERSKTDTLTSQLKELEK
QEQTHSKASRRQEITKIRAELKE
IETQKTLQEINDSRSSFSEKLNKI
DRPLARLIKKKREKTQIDAIKN
DKGEINTNPTEIQTTIREYYKYL
YANKVENLEEMDTFLDTYTLP
RLNQEEVESLNRPITGSEIEAIIN
SLPTKESPGPDGFTAEFY/HEGN
QERERNKGYSIRKRGSHIVPVC
R*HDCLFRKPHHLSSKSHEAVK
QLQQSLRIQNQWAKITSMPVH
Q*QTNREPNHE*TPIHNCYKEN
KIPRNPTYEGCEGPLQGELQTT
AQQNKRGQKEMEKHSMLMDR
KNHYHENGHTAHGNL*IQCHP
HQATNDFLHRIGKNYVNFHME
PKKSLHCQENPKQREQSWRHH
AT*LQTILQGYSNQNSMVLVPK
QIHRTMEQNRGLRNNTTHLQPS
DL*QI*QKQEMGKQFPI**MVL
GKLASYM*KAETGSLPYTLYK
N*FKMD*RLKC*T*NHKNPRRK
PRQYHSGHRHGQGLHD*NTRS
NGNKNQNRQMGSN*TKELLHG
KRNYHQSAQTTYRMGENFCNL
PI*QRANIQNLQRT*TNLQEKNK
QPHQKVGKGHEQTLLKRRHLC
SQQTHEKM L I ITGHQRN ANQN
Figure imgf001096_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Amino acid sequence ( X=Unkno n, NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
8657 39025 8714 1842 VPPTTCGNYGSTIQDEIWMGTQ
NQTISVSEEMMFQAEEIASAQK
MWNQHHITPSKVDHIVGSKAL
FSKCKRTEIITNYLSDHSAIKLK
LRIKKLTQNRSTTWKLNNLLLN
DYWVHNEMKAEIKIFFETNEN
KDTTCQNLWDTFKAMCRGKFI
TLNAHKRKQQRCKIDTLTSQLK
ELEKQEQTHSKASRRQEITKIRA
ELKEIEAQKTLQKINESRSWIFE
KINKTDRLLARIIKKKREMNQI
DAIKNDKGDITTNPTEIQTTIRE
YCKHLYANKLENLEEMDKFLN
TYTLPRLNQEEIESLNRPITGSEI
EAIINSLPTKKSPGPDGFTAEFY
QRYKEEMIEKKAFDKSQQPFM
LKTLNKLGIDGTFLKIRRAIYDK
PTANIILNGQKLEAFPLKTGTRQ
GCPLSPLLFNIVLEVLAREIRQG
KEIKHIQLGKEEVKFSLFADDVI
AYLENPIVSAQNLLKLISNFSKV
SGYKINVQKLQAFLYTNNRQTE
SQIMSEFPFTIASKRIKYLGIQLT
RDVKDLFKENYKPLLNKIKEDT
NKWKNIPCSWIGRINIMKMAIL
H KVIDRFSA I PFKL/PKD/FFTEL
EKTTLKFIWNQKRPRISKTILSK
KNKAGGITLR
Figure imgf001098_0001
Figure imgf001099_0001
Figure imgf001100_0001
Figure imgf001101_0001
Figure imgf001102_0001
Figure imgf001103_0001
Figure imgf001104_0001
Figure imgf001105_0001
Figure imgf001106_0001
Figure imgf001107_0001
Figure imgf001108_0001
Figure imgf001109_0001
Figure imgf001110_0001
Figure imgf001111_0001
Figure imgf001112_0001
Figure imgf001113_0001
Figure imgf001114_0001
Figure imgf001115_0001
Figure imgf001116_0001
Figure imgf001117_0001
Figure imgf001118_0001
Figure imgf001119_0001
Figure imgf001120_0001
Figure imgf001121_0001
Figure imgf001122_0001
Figure imgf001123_0001
Figure imgf001124_0001
Figure imgf001125_0001
Figure imgf001127_0001
Figure imgf001128_0001
Figure imgf001129_0001
Figure imgf001130_0001
Figure imgf001131_0001
Figure imgf001132_0001
Figure imgf001133_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Amino acid sequence ( X=Unknown, NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence- 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
8874 39242 8932 2466 MLRGVNVLADAVKVTLGPKG
RNVVLDKSFGAPTITKDGVSVA
REIELEDKFENMGAQMVKEVA
SKAN D A AG DGTTT AT VL AQ A 11
TEGLKAVAAGMNPMDLKRAS
RSRKGYGPARIRQELNQKGISR
EATEKAMRECDIDWCALARDQ
ATRKYGEPLPTVFSEKSRKLIA
GVKRAFTHQAVNAGFCTQPAV
SVIAGDFDRHGFNTRHFTFRLF
DDFSFETTRFCPAQIHTLKHARP
VLCFRTARPRLNIEVAVGAVIF
AREHTAELKLRQFFQGAPQKA
VISIPVVPQVTGVVIEVTDKKNT
LIKKGEVLFRLDPTRYQARVDR
LMADIVTAEHKQRALGAELDE
MAANTQQAKATRDKFAKEYQ
RYARGSQAKVNPFSERDIDVAR
QNYLAQEASVKSSAAEQKQIQS
QLDSLVLGEHSQIASLKAQLAE
AKYNLEQTIVRAPSDGYVTQVL
IRPGTYAASLPLRPVMVFIRSET
SLRTHCPGHADYVKNMITGAA
QMDGAILVVAATDGPMPQTRE
HILLGRQVGVPYHVFLNKCDM
VDDEELLELVEMEVRELLSQY
DFPGDDTPIVRGSALKALEGDA
EWEAKILEL/RWLPGFLYSGTR
ACD*QAVPAADRRR/SSPSPVV
VPLLPVV*NAVSSKLVKKLK\R
WYPRDSEVYLYWR*NVPPTAG
RRPC W*ERRCSAAWYQT* RNR
Figure imgf001135_0001
Figure imgf001136_0001
Figure imgf001137_0001
Figure imgf001138_0001
Figure imgf001139_0001
Figure imgf001140_0001
Figure imgf001141_0001
Figure imgf001142_0001
Figure imgf001143_0001
Figure imgf001144_0001
Figure imgf001145_0001
Figure imgf001146_0001
Figure imgf001147_0001
Figure imgf001148_0001
Figure imgf001149_0001
Figure imgf001150_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Amino acid sequence ( X=Unkno n,
NO: of peptide hod in USSN location of first codon for last ammo acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possιble nucleotide insertion) sequence
8995 59363 9054 1737 MKEKPAVEVRLDKWLWAARF
YKTRALAREMIEGGKVHYNGQ
RSKPSKIVELNATLTLRQGNDE
RTVIVKAITEQRRPASEAALLYE
ETAESVEKREKMALAQMIMPQ
HDQLHRYLFENFAVRGELVTV
SETLQQILENHDYPQPVKNVLA
ELLVATSLLTATLKFDGDITVQ
LQGDGPMNLAVINEΓNERAQNP
LLCAFRCVNTYGSYECKCPVG
YVLREDRRMCKAVFPYCQNES
SDVLWQFDSTSQIFLKGTPKTT
VRRGGEHNLKNPDECEDNPNIC
DGGQCAGIPGRNRCLCYDGFM
ASEDMKTCVDIDECSLPNICVF
GTCHNLPGLFRCECEIGYELDR
SGGNCTEDPDHSKSRSGRSQND
WGRRRKT/S*KRSAVYR*RQLY
IRRQP Y A ERPSLR* RQRP* IPCIR
SQHGNRKRMRRRRIFI/STKRTT
AAARTSSVKKARSFSRALKNA
MLS*RTAPSVS*S*IMITH*KK**
SR*SLQTR*LMKSSARVFSK*TA
NGTWSL/HSRGSKVTIDGINSND
IYMLGYVSNSLTGPYRPVDKTG
LGLEMGLDPNDVTLTYSHFRVP
EAEAAMWVSHAT
Figure imgf001152_0001
Figure imgf001153_0001
Figure imgf001154_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Amino acid sequence ( X=Unknown, NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
9010 39378 9069 1687 ICVPKSVPSFLWAGRGPHCLLP
CGSSSP*SGGLARA*GVFQGRA
GVARAPSHLSFHNSPCVWSLST
CCRAGSQDGPCPHAAGQGPGK
QTSSGTGASSSPWTSCL*KSLPK
\VCS*GLETTIGRGRVKPIKVTD
TAQ*CAASVTAPLLSS*LQDGG
STAKVTRCT/TAGGRAVYGTAE
G KAFPE A PELTCRS/ALAALGT
WPLPPLQVSCGLRNPSPDLPSV
VGSRASV*TKSGSSPAKVWPLA
EWFKSGSA\PVTGQEAWLLWA
SDSSSELSLCPRKHAAPPHSAEH
RQISQGLASSCWGAGGDKVVS
ASMVGSAPGQCPMSALMTGSA
SGQCPGHPCWSGTNADS*GRDP
HRASRLLALPTASYEQ*PQPHW
GSWTAHGQNGLGIFFQRVSWF
V*GQTMVSRPRISGLGQPVCLS
AASPQAAPLGSHDPTPAGR*GL
EPGGRQPSSS*IYVWEAPPGLW
VPPGPFLSH*SWRFGRTKAAPL
KEPRPGGRCPGDRPSPRAEAGL
PQTRKPALPAIGR/PKKACSLRR
PRVILEQSSQHLAAALSSWMQP
ALLETGGKAVFFTPGGL
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Amino acid sequence ( X=Unknown, NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
901 39379 A 9070 66 1956 PLLAIPAHSTFPARKPEEPPRPD'
LCPHIHCAAV*GEIRTVP/SKTV
RGVATRTRPRSASSLGTTTGLR
VSSVRRVQLSIKPGCSRVNENSS
NSPCSSSQPGTΛSVLPCPTVCRE
AGP/AGP*PPG*ALSLPAAGFTT
ATEWKQSQLSSGG*THSGEDM
GRATAAATEKAQEQCMLREKL
GTRQK AESG SRG AVAGLQHFCS
APRTAWQPRASRA/RAGVRAW
PCADPM*PPSHSPSFPKPFIPAA/
PGAQTLGAAGVEHVAGSARIPC
CRTGLE/PHPHSPRRTLERQARI
SGAGRRSSPNAATGARPRPAAG
SPVSARRAPRRRPA*SSPCSGPW
TAVSAPRSPGPRGGGRPPSPGR
RRPCCSVFSGGASPSSARVSSPS
GSCSASPPAAPAVPGASGAVVA
GPVVGSPREHGG*LRPPLPRLG
VPGLLAARTSGAEVIHEQAAAQ
GTAADTPGAPGLLLAPAAAGA
V/TRPGPPGASGCPRPAPRPPTP
ASRSGRPWPGGGRGRRGGRGA
GGEALQGAVVGGQVGGSGAL
A/ATARGSPAAGVSRWTAPRCR
HRRSDASASGTPARFPGAGRKG
RTLWQ*SWRSAPPPGLSELL*G
LPRPLPESGRSKAPLPSGSTPLPS
GRTPLPSGRSPSTRRSPS
9012 39380 9071 95 1423 GLENKRAKGTEAHTRACP*GK
STEQDTTWTNTGRNEQQGPRP
KGEQTQRKTERKPGRERQVQV
LSSPPGRWC*DVGPLGGLPSRA
KAEAALTAAQKAQGRRGSPGS
LPKSSPLPSAPGKR/TPPRGHLL
VPLPSPYYPALLSQHPPQGPPCG
ASAPPSALLQRAPPPGTRTL\QR
PPGPRVTCGTTP/MAAKAKSQS
NAPRGSLLKKSNSFYNIPLKQV
SCLITISKHSRNEVSVYQTSRPR
AAATDVPRTΛPSLSVGCSWSCT
RGHWPSSCCPTAPRGQGL*EHS
QGPGSRSKLPWPTGPCGGLTPS
CWSLAGSAGERVDLNQAAASR
PHCAARTMAGKTLSGGCGCVD
VGAASAHRSPRLRCTTSPTDGP
NTERGSRKTGWTPGPPGFWPCP
RPPTSNTTTPLGLMDSARPGKA
PAPFTDFRTPSGLLLPGWAVRA
RREEDGCG
Figure imgf001157_0001
Figure imgf001158_0001
Figure imgf001159_0001
Figure imgf001160_0001
Figure imgf001161_0001
Figure imgf001162_0001
Figure imgf001163_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Amino acid sequence ( X=Unknown, NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
9047 39415 9106 1249 MAVLIFLGCLLGGIAIGLPIAWA
LLLCGAALMFWLDMFDVQIMA
QTLVNGADSFSLLAIPFFVLAGE
IMNAGGLSKRIVDLPMKLVGH
KPGGLGYVGVLAAMIMASLSG
SAVADTAAVAALLVPMMRSA
NYPVNRAAGLIASGGIIAPIIPPSI
PFIIFGVSSGLSISKLFMAGIAPG
MMMGATLTLGKRARFTEDDTR
ERDAISVINHQRNAVLIRQRVK
LQQLFVIRVLAMRIFVYGSLRH
KQGNSHWMTNAQLLGDFSIDN
YQLYSLGHYPGAVPGNGTVHG
EVYRIDNATLAELDALRTRGGE
YARQLIQTPYGSAWMYVYQRP
VDGLKLIESGDWRVLRWLRVH
SIDVPDPIFGLTVTGDRTDRAG
KTSPAQAGCKLFLTNPLGIGVL
TTAEKKSLLKPEHQGLATEVM
CRMNIAGASFANIEGVKAMTD
VTGFGLLGH LSEMCQG AG VQA
RVDYEAIPKLPGVEEYIKLGAV
PGGTERNFASYGHLWLR*LAL
AP/LEVVEANGSL*CSGERGCL
YGQTTTIQLNFRPAFCPEHAIAG
QKVAILRLNMQIDIKRVSSRTH
GNHFTNFKFTVQHYRSCLT
Figure imgf001165_0001
Figure imgf001166_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Λmino acid sequence ( X=Unknovvn,
NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
9066 59434 A 9125 2340 QFGIWLDSSSPEQTVPYLWSEE
TPATPIGQAIHRLLLIQAFRPDR
LLAMAHMFVSTNLGES\FMSIM
EQPLDLTHIVGTEVKPNTPVLM
CSVPGYDASGHVEDLAAEQNT
QITSIAIGSAEGFNQADKAINTA
VKSGRWVMLKNVHLAPGWLM
QLEKKLHSLQPHACFRLFLTME
INPK\VPV\NLLRAGRIFVFEP\PP
GWKANMLRTFSSIPVSRICKSP
NERARLYFLLAWFHAIIQERLR
YAPLGWSKKYEFGESDLRSAC
DTVDTWLDDTAKASGRQNISP
DKIPWSALKTLMAQSIYGGRV
DNEFDQRLLNTFLERLFTTRSF
DSEFKLACKVDGHKDIQMPDGI
RREEFVQWVELLPDTQTPSWL
GLPNNAERVLLTTQGVDMISK
MLKMQMLEDEDDLAYAETEK
KTRTDSTSDGRPAWMRTLHTT
ASNWLHLIPQTLSHLKRTVENI
KDPLFRFFEREVKMGAKLLQD
VRQDLADVVQVCEGKKKQTN
YLRTL1\NELVKGILP\RSWSHYT
VPAG\MTVIQWGVPISARRI\KQ
LQNISLAAASGGAKELKNIHVC
LGGLFVPEAYITATRQYVAQAN
SWSLEELCLEVNVTTSQGATLD
ACSFGVTGLKLQGATCNNNKL
SLSNAISTALSLTQLRWVKQTN
TEKKASVVTLPVYLNFTRADLI
FTVDFEIATKEDPRSFYERGGG
9067 39435 A 9126 6375 HKVAAPDVVVPTLDTVRHEAL
LYTWLAEHKPLVLCGPPGSGK
TMTLFSALRALPDMEVVGLNFS
SATTPELLLKTFDHYCEYRRTP
NGVVLAPVQLGKWLVLFCDEI
NLPDMDKYGTQRVISFIRQMVE
HGGFYRTSDQTWVKLERIQFV
GACNPPTDPGRKPLSHRFLRHV
PVVYVDYPGPASLTQIYGTFNR
AMLRLIPSLRTYAEPLTAAMVE
FYTMSQERFTQDTQPHYIYSPR
EMTRWVRGIFEALRPLE
Figure imgf001168_0001
Figure imgf001169_0001
Figure imgf001170_0001
Figure imgf001171_0001
Figure imgf001172_0001
Figure imgf001173_0001
Figure imgf001174_0001
Figure imgf001175_0001
Figure imgf001176_0001
Figure imgf001177_0001
Figure imgf001178_0001
Figure imgf001179_0001
Figure imgf001180_0001
Figure imgf001181_0001
Figure imgf001182_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Λmino acid sequence ( X=Unkno\vn, NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
9140 59508 A 9199 2491 MGDFNTPLSTLDRSTRQEVNK
DTQELNSALHQVDLIDIYRTLH
PKSTEYTFFSAPHYTYSKIDHIV
GSKALLSKCKRTEIITNCLSDHS
AIKLELRIKKLTQNHSTTWKLN
NLLLNDYWVNNEVKAEIKMFF
ETNENKDTTYQNLWDTFKAIN
KIDRPLARLIKKKREKNQIDAIK
NDK\WDITTDPTEIQTTIREYYK
HLYANKLENLEEMDKFLDTYT
LPRLNQEEVESLNRPITGSEIEAI
TNSLTΛTKKSPG PDGFTA KFYQR
YKEELVP\FLLKLFQLIEKEGILP
NSFYEASIILIPKPGRDTTΛKQKK
NFRPISLMNIDAKILNKILANRI
QQ\HIKKLIHHDQVGFIPG\MQG
LFNICKSINVIHHINRTKDKNYM
IIS/I*DAEKAFDKIQQPFMLKTL
NKLGIDGTYLKIIRAIYDKPTAD
11 LNGQKLE AFSLKTGTRQGCPL
SPLLFNIVLEVLARAVRQEKEIK
GIRLGKEEVKLSLLADDMIVYL
ENPIVSAPNLLKLISNFSKVSGY
KINVQKSQAFLYTNNRQTESQI
MSELPFTIASKRIKYLGIQLTRD
MKDLFKEΝYKPLLΝEIKEDTΝK
WKΝIPCSWVGRIΝIMKMAILPK
VIYRFNAIPIKLPMTFFTELEKTT
LKFIWNQKRARIAKSILSQKNK
AGGITLPDFKLYYKATVTKTA
WYWYQNRDIDQWNRTEPSEIT
PHIYNYLIFDKPEKMSIIDTGGW
9141 39509 9200 3083 MGKKQNRKTGNSKTQSASPPP
KERSSSPATEQSWMENDFDELR
EEGFRRSNYSELWEDIQTKGKE
VENFEKNLEECITRITNTEKCLK
ELMELKTKARELREECRSLRSR
CDQLEERVSAMEDEMNEMKRE
GKFREKRIKRNEQSLQEIWDYV
KRPNLRLIGVPESDVENGTKLE
NTLQDIIQENFPNLARQANVQI
QEIQRTPQRYSSRRATPRHIIVR
FTKVEMKEKMLRAAREKEIQT
TIREYYKHLYANKL
Figure imgf001184_0001
Figure imgf001185_0001
Figure imgf001186_0001
Figure imgf001187_0001
Figure imgf001188_0001
Figure imgf001189_0001
Figure imgf001190_0001
Figure imgf001191_0001
Figure imgf001192_0001
Figure imgf001193_0001
Figure imgf001194_0001
Figure imgf001195_0001
Figure imgf001196_0001
Figure imgf001197_0001
Figure imgf001198_0001
Figure imgf001199_0001
Figure imgf001200_0001
Figure imgf001201_0001
Figure imgf001202_0001
Figure imgf001203_0001
Figure imgf001204_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Amino acid sequence ( X=Unkno\vn, NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
9273 39641 A 9333 1365 STETLSPLPATPGEVTLRHLDV
VSAEQQMFSGLVEKIQVTGSEG
ELGIYPGHAPLLTAIKPGMIRIV
KQHGHEEFIYLSGGILEVQPGN
VTVLADTAIRGQDLDEARAME
AKRKAEEHISSSHGDVDYAQAS
AELAKAI AQLR VIELTKKAM * H
RLEKHKSQSGNRLAFFCACDPS
9274 39642 9334 601 MLSAVGYQP/TLAEEMGVLQE
RIPSTKTGSITSVQAVYVPADDL
TDPSPATTFAHLDATVVLSRQI
ASLGIYPAVDPLDSTSRQLDPL
VVGQEHYDTARGGQFILARYQ
ELESLLVAILGMNDLSEEDKLVV
ARARKIQRFLSQPFFVAEVFTGS
PGKYVSLKDTIRGFKGIMEGEY
DHLPEQAFYMVGSIEEAVEKA
9275 39643 B 9335 681 1071
9276 39644 A 9336 203 2189 VASSARRGLPVHGPAPLQGPGN
SQDRL/ESGTNRCGLRGPLVKG
GKGGRFPSARL1QTVKKGGTPS
RDHA\IEPSGY\SVSAGVGE\RTR
KGSVFAHEMPDCNVIDKVSLV
YGQMTEPPGNCMRVALSGLTM
AEKFRDEGRDVLLFVDNIYRYA
LAG*ERFATVGR*ASN/AVGIQP
TLAEKMGVLQERITSTKTGSITS
VQAVYVPANDLTDPSPATTFA
HLDATVVLSRQIASLGIYPVAVD
PLDSTSRQLDPLVVGQEHYDTA
RGG*SI\LQRF\QELKDFMALP/G
LSDELSEEDKLVVARARKIQRF
LSQPFFVAEVFTGSPGKYVSLK
DTIRGFKGIMEGEYDHLPEQAF
YMGDMAMTYHLDVVSAEQQ
MFSGLVEKIQVTGSEGELGIYS
GHAPLLTAIKPGMIRIVKQHGH
EEFIYLSGGILEVQPGNVTVLAD
TAIRGQDLDEARAMEAKRASG
KVKPSIMQSLTPEKIAAILPLQF
FPQPADFTGPYVMLTSRRNNRA
LSRCDDQTLMPGLAIRGIRHRH
QQAGVRILLNNAIEHVVDGEK
VELTLQSGDTLQADVVIYG1GIS
ADEPLAREANLDTANGIVIDEA
CRTCDPAIFAGGDVP1TSSLINV
VDEIIHDILLTQPVATRYCIVKVI
FEAVMILRDSGGAPFCSNGMAT
Figure imgf001206_0001
Figure imgf001207_0001
Figure imgf001208_0001
Figure imgf001209_0001
Figure imgf001210_0001
Figure imgf001211_0001
Figure imgf001212_0001
Figure imgf001213_0001
Figure imgf001214_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Amino acid sequence ( X=Unknovvn, NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possiblc nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
9325 39693 9385 1852 SKQKPMNVGLSETQNGGMSQE
AVGNIKVTKSPQKSTVLTNGEA
AMQSSNSESKKKKKKKRKMV
NDAEPDTKKAKTENKGKSEEE
SAETTKETENNVEKPDNDEDES
EVPSLPLGLTGAFEDTSFASLCN
LVNENTLKAIKEMGFTNMTEIQ
HKSIRPLLEGRDLLAAAKTGSG
KTLAFLIPAVELIVKLRFMPRNG
TGVLILSPTRELAMQTFGVLKE
LMTHHVHTYGLIMGGSNRSAE
AQKLGNGINIIVATPGRLLDHM
QNTPGFM YKN LQCL VIDE ADRI
LDVGFEEELKQIIKLLPTRRQTM
LFSATQTRKVEDLARISLKKEPL
YVGVDDDKANATVDGLEQGY
VVCPSEKRFLLLFTFLKKNRKK
KLMVFFSSCMSVKYHYELLNYI
DLPVLAIHGKQKQNKRTTTFFQ
FCNADSGTLLCTDVAARGLDIP
EVDWIVQYDPPDDPKEYIHRVG
RT ARG\LNG RG H ALLI LRPEELG
FLRYLKQSKVPLSEFDFSWSKIS
DIQ\SQLEKLIEK\NYFLHKSA\Q
EAYKSYIRAYDSHSLKQIFNVN
NLNLPQVALSFGFKVPPFVDLN
VNSNEGKQKKRGGGGGFGYQ
KTKKVEKSKIFKHISKKSSDSRQ
Figure imgf001216_0001
Figure imgf001217_0001
Figure imgf001218_0001
Figure imgf001219_0001
Figure imgf001220_0001
Figure imgf001221_0001
Figure imgf001222_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Amino acid sequence ( X=Unknown,
NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, --possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
9358 39726 9418 1875 EPEKQQLYDIPASPKKAGLHPP
DSQASGQGVPLISVTTLRRGGY
STLPNPQKSEWIYDTPVSPGKA
SVRNTPLTSFAEESRPHALPSSS
STFYNPPSGRSRSLTPQLNNNVP
MQKKLSLPEIPSYGFLVPRGTFP
LDEDVSYKVPSSFLIPRVEQQN
TKPNIYDIPKATSSVSQAGKELE
KAKEVSENSAG\QIPHGSPDGQ/
RSPSPEPDRLSGSSSDSRASIVSS
CSTTSTDDSSSSSSEESAKELSL
DLDVA KETVM A LQHKVVSS VA
GLMLFVSRKWRFRDYLEANID
AIHRSTDHIEESVREFLDFARGV
HGTACNLTDSNLQNRIRDQMQ
TISNSYRILLETKESLDNRNWPL
EVLVTDSVQNSPDDLERFVMV
ARMLPEDIKRFAS1VIANGRLLF
KRNCEKEETVQLTPNAEFKCEK
YIQPPQRETESHQKSTPSTKQRE
DEHSSELLKKNRAN1CGQTLPN
LEEKDKPILEQRLDENKDLGTM
NPGPLIPQPSSQQTPERKPRLSE
HCRLYFGALFKAISAFHGSLSSS
QPAEIITQSKLVIMVGQKLVDT
LCMETQERDVRNEILRGSSHLC
SLLKDVALATKNAVLTYPSPAA
LGHLQAEAEKLEQHTRQFRGT
Figure imgf001224_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Λmino acid sequence ( X=Unkno n, NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
9363 59731 9423 504 1862 RMLEATSKMRKGVPSSIQ/SSQ
GNSYTLIPSAKSDNLSDSSHSEI
SSRSSIVSNCSVDSMSAALQDE
RCSSQALAVPESTGALEKTEHA
SGIGDHSQHGPGWTLLKPSLIK
CLAVSSSVSNEEISQEHIIIEAAD
SGSWEVGTSCSSSSHDNFQSLP
NPKSWDFLNSYRHTHLDDPIAE
VEPTDSEPYSCSKSCSRTCGQC
KGSLERKSWTSSSSLSDTYEPN
YGTVKRRVLESTPAESSEGLDP
KDATDPVYKTVTSSTEKGLIVY
CVTSPKKDDRYREPPPTPPGYL
GISLADLKEGPHTHLKPPDYSV
AVQRSKMMHNSLSRLPPASLSS
NLVACVPSKIVTQPQRHNLQPF
HPKLGDVTDADSEADATLGNK
CKLLYMSDVRNPVPAVKDVLV
QKYTSFNFFHPQEALDRRREEL
ALPEGASGEACLSTRSLKRSGV
QALAQQTHSRNFLPEKQPFSAQ
AAGILHGSINHNTRGPCLLEGT
KMMTGVLYRAYLLCIAYSRDLI
VDEQIMNGLRKAI
Figure imgf001226_0001
Figure imgf001227_0001
Figure imgf001228_0001
Figure imgf001229_0001
Figure imgf001230_0001
Figure imgf001231_0001
Figure imgf001232_0001
Figure imgf001233_0001
Figure imgf001234_0001
Figure imgf001235_0001
Figure imgf001236_0001
Figure imgf001237_0001
Figure imgf001238_0001
Figure imgf001239_0001
Figure imgf001240_0001
Figure imgf001241_0001
Figure imgf001242_0001
Figure imgf001243_0001
Figure imgf001244_0001
Figure imgf001245_0001
Figure imgf001246_0001
Figure imgf001247_0001
Figure imgf001248_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Λniino acid sequence ( X=Unkno\vn, NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
9516 59884 A 9576 131 1066 SHSSSAAAHPLGLLPIAAQSLA
VFIEPWSQASSRGQLQ/VSAPGI
WGIGVATQKVNLNQIPLGRDM
HSLVMRNDGALYHNNEEKNRL
PANSLPQEGDVVLCWSNNGHS
LCKEESLSPILPTSDGEERRRKR
KKRKRKKRKQKQQQQQEAAA
PAE A DAE ATA AE AA VEE AAAR
KEEEEEGEEGEGNGEEEEEKAA
AAARGDWSQQEKACKSSLQAI
LFGHAKLEPGRLPRPTQCRVEA
TVLVGTNSPLDNPLWSISWSLW
QKPVGTSQQQPLGLWTREFPLE
GHLLTRN ETFTEATPLIPEIGML
SQMMSEEHI
9517 39885 9577 2363 ELQRDIEKHSTGVASVLNLCEV
LLHDCDACATDAECDSIQQATR
NLDRRWRNICAMSMERRLKIE
ETWRLWQKFLDDYSRFEDWLK
SSERTAAFPSSSGVIYTVAKEEL
KKFEAFQRQVHECLTQLELINK
QYRRLARENRTDSACSLKQMV
HEGNQRWDNLQKRVTSILRRL
KHFIGQREEFETARDSILVWLTE
MDLQLTNIEHFSECDVQAKIKQ
LKAFQQEISLNHNKIEQIIAQGE
QLIEKSEPLDAAIIEEELDELRR
YCQEVFGRVERYHKKLIRLPLP
DDEHDLSDRELELEDSAALSDL
HWHDRSADSLLSPQPSSNLSLF
ASLQPLRSER\SGRDTPG\SVDSI
PLVEWDHDYDLSRDLESAMSRA
LPSEDEEGQDDKDFYLRGAVG
LSDVMIPESPEAYVKLTENAIK
NTSGDHSALESQIRQLGKALDD
SRFQIQQTENIIRSKTPTGPELDT
SYKGYMKLLGECSSSIDSVKRL
EHKLKEEEESLPGFVNLHSTET
QTAGVIDRWELLQAQALSKEL
RMKQNLQKWQQFNSDLNSIW
AWLGDTEEELEQLQRLELSTDI
QTIELQIKKLKELPESCWDHR\K
AIILSINLCSPEFTQADSKESRDL
QDRLSQMNGRWDRVCSLLEE
WRG LLQDA LMQCQGFHEMSH
GLLLMLENIDRRKNEIVPIDSNL
DAEILQDHHKQLMQIKHELLES
Figure imgf001250_0001
Figure imgf001251_0001
Figure imgf001252_0001
Figure imgf001253_0001
Figure imgf001254_0001
Figure imgf001255_0001
Figure imgf001256_0001
Figure imgf001257_0001
Figure imgf001258_0001
Figure imgf001259_0001
Figure imgf001260_0001
Figure imgf001261_0001
Figure imgf001262_0001
Figure imgf001263_0001
Figure imgf001264_0001
Figure imgf001265_0001
Figure imgf001266_0001
Figure imgf001267_0001
Figure imgf001268_0001
Figure imgf001269_0001
Figure imgf001270_0001
Figure imgf001271_0001
Figure imgf001272_0001
Figure imgf001273_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Ammo acid sequence ( X=Unkno n, NO: of peptide hod in USSN location ot first codon for last ammo acid :Stop codon, /=possιb!e nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possιble nucleotide insertion) sequence
9677 40045 A 9737 1665 MAGACWLMAFQYWVLKLLVV
KSA/SWPRQSSLKWNWNFKGC
QDLETWGKKETRKFYGYRRNE
EAQKYSCRIPRKPD*WCHCWQ
W**WLMAFQYWVLKLLVVKS
AYGLVKALSNGIGISKVAKTW
KPGGKKRLENFMAIEEMKKHR
STHVGFPENLTNGATAGNGDD
GLIPPRKSRTPESQQFPDTENEE
YHRDFSKLCLGSAANGVVHVE
VKMCVYNGKGERQEEKMKGLI
SMNLVVEEEMKKHESNNVGLL
ENLTNGVTAGNGDNGLIPQRKS
RTPENQQFPDNESEEYHSLGDK
SKTSFQNSNNNNNKQQEQQQQ
NPTFSNTRKLTKLYKAPIPPSIIL
SGCPNINDSNWQEIEHGMQTA
GLPTRPLSHGLQQKGAAFRCLG
CKCSEPFTGSLILQKAKTNTQK
WQATYPKSQNEQLVPSVGKSY
RCSTPAQPMKTAVGHKPCKAT
GAELPKALGAQPLHPCALDVG
QGFKKGNFGAVGLNGLLGLEF
HGVSGVLLVGPGDGGLISEGVV
REDLMCGVWSAGTWSVGTAE
RCLEKPGALHVIEGPLDSWDGP
VMPNGPVKSRQSSCLDGPGRC
CSEILTGQSHGNKKPARASSKS
SQSINDRPLAVLTNQYQCEQLA
SERQPSSNSCSIRIDSA
9678 40046 9738 1 15 RRTARGPQQTSTDGLGTSPCRN
ASEHTADILELSTLIV*RRQ*ICL
CLDFLYY/LIPEGRCFCRACDLV
HNFVDTIILQQHRTLTSQGVDDF
LKAKATFKASDFIDALVLSKVA
SPRILIQGGRMGA*DQVSDQAS
VYGRCSSSGAEAAKRPKSAWL\
LLGYDSRASGLY*ILYNLTKHP
D/YRECCQP VMQES* S * RRTAR
GPQQTSTDGLGTSPCRNASEHT
ADILELSTLIV
9679 40047 9739 506 MVIVGLAAGVLLVGPGDGGLIS
EGVVREDLMCGVWSAGTWSV
GTAERCLEKPGALHVIEGPLDS
WDGPVMPNGPVKNHKGEQQE
VPSKHPQMALEICLCLDFLYY/L
I PEG RCFCRACDL VHNF\DTI ILQ
QHRTLTSQGVDDFLKAKATFK
ASDFIDALVLSKVASPGS
Figure imgf001275_0001
Figure imgf001276_0001
Figure imgf001277_0001
Figure imgf001278_0001
Figure imgf001279_0001
Figure imgf001280_0001
Figure imgf001281_0001
Figure imgf001282_0001
Figure imgf001283_0001
Figure imgf001284_0001
Figure imgf001285_0001
Figure imgf001286_0001
Figure imgf001287_0001
Figure imgf001288_0001
Figure imgf001289_0001
Figure imgf001290_0001
Figure imgf001291_0001
Figure imgf001292_0001
Figure imgf001293_0001
Figure imgf001294_0001
Figure imgf001295_0001
Figure imgf001296_0001
Figure imgf001297_0001
Figure imgf001298_0001
Figure imgf001299_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Amino acid sequence ( X=Unkno n, NO: of peptide hod n USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
9831 40199 9892 121 2320 EM AFQKA VKGTI LVGGG ALAT
VLGLSQFAHYRRKQMNLAYVK
AADCISEPVNREPPSREAQLLTL
QNTSEFDILVIGGGATGSGCAL
DAVTRGLKTALVERDDFSSGTS
SRSTKLIHGGVRYLQKAIMKLD
IEQYRMVKEALHERANLLEIAP
HLSAPLPIMLPVYKWWQLPYY
WVGIKL\YDLVARKANCLKSSY
VLSKSRALEHFPMLQKDKLVG
AIVYYDGQHNDA\RMNLAIALT
AA\RYGAATANYMEVVSL\LKK
TDP\QTGK\VHVSGARCKGCPST
GQEFDVRAKCVINATGPFTDSV
RKMDDKDAAAIVCQPSAGVHIV
MPG Y YSPESMG LLDPATSDGR
VIFFLPWQKMTIAGTTDTPTDV
THHPIPSEEDINFILNEV\RNYLS
CDVEVRRGDVLAAWSGIRPLV
TDPKSADTQSISRNHVVDISESG
LITIAGGKWTTYRSMAEDTINA
AVKTHNLKAGPSRTVGLFLQG
GKDWSPTLYIRLVQDYGLESEV
AQHLAATYGDKAFEVAKMAS
VTGKRWPIVGVHLVSEFPYIEA
EVKYGIKEYACTAVDM1SRRTR
LAFLNVQAAEEALPRIVELMGR
ELNWDDYKKQEQLETARK\FL
YYEMGYKSRSEQLTDRSEISLL
PSDIDRYKKRFH KFDADQKGFI
TIVDVQRVLESINVQMDENTLH
EILNEVDLNKNGQVELNEFLQL
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Amino acid sequence ( X=Unkno\vn, NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
9832 40200 9893 2020 MLKPLMGGGAHRRAAAGARK
WDGTAIIASTSGLEVLWVTTSR
SKTVESVGLAKPGQHNDARMN
LAIALTAARYGAATANYMEVV
SLLKKTDPQTGKVRVSGARCK
DVLTGQEFDVRAKCVINATGPF
TDS VRKM DDKD AAAI\CQA\SA
GVHIVMPGYYSPESMGLLDPAT
SDGRVIFFLPWQKMTIAGTTDT
PTDVTHHPIPSEEDINFILNEVR
NYLSCDVEVRRGDVLAAWSGI
RPLVTDPKSADTQSISRNHVVDI
SESGLITIAGGKWTTYRSMAED
TIΝAAVKTHΝLKAGPSRTVGLF
LQGGKDWSPTLYIRLVQDYGL
ESE V AQH LAATYGDKAFE VAK
MASVTGKRV/WPIVGVRLVSDF
PYIEAEVKYGIKEYACTAVDMT
SRRTRLAFLΝLQAAEEALPRLL
ELMGRELΝWDDYKKQEQLET
ARKFLYYEMGYKSRSEQLTDR
SEISLLPSDIDRYKRRCHKFDAD
QRGFMTVVEVQRVLESIΝVQM
DEΝTLHEILΝEVDLΝKΝGQVEL
ΝEFLQLMSAIQKGRVSGSRLAI
LMKTAEEΝLDRRVPIPVDRSCG
GLYFCVRΝWWVLGLTDFKΝE
AADPSGVKLQTFTVSVTALKV
ARLELFVSPGGLVVLLGSGVKL
QIFAPΝMAMHΝKAAPPQIPDTR
RELAELVKRKQELAATLAKFG
9833 40201 9894 593 ISSAVEFETSQLSGLWTEQSAA
RSRGEΝIVCGPQGPRG/GPWVP
AEPTGSPVVSEEPLDLLPTLDLR
QEMPPPRVFKSFLSLLFQGLSVL
LSLAGDVLVSMYREVCSIRFLF
TVVSLL\ΝLFLSAFWLGLLYLVS
PLENEPKEMLTLSEYHERVRSQ
GQQLQQLQAELDKLHKEVSTV
RAANSERGAKLVFQRLNEGFV
Figure imgf001303_0001
Figure imgf001304_0001
Figure imgf001305_0001
Figure imgf001306_0001
Figure imgf001307_0001
Figure imgf001308_0001
Figure imgf001309_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Amino acid sequence ( X=Unknovvn, NO: of peptide hod n USSN location of first codon for last amino acid Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
9859 40227 A 9920 868 NSSPPFPFPRIVVGATLHSKETS
SYRLGSQPSHQVPTPSPRPRVFT
PQSSPAMPLAPSH PSPYQG PRT
QNISDYRAPGPQAIQPLPLSPGV
RPGPQDSWKEAPAPRGNLQRN
KLPETFMPPAPITGPS/IM/RPHP*
ATRDSSLTAPLSPV*VMLPQEF
QENS\PAGDKDNVFPSLGKPPSP
PCVQVMVDGPHSSR/PCAEGD
WVQVPGVLTAEKGRLLISPTLL
*WEQCDCESLGTSP*GSHEKDS
LPLWPFSLLLYQQLQHLPPEKM
ERKELPPEHQSLKSSFEALLQRC
9860 40228 9921 307 MKLKELERPAVQAWSPASQYP
LYLATGTSAQQLDSSFSTNGTL
EIFEVDFRDPSLDLKHRGVLSA
LSRFHKLVWGSFGSGLLESSGV
IVGGGDNGMLILYNVTHILSSG
KEPVIAQKQKHTGAVRALDLN
PFQPPEDIKALSWNRQAQHILSS
AHPSGKAVVWDLRKNEPIIKVS
DHSNRMHCSGLAWHPDIATQL
VLCSEDDRLPVIQLWDLRFASS
PLKVLESHSRGILSVSWSQADA
ELLLTSAKDSQILCRNLGSSEPC
PRLVFISQVTTESEFLMRSAELQ
EALGSGNLLNYCQNKSQQALL
QSEKMLWQFLKVTLEQDSRMK
FLKLLGYSKDELQKKVATWLK
SDVGLGESPQPKGNDLNSDRQ
QAFCSQASKHTTKEASASSAFF
DELVPQNMTPWEIPITKDIDGLL
SQALLLGELGPAVELCLKEERF
ADAIILAQAGGTDLLKQTQERY
LAKKKTKISSLLACVVQKNWK
DVVCTCSLKNWREALALLLTY
SGTEKFPELCDMLGTRMEQEGS
RALTSEARLCYVCSGSVERLVE
CWAKCHQALSPMALQDLMEK
VMVLNRSLEQLRGPHGVSPGP
ATTYRVTQYANLLAAQGSLAT
AMSFLPRDCA/PAARDIYAPSTN
YCSSYEPHP*ATRDSSLTAPCLQ
SASAPAT*EDGKEGAAPRASVL
EEQL*GASPTLLPVCN*LKDKK
Figure imgf001311_0001
Figure imgf001312_0001
Figure imgf001313_0001
Figure imgf001314_0001
Figure imgf001315_0001
Figure imgf001316_0001
Figure imgf001317_0001
Figure imgf001318_0001
Figure imgf001319_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Amino acid sequence ( X=Unknown,
NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
9922 40290 9983 2023 MGGGSWAPTGLCPLQSLLAMS
PEKRKLAAQEGQFTEPRPEEPP
KEKLHTLEEFSYEFFRAPEKDM
VSMAVLPLARARGHLWAYSCE
PLRQPLLKRVHANVDLWDIAC
QIFVDILPHQPACTSAVVGVSV
YPQSVSAWSCPSTVLLDSHTUL
RYMGDYPSRQAWPTLELTDQIF
TLALQHPALQDEVYCQILKHLT
HNSDRHSEERGWQLLWLCTGL
FPPSKGLLPHAQKFIDTRRGKLL
APDCSRRIQKVLRTGPRKQPPH
HVEVEAAEQNVSRICHKIYFPN
DTSEMLEVVANTRVRDVCDSI
ATRLQLASWEGCSLFIKISDKVI
SQKEGDFFFDSLREVSDWVKK
NKPQKEGEEASVELGEGVAGW
VEGALAASVRPMPIHPSIRSRVS
LGCGNNQALGPGGAPVTLPYQ
VYFMRKLWLNISPGKDVNADT
ILHYHQELPKYLRGFHKCSRED
AIHLAGLIYKAQFNNDRSQLAS
VPKILRELVPENLTRLMSSEEW
KKSILLAYDKHKDKTVEEAKV
AFLKWICRWPTΛFGSAFF/EGEG
KPCPTPGPPTQAASAQPSPSRPS
VNQHTSWQTSEPSYPDVILIAIN
RHGVLLIHP\KTKDLLTTYPFTK
ISSWSSGSTYFHMALGSLGRGS
RLLSETΛSLGYKMDDLLTSYVQ
QLLSAMNKQRGSKAPALAST
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Amino acid sequence ( X=Unknown, NO: of peptide hod n USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
9923 40291 9984 428 2502 VSKEIVYNKLLPYAERLDAESD
LQLAQIKCNLGRAVQLQELWP
GGLFWTRKLS/TRAGPLALSSLE
LSPSGDPRVVTEPVKYSSRPPCH
FPTGLPGSLPSLYVLRICFSRSV
RVVVFPPGGPWVAQGPRPRGR
ETWGFTMLLRYIRLYGRKFSKE
DHVLFIKLLYELVSIPKLEISMM
QGFARLLINLLKYFPADATAEM
LEEWRPLMCPFDVTMQKAITY
FEIFLPTSLPPELHHKGFKLWFD
ELIGLWVSVQNLPQWEGQLVN
LFARLATDNIGYIDWDPYVPKG
GPSKLVQKHLAGLFNSITSFYH
PSNNGRWLNKLMKLLQRLPNS
VVRRLHRERYKKPSWLTPVPDS
HKLTDQDVTDFVQCIIQPVLLA
MFSKTGSLEAAQALQNLALMR
PELVIPPVLERTYPALETLTEPH
QLTATLSCVIGVARSLVSGGRW
FPEGPTHMLPLLMRALPGVDPN
DFSKCMITFQFIATFSTLVPLVD
CSSVLQERNDLTEVERELCSAT
AEFEDFVLQFMDRCFGLIESSTL
EQTREETETEKMTHLESLVELG
LSSTFSTILTQCSKEIFMVALQK
VFNFSTSHIFETRVAGRMVADM
CRA A VKD WGKPGDL WNLG IQ
WHVPSSEEVSFAFYLLDSFLQP
ELVKLQHCGDGKLEMSRDDIL
QSLTIVHNCLIGSGNLLPPLKGE
9924 40292 9985 5396 AESDLQLAQIKCNLGRAVQLQE
LWPGGLFWTRKLSTYIRLYGR
KFSKEDHVLFIKLLYELVSIPKL
EISMMQGFARLLINLLKKKELL
SRADLELP WRPL YDMVERI LYS
KTEHLGLNWFPNSVENILKTLV
KSCRPYFPADATAEMLEEWRP
LMCPFDVTMQKAITYFEIFLPTS
LPPELHHKGFKLWFDELIGLWV
SVQNLPQWEGQLVNLFARLAT
DNIGYIDWDPYVPKIFTRILRSL
NLPVGSSQVLVPR
9925 40293 9986 477 VREFPGRRFRGCCGALRPRYKR
LVDNIFPEDP\GYVCIAMEALDQ
LLMACHCQSINLFVESFLKMVA
KLLESEKSNLQILGTNSFVKFA
N IEEDTPS YHRSYDFFVSRFSEM
CHSSHDDLEIKTKIRMSGIKSLQ
G V VRKTVNDELQAN I WDPQH SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Amino acid sequence ( X=Unknown, NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possiblc nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
9926 40294 A 9987 2371 AGGARLRPARGRPPRLLPPRPG
PCRPPP V PAPT VNERRAPPRAG
WERRSDAGLSRGARPAEMYGV
CGCCGALRPRYKRLVDNIFPED
PEDGLVKTNMEKLTFYALSAPE
KLDRIGAYLSERLIRDVGRHRY
GYVCIAMEALDQLLMACHCQS
INLFVESFLKMVAKLLESEKPN
LQILGTNSFVKFANIEEDTPSYH
RSYDFFVSRFSEMCHSSHDDLEI
KTKIRMSGIKGLQGVVRKTVN
DELQANIWDPQHMDKIVPSLLF
N LQH VEEAESRSPSPLQAPEKE
KESPAELAERCLRELLGRAAFG
NIKNAIKPVLIHLDNHSLWEPK
VFAIRCFKIIMYSIQPQHSHLVIQ
QLLGHLDANSRSAATVRAGIVE
VLSEAAVIAATGSVGPTVLEMF
NVTLLRQLRLSIDYALTGSYDGA
VSLGTKIIKEHEERMFQEAVIKT
VGSFASTLPTYQRSEVILFIMSK
VPRPSLHQAVDTGRTGENRNR
LTQIMLLKSLLQVSTGFQCNNM
MSALPSNFLDRLLSTALMEDAE
IRLFVLEILISFIDRHGNRHKFSTI
STLSDlSVLKLKVDKCSRQDTV
FMKKHSQQLYRffiYLSCKEETN
VQKHYEALYGLLALISIELANE
EVVVDLIRLVLAVQDVAQVNE
ENLPVYNRCALYALGAAYLNLI
SQLTTVPAFCQHIHEVIETRKKE
APYMLPEDVFVERPRLSQNLDG
9927 40295 9988 662 IPGSTISWSPAAARGLSVCRCCR
LHPASAMDLFGDLPEPERSPRP
AAGKEAQKGPLLFDDLPPASST
DSG/SSLQALPGSGGPLLFDDLP
PASSGDSGSLATSISQMVKTEG
KGAKRKTSEEEKNGSEELVEK
KVCKASSVIFGLKGYVAERKGE
REEMQDAHVILNDITEECRPPSS
LITRVSYFAVFDGHGGIRASKF
AAQNLHQNLIRKFPKGDVISVE
Figure imgf001323_0001
Figure imgf001324_0001
Figure imgf001325_0001
Figure imgf001326_0001
Figure imgf001327_0001
Figure imgf001328_0001
Figure imgf001329_0001
Figure imgf001330_0001
Figure imgf001331_0001
Figure imgf001332_0001
Figure imgf001333_0001
Figure imgf001334_0001
Figure imgf001335_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Amino acid sequence ( X=Unkno\vn, NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
10002 40370 10063 2480 RPITGAEIVAIINSLPTKKSPGLD
GFTAEFYQRYKEELVPFLLKLF
QSIEKEGILPNSFYEASIILIPKPG
RDTTKKENFRPISLMNIDAKILN
KILANRIQQHIKKLIHHDQVGFI
PGMQGWFNIRKSINVIQHINRT
KDKNHMIISIDAEKAFDKIQQ/R
LHAKNS/RIN*VLM/ETYFKIIRA
IYDKPIANIILNGQKLEAFPLKT
GTRQGCPLSPLLFNIVLEVLAR
AIRQEKEIKGIQLGKEEVKLSLF
ADDMIVYLENPIVSAQNLLKLIS
NFSKVSGYKINVQKSQAFLYTN
NRQTESQIMSELPFTIASKRIKY
LGIQLTRDVKDLFKENYKPLLK
EIKEDTNKWKNIPCSWVGRINI
VKMAILPKVIYRFNAIPIKLPMT
FFTELEKTTLKFIWNQKRAHIA
KSILSQKNKAGGITLPDFKLYY
KATITKTAWYWYQNRDIDQW
NRTEPSEITPHTYNCLIFDKPEK
NKKWGKDSLFNKWCWENWLA
ICRKLKLDPFLTPYTKINSRWIK
DLNVRPKTIKTLEENLGITIQNI
GMGKDFMSKTPKAMATKDKID
KWDLIKLKSFCTAKETTIRVNR
QPTKWEKIFATYSSDKGLISRIY
NELNQIYKKKTNNPIKKWAKD
MNRHFSKEDIYAAKKHMKKCS
SSLAIREMQIKTTMRYHLTPVR
MAIIKKSGNNRCWRGCGEIGTL
LHCWWDCKLVQPLWKSVWRF
10003 40371 10064 317 582 STVCCFLLRFIGGGFRHIPLSVD LWIHYIKILKETLDPGDPETNNT IRGYVEH*Y*NVFYDINNYSDV DNRAHES*AGSLTSIPLLKK
10004 40372 A 10065 41 434 GDLQPTAALWEPLSGLAKAGA
RSLSLQGGVGGEARARTRAAR
GACGPAAAPGGHGLGGHALGA
AGR/RLLALGRLGTCSLPCLSLP
PSPWAPVPPEPPR*VPPHAPGRP
VPSATQRLRSAGAQRRTGRQL
10005 40373 10066 67 459 LRDCKYTHRHCVSSSRFINTPIN
TLCLAQGL*VHQSTLCI*LLWV
GGLGGNFCVQYSVSN* SDGDV
ENLCPLRDCKCTNQHPVKTGH
FGSTNQQDVGWGQIKRIKAGLP
EPASGNSLGSPSTLWELCSLQ
Figure imgf001337_0001
Figure imgf001338_0001
Figure imgf001339_0001
Figure imgf001340_0001
Figure imgf001341_0001
Figure imgf001342_0001
Figure imgf001343_0001
Figure imgf001344_0001
Figure imgf001345_0001
Figure imgf001346_0001
Figure imgf001347_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Amino acid sequence ( X=Unkno n, NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
10130 40498 10191 272 2722 ETSCPLVRVHYAEAITGRCTAP
EDEGSLGQKPPARRSNNRTEGA
WGKRQLM SSP/STEPRVCLTIEG
QEIDFLLDTGAAFSVLISCPGRL
SSTSVTIQGILGQPVTRYFSHLL
SCΝWETLLFSHAFLLMPERPTG
LLGRDILAKAGAIIYMNMGNKL
PICCPLLEEGINPEVWALEGQFG
RAKNAHPVQIRLKDPTTFPYQR
QY/PLRPEAHKGLQNIVKHLKA
QGLVRKCSSPCNTPILEVQRPRL
VQDLRLIN EAVISL YPVVPNPYT
LLSQIPEEAEWFTVLDLKDAFF
CVPLHSDSQFLFAFEDPTNHTS
QLTWMVLPQGFRDSPHLFGQA
LAKDLGHFSSPGTLVLQYVDDL
LLATS*EALCQQATLDLLNFLA
NQGYKVSMSKAQLCLQQVKY
LGLILAKGTRALSKE*IQPILAY
PRPKTLKQLREFLGITSFCRLWI
PGYSETARPLYTLIKDTQRANT
HLVEWESEAETAFKTLKQALV
QAPGLSLPTGQNFSLYVTERAG
IALGVLTQTRGTTPQPVAHLSK
ETDVVAKGWPHCLRVVAAVA
VLVSEAIKIIQGKDLIVWTTHEV
NGILGAKGSLWLSDNCLLRYQ
A LLLEGPVLQICTCMALNPATF
LPEDGEPIKHDCQQIIVQTYAAR
DDLLEVPLTNPDLNLYTDGSSF
VENGI/RKVSDVTILESKPLPPGT
SAQLAELVALTWALELGKGKR
Figure imgf001349_0001
Figure imgf001350_0001
Figure imgf001351_0001
Figure imgf001352_0001
Figure imgf001353_0001
Figure imgf001354_0001
Figure imgf001355_0001
Figure imgf001356_0001
Figure imgf001357_0001
Figure imgf001358_0001
Figure imgf001359_0001
Figure imgf001360_0001
Figure imgf001361_0001
Figure imgf001362_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Λmino acid sequence ( X=Unknown,
NO: of peptide hod in USSN location of first codon for last ammo acid *=Stop codon, /=possιble nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possιble nucleotide insertion) sequence
10218 40586 10279 1722 MIISIDAEKAFDKIQQRFMLKTL
NKLGIDGLYLKIIRAIYDKPTAN
IILNGQKLEALPLKTGTRQGCPL
SPLQFNIVLEVLARAIRQEKEIK
GIQLGKEEVKLSLLADDMIVYL
ENPIVSAQNLLKVISNFSKVSGY
KINVQKSQAFLYTNNRQTESQI
MSELPFTIASKRIKYLGIQLTRD
VKDLFKENYKPLLNEIKEDTKK
WKNIPCSWVGRINIVKMAILPK
VIYRFNAIPIKLPMTFFTELEKTT
LKFIWYQKRARITKSILSQRNK
AGDITLPDFKLYYKATVNKTA
WYWHQNRHIDQWNRTKPSEIT
LHIYNYLFFDNPDKNKKWGKD
SLFNKWCWENWLAICRKLKLD
PFLTPYTKINSRWIKDLNIRPKTI
KTLEENLGITIQDIGMGKYFMT
KTPKAMATKAKIDKWDLIKLK
SFCTGKETTIRVNRQPTKWEKIF
ATYSSDKGLISRIYNELKQIYKK
KTNNPIKKWAKDMNRHFSKED
IYAAKKHMKKCSPSLAIREMQI
KTTMRYHLTPT/RLIVIGIEECLR
GKEKLETLYYWNSDTDHQV*W
LGWP*WKSDHKKEQPSF
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Amino acid sequence ( X=Unkno n, NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possiblc nucleotide insertion) sequence
10219 40587 10280 2076 MDKFLNTYTIPRLNQEEVESLN
RSITGSEIEAIINNLPTKRSPGPD
G FTPEF YQRYKEEL VPFLLKLF
QSIVKEGFLPNSFYEGSIILITKS
GRDTTKKENLRPISLMNINAKIL
NKILANQIQQHIKKLIHHEQVDF
IPGMQGWFNIRKSINVIQHINRT
NDKNHMIISIDAEKAFDKIQQLF
MLKTLNKLGIDGTYLKIIRAIYD
KHTANIILNGQKLEAFPLKTGT
RQGCPLSPLLFNIVLEVLARAIS
QEKEIKGIQLGKEEVKLSLFAD
DMIVYLENPIVSAQNLLKLIGNF
SKVSGYKISVQKSQAFLYTNKR
QTESQIMSELPFTIVSKRIKYLGI
QLTRDVKDLFKENYKPLLNEIQ
DETNKWKN\CSWVGRINIVKM
VILP/KELEKTTLKFIWNQKRAR
IAKTILSQKNKAGGITLPDFELH
YKATVTKTACYLYQNRDIDQR
NRTEPSEIMPHIYNYLIFDKSDK
NKKWGKDSLFNKWCWENWLA
ICRKLKLDPFLPPYTKINSRWIK
DLNVRPKTIKTLEENLGNTIQDI
GMGKDFMSKTPKAMSTKAKID
KWDLIKLKSFCTAKETAIRVNR
EPTEWEKIFAIYSSDKGLISRIYN
EFKQIYKKKTNNPIKKWAKDM
NRHFSKEDIYAANRHMKKCSSS
LAIREMQIKTTQVGGSEEKSAS
ELGNKLIELDQSEEWRGKED
Figure imgf001365_0001
Figure imgf001366_0001
Figure imgf001367_0001
Figure imgf001368_0001
Figure imgf001369_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Amino acid sequence ( X=Unkno>vn,
NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
10233 40601 10294 2605 M K AEIKMFFETN EN KDTTYQN
LWDAFKAVCRGKFIALNAYKR
KQERSKIDTLTSQLKELEKQEQ
SHSKAGRRQEITKIRAELKEIET
QKTLQKINESRSWFFERINKIDR
PLARLIKKKREKNQIDTIKNDK
GDITTNPTEIQTTIREYYKHLYA
NKLENLEEMDTFLDTYTLPRLN
QEEVGSLNTPITGSEIVAIINSLP
TKKSPGPDGFTAEFYQRYKEEL
PGRVPTKKENFRPISLMNIDAKI
LNKILANRIQQHIKKLIHHDQV
GFIPGMQGWFNIRKSINVIQHIN
RAKDKNHMIISIDAEKAFDKIQ
QPFMLKTLNKLGIDGTYFRIIRA
I YDKPTANIILNGQKLEAFPLKT
GTRQGCPLSPLLFNIVLEVLAR
AIRQEKEIKGIQLGKEEVKLSLF
ADDMIVYLENPIVSAQNLLKLIS
NFSKVSGYNFYVQKSQAFLYT
NNRQTESQIMSELPFTIASKRIK
YLGIQLTRDVKDLFKENYKPLL
KEIKEDTNKWKNIPCSWVGRIN
IVKMAILPKV1YRFNAIPIKLPM
TFFTELEKTTLKFIWNQKRARIA
KSILSQKNKAGGITLPDFKLYY
KATVTKTAWYWYQNRDIDQW
NRTEPSEIMPHIYNYLIFDKPEK
NKQWGKDSLFNKWCWENWLA
ICRKLKLDPFLTPYTKINSRWIK
DLN VRPKTIKTLEENLG ITIQDI
GVGKDFMSKTPKAMATKAKID
10234 40602 B 10295 111 2440
Figure imgf001371_0001
Figure imgf001372_0001
Figure imgf001373_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Amino acid sequence ( X=Unknown, NO: of peptide hod in USSN location of first codon for last a ino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
10241 40609 10302 248 2623 RQWAGVVGRCSHLASWVSSNT
SETGAIRSSTEVDAPDDSMLLST
CDIDLTAARRAWLGCLPTKKSP
GPDGFTAEFYQRCKEELVPFLL
KLFQSI/EKEGILPNLFDEASIILI
PKRGRDTTKKENFRPISLMNID
AKILNKILANRNQQHIKKLIHH
DQVGFIPGMQGWFNICKSINVI
QHINRTKDKNHMIISIDAEKAF
DKIQQPFMLKTLNKLGIDGTYL
KIIRAIYDKPTANIILNGQKLEAF
PLKTGTRQGCPLSPLLFNIVLEV
LARAIRQEKEIKGIQLGKEEVKL
SLFADDMIVYLENPIVSAQNLL
KLISNFSKVSAYKINVQKSQAF
LYTNNRQTESQIMSVFPFTIASK
RIKYLGIQLARNAKDLFKENYK
PLLNEIKEDTKKWKNIPCSWVG
RINI VKM Al LPKVI YRFN A I PIKL
PMTFFTELEKTTLKFIWNQKRA
RIAKSILSQKNKAG\GIT\LPDFK
LYLQGLQ*PKTAWYWY\QNRD
IDQWNRTEPSEITPH1YNYLIFD
KPEKNKQWGKDSLFNKWCWE
NWLATCRKLKLDPFLTPYTKIN
SRWIKDLNVRPKTIKTLEENLGI
TIQDIGVGKDFMSKTPKAMAT
KAKIDKWDLIKLKSFCTAKETA
IRVNRQPTTWEKIFATYSSDKG
LISRI YNELKQI YKKK\TNN PIKK
WAKDMNRHFSKEDIYAANKH
MKKCSPSLAIREMQIKTTMRYH
Figure imgf001375_0001
Figure imgf001376_0001
Figure imgf001377_0001
Figure imgf001378_0001
Figure imgf001379_0001
Figure imgf001380_0001
Figure imgf001381_0001
Figure imgf001382_0001
Figure imgf001383_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Λmino acid sequence ( X=Unkno n, NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
10276 40644 10337 3 2431 GCLQELSVLFARGEPRGPCHNL
LHYMLALIGVPEVFWAFLFHCS
LGLEEEKLAPLIKGEEKIVWLRS
SFYFQHCLVDKSTLLPSSSVSLG
IFHEEKNSGEFPIFAKAVRLALV
PLRYVKLEDRDNWISVDSVTSE
IKLAKLPDFESRYVQNGTYTVK
IVAISEDYPRKTITGTVLINVEDI
NDNCPTLIEPVQTICHDAEYVN
VTAEDLDGHPNSGPFSFSVIDKP
PGMAEKWKIARQESTSVLLQQ
SEKKLGRSEIQFLISDNQGFSCP
EKQVLTLTVCECLHGS\GCREA
QHDSYVGLGPAAIALMILAFLL
LLLVPLLLLMCHCGKGAKGFTP
IPGTIEMLHPWNNEGAPPEDKV
VPSFLPWDQGGSLVGRNGVGG
MAKEATMKRKVSSASIVKGQH
EMSEMDGRWEEHRSLLSGRAT
QFTGATGAIVMTTETTITARATG
ASRDVAGAQAAAVALNEEFLK
NYFTDKAASYTEEDENHTAKD
CLLVYSQEETESLNASIGCCSFI
EGELDDRFLDDLGLKFKTLAEV
CLGQKIDINKEIEQRQKPATETS
MNTASHSLCEQTMVNSENTYS
SGSSFPVPKSLQEANAEKVTQEI
VTERSVSSRQAQKVATPLPDPM
ASRNVIATETSYVTGSTMPPTT
V I LG PSQPQSLI VTERV Y AP AST
L V DQP YANEGT V V VTERVIQPH
GGGSNPLEGTQHLQDVPYVME
10277 40645 10338 354 RPRLTSQDIKKPDC/DGEDAIGR
GFECDGLHLEKIVHFALHLEKN
VNQSLLELHKLATDKNDPHLC
DFIETHYLNEQVKAIKELGDHV
TN M HEMG APDSG V AEYLFDKH
TLGDSDNES
Figure imgf001385_0001
Figure imgf001386_0001
Figure imgf001387_0001
Figure imgf001388_0001
Figure imgf001389_0001
Figure imgf001390_0001
Figure imgf001391_0001
Figure imgf001392_0001
Figure imgf001393_0001
Figure imgf001394_0001
Figure imgf001395_0001
Figure imgf001396_0001
Figure imgf001397_0001
Figure imgf001398_0001
Figure imgf001399_0001
Figure imgf001400_0001
Figure imgf001401_0001
Figure imgf001402_0001
Figure imgf001403_0001
Figure imgf001404_0001
Figure imgf001405_0001
Figure imgf001406_0001
Figure imgf001407_0001
Figure imgf001408_0001
Figure imgf001409_0001
Figure imgf001410_0001
Figure imgf001411_0001
Figure imgf001412_0001
Figure imgf001413_0001
Figure imgf001414_0001
Figure imgf001415_0001
Figure imgf001416_0001
Figure imgf001417_0001
Figure imgf001418_0001
Figure imgf001419_0001
Figure imgf001420_0001
Figure imgf001421_0001
Figure imgf001422_0001
Figure imgf001423_0001
Figure imgf001424_0001
Figure imgf001425_0001
Figure imgf001426_0001
Figure imgf001427_0001
Figure imgf001428_0001
Figure imgf001429_0001
Figure imgf001430_0001
Figure imgf001431_0001
Figure imgf001432_0001
Figure imgf001433_0001
Figure imgf001434_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Λmino acid sequence ( X=Unkno» n,
NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
10622 40990 10684 7232 MPEPPTHSMGSCAAGASLTSTA
PCSTAPSPIDHLRAKECERTAQ
DWQAAPPAALVRDPLAALVRT
WRTFMSSSRIVNTPIGTLCLAQ
GFQWFSVLSQVHASTDQEIQE
MH DEQAΝPQΝ A VGTLD VGL I D
SVCASDSPDRPΝSFVIITAΝRVL
HCΝADTPEEMHHWITLLQRSK
GDTRVEGQEFIVRGWLHKEVK
ΝSPKMSSLKLKKRWFVLTHIPG
LLQEFREDALΝ WG PDEKIFKET
ELVΝDMDKIΝGRIERAE
10623 40991 10685 2146 IDSVCASDSPDRPΝSFVIITAΝR
VLHCΝADTPEEMHHWITLLQR
SKGDTRVEGQEFIVRGWLHKE
VKΝSPKMSSLKLKKRWFVLTH
ΝSLDYYKSSEKΝALKLGTLVL
ΝSLCSVVPPDEKIFKETGYWΝV
TVYGRKHCYRLYTKLLΝEATR
WSSAIQΝVTDTKAPIDTPTQQLI
QDIKENCLNSDVVEQIYKRNPIL
RYTHHPLHSPLLPLPYGDMNLN
LLKDKGYTTLQDEAIKIFNSLQ
QLESMSDPIPIIQGILQTGHDLRP
LRDELYCQLIKQTNKVPHPGSV
GNLYSWQILTCLSCTFLPSRGIL
KYLKFHLKRIREQFPGTEMEKY
ALFTYESLKKTKCREFVLSRDEI
EALIHRQDMTSTVYCHGGGSC
KITINSHTTΛAGEVVEKLIRGLA
MEDSRNMFALFEYNGHVDKAI
ESRTVVADVLAKFEKLAATSEV
GDLPWKFYFKLYCFLDTDNVP
KDSVEFAFMFEQAHEAVIHGH
HPAPEENLQVLAAHATSSIMQG
DYT\LHA\AIPP\LEEVYSLQRLK
ARISQSTKTFTPCERLEKRRTSF
LEGTLRRSF\RTGSVVRQKVEEE
QMLDMWIKEEVSSARASIIDK
WRKFQGMNQEQAMAKYMALI
KEWPGYGSTLFDVEVRTGCHV
LGWAGCWHLRTWITAKFMWR
EDKMEHFALSTSFFRAPKIVPLT
PPFSSQFLFSCVVNASVILGMN
Figure imgf001436_0001
Figure imgf001437_0001
Figure imgf001438_0001
Figure imgf001439_0001
Figure imgf001440_0001
Figure imgf001441_0001
Figure imgf001442_0001
Figure imgf001443_0001
Figure imgf001444_0001
Figure imgf001445_0001
Figure imgf001446_0001
Figure imgf001447_0001
Figure imgf001448_0001
Figure imgf001449_0001
Figure imgf001450_0001
Figure imgf001451_0001
Figure imgf001452_0001
Figure imgf001453_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Λmino acid sequence ( X=Unkno\vn,
NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
10764 41 132 10826 99 2052 NPADGRTGFTLCATGEALSSAT
IAAAERRQALQEGCDDILS/HG
DADMVSMAR/PFLADAELLSK
AQSGRA/DEINTCIGCNQACLD
Q/IFVGKVTSCLVNPRACH/ETK
MPILPAVQKKNLA/VVGAGPAG
LAFA\FNIAKQIPGKEEFYD/TL/
RYYRRM I EVTG VTLKL/NHTVT
ADQLQAFDETILAVG/IVPRTPPI
DGIDHP/KVLSYPDVLRDKAPV/
SN/KVAIIGCGGIGFDTAM/YLS
QPGESTSQNIAGFC/NEWGIDSS
LQQADCDAPAKASKPGQGLGK
TTGWI/HRTTL/LSRGVKMIPGV
S/YQKIDDDGLHVVINGANQGS
DMLRLQGELDSASRIGRRSAAP
KLGSLNDGRCFINCHILDRLPY
KGLVPACSWNGGSLDPLEGRP
DEPHVEVLGIPPAAMLDQAPRL
ASTLNGCLGVAGPYEDERTQE
GLPGPRDTAAEEVLGVQTGGA
GSRGRVRGPQPADLPSASGPGA
ACWSMSPPPDQPPCVLAHGFFR
RTPTSTQQRPAEQCFHSPAPLEL
PLSSTSSILSFAEYGNAMAWLY
RGKILPSGTPDDLKAQSANDGN
PDPTMEKAFIQLIHDGIRSIAMS
NPILSCRRYGRYQEWCLTIPAV
FISAPYVRRYVRFHEHQEGGGV
RQEATVREIPLAIAVIPNSRTPC
10765 41133 10827 525 DAMVSSWNRAAPNTIPTAAKA
GGNYLSSLLVGSEARRHGYQE
GIALDVNGYISEGAGENLFEVK
DGVLFTP/PGFTSSALPGITRDAI
IKL\AKE\LGIEVREQVLSR/EKS
LYLADEVF\MSGT\AGEITPR\RS
GNGYSGLAKGRCGPVTKRIQH
ALQGVYSWVTPGYFESVRCTA
10766 41 134 10828 573 REPAGENGPPVCDGAMLDLKA/
WGSECHQQLTGRDNQQIKRSIY
LLPERAKLAELRLLVIPGQVDY
LQHIEELAAFIKGLGDVPVRLN
AFHAHGVYGEAQSWASATPED
VEPLADALKVRGTCRPDKTRQ
ASHQALCANCRMRRERLIRPTV
RHKPVG M I RRDKRRIRH Y APT A
GCGVNALSGLRFGTDL
Figure imgf001455_0001
Figure imgf001456_0001
Figure imgf001457_0001
Figure imgf001458_0001
Figure imgf001459_0001
Figure imgf001460_0001
Figure imgf001461_0001
Figure imgf001462_0001
Figure imgf001463_0001
Figure imgf001464_0001
Figure imgf001465_0001
Figure imgf001466_0001
Figure imgf001467_0001
Figure imgf001468_0001
Figure imgf001469_0001
Figure imgf001470_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Amino acid sequence ( X=Unknown, NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
10872 41240 10934 2358 VPMVVVPPVGAKGNTPATGTT
QGKKAEGTQNQSKKAEGAPNQ
GRKAEGTPNQGKKTEGTPNQG
KKAEGTPNQGKKA\EGTPNQG
KKAEGAHNQGKKVDTTPNQG
KKVEGAPTQGRKAEGAQNQA
KKVEGAQNQGKKAEGAQNQG
KKAEGAQNQGKKAEGAQNQG
KKAEGGQNQGKKTEEAQKQG
KKAEGAQIQGKKNEGAQTQGK
KAEGAQNQGKKNEGAQTQGK
KAEGAQTQGKKADGAQNQGK
KAEGAQNQGKKAEGAQNQGK
KAEGAQNQGKKADGAQNQGK
KAEGAQNQGKKAEGAQNQGT
KAEGAQNQGKKAEGAQNQGK
KAEGAQNQGKKAEGAQNQGK
KAEGAQNQGKKAEGAQNQGK
KAEGAQNQGKKAEGAQNQGK
KAEGAQNQGKKAEGAQNQGK
KAEGAQNQGKKAEGAQNQGK
KVEGAQNQGKKAEGAQNQGK
KAEGAQNQGKKAEGAQNQGQ
KGEGAQNQGKKTEGAQGKKA
ERSPNQGKK\GEGAPIQGKKAD
SVANQGTKVEGITNQGKKAEG
SPSEGKKAEGSPNQGKKADAA
ANQGKKTESAFVQGKNTDVAQ
SPEAPKQEAPAKKKSGSKKKGE
PGPPDADGPLYLPYKTLVSTVG
SMVFNEGEAQRLIEILSEKAGII
QDTWHKATQKGDPVAILKRQL
10873 41241 10935 967 1574 SLKCLIISLSPVHSYPGKRPEVSF
SPINSGQPPNYEMLKEEHEVAV
LGVAYNPAPPTSTVIHIRNETSV
PDHV/VLVPVQHSLHEP\CCLGF
IAFA YSVKSRH RKMVGDLTGA
Q/ALCLHRQVPEHLGPDFGHPH
DHSAHHHPSVDLPSLPSSFLLLF
LFLFLPA LQLLFLSFSLVFFFFFS
FFPLFQRLELWGWGEWDHHST
10874 41242 10936 370 RVATIVDIWEKSISGKKNSQNL FVNPCCLGFIAFAYSVKSRDRK MVGDLTGAQA/LCLHRQVPEH LSPDCGHPHDHSAHHHPGVDL PSLSIDQEASSRPGARALQDGA WASVFPSVQWEQQGW
Figure imgf001472_0001
Figure imgf001473_0001
Figure imgf001474_0001
Figure imgf001475_0001
Figure imgf001476_0001
Figure imgf001477_0001
Figure imgf001478_0001
Figure imgf001479_0001
Figure imgf001480_0001
Figure imgf001481_0001
Figure imgf001482_0001
Figure imgf001483_0001
Figure imgf001484_0001
Figure imgf001485_0001
Figure imgf001486_0001
Figure imgf001487_0001
Figure imgf001488_0001
Figure imgf001489_0001
Figure imgf001490_0001
Figure imgf001491_0001
Figure imgf001492_0001
Figure imgf001493_0001
Figure imgf001494_0001
Figure imgf001495_0001
Figure imgf001496_0001
Figure imgf001497_0001
Figure imgf001498_0001
Figure imgf001499_0001
Figure imgf001500_0001
Figure imgf001501_0001
Figure imgf001502_0001
Figure imgf001503_0001
Figure imgf001504_0001
Figure imgf001505_0001
Figure imgf001506_0001
Figure imgf001507_0001
Figure imgf001508_0001
Figure imgf001509_0001
Figure imgf001510_0001
Figure imgf001511_0001
Figure imgf001512_0001
Figure imgf001513_0001
Figure imgf001514_0001
Figure imgf001515_0001
Figure imgf001516_0001
Figure imgf001517_0001
Figure imgf001518_0001
Figure imgf001519_0001
Figure imgf001520_0001
Figure imgf001521_0001
Figure imgf001522_0001
Figure imgf001523_0001
Figure imgf001524_0001
Figure imgf001525_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Amino acid sequence ( X=Unknown, NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
11425 41793 A 1493 1955 FVSCQAPWSHPPAQLSPVGADA
MTLTLSVLICLGLSVGPRTCVQ
AGTLPKPTLWAEPASVIARGKP
VTLWCQGPLETEEYRLDKEGLP
WARKRQNPLEPGAKAKFHIPST
VYDSAGRYRCYYETPAGWSEP
SDPLELVATGFYAEPTLLALPSP
VVASGGNVTLQCDTLDGLLTF
VLVEEEQKLPRTLYSQKLPKGP
SQALFPVGPVTPSCRWRFRCYY
YYRKNPQVWSNPSDLLEILVPG
VSRKPSLLIPQGSVVARGGSLTL
QCRSDVGYDIFVLYKEGEHDL
VQGSGQQPQAGLSQANFTLGP
VSRSHGGQYRCYGAHNLSPRW
SAPSDPLDILIAGL1PDIPALSVQ
PGPKVASGENVTLLCQSWHQID
TFFLTKEGAAHPPLCLKSKYQS
YRHQAEFSMSPVTSAQGGTYR
CYSAIRSYPYLLSSPSYPQELVV
SGPSGDPSLSPTGSTPTPGPEDQ
PLTPTGLDPQSGLGRHLGVVTG
VSVAFVLLLFLLLFLLLRHRHQ
SKHRTSAHFYRPAGAAGPEPKD
QGLQKRASPVADIQEEILNAAV
KDTQPKDGVEMDARAAASEAP
\QDVTYAQLHSLTLRREATEPPP
S\QER£PPAEPSIYAPLAIHLAHG
GPRSHTQQKETQRLQKARELPP
VDTNEPQPAWDP
Figure imgf001527_0001
Figure imgf001528_0001
Figure imgf001529_0001
Figure imgf001530_0001
Figure imgf001531_0001
Figure imgf001532_0001
Figure imgf001533_0001
Figure imgf001534_0001
Figure imgf001535_0001
Figure imgf001536_0001
Figure imgf001537_0001
Figure imgf001538_0001
Figure imgf001539_0001
Figure imgf001540_0001
Figure imgf001541_0001
Figure imgf001542_0001
Figure imgf001543_0001
Figure imgf001544_0001
Figure imgf001545_0001
Figure imgf001546_0001
Figure imgf001547_0001
Figure imgf001548_0001
Figure imgf001549_0001
Figure imgf001550_0001
Figure imgf001551_0001
Figure imgf001552_0001
Figure imgf001553_0001
Figure imgf001554_0001
Figure imgf001555_0001
Figure imgf001556_0001
Figure imgf001557_0001
Figure imgf001558_0001
Figure imgf001560_0001
Figure imgf001561_0001
Figure imgf001562_0001
Figure imgf001563_0001
Figure imgf001564_0001
Figure imgf001565_0001
Figure imgf001566_0001
Figure imgf001567_0001
Figure imgf001568_0001
Figure imgf001569_0001
Figure imgf001570_0001
Figure imgf001571_0001
Figure imgf001572_0001
Figure imgf001573_0001
Figure imgf001574_0001
Figure imgf001575_0001
Figure imgf001576_0001
Figure imgf001577_0001
Figure imgf001578_0001
Figure imgf001579_0001
Figure imgf001580_0001
Figure imgf001581_0001
Figure imgf001582_0001
Figure imgf001583_0001
Figure imgf001584_0001
Figure imgf001585_0001
Figure imgf001586_0001
Figure imgf001587_0001
Figure imgf001588_0001
Figure imgf001589_0001
Figure imgf001590_0001
Figure imgf001591_0001
Figure imgf001592_0001
Figure imgf001593_0001
Figure imgf001594_0001
Figure imgf001595_0001
Figure imgf001596_0001
Figure imgf001597_0001
Figure imgf001598_0001
Figure imgf001599_0001
Figure imgf001600_0001
Figure imgf001601_0001
Figure imgf001602_0001
Figure imgf001603_0001
Figure imgf001604_0001
Figure imgf001605_0001
Figure imgf001606_0001
Figure imgf001607_0001
Figure imgf001608_0001
Figure imgf001609_0001
Figure imgf001610_0001
Figure imgf001611_0001
Figure imgf001612_0001
Figure imgf001613_0001
Figure imgf001614_0001
Figure imgf001615_0001
Figure imgf001616_0001
Figure imgf001617_0001
Figure imgf001618_0001
Figure imgf001619_0001
Figure imgf001620_0001
Figure imgf001621_0001
Figure imgf001622_0001
Figure imgf001623_0001
Figure imgf001624_0001
Figure imgf001625_0001
Figure imgf001626_0001
Figure imgf001627_0001
Figure imgf001628_0001
Figure imgf001629_0001
Figure imgf001630_0001
Figure imgf001631_0001
Figure imgf001632_0001
Figure imgf001633_0001
Figure imgf001634_0001
Figure imgf001635_0001
Figure imgf001636_0001
Figure imgf001637_0001
Figure imgf001638_0001
Figure imgf001639_0001
Figure imgf001640_0001
Figure imgf001641_0001
Figure imgf001642_0001
Figure imgf001643_0001
Figure imgf001644_0001
Figure imgf001645_0001
Figure imgf001646_0001
Figure imgf001647_0001
Figure imgf001648_0001
Figure imgf001649_0001
Figure imgf001650_0001
Figure imgf001651_0001
Figure imgf001652_0001
Figure imgf001653_0001
Figure imgf001654_0001
Figure imgf001655_0001
Figure imgf001656_0001
Figure imgf001657_0001
Figure imgf001658_0001
Figure imgf001659_0001
Figure imgf001660_0001
Figure imgf001661_0001
Figure imgf001662_0001
Figure imgf001663_0001
Figure imgf001664_0001
Figure imgf001665_0001
Figure imgf001666_0001
Figure imgf001667_0001
Figure imgf001668_0001
Figure imgf001669_0001
Figure imgf001670_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Amino acid sequence ( X=Unknown, NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
13345 43713 13425 1307 1992 DKLNPVAHQKAYLAQSSQCHP
QNARLVQHTQIRILPISEPPSNRI
FACWGKPAWTACCN/FSQGQA
VKGNQLLPVSLVKRKTTLAPNT
QTASPRALADSLMQLARQVSR
LESGQDFADFGTTIKQDFRLLG
QTSVDRLLQLSQGQAVKGNQL
LPVSLVKRKTTLAPNTQTASPR
ALADSLMQLARQVSRLESGQD
FADFGTTIKQDFRLLGQTSVDR
LLQLSQGQAVKGNQLLPVSLV
KRKTTLAPNTQTASPRALADSL
MQLARQVSRLESGQDFADFGT
TI KQDFRLLGQTS VDRLLQLSQ
GQAVKGNQLLPVSLVKRKTTL
APNTQTASPRALADSLMQLAR
QVSRLESGQLKHKEVESPNRPI
TSSEIQVLIKSLPIKKSPGPDGFT
AEFYERYKEELVPFLLKLFQTIE
QERLLPNSFYEASIILTSKPSRD
ATKKDNFRPTSLMNIYAKILNK
IGQTESSSTSKSLFSTIKSVSSPE
CKAGSTYANQDFADFGTTIKQ
DFRLLGQTSVDRLLQLSQGQA
VKGNQLLPVSLVKRKTTLAPNT
QTASPRALADSLMQLARQVSR
LESGQETTPWSPFSDPVLAFDY
KVFMAFVYNVVRVSQRMVVA
GCISAFNDKLLNDLICFPVAPVS
SKQIFPSDVKVAEPDHSCVSSY
RIGSMAPNLTLSEMPRHNYFPA
GNSVDHDNEFEL
13346 43714 13426 560 844
13347 43715 B 13427 200 1519
13348 43716 13428 950 1230 CSWHDRFPDWKAGRILPISEPPS
NRIFACWGKPAWTACCN/FSQG
QAVKGNQLLPVSLVKRKTTLA
PNTQTASPRALADSLMQLARQ
VSRLESGQDFADFGTTIKQDFR
LLGQTSVDRLLQLSQGQAVKG
NQLLPVSLVKRKTTLAPNTQTA
SPRALADSLMQLARQVSRLESG
QDFADFGTTIKQDFRLLGQTSV
DRLLQLSQGQAVKGNQLLPVS
LVKRKTTLAPNTQTASPRALAD
SLMQLARQVSRLESGQDFADF
GTTIKQDFRLLGQTSVDRLLQL
SQGQAVKGNQLLPVSLVKRKT
TLAPNTQTASPRALADSLMQLA
RQVSRLESGQ
Figure imgf001672_0001
Figure imgf001673_0001
Figure imgf001674_0001
Figure imgf001675_0001
Figure imgf001676_0001
Figure imgf001677_0001
Figure imgf001678_0001
Figure imgf001679_0001
Figure imgf001680_0001
Figure imgf001681_0001
Figure imgf001682_0001
Figure imgf001683_0001
Figure imgf001684_0001
Figure imgf001685_0001
Figure imgf001686_0001
Figure imgf001687_0001
Figure imgf001688_0001
Figure imgf001689_0001
Figure imgf001690_0001
Figure imgf001691_0001
Figure imgf001692_0001
Figure imgf001693_0001
Figure imgf001694_0001
Figure imgf001696_0001
Figure imgf001697_0001
Figure imgf001698_0001
Figure imgf001699_0001
Figure imgf001700_0001
Figure imgf001701_0001
Figure imgf001702_0001
Figure imgf001703_0001
Figure imgf001704_0001
Figure imgf001705_0001
Figure imgf001706_0001
Figure imgf001707_0001
Figure imgf001708_0001
Figure imgf001709_0001
Figure imgf001710_0001
Figure imgf001711_0001
Figure imgf001712_0001
Figure imgf001713_0001
Figure imgf001714_0001
Figure imgf001715_0001
Figure imgf001716_0001
Figure imgf001717_0001
Figure imgf001718_0001
Figure imgf001719_0001
Figure imgf001720_0001
Figure imgf001721_0001
Figure imgf001722_0001
Figure imgf001723_0001
Figure imgf001724_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Λmino acid sequence ( X=Unknown, NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
13950 44318 14032 2685 MGDFNTPLSTLDRSTRQKVNK
DTQELNSAPHQADLIDIYRTLH
PKSTEYTFFSAPHHTYSKTDHIL
GSKALLSECKRTEIITNYLSDDS
AIKLELRIKNLTQNRSTTWKLN
NLLLDDYWVHNEMKAEIKMFF
ETNENKDTTYQNLWDAFKAVC
RGKFVALNAHKRKQGRSKIDT
LTSQLKELEKQEQTHSKASRRQ
EITKIRAELKEIETQKTVQKINES
RSWFFERINKIDRQLARLIKKKR
EKNLIDAIKNDKGDITTDPTEIQ
TTIREYYKHLYANKLENLEEM
DKFLDTYTLPRLNQEEVESLNR
PITGSEIVAIINSLTTKKSPGPDG
FTAEFYQRAIRQEKEIKGIQLGK
EEVKLSLFADDMIVYLENPIVS
AQKLISNFSKVSGYKINVQKSQ
AFLYTNNRQTESQIMSELPFTIA
SKRIKYLGIQLTRDVKDLFKEN
YKPLLKEIKEDTNKWKNIPCSW
VGRINIVKMAILPKVIYRFNAIPI
KLPMTFFTELKKTTLΝFIWΝQK
\RAHIAKS/VLSQKΝKAGGITLP
DFKLYYKATVTKTAWYWYQN
RDTDQWNRTEPSEIMPRIYNYL
IFDKPEKNKQWGKDSLFNKWC
WKNWLAICRKLKLDPFLTPYT
KINSRWIKDLNIRPKTIKTLEEN
LGITIQDIGMGKDFMSKTPKAM
ATKAKIDKWDLIKLKSFCTAKE
TTNRVNRQPTKWEKIFATYSSD
13951 44319 14033 845 1616 ARAEVKLSLFADDMIVYLENPII
* ARAEVKLSLFADDMI VYLENP
IISAQNLLKLISKFSKVSRYKINV
QKSQAFLYTNNRQTESQIMSEL
PFTIATKRIKYLGIQLTRDVKDL
FKENYKPLLNEIKEDTNKWKNI
PCSWIGRINIVKMAILPKVIYRF
SAIPIKLPMTFFTELEKKNWLAI
CRKLKLDFFFIPYTKINSRWIKD
LNVRPKTMKTLEESLGNTIQDI
GIGKDFMTKTPKAMATKA/KKS
FCTAKETTIRVNRQPTEWEKIF
AIYPSDKGLIS
Figure imgf001726_0001
Figure imgf001727_0001
Figure imgf001728_0001
Figure imgf001729_0001
Figure imgf001730_0001
Figure imgf001731_0001
Figure imgf001732_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Λmino acid sequence ( X=Unknown,
NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /^possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possiblc nucleotide insertion) sequence
13990 44358 14072 2156 MGKKQSRKTGNSKNQSASPPP
KERSSSPAMEQSWMENDFDDL
REEGFRQSNYSELKEEVRTHGK
EVKNLEKKLDEWLTRITNAEKS
LKDLMELKTTARELHDECTSLK
RVSVTEDQMNEMKREEKFREK
RIKRNEQSLQEIWDYVKRPNLH
LIGVPESDRENGTKLENTLQDII
QENFPNLVRQANIQIQEIQRTPQ
RYSSRRATPRHIIVRFTKVEMK
EKMLRAAREKGRVTHKGKPIR
LTADLLAETLQARREWGPIFNI
LKEKNFQPRISYPAKLSFIDRST
RQKVNKDTQELNSALHQADLI
DIYRTLHPKSTEYTFFSAPHHTY
SKIDHIVGSKALLSKCKRTEIIT
NCLSDHSAIKLELRIKKLTRNHS
TTWKLNNLLLNDYWVHNEMK
A EIKMFFETNENKDKTYQNL W
DTFKTVCRGKFIALNAHKRKQE
RSKIDTLTSQLKGLEKQEQTHS
KASRRQEITKIRAELKEIETQKT
LQKINESRSWFFEKINKIDRPLA
RLIKNKREENQIDAIKNDKGDIT
TDPTEIQTAISEYYKHLYANKL
ENLEEMDKFLDTYTLPRLNQEE
VESLNRPITGSEIEAIINSLPTKK
SPGPDG FM RRILPEGQGGAGTI
PSETVPINRKRGNPP*/PHFM/SA
SIMLIPKPGRDTTKKENFRPISL
MNIDAKI/RQ*NTGKPNPAAHQ
KAYPP*SSGLHPWDARLVQLTQ
Figure imgf001734_0001
Figure imgf001735_0001
Figure imgf001736_0001
Figure imgf001737_0001
Figure imgf001738_0001
Figure imgf001739_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Amino acid sequence ( X=Unknown,
NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
14026 44394 14108 146 2616 LAAVGMASHCFRQRRQSQSPA\
GLRGAGSAYSPA\EGARSSHGQ/
PPWRGRRRGPGDPVRGAARCR
RPRAAAQPARPAARPPARPRPA
P*AACCE/VPSAANSVAPSAAM
*SSAGRDGASSQRSRRPHAAPP
RSSALPANGK/PPSPVTRAARTC
GARSSNTGA*SAARPKHSAASS
RHPPPAA/MPLPARHA PGGTSG
RPLPTPQGSAPTSSASPRGPATP
PPAGPRAPPPSPRRAAAPRPPGR
QLSPAPGTPAAGGGGPPFPALR
*SGSVFGRQLVGDGERGRACG
ARGSAAVPALRTFVVSPSTVVA
GAACPLAVALPAPDPLPCSE*A
PLRSG*VPQPASLFCSQFRPQA
WQHP*LLRAPCPRLGAAVPCV
WCHSRALDSATPRWM*G*RLP
AVWL\PAFPSPPFNQAFVPPCQQ
NCSLQNRQTLSASAKIGTEEWD
IAIKIPENVEVTLELGNEQRLEE
FGGLRRRQEDEGKFGTPTDWL
NGCDQNADSNMDSEGQGEDG
GCSFGGAGLRGSGWGDQTRET
FLALKANLRWPFRSSAQSQTSQ
EGAAGARSGLKGGKKNPLKLP
KKQAKKMNKAFKQKQEKERK
KHKELKAKASWKGLRPQVEFK
NLAERRGEAAVPGCPVVAGHS
WRLLSLARGCAKRARQLLFPSE
KLLAALREGSTRGGGAASQPQ
RKLRVCMRVLDSSVEDFHMDG
14027 44395 14109 885 MIISIDTEKALDKILHPFILKTLN
KLGIDGTYLKIIRALYDKPTANI
IMNEQKLEAFPLKSNTKQGCPF
LPLLFNIVLEVLARAIKQEKEIK
SIQIGREEVKLSLFADDMIVYLE
NPIISAPNLLKLISNFSKVSGYKI
NVQKSQAFLYTSNRQTEIQMM
SELPFTIATKGIKYLGI\HPLTRD
VKDLFKENSKPLL/KELEKTTLN
FIWNQKRAHIAKTMLSKKNKA
GGVMLPDFKLYYKAAVTKTA
WLGDDQVFLIPRGHISDYHMG
KNLGQYPAFQDRGPCGFPQCIV
PLVYRD
Figure imgf001741_0001
Figure imgf001742_0001
Figure imgf001743_0001
Figure imgf001744_0001
Figure imgf001745_0001
Figure imgf001746_0001
Figure imgf001747_0001
Figure imgf001748_0001
Figure imgf001749_0001
Figure imgf001750_0001
Figure imgf001751_0001
Figure imgf001752_0001
Figure imgf001753_0001
Figure imgf001754_0001
Figure imgf001755_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Amino acid sequence ( X=Unknown, NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /^possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
14116 44484 14198 560 MAALLLVGASWL/LLLLVGAS
WLLARVLAW/TYAFYDNCHRL
QCFQQ/PPKRNCF*GHLSLCGA/
TEEDMRLMEDL/VPLFRDVQL
WWLGSFYPVL/SLVHPTFTAPV
LQASGG*PGFTTTMGLSFKRAK
WQRLA/SEGS VRLEMFEHI SLM
TLDSLQKCIFSFDSHCQMVSPG
SCTTSRGPRIPGALPTGGARAPE
NGDPKEIE
141 17 44485 14199 175 2225 RWSSVSFASGMPSAPCCWGHP
QCPSAAQPTGPSWAGCRGLTM
PNGQASPAS*VPPGAFPFLCIKQ
PLWENLMTGVGKFSREAATLE
GPTLPTR/GVGPQCRQIADLPLE
RTEGWRGD\GNPVRGVRQKRK
QLVAGETAASLLPCQASGESCS
VP*QARGPPVLFLPQ\WGRRWA
KGCASRSCF*PDFRPSHAVPGA
REGAGLAAAPKRVARLSPAVPP
ACC\GLQGGSSSAAQAL*PQLS
GPRLPREK*GA/PCWRGGPRCC
PQNSAPETLEYVEPRGTGAPGT
THTGLPSPSQGCRGAEGW/WGL
PGLPGPRAGSSPQPRGRAWRPR
ARTPPPASWAAKGRGRPLAAG
GLARPRDPGPGSGVAPAGSLRP
HSSGRGWGRRRLRARPTGGNR
GSEWPR/RPPGYPGARPGAPSVP
RPQP/PAA*TRPLPVGCGARPFH
PVPATVWPHAGPP/SGMGRRRG
RYPRPASARPAGRCGAGSAEPG
RGTDKGGDGGEGKTKRGGG/L
GGG AG APKAAPG AAEPMGG R
GAGAGGGGA\R*GGGGGGGSG
TTGSGSSSRPRPPRTRAPPETDP
APRPRPGLVRAAGGGAASGSG
TGRGAEGCGVRAARGVPASLR
AALARGAQARVLVSGPAPGWA
EPRPAGGARGVGVWGPVLAGP
GRGPRGGAGRGPRALSRGCAA
*AWTPGRGAGQRGPGSGRGSG
Figure imgf001757_0001
Figure imgf001758_0001
Figure imgf001759_0001
Figure imgf001760_0001
Figure imgf001761_0001
Figure imgf001762_0001
Figure imgf001763_0001
Figure imgf001764_0001
Figure imgf001765_0001
Figure imgf001766_0001
Figure imgf001767_0001
Figure imgf001768_0001
Figure imgf001769_0001
Figure imgf001770_0001
Figure imgf001771_0001
Figure imgf001772_0001
Figure imgf001773_0001
Figure imgf001774_0001
Figure imgf001775_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Amino acid sequence ( X=Unknown, NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
14235 44603 14318 2340 MFVWRNVEGHSAAVFSWYSIP
FLTPPCSHTRPSNLPVTQWPPTR
ENNLPSWQPLLMSVHQAQSLS
ALRKEQDSSSEKDGRSPNKSDK
DHIRWPMSGAHDLQQAAPGPG
GAHQGHPNQDNRTVSQMLSER
WYTLGPNEMQKYNLAFQNVE
ARIRYVIVRGNEQGFFRMHHLR
GVSSLQLGRRRPGPGTYRLEW
DMITIDARYRCSGLRSPPDPDH
LKSVDLHRNEHFFLQKARWEIH
LVLQATPSQIQLLHVAFVLAKQ
PLRCRKQTLFLISWGTKEKNV
ASEGRNPVRPLRGHRWLPASPP
AAEHFAGGADRKSRLSRAPRSA
ATATPANEQRAVADVGQWAR
ARGGRVRAAGRSGSCALRRSR
CRRRRRRHRRHRYRGRAEPLA
PRRAASGQPGKLRARRGINCNA
SSIPMLQWRIKSISKHCQMAR
VAGEGSLPLSHAQLTHCVFGN
KQVQRTIAYLPPQQQGVQGFL
KNTMSNGYEDHMAEDCRGDIG
RTNLIVNYLPQNMTQDELRSLF
SSIGEVESAKLIRDKVAVGLLSL
CVPIRNVEIVASTSWWSLPLAP
YVPRGIRRSQPSTLMDDLSCRS
HQAAAPFTASVLLYISGRVKAH
TAAQGFNHIPEEGVSQEAGSG
WSQAWKDGWLTNVFMFSCTG
HSLGYGFVNYVTAKDAERAIN
TLNGLR PPVKNH*GRPLQLGFL
14236 44604 14319 701 1050 LIVHFCLQVGCGVGNTVFPILQ
TN/NTNSEYDPSRCFAFVHDLC
DEEKSYPVPKSSLDIIILIFVLSAI
VPDKMQKAINRLSRLLKPGGM
MLLRDYGRYDMAQLRFKKGIL
KVLNPNH
14237 44605 14320 701 1519 LIVHFCLQVGCGVGNTVFPILQ
TNNDPGLFVYCCDFSSTAIELV
QTNSEYDPSRCFAFVHDLCDEE
KSYPVPKGSLDIIILIFVLSAIVP
DKMQKAINRLSRLLKPGGMVL
LRD YG RYDM AQLRFKKGQCLS
GNFYVRGDGTRVYLCHTRCNW
TRFSPTAGLEKPQKPWDRRLQ
VNRGKTTDNVPGLDSVQILQAP
SVQHQLRGYLLPTRCKPVWS
GLFLKKKIVALGW/RCL*SQPL
RRLRRGGSIE/HQQSNLGKIVRD
PVSESNDKIKRI
Figure imgf001777_0001
Figure imgf001778_0001
Figure imgf001779_0001
Figure imgf001780_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Amino acid sequence ( X=Unknown,
NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
14269 44637 14352 2380 MMWSVLLQPVVLAAVERQGE
ALEFLIGVPESDGENGTKLENT
LQDIIQEDFPNLARQAHVQIQEI
QRTPQRYSLRRATPRHIIVRFTK
VEMKEKMLRAAREKGRVTLK
GKPIRLTADLSAETLQARRDRG
PIFNILKEKNFQPRISYPAKLSFT
SEGEIKYFTDKQMLRDFVTTRP
ALKELLKEALNMERNNRNFFG
VTFAFFVGGQSLGKALQETSIPF
LREEDLEKTSILETGNGDWEAT
RGLPEECAGMRERDCQQNEKY
QVPQFDQSTIKNIESAKGLDVW
DSWPLQNADGTVAEYNGYHV
VFALAGSPKDADDTSIYMFYQ
KVGDNSIDSWKNAGRVFKDSD
KFDANDPILKDQTQEWSGSATF
TSDGKIRLFYTDYSGKHYGKQS
LTTAQVNVSKSDDTLKINGVED
HKTIFDGDGKTYQNVQQFIDEG
NYTSGDNHTLRDPHYVEDKGH
KYLGLIPTYDISSNIYLISKSCLW
QANFRLNSALAQSPAITNFQFR
GHSLKTLDLLTMKNLDSKVNII
PVIAKADTVSKTELHKFKIKFM
SELVSNGVQIYQFPTDDDTIAK
VNAAMNGQLPFAWGSMDEIK
VGNKMVKARQYPWGWQVEN
ENHCDFVKLREMFICTNMEDL
REQTHTRUYELYRVRLQLEEMG
FTDWAPKNKPASVQETYEAKR
HEFHGERQRKEEEMKQMFVQR
14270 44638 14353 105 328
14271 44639 14354 107 676 IRPDSLPSASPAQDFYAASSNTF
PA/DRWSALTKIYEDQTSHNIPL
SQ/TLIQSKALTLFNSVKAEKDE
EAAEEKFEVTRWHIRGLIDGV
VEADIVEALQEFGLISYVMVMS
KKRQALVEFEDVLEACNAVNY
TADNQIHIAGHPAFVNYSTSQKI
SRPGNSDDPEREQCASLYHPEP
HLFDHHGCSLHYR
14272 44640 14355 221 648 LD WLRRV VG I PSLRWTC ATL V
GTRRAARNRCVNVWECGGLKE
ITY*SVEGLCRA/EKPKPLALLE
ETCKLPERQVVNAKEKFLEEMK
SATPVNT*MIRKRNCIITDMEK
V*AVWIEDQTSHNLPLSQSLIES
RALTLFNYVMA
14273 44641 14356 122 317
Figure imgf001782_0001
Figure imgf001783_0001
Figure imgf001784_0001
Figure imgf001785_0001
Figure imgf001786_0001
Figure imgf001787_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Amino acid sequence ( X=Unknovvn, NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
14304 44672 14390 723 PVLRGGIQEFAYIKALYERKFL
VPKPIDYNRHAVDMELINGYPL
C*IHHVEDPASVYDEAMELIVE
LANHGLIHGDFNEFNLILDESD
H ITMIDFPQMDSTSH PN AE WYF
DRDVKCIIDFFMKRFSYESELFP
TFKDIRREDTLDVEVSASGYTK
EMQADDELLHPLGPDDKNIETK
EGSEFSFSDGEVAEKAEVYGSE
NESERNCLEESEGCY\SDHLETL
NK*RKTVYQKRVLMHGVLK
14305 44673 14391 1889 MGSKVKEVEDDSQMSDLVRG
MNAFSKTGSGGVEQGRTGQGG
TNGHIKKAKARDRDRRFKKTP
QRKVGNDNLQLPLRWTEQHAE
THIMNFCSKNYNRNIPGKPRESI
DTLNEAACRGRLSKTARKLWS
EAQDQRSFQATEMQAFLFSAK
RIAAEKKTIIRRRDLCSIWGKLL
PRPGSQSVPAQIPENLQEQEERA
KWLLNGRSKFFPQTTSTLGKVT
SNKTGVKEPELESRQPNLLKVE
MGMKNHEIVPGSLIASIASLKH
GGCNKVLRELVKHKLIAWERT
KTVQGYRLTNAGYDYLALKTL
SSRQWESVG\NQMGVGK\ESYT
YIVANEEGQQFALKLHRLGRTS
FRNLKNKRDYHKHRHNVSWL
YLSRLSAMKEFAYMKFRTLEIR
D YFYWVKINVLTG PCSPKRGSS
GEFVPCLLQLLLGFGDCWHSFG
LWHITPVSVSMLTLLPSLLLVK
QKKPTLDSRMGQFDFWELQL
YLCVLEREVIREYPPDGQIFGRY
FDRDVKCIKDFFMKRFSYESEL
FPTFKDIRREDTLDVEVSASGY
TKEMQADDELLHPLGPDDKNIE
TKEGSEFSFSDGEVPEKAEVYG
SENESERNCLEESEGCYλSDHLE
TLNK*RKTVYQKRVLMHGVLK
Figure imgf001789_0001
Figure imgf001790_0001
Figure imgf001791_0001
Figure imgf001792_0001
Figure imgf001793_0001
Figure imgf001794_0001
Figure imgf001795_0001
Figure imgf001796_0001
Figure imgf001797_0001
Figure imgf001798_0001
Figure imgf001799_0001
Figure imgf001800_0001
Figure imgf001801_0001
Figure imgf001802_0001
Figure imgf001803_0001
Figure imgf001804_0001
Figure imgf001805_0001
Figure imgf001806_0001
Figure imgf001807_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Λmino acid sequence ( X=Unknown, NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
14435 44803 14521 2616 TTYQNLWDAFKAVCRGKFIAL
NVHKRKQERSKIDTLTSQLKEL
EKQEQTHSKASRRQEITKIRAEL
KEIETQKTLQKINESRSWFFERI
NKIDRPLARLIKKKREKNQIDAI
KNDKGDITTNPTEIQTTIREYYK
HLYANKLENLEEMHKFLDTYT
LPRLNQEEVESLNRPITGAEIVA
IINSLPTKKSPGPDGFTGEFYQR
YKEELVPFL/LKLFQSIEKEG M
NIDAKILNKILANRIQQHIKKLI
HHDQVGFIPGRQGWFNICKSIN
VIQHINRAKDKNHMIISIDAEKA
FDKIQQLFMLKTLNKLGIDGTY
FKIIRAIYDKPTANIILNGKKLEA
FPLKTGTRQGCPLSPLLFNIVLE
VLARAIRQEKEIKGIQLGKEEV
KLSLFADDM I V YLEΝPI VSAQΝ
LLKLISNFSKVSGYKINVQKSQ
AFLYTNNRQTESQIMSELPFTIA
SKRIKYLGIQLTRDVKDLFKEN
YKPLLKEIKEDANKWKNIPCSW
VGRINIVKMAILPKVIYRFNAIPI
KLPMTFFTELEKTTLKFIWNQK
RAHITKAILSQKNKARGITLPDF
KLYYKATVTKTAWYWYQNRD
IDQWNRTQPSEITPHIYNYLIFD
KPDKNKQWGKGSLFNKWCWE
NWLAICRKLKLDPFLTPYTKIN
SRWIKDLNVRPKTTKTLEENLG
ITIQDIGMGMDFMSKTPKAMAT
KDKIDKWDLIKLKSFCTAKETT
Figure imgf001809_0001
Figure imgf001810_0001
Figure imgf001811_0001
Figure imgf001812_0001
Figure imgf001813_0001
Figure imgf001814_0001
Figure imgf001815_0001
Figure imgf001816_0001
Figure imgf001817_0001
Figure imgf001818_0001
Figure imgf001819_0001
Figure imgf001820_0001
Figure imgf001821_0001
Figure imgf001822_0001
Figure imgf001823_0001
Figure imgf001824_0001
Figure imgf001825_0001
Figure imgf001826_0001
Figure imgf001827_0001
Figure imgf001828_0001
Figure imgf001829_0001
Figure imgf001830_0001
Figure imgf001831_0001
Figure imgf001832_0001
Figure imgf001833_0001
Figure imgf001834_0001
Figure imgf001835_0001
Figure imgf001836_0001
Figure imgf001837_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Λmino acid sequence ( X=Unkno n,
NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possib!e nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
14635 45003 A 14721 696 1791 VWIALKSENITCELLLFKMQTQ
CQVLVYLFPIFQPSFLGKRCPIS
TDPQNLTDVSIFLLLEVSGDPEL
QPSPCWAVPCPCAWVTVLGNL
LIILAISPDSHLHTPMYFFLSNLS
LA\DIGFTSTTVPK\MIVDI\QSHS
RVISYAGCLTQMSLFAIFGGME
E\DMLLS VMA YD WFV A ICHPL
YHSAIMNLCSVGFLVLLSFFFFV
LSLSDSQLHΝLIALQMTCFKDV
EIPΝFFWEPSQLSHLACCDTFTΝ
ΝIIMYFPTAIFGFLPISGTLFSYY
KIVSSILRVSSSGGKYKAFSTCG
SHLSWC*FYGRGVGGYLSSDD
VSSSPRKGAVASVMYTWTPM
LΝPFIYSLRΝRDIKSVLRRPTEH
CLISKK
14636 45004 14722 1971 MTRKVTLLLRTWRGGCGRLA
MGMEATPQGSLHPLFQEHWIR
AEDFGVLSPMHPΝSVQGVDDM
IRLGDLΝEAGMVHΝLLIRYQQ
HKIYVSLPSPVSTGEPLTGAΝLL
DREKKMMHRGHCPIALTEGVS
GPDΝPEGRARAKAKAKARAEL
PLPGAEGTDHLLPSQTYTGSILV
AVNPFQVLPLYTLEQVQLYYSR
HMGELPPHVFAIAΝNCYFSMK
RΝKRDQCCIISPQΝRVGAQTDT
VLKVFQHLLISLICKTTLQACCI
PSHPYLSSAPGPPPLTTQCRSYS
DSLLGPQLFGDVCYΝCSHVIEG
DGEALCSLPHPTCSPGAWVLQL
VSPLΝKAWCVSCFSCSTCΝSKL
TLKΝKFVEFDMKPVCKRCYEK
FPLELKKRLKKLSELTSRKAQP
KATDLΝSAEGPLAQLGAVGA/V
TGITHEPHIPQVEΝGSQQLDDV
PALGV\EΝWRIFMAERMEAQG
GQRLLLSTMHEEYEFVSPSSVAI
AELVALFLEGLKERSIFAMALQ
DRKATEGGPVGLTKKQGLLAS
ENWTLGQNDRTGKTGLVPWH
CLYTIHTVTKHSDSCCTAVPTL
LEASSGCIFMSYKKHITLTPKSA
DSNKRPSTLGSQFKQSLDQLMK
ILTNCQPYFIRCIKPNEYKKPLG
FSDSFML1KLIRIDKVQYDSFSYI
Figure imgf001839_0001
Figure imgf001840_0001
Figure imgf001841_0001
Figure imgf001842_0001
Figure imgf001843_0001
Figure imgf001844_0001
Figure imgf001845_0001
Figure imgf001846_0001
Figure imgf001847_0001
Figure imgf001848_0001
Figure imgf001849_0001
EQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Amino acid sequence ( X=Unknown, O: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
14691 45059 14777 2091 MKPDDNSTQNEGGQSFEERHE
TLQRVKFRGGPKTAPGRKLIAF
LGQMNSGTSGGQRVPGEEELM
AGGATCLPNTEAHQPIRMKNL
HTPAFRLCPLTLQDNEADAFW
HQAFVDWGMPTGKKPQTLGLT
YKTTHDLSMQNS/EGGADSPAS
VALRPSAA/APPVPASPQMLPLE
LVILGW/PLDKCIPAQWSKEG
QTNSYTYNPEPQMYAPSST/AS
YFEAPGGAQ\MVGITMDV/GGS
PIVSSAGAYLIHGG/MDSTRHSL
AHTSRSSPARSFRSNSKAPSSRK
FCC*LPTLSSKTL/SLFSKNLLRK
LPLGKHSPDSVQWPWIYGLLV
AGSESLASLLFPKASSSGETLGF
ATGQLAGESRLPSLLAFILVPID
VAWGKEQLVTQTTVYRVQGS
GQHLNKAWQEPALVNICGMGE
KVDVQVGLCQHRAHLDGWNS
SRAASVWPGETLTHHQDQCQA
NLPPGIETSI LGTKG ARYPSPQE
QGGGRVWLERLDPRG/AVSGT
DL*CLPQSDVPPGRA*SGGTGD
KS I STHDKSGRGNSKYHY/YGI R
LKPDSPLNRLQED/TQYMAMR
QQPMHQKPRWCRAELGGAGSP
L\GMEDVSHVFPEFPAPD/LGSF
LLQDGVTLHDVKAL/QLVYRR
HC/EDW/MNLQFHYIEKLWLS
FWN/SKASSSDGPTSLPASDEDP
EGAVLP/KDKLISLCQCDPILRW
14692 45060 14778 94 3006 RTRSLTRKAMAEHAPRRCCLG
WDFSTQQVKWAVDAELNVFY
EESVHFDRDLPEFGHVLDVHG
VHVHKDGLTVTSPVLMWVQA
LDIILEKMKASGFEFSQVLALSG
AGQQHGSIYWKAGAQQALTSL
SPDLRLHQQLQDCFSISDCPVW
MDSSTTAQCRQLEAAVGGAQA
LSCLTGSRAYEFNLVCDRKHLK
DTTQSVFMAGLLVGTLMFGPL
CDRIGRKATILAQLLLFTLIGLA
TAFVPSFELYMALRFA\GL
Figure imgf001851_0001
Figure imgf001852_0001
Figure imgf001853_0001
Figure imgf001854_0001
Figure imgf001855_0001
Figure imgf001856_0001
Figure imgf001857_0001
Figure imgf001858_0001
Figure imgf001859_0001
Figure imgf001860_0001
Figure imgf001861_0001
Figure imgf001862_0001
Figure imgf001863_0001
Figure imgf001864_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Amino acid sequence ( X=Unknown,
NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
14771 45139 14857 1402 3004 TEPKTKTT*LSA*MQKGPLTKF
NNPSC*KLLIN/IVLEVLARAIRQ
EKEIKGIQVGKEEVKLSLFADD
MIVYLENPIVSAQNLLKLISNFS
KVSGYKINVQKSQAFLYTNNR
QTESQI MSELPFTI ASKRIK YLG I
QLTRDVKDLFKENYKPLLNEIK
EDTNKWKNILCSWVGRINIVK
MAILPKVIYRFNAIPIKLPMPFFT
ELEETTLKFIWNQKRARIAKAIL
SQKKKAGGITLPDFKLYYKATV
TKTAWYWYQNTDIEQWNRTEP
SEIMLHIYNYLIFDQPEKNKQW
GKDSLFNKWC WEN WLAICRKL
KLDPFLTPYTKINSRWIKDLNV
RPKTIKTLEENLG ITIQDISMGK
DFMSKTPKAKATKAKIDKWDL
IKLKSFCTAKETTI RVNRQPTK
WEKIFATYSSDKGLISRIYNELK
QIYKIKTNNPIEKWVKDMNRHF
SKEDIYAAKKHMKKCSSSLAIR
EMQIKTTMRYHLTAVRMAIIKK
SGNNSLIVGASMDTSLLSGPIPY
DIQHFDRFGSVTVRRGCAIEFY
Figure imgf001866_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Amino acid sequence ( X=Unkno n, NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
14774 45142 14860 5019 5574 RETRGEVKNSKNQSTSAPPKDC
SSLADPKKIQTTIREYYKHLYPN
KLENQEEMDKFLDTYTLPRLN
QEEVESLNRPITGSEIVAIINSLP
TKKSPGPDGFTAEFYQRYKEEL
VPFLLKLFQSIEKEGILPNSFYE
ASIILIPKLGRDTTKKENFRPISV
MNIDVKILNKILANRIQQHIKKL
IHHDQVGFIPGMQGWFNIHKSI
NVIQHINRTKDKNHMIISIHAEK
AFDKIQQHFMLKTLNKLGIDGT
YLKIIRAIYDKPTANIILNGQKL
EAFPLKIGTRQGCPLSPLLFNIV
LEVLARAIRQEKEIKGIQLGKEV
VKLSLFADDMIVYLENPIVSAQ
NLLKLISNFGKVSGYKINVQKS
QAFLYTNNRQTESQIMSELPFTI
ASKRIKYLGIQLTRDVKDLFKE
NYKPLLNEIKEDTNKWKNIPCS
WVGRINIVKMAVLPKLPMTFFT
ELEK\TTLKFIWNQKRARIAKSI
LSQKNKAGGITLPDFKLYYKAT
ATKTAWYWYQNRDLDQWNRT
EPSEITPHIYNYLIFDKPDKNKQ
WGKDSLFNKWCWENWLAICR
KLKLDPFLTPYTKINSRWIKDL
NIRPKTIKTLEENLGITIQDIGMG
KDFMSKTPKAMATKAKIDKW
DLIKQESFCTAKETTIRVNRQPT
KWEKIFATYSSDKGLISRIYSEL
KQIYKKKTNNPIKKWAKDMNR
HFSKEDIYAAKKHMKKCPSSLA
14775 45143 14861 1895 6140
14776 45144 14862 2875 MKAEIKVFFETNENKDTTYQNL
WDTFKAVCRGKFIALNAHKRK
QERSKIDTLTSQLKELEKQEQT
HSKASRRQEITKIRAELKEIQTQ
KTLQKINESRSWFFERINKIDRS
LARLIKKKREKNQIDTIKNDKG
DITTDPTEIQTTIREYYKHLYAN
KLENLEEMDKFLDTYTLPRLNQ
EEVESLNRPITGAEIVAIINSLPT
KKSPGPDGFTAEFYQSWAETQP
KKENFRPISLMNIDAKILNKILA
KRIQQHIK
Figure imgf001868_0001
Figure imgf001869_0001
Figure imgf001870_0001
Figure imgf001871_0001
Figure imgf001872_0001
Figure imgf001873_0001
Figure imgf001874_0001
Figure imgf001875_0001
Figure imgf001876_0001
Figure imgf001877_0001
Figure imgf001878_0001
Figure imgf001879_0001
Figure imgf001880_0001
Figure imgf001881_0001
Figure imgf001882_0001
Figure imgf001883_0001
Figure imgf001884_0001
Figure imgf001885_0001
Figure imgf001886_0001
Figure imgf001887_0001
Figure imgf001888_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Λmino acid sequence ( X=Unknovvn, NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
14905 45273 14992 473 2385 HSFLSFIVLMYLTMDLTRSQKH
GWASSPSHGTSPWPVRLRTSLA
KPCKWQVSLCWWLSASPGSSF
SLVRSSLLIIVPHAESVSRRPSLL
SRVGLFPARY*HRVWQRSPSPQ
TRDHLIKRNPSQDV*RRQQPRG
TRVSNλKCSLHTSALGIGRQPSC
DATL*GQQGLP/VK*KRQQPAP/
MSQQQTGAALEVLARAIRQEK
EIKGIQLGKEEVKLSLFADDMI
VYLENPTVSAQNLLKLISNFSK
VSG YKIN VQKSQAFL YTNNRQ
TESQIMSELPFTIASKRITYLGIQ
LTRDAKDLFKEΝYKPLLKEIKE
DTNKWKNIPCSWVGRINIVKM
A I LPKEPG WARSLVFGARGLQG
QGRCGQSRGGGGSCLGGGGST
EΝWQHREPSARKTGSTEΝRRH
REPAAPRTGSTEΝQHREPAAPR
TGTEΝRNREPSARKTGSTEΝRR
HREPAAPRTGSTEΝQHREPAAP
RTGTENRHQEPAAPRTGGTENR
QHREPAAPRTGTENRQHREPAP
RTGSTENRQHREPAPRTGSTEN
RQHRESAAPRTGSTENRHQEPA
APRTGTENRHREPSAPRTVGTE
NRRHRELVAPRTGGTENRPREP
PAGTRPRHTPMRGGSEGALFH
QRRFLEPTPDREQLEGIRYPPVA
Figure imgf001890_0001
Figure imgf001891_0001
Figure imgf001892_0001
Figure imgf001893_0001
Figure imgf001894_0001
Figure imgf001895_0001
Figure imgf001896_0001
Figure imgf001897_0001
Figure imgf001898_0001
Figure imgf001899_0001
Figure imgf001900_0001
Figure imgf001901_0001
Figure imgf001902_0001
Figure imgf001903_0001
Figure imgf001904_0001
Figure imgf001905_0001
Figure imgf001906_0001
Figure imgf001907_0001
Figure imgf001908_0001
Figure imgf001909_0001
Figure imgf001910_0001
Figure imgf001911_0001
Figure imgf001912_0001
Figure imgf001913_0001
Figure imgf001914_0001
Figure imgf001915_0001
Figure imgf001916_0001
Figure imgf001917_0001
Figure imgf001918_0001
Figure imgf001919_0001
Figure imgf001920_0001
Figure imgf001921_0001
Figure imgf001922_0001
Figure imgf001923_0001
Figure imgf001924_0001
Figure imgf001925_0001
Figure imgf001926_0001
Figure imgf001927_0001
Figure imgf001928_0001
Figure imgf001929_0001
Figure imgf001930_0001
Figure imgf001931_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Λmino acid sequence ( X=Unknown, NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
15164 45532 15254 2423 MNGKNKT YQEPTVCQALHW
TLARIGKEEPRRGQRGEGLLCS
RERQPRGLSSPGTAAPAAPAAP
TPPALPGHRTRSGSPEGPEPILR
RGGSADAPGRVKTWVNRPAAF
VGKCLGDGSADTACDQAMKG
QRGSNRKRKTSHSLTFPTSSDM
HAGGLSAAHPCAHSRAPPPASP
SPYKGSLGTRTDGLTDSRADSA
ASGARALGVDGQPQPSPPRRAP
PRPVGADGSSRGPQPGRRRVER
RGPATQPGDSRALPEPRGVPAV
HPAGSGSEWERPPPAAPSPEHR
DKMLPGLRRLLQASARRPQPFP
EAARWVWSPCFSSGTLCPSFQS
ERPLLPFQGASFPGTTLHQRLCP
VAALHGSGKGQKWQQSHCYA
AAVMEEPPAPGRSSESLQGPLS
WAGGAPASACLLLMLLALPLA
APSCPMLCTCYSSPPTVSCQAN
NFSSVPLSLPPSTQRLFLQNNLI
RTLGARHLWVQPAHPVALLQQ
PLHHLPGHFPPL\QALEELDLGD
NRHLRSLEPDTFQGLERLQSLH
LYRCQLSSLPGNIFRGLVSLQYL
YLQENSLLHLQDDLFADLANLE
PTSFLHGNRLRLLTEHVFRGLG
SLDRLLLHGNRLQGVHRAAFR
GLSRLTILYLFNNSLASLPGEAL
ADLPSLEFLRLNANPWACDCR
ARPLWAWFQRARVSSSDVTCA
TPPERQGRDLRALREADFQACP
15165 45533 B 15255 47 482
15166 45534 A 15256 636 MRDPNTKRSRGFGFVTCATVE
EVDAAMNARPCKWGRTVEPK
RAVSREDSQRPEDTEELHLRDY
FEQYGKIEVIEIMTDQGSGKKR
GFAFVTFDNHDSMDKTVIQKY
HTVNGHNCEARKALSKQEMAR
ASSSQRGRSGSGNFGGGRGGGF
GGNDNFGRGGNFSGHGGFGGS
HGGGGYGG/SGDGYNGFGNDG
GGGSYNDFVNYNNQSSHFGPM
Figure imgf001933_0001
Figure imgf001934_0001
Figure imgf001935_0001
Figure imgf001936_0001
Figure imgf001937_0001
Figure imgf001938_0001
Figure imgf001939_0001
Figure imgf001940_0001
Figure imgf001941_0001
Figure imgf001942_0001
Figure imgf001943_0001
Figure imgf001944_0001
Figure imgf001945_0001
Figure imgf001946_0001
Figure imgf001947_0001
Figure imgf001948_0001
Figure imgf001949_0001
Figure imgf001950_0001
Figure imgf001951_0001
Figure imgf001952_0001
Figure imgf001953_0001
Figure imgf001954_0001
Figure imgf001955_0001
Figure imgf001956_0001
Figure imgf001957_0001
Figure imgf001958_0001
Figure imgf001959_0001
Figure imgf001960_0001
Figure imgf001961_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Amino acid sequence ( X=Unknown, NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
15341 45709 15431 129 444 WGGPGRQPDGGRVPGRCGPRR
PATWESPAACRPRPQSRHHSFE
*GGSRSGSSHKRPCPSSPAVLRA
RDQWLHTSFEATEGDSSHEDR
STPLARRCLCSHRSGEA
15342 45710 15432 2446 METSLMVQVCGGRDCNALLPG
NLCSQPTQVLPLEVDGDDGPG
GLPLNAAPPSSVPPVPPSEALPP
PACPSAPAPWRSIISRLFGTSPA
AEAAHPPPGGALDFGALRTSSC
LSQSHGHIPGGPGLGLGAVRVP
VPSEEDPASLFTEPVPAAEAPAT
VQSVEDFVANDRLDRSFLEDM
TPARDEKKVGAKAAQQDSDTG
RASRAPEQQIPHGGH WRGH SRS
YERFRPPDMVSAGPGPSDGEAL
GGNPIVAGFQDDVDLEDQPRGS
PPPPAGPVPSQDITLSSEEEAEV
ADPPKGPAPAPQQCSEPETKWS
SIPASKPQRGTAPMTAAASPWP
GGASVHTGLEKCSSTRPPAEIEP
GKGEQASSSESDPEGPIAAQML
SFVMDDPDFESEASDTQCRAVR
RVLPAQRAENAYGSAFQPGAL
KPECRPTLKGGPGPVPSKAGKK
TEATSPRPTGGASPLAHGATVG
AAWGLGMASWPIGSSLAPTRA
RTREPRLQRRVGSELPGDPCSG
SGASCRQVCARGRATSVRLPAL
ESLWPLSSDGSAAAREGLEGPR
LRQAAAPGGPGPPLPPLSRPPSP
ARRMRGGAAVAGRGAAAPLE
GRVRLGVGFLAETAVRAGAGA
LAAADAPPGSRTPRARPLVAAG
SGAAGGRCPVPSRRQREAPSPL
GGAGVDLQLECLTSEKAGVGA
ATGGVLEALARAIRQKKEIEEIQ
Figure imgf001963_0001
Figure imgf001964_0001
Figure imgf001965_0001
Figure imgf001966_0001
Figure imgf001967_0001
Figure imgf001968_0001
Figure imgf001969_0001
Figure imgf001970_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Amino acid sequence ( X=Unknown,
NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
15406 45774 15496 1605 WQLPH PPAAPS ARQI LRACQLS
SVTAVAQSCLYGKQLC/GLTTG
NIAGAGLLFFGGGIGGTILYAK
WASHFRERVEKTIPYSDKLFEV
VLGSAPYNVPLPKKSTQSGPLK
ISSVSEVTKEFKQPASQLQKQK
GDTPASATAGDTVLVPGVQHE
ESLKTDHPAIDEGKPTPALSEEA
SSSSIRERPCEEIAACLAQEEKQ
EQVKTESLAKSFEDALRQTANV
TLQAIAAQNTVVQAVNAHSNIL
KAAMDNSEIEGKKKSAKWHTV
QGALKEHRKAVDEAANALLKA
KEEVQAAQFESKLVSQYHELV
GQAQDDFKRQLDSITPEVLPGW
KGMSVSDPADKLSTDDLNSLIA
RAHRHIDQLNRELAEQKATEK
QHITLALEKQTLEEKVQEQELK
YEFEQNLSEKLSEQELQFHHLS
QEQVHNFTLDINTAYARLRGIE
HAVQSHAVAEEEARKAYQLW
LSVEALKYSMKTSSAEMPAVPL
GSAVEAIKANCSDNEFTKALAT
AIPPESLTHGVYSEETLRVRFYA
VQKLARRVAMID
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Amino acid sequence ( X=Unknown, NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
15407 45775 15497 89 2490 PKCCFFCLLSGETAAAQSCLCG
KWLRPLRPCRRYSTSGSSGLT
TGKIAGAGLLFVGGGIGGTILY
AKWDSHFRESVEKTIPYSDKLF
EMVLGPAAYNVPLPKKSIQSGP
L* I SS VSEVM\KESNRPASQ\LQK
QK\GDTPASATAPTEAAQIISAA
GDTLSVPAPAVQPEESLKTDHP
EIGEGKPTPALSEEASSSSIRERP
PEEVAARLAQQEKQEQVKIESL
AKSLEDALRQTASVTLQAIAAQ
NAAVQAVNAHSNILKAAMDNS
EIAGEKKSAQWRTVEGALKER
RKAVDEAADALLKAK*LLMDF
NKEELEKMKSVIENAKKKEVA
GAKPHITAAEGKLHNMIVDLD
NWKKVQAAQSEAKWSQYH
ELWQARDDFKRELDSITPEVL
PGWKGMSVSDLADKLSTDDLN
SLIAHAHRRIDQLNRELAEQKA
TEKQHITLAL\EKQKL\EEKRAF\
DSAVAKAFRNIHRKLKYRA*TG
QKR*KEVRDAME\NEMRTQLR
RQAAAHTDHLRDVLRVQEQEL
KSEFEQNLSEKLSEQELQFRRLS
QEQVDNFTΛLDINTAYARLRG I E
QAVQSHAVAEEEARKAHQLW
LSVEALKYSMKTSSAETTSMPL
GSAV\EAI\KANCSDNEFTQALT
AAIP\PESLTRGWTVKRPLGAR
F\YAVQKL\ARRVAMIDETRNSL
YQYFLSYLQSLLLFPPQQLKPPP
15408 45776 15498 1212 MPKKRQALVEFEDVLGACNAV
NYAADNQIYIAGHPAFVNYSTS
QKISRPGDSDDSRSVNSVLLFTI
LNPIYSITTPTRLNVFKNDQDT
WDYTNPNLSGQGDPGSNPNKR
QRQPPLLGDHPAEYGGPHGGY
HSHYHDEGYGPPPPHYEGRRM
GPPVGEYGPHADSPVIMVYGL
DQSKMNCDRVFNVFCLYGNVE
KVKISLKKQSPGGRPMGEEWL\
DGYAVDRAITHLNNNFMFGQK
LNV/CVGAQAREGSRGTGERK
GGEWGPAEEHSEA E VLTHTEM
GCGSVSKQPAIMPGQSYGLEDG
SCSYKDFSESRNNRFSTPEQAA
KNRIQHPSNVLHFFNAPLEVTE
ENFFEICDELGVKRPSSVKVFSG
KSERSSSGLLEWESKSDALETL
GFLNHYQMKNPSΓNLVT
Figure imgf001973_0001
Figure imgf001974_0001
Figure imgf001975_0001
Figure imgf001976_0001
Figure imgf001977_0001
Figure imgf001978_0001
Figure imgf001979_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Amino acid sequence ( X=Unknown, NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
15459 45827 15549 1214 MEEEIAALVIDNGSGMCKAGF
AGDDAPRAVFPSIVGRPRHQGV
MVGMGQKDSYVGDEAQSKRG
ILTLKYPIEHGIVTNWDDMEKI
WHHTFYNELRVAPEEHPVLLTE
APLNPKANREKMTQIMFETFNT
PAMYVAIQAVLSLYASGRTTGI
VMDSGDGVTHTVPIYEG\YALP
HAILRLDLAGRDLTDYLMKILT
ERGYSFTTTAEREIVRDIKEKLC
YVALDFEQE\MATAA\SSSSLEK
SYELPDSPRSSTNWQLSGFRLPL
RHSFQPSFLG\MESCG\IHETTF\
ΝSIMK\CDVDIRKDLY\AΝTVLS
GRTPPCTLGIADRMQKEITALA
PSTMKIKIIAPPERKYSVWIGGF
HPWAWLSHLSSKMWVQQARS
MTESGPLHSSTANGFLGGLLTL
VAFTPFLWTKP
15460 45828 15550 2355 MKQITFAPRNHLLTNTNTWTPD
SQWLVFDMRTSGASFTGETIER
VNIPTGYVEKPNKQCRHSVTRY
GHGGHSDHTPSLTVATVATQIT
PRHSGHGSHCDRNLSLATATA
ATLITPHHLLWPRWPLGSHSVT
WCGHGGHSDHTPSPTTATAVT
RITSRHPLRPQQSLGSHPVTHYS
HGSHSDCIPVTRYAHGGHSDHT
PSLAMATAATQMALCHSLWPR
QPLGYLVNKIRRKRCTDGVSPS
GKAPVFDTGIPWFESRYPSHLL
RVSGSPPGYWGIAKRHQILNVS
NMFGKFKTNLLGYRQAVRHRI
LIPAFRGSNPRTPANLFKTLTLIS
NMRALENDFFNSPPRKTVQFG
GTVTEVLLKYKTAIEKRLEKLA
VDPHRDRAQMLKVENVQQAW
QQWTNKLPPARREDEDVKEIR
WMIEELRVSYFAQQLGKNSRA
ESML WSFHKA A I EAARTNIIQA
QTRVEAAQATERRIAADIDDSE
LKAPRDGRVQYRVAKPGEVLA
AGGRVLNMVDLSDVYMTFFLP
TEQAGTLKLGG E ARLILDA APD
LRIPATISFVASVAQFTPKTVET
SDERLKLMFRVKARIPPELLQQ
HLEYVKTGLPGVAWSFLYSINQ
TICLRLDSIEAKLQALEATCKSL
EEKLDLVTNKQHSPIQVPMVA
GSPLRTTQMCNKVR/CVNP*AT
WPPPVPQPTTQQYQGLDAGA
Figure imgf001981_0001
Figure imgf001982_0001
Figure imgf001983_0001
Figure imgf001984_0001
Figure imgf001985_0001
Figure imgf001986_0001
Figure imgf001987_0001
Figure imgf001988_0001
Figure imgf001989_0001
Figure imgf001990_0001
Figure imgf001991_0001
Figure imgf001992_0001
Figure imgf001993_0001
Figure imgf001994_0001
Figure imgf001995_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Λmino acid sequence ( X=Unknown, NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
15556 45924 15646 2002 MLESAGNQDIEDVDIVLRQIQV
KNRSTTEVGQNGNLQVEVSYT
QAHIAPRPLASQGTWPHFVSSA
PQPCFQVQNHSKYQNLYYSGFI
RETEPISKGDVGEWLSAGKGES
SAMFASEQEISKDEQGTPVLGS
FYWEVDSPRKESSQAWAPGQE
WI KLERDTTEEKMFEQLKPIEP
VQKTLPWVGEVAATLQEAMK
RDCWREARVKKKPVTFEDVAV
NFTQEEWDCLDASQRVLYQDV
MSETFKNLTSVEGKKKELREQ
HPSLRDEGTSDDKVFLACRGA
GQCPLSAPAGTMDRTRVLQAS
QAGPP/FFFATTVGKCFSRRSYL
YSHQFVHNPKLTNSCSQCGKLF
RSPKSLSYHRRMHLGERPFCCT
LCDKTYCDASGLSRHRRVHLG
YRPHSCSVCGKSFRDQSELKRH
QKIHQNQEPVDGNQECTLRIPG
TQAEFQTPIARSQRSIQGLLDVN
HAPVARSQEPIFRTEGPMAQNQ
ASVLKNQAPVTRTQAPITGTLC
QDARSNSHPVKPSRLNVFCCPH
CSLTFSKKSYLSRHQKAHLTEP
PNYCFHCSKSFSSFSRLVRHQQ
THWKQKSYLCPICDLSFGEKEG
LMDHWRGYKGKDLCQSSHHK
CRVILGQWLGFSHDVPTMAGE
EWKHGGDQSPPRIHTPRRRGLR
EKACKGDKTKEAVSILKHK
15557 45925 15647 1048 MPSLAELFEGSSKAFGAAASQL
NFPVASGEMLKFKYGVLRNPLH
AVAAEPIASRTSRLNLFFQGKPP
FMTQQQMSPLSREGILDALFVL
FEECSQPALMKIKHVSNFVRKY
SDTIAELQELQPSAKDFEVRSLV
GCGHFAEVQWREKATGDIYA
MKVMKKKALLAQEQVSFFEEE
RNILSRSTSPWIPQLQYAFQDK
NHLYLVMEYQPGGDLLSLLNR
YEDQLDENLIQFYLAELILAVH
SVHLMGYVHRDIKPENILVDRT
GHIKLVDFGSAAKMNSNKMVN
AKLPIGTPDYMAPEVLTVMNG
DGKGTYGLDCDWWSVGVIAY
EMIYGRSPFAEGTSARTFNNIM
Figure imgf001997_0001
Figure imgf001998_0001
Figure imgf001999_0001
Figure imgf002000_0001
Figure imgf002001_0001
Figure imgf002002_0001
Figure imgf002003_0001
Figure imgf002004_0001
Figure imgf002005_0001
Figure imgf002006_0001
Figure imgf002007_0001
Figure imgf002008_0001
Figure imgf002009_0001
Figure imgf002010_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Λmino acid sequence ( X=Unkno\vn, NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
15659 46027 15750 2319 MRIDGAQELTLALPFASGVACR
DVKPGPRRRPLRYRCFQKPSNR
LGSASLGAGFWAGRRDAKSGH
DDDPSGALFSVTMDRDLEQAL
DRAENIIEIAQQRPPRRRYSPRA
GKTLQEKLYDIYVEECGKEPED
PQELRSN VNLLEKL V RRESLPC
LLVNLYPGNQGYSVMLQREDG
SFAETIRLPYEERALLDYLDAEE
LPPALGDVLDKASVNIFHSGCV
IVEVRDYRQSSNMQPPGYQSRH
ILLRPTMQTLAPEVKTMTRDGE
KWSQEDKFPLESQLILATAEPL
CLDPSVAVACTANRLLYNKQK
MNTDPMEQCLQRYSWPSVKPQ
QEQSDCPPPPELRVSTSGQKEE
RKVGQPCELNITKAGSCVDTW
KGRPCDLAVPSEVDVEKLAKG
YQSVTAADPQLPVWPAQEVED
PFRHAWEAGCQAWDTKPNIMQ
SFNDPLLCGKIRPRKKARQKSQ
KSPWQPFPDDHSAWLRPGSETD
AGRAVSQAQESVQSKVKGPGK
MSHSSSGPASVSQLSSWKTPEQ
PDPVWVQSSVSGKGEKHPPPRT
QLPSSSGKISSGNSFPPQQAGSP
LKRPFPAAAPAVAAAAPAPAPA
PAAAPALAAAAVAAAAGGAAP
SHSQKPSVPLIKASRRRPAAGRP
TRFVKIAPAIQVRTGSTGL/IGH
QRGGPSPGSPGISGSGLQSSGGP
LPDARPGSSAGCPAPLCLG\QQP
Figure imgf002012_0001
Figure imgf002013_0001
Figure imgf002014_0001
Figure imgf002015_0001
Figure imgf002016_0001
Figure imgf002017_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Λmino acid sequence ( X=Unkno\vn, NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
15694 46062 A 15785 610 2986 QTERNSININRNGHQHQNPICRS
PSSKTKGSIKPQRSGRNQSRKA
ENSKDQRTSSPTKDCSSSPATE
QRWMENDFDELTEV/VLQKPG
RDTT\KQKEKFRPISLMNIDVKI
LNKILANRIQQHIKKLIHHDQV
GFISGMQGWFNICKSINVIHHIN
RTNDKNHMIISIDAEKAFDKIQ
QPFMLKTLKKLGIDGTYLKIIRA
IYDKSTASIILNGQKLEAFPLKD
RTRQGCPLSPLLFNIALEVLARA
IRQEKDIKCIQLGKEKVRLSLFA
EDMIVYLENPIVSAPNLFKLISN
FSKVSGYKINVQKSQVFLYINN
RQRESQIMNEFPFTIARRRIKYL
GIQLTRDVKDLFKENYKPLLKE
IKEDTNKWKNMPCSWIGRINIM
KMAILAKVIYRFΝAIPIKLPMTF
FTELEK\TTLKCI*ΝQKRAHIAK
PILSQKNKAGGITLPDFKLHYK
AAVTKTAWYWYQNRDIDQRN
RTEPTEIMLHIYNHLIFDKPDKN
KKWGNDSLFNKWCWENWLAI
CRKLKRDPFLTPYTKINSRWIE
DLNIRPETIKTLEENLGNTIQDIA
MGKDFMSKTPKAMATKAKIDK
WDLIKLKNFCTAKETTIRVNRQ
PTEWEKIFAIYSSDKGIISRIYNE
LKQIYKKK\NNPIKKWAKDINR
HFSKEDIQAASKHVKKHSTSRIF
MQIKTTLGYHLTPVRMATIKKS
KNNRCWCGCGEKGTLIHCW/M
15695 46063 15786 906 MVNQKRYHIPGGTAEISGTIKK
LKNAGWIPTTSPLNSPIWPVQ
KTDGSWRMTMDYHKLNQWT
PIAVAVPYVVALLEQINTFSCT
WYAAIDLANTFFSFSVHKA/HK
K*FAFSRQGQQYTFTVHP\YDN
YSPFERQLLACYWASVETEHLT
MGYQVTIQLELPIMNWVLSDPS
SHKLGHAKQHSIIKWKWYIRD
RARAGPEGTARIHRSRNQGVEV
EVAPLTITPSDPLARFLLPVPTIL
HSAGLEVLVPEGGTLPLGDTTT
IPFNWKLRLPPGHFRLLLPLNQ
QAKKGVTVFAGVIEAIKMK
Figure imgf002019_0001
Figure imgf002020_0001
Figure imgf002021_0001
Figure imgf002022_0001
Figure imgf002023_0001
Figure imgf002024_0001
Figure imgf002025_0001
Figure imgf002026_0001
Figure imgf002027_0001
Figure imgf002028_0001
Figure imgf002029_0001
Figure imgf002030_0001
Figure imgf002031_0001
Figure imgf002032_0001
Figure imgf002033_0001
Figure imgf002034_0001
Figure imgf002035_0001
Figure imgf002036_0001
Figure imgf002037_0001
Figure imgf002038_0001
Figure imgf002039_0001
Figure imgf002040_0001
Figure imgf002041_0001
Figure imgf002042_0001
Figure imgf002043_0001
Figure imgf002044_0001
Figure imgf002045_0001
Figure imgf002046_0001
Figure imgf002047_0001
Figure imgf002048_0001
Figure imgf002049_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Λmino acid sequence ( X=Unknown,
NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
15899 46267 15990 875 MRMEAGEAAPPAGAGGRAAG
GWGKWVRLNVGGTVFLTTRQ
TLCREQKSFLSRLCQGEELQSD
RDETGAYLIDRDPTYFGPILNFL
RHGKLVLDKDMAEEGVLEEAE
FYNIGPLIRIIKDRMEEKDYTVT
Q\VPPKHVYRVL\QCQEEELTQ
MVSTMSDGWRFEQLVNIGSSY
NYGSEDQAEFLCWSKELHSTP
NGLSSESSRKTKSTEEQLEEQQ
QQEEE V EE VE VEQ VQ VE ADAQ
EKGSRPHPLRPEAELAVRASPR
PLARPQSCHPCCYKPEEPGCEA
PDHLQGLGVPI
15900 46268 15991 2448 MRRGETRSRRPPQCWAFAHW
SRSSRRGEFFSFDRWKKHRSEV
RESAQVPTTKREPVLFQHKINIK
IVIALVDILFALGLGNLVCVSPV
QLFSVQTDSRATGDQRLPSWT
AQRYGTRTLGPDGFTERCPYGP
SPESLLCSALPWLRDLGMPVAE
APQVAGGQGDGGDGEEAEPEG
MFKACEDSKRKARGYLRLVPL
FVLLALLVLASAGLEMLSDPGS
RSYGSGLLPAWAKCPKAPHQR
SLIPPDYAPGVNHPPPQERCSM
GLRTQGTLGRHRVLSAQQVAF
SVSLLMAKIAFAKQKSDDIIPDF
KLFMVTLVHRPNRLSARVLTM
VPVLPAPCPMLLPLLEGYKAEV
MVSQVYSGSLRVLNRHFSQDL
TRRESSAFRSETAKAQKMLKEL
ITSTRLGTYYNSSSVYSFGEGPL
TCFFWFILQIPEHRRLMLSPEW
QALLVEELLSTVNSSAAVPYRA
EYEVDPEGLVILGQYCEWKPPG
RRMQTPRPAMRMEAGEAAPPA
GAGGRAAGGWGKWVRLNVG
GTVFLTTRQTLCREQKSFLSRL
CQGEELQSDRDETGAYLIDRDP
TYFGPILNFLRHGKLVLDKDMA
EEGSQVWTVTSGGWGDRNAV
LTGMGRALSLFLWLIQYLKFCL
VRAWVLEEAEFYNIGPLIRIIKD
RMEEKDYTVTQVPPKHVYRVL
QCQEEELTQMVSTMSDGWRFE
Figure imgf002051_0001
Figure imgf002052_0001
Figure imgf002053_0001
Figure imgf002054_0001
Figure imgf002055_0001
Figure imgf002056_0001
Figure imgf002057_0001
Figure imgf002058_0001
Figure imgf002059_0001
Figure imgf002060_0001
Figure imgf002061_0001
Figure imgf002062_0001
Figure imgf002063_0001
Figure imgf002064_0001
Figure imgf002065_0001
Figure imgf002066_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Λmino acid sequence ( X=Unknown,
NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
15984 46352 16075 2111 MNTPLQSAHTHLTLVEFIAYLV
SDSPFPAASTLSLPTCNFDPCPT
KLGAGTMPPAAAGGQRRPHAA
PPAAYPHRNKVRAAEARTSRA
VRREYEAGDEASGRRGRGGRM
KSVQNARADRGMWVRERRAV
VRGQENTWVFAAAASPESDFL
QIVLRDIAREMVPEYKSDLQPR
TQMDAKKPRKCDLTPFLVLKA
RKKQKFTSAKHLWVQRKGKG
VEDHADWQGQIWVGVEGKSA
EGQGLNLTGAFNRSPENKVSPQ
GPALNALEIDSSYLDPQALALP
QGWQGLLFPPLLGTELVGSKLS
VQIQKPPSNIKNSRMTQVFHKN
TSVTSLPFVDTKGKKNTVSFPHI
SKKVLLKSSLLYQVSADGLDPE
KAGERPGFSPKPVLERPRIVGKS
TVAAEEENQAHNQMPASELKA
SEIPFHPSIKTQDPKAEEKSPKK
QKVTLTAAEALKLFKNQLSPYE
QSEILGYλPELWFLGLEAKKLDT
APEKFSKTSFDDEHGFYLKVLH
DHIAYRYEVLETIGKGSFGQVA
KCLDHKNNELVALKIIRNKKRF
HQQALMELKILEALRKKDKDN
TYNWHMKDFFYFRNHFCITFE
LLGINLYELMKNNNFQGFSLSI
VRRFTLSVLKCLQMLSVEKIIH
CDLKPEΝIVLYQKGQASVKVID
FGSSCYEHQKGHLPSRLRSAVT
TTSMILΝKVLRFSEPKLPHLQFE
Figure imgf002068_0001
Figure imgf002069_0001
Figure imgf002070_0001
Figure imgf002071_0001
Figure imgf002072_0001
Figure imgf002073_0001
Figure imgf002074_0001
Figure imgf002075_0001
Figure imgf002076_0001
Figure imgf002077_0001
Figure imgf002078_0001
Figure imgf002079_0001
Figure imgf002080_0001
Figure imgf002081_0001
Figure imgf002082_0001
Figure imgf002083_0001
Figure imgf002084_0001
Figure imgf002085_0001
Figure imgf002086_0001
Figure imgf002087_0001
Figure imgf002088_0001
Figure imgf002089_0001
Figure imgf002090_0001
Figure imgf002091_0001
Figure imgf002092_0001
Figure imgf002093_0001
Figure imgf002094_0001
Figure imgf002095_0001
Figure imgf002096_0001
Figure imgf002097_0001
Figure imgf002098_0001
Figure imgf002099_0001
Figure imgf002100_0001
Figure imgf002101_0001
Figure imgf002102_0001
Figure imgf002103_0001
Figure imgf002104_0001
Figure imgf002105_0001
Figure imgf002106_0001
Figure imgf002107_0001
Figure imgf002108_0001
Figure imgf002109_0001
Figure imgf002110_0001
Figure imgf002111_0001
Figure imgf002112_0001
Figure imgf002113_0001
Figure imgf002114_0001
Figure imgf002115_0001
Figure imgf002116_0001
Figure imgf002117_0001
Figure imgf002118_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Λmino acid sequence ( X=Unknown, NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
16390 46758 16487 2388 QGRLVAKPPRPARSTAADSSTP
RDMLGLRPPLLALVGLLSLGCV
LSQECTKFKVSSCRECIESGPGC
TWCQKLNFTGPGDPDSIRCDTR
PQLLMRGC A ADDI MDPTSL AET
QEDHNGGQKQLSPQKVTLYLR
PGQAAAFNVTFRRAKGYPIDLY
YLMDLSYSMLDDLRNVKKLGG
DLLRALΝEITESGRIGFGSFVDK
TVLPFVΝTHPDKLRΝPCPΝKEK
ECQPPFAFRHVLKLTΝNSΝQFQ
TEVGKQLISGΝLDAPEGGLDA
MMQ VA ACPEEIG WRΝ VTRLL V
FATDDGFHFAGDGKLGAILTPΝ
DGRCHLEDΝLYKRSΝEFDYPS
VGQLA\HKLAEΝΝIQPIFAVTSR
M VKTYEKLTEI IPKS A VGELSE
DSSNWHLIKNAYNK\LSSRVFL
DHNALP\DTLKVTYDSFCSNGV
THRNQ\PRGDC\DGVQINVPITF
QVKVTATECIQEQSFVIRALGFT
DIVTVQVLPQCECRCRDQSRDR
SLCHGKGFLECGICRCDTGYIG
KNCECQTQGRSSQELEGSCRKD
NNSIICSGLGDCVCGQCLCHTS
DVPGKLIYGQYCECDTINCERY
NGQVCGGPGRGLCFCGKCRCH
PGFEGSACQCERTTEGCLNPRR
VECSGRGRCRCNVCECHSGYQ
LPLCQECPGCPSPCGKYISCAEC
LKFEKGPFGKNCSAACPGLQLS
NNPVKGRTCKERDSEGCWVAY
16391 46759 A 16488 353 1205
16392 46760 16489 11 440 FRALTEGDTQLNWNIVSFPVAE
ELSHHENLVSFLETVNQPHHQN
VSVPSNNVHAPYSSDKEHMCT
WYFDDCMSIHQCKISC\ESMG
SHPΝIRWVSLMACCECIGPECID
YGSKTVKCMNCMF* RRQMQT
KAT**ΝΝRYKKLKK
Figure imgf002120_0001
Figure imgf002121_0001
Figure imgf002122_0001
Figure imgf002123_0001
Figure imgf002124_0001
Figure imgf002125_0001
Figure imgf002126_0001
Figure imgf002127_0001
Figure imgf002128_0001
Figure imgf002129_0001
Figure imgf002131_0001
Figure imgf002132_0001
Figure imgf002133_0001
Figure imgf002134_0001
Figure imgf002135_0001
Figure imgf002136_0001
Figure imgf002137_0001
Figure imgf002138_0001
Figure imgf002139_0001
Figure imgf002140_0001
Figure imgf002141_0001
Figure imgf002142_0001
Figure imgf002143_0001
Figure imgf002144_0001
Figure imgf002145_0001
Figure imgf002146_0001
Figure imgf002147_0001
Figure imgf002148_0001
Figure imgf002149_0001
Figure imgf002150_0001
Figure imgf002151_0001
Figure imgf002152_0001
Figure imgf002153_0001
Figure imgf002154_0001
Figure imgf002155_0001
Figure imgf002156_0001
Figure imgf002157_0001
Figure imgf002158_0001
Figure imgf002159_0001
Figure imgf002160_0001
Figure imgf002161_0001
Figure imgf002162_0001
Figure imgf002163_0001
Figure imgf002164_0001
Figure imgf002165_0001
Figure imgf002166_0001
Figure imgf002167_0001
Figure imgf002168_0001
Figure imgf002169_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Amino acid sequence ( X=Unknown, NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
17049 47417 17154 999 3009 ADSGCKL/PSSC*HVLCA/WVTY
MVMAAILHKLDQLDASRGRAI
PFCGQRGRFKYEILVKTGWGR
GSGTTAHVGIMLYGVDSRSGH
RHLDGDRAFHRNSLDIFRIATP
HSLGSVWKIRVWHDNKGLSPA
WFLQHVIVRDLQTARSAFFLVN
DWLSVETEANGGLVEKEVLAA
SDAALLRFRRLLVAELQRGFFD
KHIWLSIWDRPPRSRFTRIQRAT
CCVLLICLFLGANAVWYGAVG
DSAYSTGHVSRLSPLSVDTVAV
GLVSSVWYPVYLAILFLFRMS
RSKVAGSPSPTPAGQQVLDIDS
CLDSSVLDSSFLTFSGLHAEVRT
LLGVLGWAGGPAALAQLGLKT
LCTSQQAFVGQMKSDLFLDDS
KSLVCWPSGEGTLSWPDLLSDP
SIVGSNLRQLARGQAGHGLGPE
EDGFSLASPYSPAKSFSASDEDL
IQQVLAEGVSSPAPTQDTHMET
DLLSSLCAEVHTALPLHCSCLQ
DSVQWHAVQGLPAFCRGDPVT
QE AQETA RPAAQNL VLIPLHA A
PHQAVAEIDALYDVYLDVIDK
WGTDDMLFLGDFNADCSYVR
AQDWAAIRLRSSEVFKWLIPDS
ADTTVGNSDCAYDRIVACGAR
LRRSLKPQSATVHDFQEEFGLD
QTQASGPWGGAGLGSPPPGRT
QAAGRALAISDHFPVEVTLKFH
17050 47418 17155 242 363 TKRQENGRKKQDRTPTEQQLR VK*KSRSTKKRRKQSRSES
17051 47419 B 17156 12879
17052 47420 17157 596 NLLFPVRSERGPHRPTPLSWLW
LRPCELNKLAAGGLAAGSSRA
GALYLLIYIPRRPARAAPLCPAA
ATGAPGASAGSASPASARPPGL
ARAAPGPLRGRRPRSPRTPFCV/
WLGVSLLLSVYQHYFAEPFESFQ
VCCRHCGILCLTIRVHFPCLSAN
PFHFSQLHLGPRPGKACHSVGL
QIYNSQAHSHPSQQERQLFC
17053 47421 17158 235 FFFVLYNLFLFVITSIPLQKVSIL TQRQL VAEEN VKCCTVVPEPQ* * *LSNYPIFVEMGFCHVGPGWSR TPTPKSVSR
Figure imgf002171_0001
Figure imgf002172_0001
Figure imgf002173_0001
Figure imgf002174_0001
Figure imgf002175_0001
Figure imgf002176_0001
Figure imgf002177_0001
Figure imgf002178_0001
Figure imgf002179_0001
Figure imgf002180_0001
Figure imgf002181_0001
Figure imgf002182_0001
Figure imgf002183_0001
Figure imgf002184_0001
Figure imgf002185_0001
Figure imgf002186_0001
Figure imgf002187_0001
Figure imgf002188_0001
Figure imgf002189_0001
Figure imgf002190_0001
Figure imgf002191_0001
Figure imgf002192_0001
Figure imgf002193_0001
Figure imgf002194_0001
Figure imgf002195_0001
Figure imgf002196_0001
Figure imgf002197_0001
Figure imgf002198_0001
Figure imgf002199_0001
Figure imgf002200_0001
Figure imgf002201_0001
Figure imgf002202_0001
Figure imgf002203_0001
Figure imgf002204_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Λmino acid sequence ( X=Unkno>vn,
NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
17421 47789 17527 1578 MAKKEQLLSAAPSKTNAEGQEI
PSGVYTTRALGFKHKTGQPFGQ
TPSYPQEFILYTNDTWNASKTE
PFTPLERGLKPGSQESSGWHLV
GTPLGRSFQRKEQAAIFAVLQP
PLVIPRQTGSGVDLQQTPTDLQ
KRVLTVRRKTNSININKKDDHA
KTTSKGHQQQRPKVDKSTKMR
KNQCKKATNSKNQNVSSPPKD
HNSSTAKEQNWMENEFDKLTE
VGFRSSQFDQVEERISDIEDKLN
EIKHEDKI REKRMKKKEQSLQE
IWEYVKRPNLRLIGVPESDGEN
GTKLENTLQDTIQENFPNLARQ
ANIQIQEIQRTPQRYSSRRATPR
HII/E*IHQG*NEGKTVKGSQRE
K*GQEIPSGVYTTRALGFKHKT
GQPFGQTPSYPQEFILYTNDTW
NASKTEPFTPLERGLKPGSQESS
GWHLVGTPLGRSFQRKEQAAIF
AVLQPPLVIPRQTGSGVDLQQT
PTDLQKRVLTVRRKTNSININK
KDDHAKTTSKGHQQQRPKVDK
STKMRKNQCKKATNSKNQNVS
SPPKDHNSSTAKEQNWMENEF
DKLTEVGFRSSQFDQ VEERI SDI
EDKLNEIKHEDKIREKRMKKKE
QSLQEIWEYVKRPNLRLIGVPE
SDGENGTKLENTLQDTIQENFP
NLARQANIQIQEIQRTPQRYSSR
RATPRHIIGRFTKVEMKEKRLR
AVREKSRVTHKGKLIRLTADLS
17422 47790 17528 738 1323 ATGGHSNQRQRS*KLSKKFRR
MYN*NNQYREVLKGADGAEN
QGSRTTETLCM*RMRQSLQFVD
SDLVRHQRKHSGEKPYECCEC
GKAFRGSSELIRHRRIHTGEKPY
ECGEYKSYECIACGKAFGRSST
LIENQRIHTGEKPYECNECGKSF
NQSSALTQHQRIHTGEKPYECS
ECRKTFRHRSGLMQHQRTHTR
17423 47791 17529 656 MAGKPEYDKTISTSIVLNALNA
LGVSAEASGRNDLWKTVEGD
RKVSGSAYRETKDRGFHHGTL
LLNADLSRLANYLNPDKKKLA
AKGITSVRSRVTNLTELLPGITH
EQVCEAITEAFFAHYGERVEAE
IISPNKTPDLPNFAETFARQSSW
EWNFGQAPAFSHLLDERFTWG
GVELHFDV/GKRPYHPRPGVYR
QPQPRAAGSPRRTTARLPFLSL
Figure imgf002206_0001
Figure imgf002207_0001
Figure imgf002208_0001
Figure imgf002209_0001
Figure imgf002210_0001
Figure imgf002211_0001
Figure imgf002212_0001
Figure imgf002213_0001
Figure imgf002214_0001
Figure imgf002215_0001
Figure imgf002216_0001
Figure imgf002217_0001
Figure imgf002218_0001
Figure imgf002219_0001
Figure imgf002220_0001
Figure imgf002221_0001
Figure imgf002222_0001
Figure imgf002223_0001
Figure imgf002224_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Amino acid sequence ( X=Unknown,
NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /^possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
17635 48003 17741 2561 MKAEIKTFFETNKNKDTTYQNL
WDTFKA VCRGKFIALNAHKRK
QERSKIDTLTSQLKELEKQEQT
YSKASRRQEITKIRAELKEIETQ
KTLQKINESRSWFFEKINKIDRP
LTRLIKKKREKNQIDSIKNDKG
DITTDPTEIQTTIREYYKHLYAN
KLENLEEMDKFLDTYTLPRLNQ
EEVESLNRPITGSEIEAIINSLPT
KKSPGPDGFTAEFYQRYKEELV
PFLLKLFQSIEKEGILPNSFYEAS
IILITKPGRHTTKKNFRPISLMNI
GAKILNKILANRIQQHIKKLIHH
DQVGFIPGMQG WFNIRKSINVI
QHINITNDKNHMIISIDAENAFD
KIQQRFMLKTLNKLGIDGMYL
KIIRAIYDKPTATQNLLKLIGN F
SKVSGYKINVQKSQAFLYTNNS
QTESQIMTELPFTIASKRIKYLGI
QLTRDVKDLFKENYKPLLSEIK
EDTKKGKNIPCSWVGRINIMK
MDILPKVIYTFNAIPIKLPMTFFT
EVEKTTLKFICNQKRARIAKSFL
SQKNKAGGITPPDFKLYYKATV
TKTACYWYQNRDIDQWNRTEP
SEIMQHIYNYLIFDKPDKNKQW
GKDSLFNKWCWENWLAICRKL
KLDSFLTPYTKINSRWIKDLNIR
PKTIKTLEENLGSIIQDIGMGKD
FMSKTPKAMATKAIIDEWDLIK
LKSFCTAKETTMRVNRQPTEW
EKIFATYSSDIGLISRIYNELKQI
17636 48004 17742 5042 MTGSNSHITILTLNINGLNSAIK
RHRLASWIKSQDPSVCCIQETH
LMCRDTHRLKIKGWRKIYQAN
GKQKKAGVAILVSDKTDFKPT
KIKRDKEGHYIMVKGSIQQEEL
TILNIYAPNTGAPRFIKQVLSDL
QRDLDSHTLIMGDFNTPLSILDR
STRQKVΝKDTQELΝSALHQAD
LIDIYRTLHPKSTEYTFFSAPHH
TYSKIDHIVGSKALLSKCKRTEI
ITΝYLSDHSAIKLELRIKΝLTQS
RSTTWKLΝΝL
17637 48005 17743 101 471 MLSRILASRΝRMRFAVEWRA
VRE/RVGΝDFIIIYRLSMLDLVE
DGGTFAETVELAQAVEAAGATI
INTGIGWHEARIPTIATPVPRGA
FSWVTRKLKGHVSLPLVTTΝRI
ΝDPQVADDILSSTH
17638 48006 17744 161 ill
Figure imgf002226_0001
Figure imgf002227_0001
Figure imgf002228_0001
Figure imgf002229_0001
Figure imgf002230_0001
Figure imgf002231_0001
Figure imgf002232_0001
Figure imgf002233_0001
Figure imgf002234_0001
Figure imgf002235_0001
Figure imgf002236_0001
Figure imgf002237_0001
Figure imgf002238_0001
Figure imgf002239_0001
Figure imgf002240_0001
Figure imgf002241_0001
Figure imgf002242_0001
Figure imgf002243_0001
Figure imgf002244_0001
Figure imgf002245_0001
Figure imgf002246_0001
Figure imgf002247_0001
Figure imgf002248_0001
Figure imgf002249_0001
Figure imgf002250_0001
Figure imgf002251_0001
Figure imgf002252_0001
Figure imgf002253_0001
Figure imgf002254_0001
Figure imgf002255_0001
Figure imgf002256_0001
Figure imgf002257_0001
Figure imgf002258_0001
Figure imgf002259_0001
Figure imgf002260_0001
Figure imgf002261_0001
Figure imgf002262_0001
Figure imgf002263_0001
Figure imgf002264_0001
Figure imgf002265_0001
Figure imgf002266_0001
Figure imgf002267_0001
Figure imgf002268_0001
Figure imgf002269_0001
Figure imgf002270_0001
Figure imgf002271_0001
Figure imgf002272_0001
Figure imgf002273_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Amino acid sequence ( X=Unkno n, NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
18261 48629 18368 2810 MKQTVYIASPESQQIHVWNLN
HEGALTLTQVVDVPGQVQPMV
VSPDKRYLYVGVRPEFRVLAY
RIAPDDGALTFAAESALPGSPT
HISTDHQGQFVFVGSYNAGNVS
VTRLEDGLPVGWHWDGLDR
LARQLIKNLATLKAGNARWN
SNAMSFLAQKGTPHNIVFVDPP
FRRGLLEETINLLEDNGWLADE
ALIYVESEVENGLPTVPANWSL
HREKVAAFAGAEPYHYLNHTL
GFPRVGLRRELKKAQDSYW
18262 48630 A 18369 2279 2563 MNFGASLNSPTPRSYLSLGGEK
KAPVWEHTLGVPSRRPAAGKL
KKAPERYWGGEHPPVKKWLG
G*ARKCRPRPWETL*KSHLRRR
PSLGPWAGGL
18263 48631 18370 481 1020
18264 48632 18371 998 MNENFGTQICVNNRLSFTDCSG
YVTRYDHDRFGQVTAVHREEG
LSQYRAYDSRGQLIAVKDTQG
HETRYEYNAAGDLTTVIAPDGS
RNGTQYDAWGKAICTTQGGLT
RSMEYDAAGRVIRLTSENGSHT
TFRYDVLDRLIQETGFDGRTQR
YHHDLTGKLIRSEDEGLVTHW
HYDEADRLTHRTVNGETAERW
QYDERGWLTDISHISERHRVTV
HYGYDSKGRLASEHLTVHHPQ
TOELLWQHETRLAYKRTGLAN
RCIPDTCPPVNAYLGTACCQHE
ICDTPWLSYRDPFTGKSWLSGR
YELHKRIYGITAKERKDVDVW
HPDVRFFELYDENNELRGSFYL
DLYARENKRGGAWMDDCVGQ
MRKADGSLQKPVAYLTCNFNR
PVNGKPALFTHDEVITLFHEFG
HGLHHMLTRIETAGVSGISGVP
WDAVELPSQFMENWCWEPEA
LAFISGHYETGEPLPKELLDKM
LAAKNYQAALFILRQLEFGLFD
FRLHAEFRPDQGAKILETLAEIK
KLVAWPSPSWGRFPHAFSHIF
AGGYAAGYYSYLWADVLAAD
AFSRFEEEGIFNRETGQSFLDNI
LSRGGSEEPMDLFKRFRGREPQ
L*HTWELPRPFHREILAFRPL*I
TQAYLRHHR*RA*RC*CLASGC
Figure imgf002275_0001
Figure imgf002276_0001
Figure imgf002277_0001
Figure imgf002278_0001
Figure imgf002279_0001
Figure imgf002280_0001
Figure imgf002281_0001
Figure imgf002282_0001
Figure imgf002283_0001
Figure imgf002284_0001
Figure imgf002285_0001
Figure imgf002288_0001
Figure imgf002289_0001
Figure imgf002290_0001
Figure imgf002291_0001
Figure imgf002292_0001
Figure imgf002293_0001
Figure imgf002294_0001
Figure imgf002295_0001
Figure imgf002296_0001
Figure imgf002297_0001
Figure imgf002298_0001
Figure imgf002299_0001
Figure imgf002300_0001
Figure imgf002301_0001
Figure imgf002302_0001
Figure imgf002303_0001
Figure imgf002304_0001
Figure imgf002305_0001
Figure imgf002306_0001
Figure imgf002307_0001
Figure imgf002308_0001
Figure imgf002309_0001
Figure imgf002310_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Λmino acid sequence ( X=Unknown, NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
18609 48977 18718 2107 IDMIFTPGPPSTPKHKKSQKGSA
FTFPSQQSPRNEPYVARPSTSEI
EDQSMMGKFVKVERQVQDMG
KKLDFLVDMHMQHMERLQVQ
VTEYYPTKGTSSPAEAEKKEDN
RYSDLKTIICNYSETGPPEPPYSF
HQVTIDKVSPYGFFAHDPVNLP
RGGPSSGKVQATPPSSATTYVE
RPTVLPILTLLDSRVSCHSQADL
QGPYSDRISPRQRRSITRDSDTP
LSLMSVNHEELERSPSGFSISQD
RDDYVFGPNGGSSWMREKRYL
AEGETDTDTDPFTPSGSMPLSST
GDGISDSTVPLFLSSEILQKQVG
QSITSMLGFLSRGPSMKLCMGL
ACVLSLWNTVSGIKGEAKKEK
GMTFLPTTDSKKFFSLLSVTSYS
SFAFHKFSVAVYNISNLKTVDP
AKFPTRYCYCLNNRTNDLSDFT
ALLVDIIGNSTSYLTEIFKSTSIL
SVNQSNESDCIFICVMTGKSGR
NLSDFWEIEEKYPIINYTFTSGL
SGVLALLLTQSLFGGLFTRTRM
KFGAVTRIGGPPLGNQSPSSCSL
LHEKDPPTTSGPQTDQPKKHLT
NFKSAARPTFLGQGQVPLNPFS
FTLS/EQVLLS*AARTPQSLISTP
QPLISVPQSLISVPQPLLYFSGG
QEPPPPPLLCVSSLFSRLASFTM
GAFTHGTQTPSPTKATAPRYPQ
TGDLSAEWPFTAGEEPVLVPRP
18610 48978 18719 828 1063 RHSQAAED*HCPIALE/VPQTIID AELRVTLTVEGYSLPPGCLRGL MLSVSGFSRSTVQAVSGSTVLG SGGQWFSCASV
Figure imgf002312_0001
Figure imgf002313_0001
Figure imgf002314_0001
Figure imgf002315_0001
Figure imgf002316_0001
Figure imgf002317_0001
Figure imgf002318_0001
Figure imgf002319_0001
Figure imgf002320_0001
Figure imgf002321_0001
Figure imgf002322_0001
Figure imgf002323_0001
Figure imgf002324_0001
Figure imgf002325_0001
Figure imgf002326_0001
Figure imgf002327_0001
Figure imgf002328_0001
Figure imgf002329_0001
Figure imgf002330_0001
Figure imgf002331_0001
Figure imgf002332_0001
Figure imgf002333_0001
Figure imgf002334_0001
Figure imgf002335_0001
Figure imgf002336_0001
Figure imgf002337_0001
Figure imgf002338_0001
Figure imgf002339_0001
Figure imgf002340_0001
Figure imgf002341_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Amino acid sequence ( X=Unkno>vn, NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
18795 49163 18904 2074 ERSRPVALRAAVPRMNPSTPSY
PTASLYVGDLHPDVTEAMLYE
KFSPAGPILSIRICRDLITSGSSN
YAYVNFQHTKDAEHALDTMNF
DVIKGKPVRIMWSQRDPSLRKS
GVGNIFVKNLDKSINNKALYDT
VSAFGNTLSCKWCDENGSKG
YGFVHFETHVEAVAERAIEKMNG
MLVLNDRKVFVGRFKSRKEREA
ELGARAKEFTVNVYIKNFGEDM
DVDGRLKVDLFGKFVGPALKCEK
LMTDESGKSKGFGFVSVFERHE
DAQKAVDEMNGKELNGKQIY
VGRAQKKVVERQTELKRKFEQ
MKQDRITRYQGVNLVYVKNLD
DGVIDDEVRLRKEFSPFGTIQLVP
KVMDGGLVAAKGLGFVCFSSP
EVEATKAVTEMNFVRIVATEPLVY
VALAVQRKEERQAHLTNQYYG
RGMVASVTEAVPNPVUQPLPGQ
APPFRFTFMGRFSHRFRNRAVAY
YPVPSQIAQTKTKFLRWTGQGA
RTVHPFVQNMPGCYPAQLAPRPP
I*GTMVRPASSQVVPRVVMPTQRV
VANTSNTDKWVPRSCSLQAAA
AYVLVRPAPFPQYKYAAGVRNS
VQQHLVNAQPQVTMQQPCCSMV
PRVQGTLWTCLPMVGHLPPPSR
SKSQMLG WNGLFPS YSKPMHP
YSLLVKSLGVMLLEIDNSELVLH
MLVESPVESLPF* RVDEA V AGTTS
PPQAKRGLAQKAG*QVPTGVP
18796 49164 18905 988 MAEDGEEAEFHFAALYISGQW
PRLRADTDLQRLGSSAMAPSRK
FFVGGNVWKMNGRVKRRLGELI
GTLNAAKVPADTEWCAPPTA
YIDFVARQKVLDPKIAVAAQNCY
KIVTNVG A FTGEISPGVMIKDCGQ
PWWLGHSERRHVFVGESDELIV
GQKVAVHALAEGLVGVIRLHLG
RS*DERGSLGITVEKWFEVQTKV
IRR*T*KDWKQ/VWLGLMSPV
WGLWLLARLATPQQAVQEVHR
RSSRGWLVKSNVSVDAVASEPPV
TIYGVGSVVTGATLQRSLAKPSP
DVDGLPLWGGASLQARNSWDI
IQCHNNEPPSHLSLPLRCQAQG
LKQTQKPK
Figure imgf002343_0001
Figure imgf002344_0001
Figure imgf002345_0001
Figure imgf002346_0001
Figure imgf002347_0001
Figure imgf002348_0001
Figure imgf002349_0001
Figure imgf002350_0001
Figure imgf002351_0001
Figure imgf002352_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Amino acid sequence ( X=Unkno n, NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
18853 49221 18962 143 2159 LTHSSTWLGRSCNHDRRQMGS
KVMSYWAGRGTCAQETPIYT
TIRYLEDKEQYGGSLGVDSVLA
RLFSGASVLFISGPPFSSYLSGVP
PGHPMEDSMVDMVDMSPLVRPQ
NYLFGCELKVADKDYHFKVDVN
DENEVHQLSLRVTVNLGGLVAK
G*VAHLLKAEAIEFTEASPNLK
VTTWATVLKMVSVPAQRVFPLG
GLLKITPTRWVLKGLKVWVQG
PVHISGTAL* * L WEGRCQSPEDE
EVEEDVNRVLFMKPKGRGLKH
MFGDLVCSWKLAAIETQSSSSE
EIVPSPPSPPPLPRIYKPCFVCQD
KSSGYHYGVSACEGCKGFFRRS
IQKNMVYTCHRDKNCIINKVTR
NRCQYCRLQKCFEVGMSKESV
RNDRNKKKKEVPKPECSESYTL
TPEVGELIEKVRKAHQETFPAL
CQLGKYTTNNSSEQRVSLDIDL
WDKFSELSTKCIIKTVEFAKQLP
GFTTLTIADQITLLKAACLDILIL
RICTRYTPEQDTMTFSDGLTLN
RTQMHNAGFGPLTDLVFAFAN
QLLPLEMDDAETGLLSAICLICG
DRQDLEQPDRVDMLQEPLLEA
LKVYVRKRRPSRPHMFPKMLM
KITDLRSISAKGAERVITLKMEI
PGSMPPLIQEMLENSEGLDTLS
GQPGGGGRDGGGLAPPPGSCSP
SLSPSSNRSSPATHSP
18854 49222 18963 748 1075 ILPTSLFFLFCFVFFVCF*DRVLL
LSPGVWSAVARSWLYCNLSLRG
FKGFSCLSLLSNWDYRCTPLRS
ANFVFL/CRDRVSPCWPTSVSNS
*PQVVIHPPWPPKVLGITRV
18855 49223 18964 674 GRRRLLRDAEGPEETVRLWPA
ARAAMDAAEVEFLAEKELVTII
PNFSLDKIYLIGGDLGPFNPGVLP
VEVPLWL/ARLNLKQRQKVCRL
LPPEWMDVEKLEKMRDHEVRK
EETFVTPMPKPFTTWELTVKLLVL
NHAVSDNIPKGRTEFRDPGSRVF
WGHSY*AKFRVFGVDSFVVRQQ
EAHAKLD*LGPWMGDQQPAVT
FLTQAVLNPHVTNSRTNPPSPLE
STSVLRTS
Figure imgf002354_0001
Figure imgf002355_0001
Figure imgf002356_0001
Figure imgf002357_0001
Figure imgf002358_0001
Figure imgf002359_0001
Figure imgf002360_0001
Figure imgf002361_0001
Figure imgf002362_0001
Figure imgf002363_0001
Figure imgf002364_0001
Figure imgf002365_0001
Figure imgf002367_0001
Figure imgf002368_0001
Figure imgf002369_0001
Figure imgf002370_0001
Figure imgf002371_0001
Figure imgf002372_0001
Figure imgf002373_0001
Figure imgf002374_0001
Figure imgf002375_0001
Figure imgf002376_0001
Figure imgf002377_0001
Figure imgf002378_0001
Figure imgf002379_0001
Figure imgf002380_0001
Figure imgf002381_0001
Figure imgf002382_0001
Figure imgf002383_0001
Figure imgf002384_0001
Figure imgf002385_0001
Figure imgf002386_0001
Figure imgf002387_0001
Figure imgf002388_0001
Figure imgf002389_0001
Figure imgf002390_0001
Figure imgf002391_0001
Figure imgf002392_0001
Figure imgf002393_0001
Figure imgf002394_0001
Figure imgf002395_0001
Figure imgf002396_0001
Figure imgf002397_0001
Figure imgf002398_0001
Figure imgf002399_0001
Figure imgf002400_0001
Figure imgf002401_0001
Figure imgf002403_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Amino acid sequence ( X=Unkno\vn, NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
19199 49567 19308 2597 MAMFMLLAAALLSTFLTNDVA
LYTWPLTITLKRLCEIPYNRLII
FEGAGSQRWFATDANWQPAK
YSYLGTFWSFVCRIYCPNGTAG
WRNDADAPAPVLAMQYHTGV
QTPEWKPRLVWSCLGLYIVFLT
ALEFKQELWGLVIVAAGFALL
ARRWLSVDWTLLLVFMAMFI
DVHLLTQLPALQGVLGNVISNV
PSTILLLNYVPPSLLLVWAVNV
GGFGFLTCRHARIAAPADLKDG
LAVHEKRETRSLTEKTGKMLLP
KLGVGSRECGGNVKEVGIRAIE
EFG PYKINGDKEI MRRMDDLLQ
GFDAQHRMKLPAQPTSLATKS
ARRVRYPLFCPSIFNGFFEEFIFH
RFLAEQALEFFNLLHGGSKFRG
RNNLFSGGDSSRKNQRSTRTHC
HHPSPKIYSASAKEPWVLATNL
PVEIRTPKQLVNIYSKRMQIEET
FRDLKSPAYGLGLRHSRTSSSE
RFDIMLLIALMLQLTCWLAGV
HAQKQGWDKHFQANTVRNRN
CKMATLKEKLIAPVAEEEATVP
NNKITVVGVGQVGMACAISILG
KSRAMEHTTAGAPYTPRQQTC
QCNMMSLADELALVDVLEDKL
KGEMMDLQHGSLFLQTPKIVA
DKDYSVTANSKIVWTAGVRQ
QEGESRLNLVQRNVNVFKFIIP
QIVKYSPDCIIVIWSNPRDIVLTY
GTWKLSGLPKHRVIGSGCNLDS
19200 49568 19309 435 777 RWLATHYLTVKLSLLPNLAVT CQPKREQNITSNESTDC*VIVTS TKSDSLYTVGMLALSVRAIRCP CRLVDWADIPEQKQLMVLRAS VALHG RSVTL YEKAFPAFRAM FKRSS
Figure imgf002405_0001
Figure imgf002406_0001
Figure imgf002407_0001
Figure imgf002408_0001
Figure imgf002409_0001
Figure imgf002410_0001
Figure imgf002411_0001
Figure imgf002412_0001
Figure imgf002413_0001
Figure imgf002414_0001
Figure imgf002415_0001
Figure imgf002416_0001
Figure imgf002417_0001
Figure imgf002418_0001
Figure imgf002419_0001
Figure imgf002420_0001
Figure imgf002421_0001
Figure imgf002422_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Amino acid sequence ( X=Unkno n, NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
19304 49672 19417 2481 MRSGDLKFGKSAAWPSGQKEK
LVFRGKIEEGLEYLHKKKPSAN
SQDNGKEASKAFHGPLQQPLLS
QALGPDREEWFPGQAPWPHCC
VHPQDTAGCIPWPAPAMAQR
CTGTAWVTASEVAPSLDCFHIV
LSQQVYRIHRPTSSLHPQYGKA
TRHTTPAVSMRAAAGAEPCKA
TGADWPKTLGAQPSHPCTMDV
GQGFKKDDSGAALAPMAALM
GWAGIECLWLFHTEGVNCWP
WDLGEKNGLLGQPRGPAAVCS
LRTLLPASQQPQLLLRPWLKDA
QVQLGSLLQRVQAIGLGGFYIV
LSQFIDPSKASYPPWVKNSRHS
TAAQPMRAAAGAEHCKATGA
ELPKALGAQPSCPCVLDVGQGF
KKNDFGAVGLNNWPAGFWNV
MGTCKSRLCFVLLSGKKSSFRV
EVLTQCLDSHTLETYQFYNREV
AFEDVLESSQTGLWQKPQTDR
LTKRGRGEATRNLRGLVPSMP
RSLLPSTAQPTAWKCQRGPSPY
THQDMALIPSPTARCLSPAKEP
KQEEVGEKSLLPDPTLPLTDPR
LTGSTEQAHAEGLAALMSALR
VSHLQGRGGWTLVDSQLGVI
AVSSTQFNKGPSYRLLADVQN
RLLPKYDSQKEAELRSWIKGFT
GLSI RPDFQKGLKDGIILCTLVN
KLQPGSVPKINVASV*NWH*LEN
LSNFLKAMVSYGM/NPVDLFEA
Figure imgf002424_0001
Figure imgf002425_0001
Figure imgf002426_0001
Figure imgf002427_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Amino acid sequence ( X=Unknown, NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possiblc nucleotide insertion) sequence
19331 49699 19444 2274 MLKPLMGGGAHRRAAAGARK
WDGTAIIASTSGLEVLWVTTSR
SKTVESVGLAKPGQHNDARMN
LAIALTAARYGAATANYMEW
SLLKKTDPQTGKVRVSGARCK
DVLTGQEFDVRAKCVINATGPF
TDSVRKMDDKDAAAICQPSAG
VHIVMPGYYSPESMGLLDPATS
DGRVIFFLPWQKMTIAGTTDTP
TDVTHHPIPSEEDINFILNEVRN
YLSCDVEVRRGDVLAAWSGIR
PLVTDPKSADTQSISRNHWDIS
ESGLITIAGGKWTTYRSMAEDT
INAAVKTHNLKAGPSRTVGLFL
QGGKDWSPTLYIRLVQDYGLE
SEVAQHLAATYGDKAFEVAKM
ASVTGKRWPIVGVRLVSEFPYI
EAEVKYGIKEYAC/DCCGYDFT
SYSPGLS/EMSRQQRKPYPGLL
N*/CGRELNWDDYKKQEQLET
ARKFLYYEMGYKSRSEQLTDR
SEISLLPSDIDRYKRRCHKFDAD
QRGFMTWEVQRVLESINVQM
DENTLHEILNEVDLNKNGQVEL
NEFLQLMSAIQKG RVSGSRL Al
LMKTAEENLDRRVPIPVDRSCG
GLYFCVRNWWVLGLTDFKNE
AADPSGVKLQTFTVSVTARKGS
VDPKNSGAQLASPSGSRTGAA
GGAACQSRALRLHSSALGWSM
GLGAVEHGAVLIGEARAAQEP
MEGVGGSGMAGCRSQVLPRG
19332 49700 19445 439 1542
19333 49701 19446 418 CLSLPPTPWAPVRPEPPRR/CAT
PCSTAPSPIDHPRAEECERTVQD
WRAAPPAAPVRDPLETGKIDQE
IHKYNTPGFTGCLSRVQFNQIAP
LKAALRQTNASAHVHIQGELV
ESΝCGASPLTLSPMSSATDPWH
LDHLDS
19334 49702 19447 1 1 1 215 GTCWELE*RKGKPIFWGEIKAS CRΝLHKLQGAKC
19335 49703 19448 740 959 DLYPHHLSWQSRΝAARKRKS* RGQRPIPAIS*GRVSQKQQQΝVΝ TSTPSHCPRLPYECRPLLHGAQS HRPPKG
19336 49704 19449 260 508
Figure imgf002429_0001
Figure imgf002430_0001
Figure imgf002431_0001
Figure imgf002432_0001
Figure imgf002433_0001
Figure imgf002434_0001
Figure imgf002435_0001
Figure imgf002436_0001
Figure imgf002437_0001
Figure imgf002438_0001
Figure imgf002439_0001
Figure imgf002440_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Amino acid sequence ( X=Unknown, NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
19404 49772 19517 2293 MARSEAAAAGPGPGPPGLAAR
ERRGGRGCVRGCERASVRVWS
VALRRARRAPARPYVARCRRV
RRPARVAIGCGGVSSPAGVSGC
ARVSPERPPLRLCVWALSRAHG
GAVCRAGPGPDSLKRGLTSHPP
AFPLHHGSRTLLLGPLSPPPAAL
PFLLLLWAGHLVAS/ACPGRCIC
QNV/SPTLTMLCAKTGLLFVPP/
SI DRRWELRLTDNFVRRHFAN
MTSLV/QLTLSRNTIGQVAAGA
F/VDLRALRALHLDSNRLAGC/
GGDQLRGLGNLRHLI/VGNNQI
RRVESAAFD/DLLSTVEDLDLS
YNNLEA/LPWEAVGQMVNLNT
LT/LDHNLIDHIAEGTFVQLQSW
AVRAQAAHPLTVSFGGNPLHCN
CE/LLWLRRLTREDDLETCASP
N/NLTDRYFWSIPEEESCTRCQR
FCG*ASLVAAELTSNSVLIRW/S
TQRPVPGIRMYQVQYN/SSVDD
SLVYRKYSLSTYMNPTRVMIPST
SQTFLVNDLAAGRAYDLCVLVA
VYDDGATALPATRWGCVQFT
TAGDPAPCRPLRAHFLGGTMII
AIGGVIVASVLVFIVLLMIRYKV
YGDGDSRRVKGSRSLPRVSHV
CSQTOGAGTGAA/AGPGPAGPG
PLRGAARGGVP/WLPPPSPSRPR
PWRPR/LASAEPEVVLGRSLGG
SATSLCLLPSEETSGEESRAAVG
PRRSRSGALEPPTSAPPTLALVP
19405 49773 B 19518 97 255
Figure imgf002442_0001
Figure imgf002443_0001
Figure imgf002444_0001
Figure imgf002445_0001
Figure imgf002446_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Amino acid sequence ( X=Unkno n, NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
19449 49817 19562 2518 AAGEWPQRPRAAVAAAGAAA
RQRLPGSRQVAAGAPQPAGGG
DPSQRVPPGAEILIKVQVYVSGE
LVPLARASVDVFGNRTLLAAG
TTDSEGVATLPLSYRLGTWVLV
TAARPGFLT SVPWRVDKLPL
YASVSLYLLPERPATLILYEDLV
HILLGSPGARSQPLVQFQRRAA
RLPVSSTYSQLWASLTPASTQQ
EMRAFPCLPGH*GVSSSGNGSW
LELMPLTAVSVHLLTGNGTEVP
LSGPIHLSLPVPSETRALTVGTSI
PAWRFDPKSGLWVRNGTGVIR
KEGRQLYWTFVSPQLGYWVA
AMASPTAGLVTITSGIQDIGTYH
TIFLLTILAALALLVLILLCLLIY
YCRRRCLKPRQQHRKLQLSGPS
DGNKRDQATSMSQLHLICGGP
LEPAPSGDPEAPPPGPLH SAFSS
SRDLASSRDDFFRTKPRSASRPV
AAEPSGARGGESAGLKGARSA
EGPGGLEPGLEEH RRGPSG AAA
FLHEPPSPPPPFDHYLGHKGAA
EGKTPDFLLSQSVDQLARPPSL
GQAGQLIFCGSIDHLKDNVYRN
VMPTLVIPAHYVRLGGEAGAA
GVGDEPAPPEGTAPGPARAFPQ
PDPQRPQMPGHSGPGGEGGGG
GGEGWGAGRAAPVSGSVTIPV
LFNESTMAQLNGELQALTEKK
LLELGVKPHPRAWFVSLDGRS
NSQVRHSYIDLQAGGGARSTD
19450 49818 19563 206 1 126 LESGSHIYAFSHESLLLHVASAS
V*TSFSDFATSVCHSPLHCQLH
HALIPHGKGECSSVVSGIVATGF
GATGFLGRCWNHLGRVGSQIP
QAIAQVSKEAEVGKFIHVAHLN
VN IKSSSRYLRKKA VGEKA VR
DTFPEAIIVKLLDIFGREDRFLN
YFANMCWFGAIPLVSLGWKAV
KQPVYWDVSKGIVNAVKDPD
ANGKTFAFVGPNRYLLFDLVK
YIFAVAHSKGWVARDFEISPFE
SWTTRDKVERIHITDMTLSHLP
SLEDLGIQATPLELKAIEVLRLH
HTYCWLSAEIEDVKLAKTVNI
Figure imgf002448_0001
Figure imgf002449_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Amino acid sequence ( X=Unknown, NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
19463 49831 A 19578 144 722 KKGLNHHHVEQNNTRPRPRSG
PQRATSNVVFAMFDVQSQIQEFK
EAFNMIDPDQDGFHRPGKICND
MLASLGVKNPT*WSILDAMMNE
VAPGPPSNFTPWFPHHGFGEEV*
MATRS*KMVIRKCPLTCFE*KK
PLGTIVGEDYLVRELLTTHGGIGF
TDEEVDELYREAPI**KRGISITS
EFTRI LKVHG AKDKDD
19464 49832 19579 429 VACLREHKQRCERATRSLLREL
LQVRARVQLQGSELRQLQQEA
RPAAQAPEKEAPEFSGLQNQM
QALDKRLVEVREALTRLRRRQ
VQQEAERRGAEQE/DRPQTGQA
DRLAAAGGAGPGGGLRRPAEE
PRGQQPEGGPGGGQN
19465 49833 19580 2454 RFPAGVAPRRAMANVSKKVSW
SGRDRDDEEAAPLLRRTARPGG
GTPLLNGAGPGAARQSPRSALF
RVGHVMSSVKLDDELLEPVDMD
PPHPFPKEIPHNEKLLSLKYESL
DYDNSENQLFLEEERRTNHTAF
RTVEIKRWVICALIGILTGLVAC
FIDIWENLAGLKYRVIKGNIDK
FTEKGGLSFSLLLWATLNAAFV
LVGSVIVAFIEPVAAGSGIPQIK
CFLNGVKIPHWRLKTLVIKVS
GVILSWGGLAVGKEGPMIHSG
SVIAAGISQGRSSSLKRDFKIFE
YLRRDTEKRDFVSAGAAAGVS
AAFGAPVGGVLFSLEEGASFW
NQFLTWRIFFASMISTFTLNFVL
SIYHGNMWDLSSPGLINFGRFD
SEKMAYTIHEIPVFIAMGWGG
VLGAVFNALNYWLTMFRIRYI
HRPCLQVIEAVLVAAVTATVAF
VLIYSSRDCQPLQGGSMSYPLQ
LFCADG E YNSM A AAFFNTPEKS
WSLFHDPPGSYNPLTLGLFTL
VYFFLACWTYGLTVSAGVFIPS
LLIGAAWGRLFGISLSYLTGAAI
WADPGKYALMGAAAQLGGIV
RMTLSLTV I MMEATSN VTYGFP
IMLVLMTAKIVGDVFIEGLYDM
HIQLQSVPFLHWEAPVTSHSLT
AREVMSTPVTCLRRREKVGVIV
DVLSDTASNHNGFPWEHADD
TQPARLQGLILRSQLIVLLKHK
19466 49834 B 19581 508 4413
Figure imgf002451_0001
Figure imgf002452_0001
Figure imgf002453_0001
Figure imgf002454_0001
Figure imgf002455_0001
Figure imgf002456_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Amino acid sequence ( X=Unknown, NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, V=possible nucleotide insertion) sequence
19509 49877 19624 4994 7008 QWQQKPKLTNDNLKFIWNQKR
DRIAKTILSQKNKAGGITLPDFK
LYYKATVTKTAWYWYQNRHI
DQWNRTEPSETI PHIYNHLNFD
KPDKNKKWGKDSLFNKWCWE
NWLAICRKLKLDPFLTPYTKIN
SRWIKHLNVRPKTIKILEENLGN
TIQDIGMGKDFMTKTPKAIATK
PVN*QMGSNILNGQKLEAFPLKT
GTRQGCPLSPLLFNIVLEVLAR
AIRQEKEIKGIQLGKEEVKLSLF
ADDMIVYLENPIVSAQNLLKLIS
NFSKVSGYKINVQKSQAFLYTN
NRQTESQI MSELPFTI ASKRIKY
LGIQLTRDVKDLFKENYKPLLK
EIKEDTNKWKNMPCSWVGRINI
VKMAILPKVIYRFNAIPIKLPMA
FFTELEKTTLKFIWΝQKRARIA
KSILSQKΝKAGGITLPΝFELYY
KATVTKTAWYWYQΝRYIDQW
ΝRTEPSEITPHIYΝYLIFDKPGK
NKQWGKDSLFNKWCWENWLA
ICRKLQLDPFLTPYTKINSRWIK
DLNVRPKTIKTLEEDLGITIQDI
GMGKDFMSKTPKAMATKDKID
KWDLIKLKSFCTAKETTIRVNR
QPTKWEKIFATYSFDKGLISRIY
NELQQIYKKKTNNPIKKWAKD
MNRHFSKEDIYAAKKHMKKYS
SSLAIREMQIKTTMRYHLTPVR
MATIKNSGNNRQ
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Amino acid sequence ( X=Unknovvn, NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
19510 49878 19625 764 GSSEKRKFKTGNQKLSMEVEK
NQQMRSLSLYQQRVEGSKCAV
RRKGGSPKPKAFLAAAEKFLN
VPVMRSASHEQLVLTGDPNKT
LPGSKSRLSGTDSPRACSTFPIW
ATKIVTEMQVQLFAACRVQLIK
HPRPSAARTAMEVLESGEQGV
LQWDRKLSELSEPGDGEALMY
HTACQGHEEQVWEEEALQLGL
RSGILLNHWKRQLVGGAGILRS
AQLHSPYHHWQMISLVSPSPAV
MAVMENDDICMVLCVKVMDE
PNHILDAISEHQIPSLEDQWFSE
CALWTSKAYENLVERHILGPHC
RPTELETLGISEDDIDHNPYLDF
PGVAGLGSRRNVKERGWLSEM
KKEGDIFMKIQG*AICPTSGPAE
GLPPPTPVPHVPDRPCGT*GP/PP
SPGSDSSVPACGPTAARPAPRSP
APPWRCGPRWAEDVLLTGLCK
QQIAEPAFRLQFWWPI
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Amino acid sequence ( X=Unknown, NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
19511 49879 A 19626 1430 MNGTAEGQHPWVHSTFNGEPQ
SLSSSVRVKNQGPRDQAELGFQ
GNGSTVPAAQARTIARARGSGS
GGGDALQGGGAGAGPGATKR
TARLLARGLRRAALRPDGRGV
LGAGAGSSQRRGGGRRKMETH
ISCLFPELLAMIFGYLDVRDKG
RAAQVCTAWRDAAYHKSVWR
GVEAKLHLRRANPSLFPSLQAR
GIRRVQILSLRRSLSYVIQGMAN
IESLNLSGCYNLTDNGLGHAFV
QEIGSLRALNLSLCKQITDSSLG
RIAQYLKGLEVLELGGCSN1TN
TGLLLIAWGLQRLKSLNLRSCR
HLSDVGIGHLAGMTRSAAEGLP
GPGAAHATGLPEAHRSFSKAHL
PRADGPEAPQPQLLWGNLGRW
PPAPVAHGQPAHAQPCAPCDNI
SDTGIMHLAMGSLRLSGLDVSF
CDKVGDQSLAYIAQGLDGLKS
LSLCS/RRGLRRAALRPDGRGV
LGAGAGSSQRRGGGRRKMETH
ISCLFPELLAMIFGYLDVRDKG
RAAQVCTAWRDAAYHKSVWR
GVEAKLHLRRANPSLFPSLQAR
GIRRVQILSLRRSLSYVIQGMAN
IESLNLSGCYNLTDNGLGHAF/R
AGDRLPARSQPEPLQADH*QQP
GPHSPVPQGP/ARCWSWEVAAT
SPTLAFCSSPGVCSASRALTSAA
AATFRMWVDRAPGRHDAQRGG
GVLPGPGAAHATGLPEAHRSFS
19512 49880 19627 293 1520 TTMDGSHSAALKLQQLPPTSSS
SAVSEASFSYKENLIGALLAIFG
HLWSIALNLQKYCHIRLAGSK
DPRAYFKTKTWWLGLFLMLLG
ELGVFASYAFAPLSLIVPLSAVS
VIASAIIGIIFIKEKWKPKDFLRR
YVLSFVGCGLAVRGVTYLLVTF
APNSHEKMTGENVTRHLVSWP
FLLYMLVEIILFCLLLYFYKEKN
ANNIWILLLVALLGSMTWTV
KAVAGMLVLSIQGΝLQLDYPIF
YVMFVCMVATAVYQAAFLSQ
ASQMYDSSLIASVGYILSTTIAIT
AGAIFYLDFIGEDVLHICMFAL
GCLIAFLGVFLITRΝRKKPIPFEP
YISMDAMPGMQΝMHDKGMTV
QPELKASFSYGALEΝNDΝISEIY
APATLPVMQEEHG SRS ASG VPY
RVLEHTKKE
Figure imgf002460_0001
Figure imgf002461_0001
Figure imgf002462_0001
Figure imgf002463_0001
Figure imgf002464_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Amino acid sequence ( X=Unknown, NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
19544 49912 19659 2707 MPGPHPASASCPPRLFTPRRRAF
CPLRICAPLAAAAFCPPRFLPHT
RRRAFLLLCRRGFLPPPCLGGG
L A AAFFPH PPLPFLRPHRRGFLP
HRRRGFLLPPRLGFLPAAAFYPP
PPRVFARRGFVPPSPPRLFARRA
FCHTPPAAAFSPSAAAAFCPTA
DTAFCTLAALAFCRRGFLPPRC
RGFLPLRHRRSFLPHRRRGFLPT
AAFCHRG FLLATAF YPPRLFAP
RCHGFLPPCRSDFFAAAAFCPR
RRGFLPLSPPLFAPAAVAFCPRH
RGFLLPRLFAPVTAAFCHRDFL
PPPPRIFVPAATALRAGAADSA
ASSTGVLARAALRGPCGLHRV
ALPVEAMGEKKEGGGGGDAA
ATEGGTGAAASRALQQCGQLQ
KLIVIFIGSLCGLCTKCAVSNDL
TQQEIQTPEPAKPAKPASQTRQ
PSQPSHPASQASQTSQPAKQPA
SQASQPAKPAKPPSQPSQPSHR
ASQASQATQPERPASQPASEPS
QPSLPASQASQDRQPAKPAKPD
SH PRQPSQP YQPSQPASQASQA
SQPAKPSRQPSQPRQPSQLASQ
ASQSTQPLKPAKSPVQQSQPSQ
PASPDSQASQASQATQPAKPAK
PPSQAAQPAKPAKPPSQPACQP
PSQRAKPAKPARQPSQPSQSSPP
ASQPSQACQPSQPGSQASQASQ
AAKPGMPAKPARQPSQPSQPAS
QARHASQAIQPAKPAG*PSQTS
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last A ino acid sequence ( X=Unknovvn, NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
19545 4991 : 19660 1223 2286 RQCFKATEQPPAIHHPSSQPSRQ
PSSQPP1YPSPICGRKSGIQPKFQ
LNG*MVNG*MDEVSRERHEWP
CNYQVRPKAGEKQPGAEANRE
TRPEGGTP/RGERGRQGPATRR
QKSTPEAGPSDLEKQIARQEAP
AGTDEIERQQARPH*AP*KIPVE
WVNGEWLNGRMPGDPHEWQC
NYQPAIQPAIQPANQLSIHNLW
TQFRHSTKIPVEWVNGEWLNG
RMPGKHHEWQCNYQPAIQPAI
QPANQLSIHNLWTQFRHSTKIP
VEWVNGEWLNGRMPEDHHEW
QCNYQPAIQPAIQPASQLSIHN
MWTQFRHSGQIPVEWVNGSRL
HGRQQGIFMHGNATSSHSPSIQ
PAIQPAIQPANQLSIHNLWTQFR
HSTKIPVEWVNGEWLNGRMQR
NIMNGN ATI SHSPSIQPAIQPAIQ
PVNHLSIHNLWTQFRHSTKIPV
EWVNGEWLNGRMPGKNHEW
QCNYQPAIQPAIQPANQLSIHNL
WTQFRHSTKIPVEWVNGEWLN
GRMPGDHHEWQCNYQIAPML
QSNRTAASHSPSIQPAIQAAIQS
ATHLSIPNLWTQVRHSTKIPVE
WVNGEWLNGRMPGRDMNGH
ATIRYGPKPEKNNRAPRQTEKQ
GQKGAHQRREGKARAGNEETE
KYSRSRAIRPRKTDSASRSSRGH
RRNRKAASAATLGPIEDTNQNK
LHVRDTGLEVKAISIQGPGTLM
19546 49914 19661 39 599 GASPNQGQNRPHARQRAPPQ/G
/PPGEPERRAPLPSGHGEPCRHR
PPPFPQPP/AGTQKPLLQGPGGG
*PAENAPTAALGSPAPPRGCQA
APPPRSGAG RPDLPTLAGPRPAP
AVPPPSAAPPPPPSGAPSR/PAAG
RQRLSGVSSGPSLGWW*VGRG
RGLPAFAQIAGHQVGPRRRRTP
AGRKPRSPAGPR
19547 49915 19662 39 599 GASPNQGQNRPHARQRAPPQ/G
/PPGEPERRAPLPSG HGEPCRHR
PPPFPQPP/AGTQKPLLQGPGGG
*PAENAPTAALGSPAPPRGCQA
APPPRSGAGRPDLPTLAGPRPAP
AVPPPSAAPPPPPSGAPSR/PAAG
RQRLSGVSSGPSLGWW*VGRG
RGLPAFAQIAGHQVGPRRRRTP
AGRKPRSPAGPR
Figure imgf002467_0001
Figure imgf002468_0001
Figure imgf002469_0001
Figure imgf002470_0001
Figure imgf002471_0001
Figure imgf002472_0001
Figure imgf002473_0001
Figure imgf002474_0001
Figure imgf002475_0001
Figure imgf002476_0001
Figure imgf002477_0001
Figure imgf002478_0001
Figure imgf002479_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Λmino acid sequence ( X=Unknown, NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
19617 49985 19732 92 2049 LTALPPAVSPGGHSSSSSSRTIS
GIT*ELQR/LLQAWPGPWQED
VADAEECAGRCGPLMDCRAFH
YNVSSHGCQLLPWTQHSPHTR
LRHSGRCDLFQEKGEWGYMPT
LRNGLEENFCRNPDGDPGGPW
CHTTDPAVRFQSCGIKSCRVAA
CVWCNGEEYRGAVDRTESGRE
CQRWDLQHPHQHPFEPGKFLD
QGLDDNYCRNPDGSERPWCYT
TDPQIEREFCDLPRCGSEAQPRQ
EATSVSCFRGKGEGYRGTANTT
TAGVPCQRWDAQIPHQHRFTPE
KYACKDLRENFCRNPDGSEAP
WCFTLRPGMRVGFCYQIRRCT
DDVRPQDCYHGAGEQYRGTVS
KTRKGVQCQR/WSLE/TG*DYP
ATTSLPLPPRFTFTSEPHAQLEE
N FC/RDPDGDSHGP WC YTMDP
RTPFDYCALRRCDQVQFEKCG
KRVDRLDQRCSKLRVAGGHPG
NSPWTVSLRNWQGQHFCGGSL
VKEQWILTARQCFSSCHMPLTG
YEVWLGTLFQNPQHGEPGLQR
VPVAKMLCGPSGSQLVLLKLE
RYVDNLGGWTKCEIAGRGETK
GTGNDTVLNVALLNVISNQEC
NIKHRGHVRESEMCTEGLLAPV
GACEGDYGGPLACFTHNCWVL
KGIRIPNRVCARSRWPAVFTRV
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Amino acid sequence ( X=Unkno\vn, NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
19618 49986 19733 2296 HSAGQRSPSNDFQVLRGTELQH
LLHAWPGPWQEDVADAEECA
GRCGPLLLSRAFFIYNVSSHGCQ
LLPWTQHSPHSRLWHSGRCDLF
QEKGEWGYMPTLRNGLEENFC
RNPDGDPGGPWCHTTDPAVRF
QSCGIKSCRVAACVWCNGEEY
RGAVDRTESGRECQRWDLQHP
HQHPFEPG KFLDQGLDDN YCR
NPDGSERPWCYTTDPQIEREFC
DLPRCGSEAQPRQEATSVSCFR
GKG EG YRGTANTTTAG VPCQR
WDAQIPHQHRFTPEKYAC/KVR
WAGGRALGRAAAGETGGG*SR
A*GWRLAG*G*VERLLRDLREN
FCWNLDGSEAPWCFTLRPGTR
VGFCYQIRRCTDDVRPQDCYH
GAGEQYRGVTVVSKTRKGVQCQ
RWSAETPHKLQ*VPGAPGPAR
ALTLGRHALMSGTRAWKWLR
LSLPRFRSRSRLGSRLTSEPHAQ
LEENFC/RDPDGDSHGPWCYTM
DPRTPFDYCA LRRCADDQPPSIL
DPPPDQVQFEKCGKRVDRLDQ
RRSKLRVAGGHPGNSPWTVSL
RNR*GTTACLPQRGAEWSSVV
MPLGAGNLSLPPEVLARRWQV
*HLSQESVPCPNSPLL*VQGQHF
CGGSLVKEQWILTARQCFSSVCE
PPLCLGTQSHPTFPFPQAS*QVS
LGATD*ESQMT*QSFSPSHMPL
TGYEVWLGTLFQNPQHGEPGL
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Amino acid sequence ( X=Unknown,
NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
19619 49987 19734 727 3012 PAPTHAHSAGQRSPLNDFQVLR
GTELQHLLHAWPGPWQEDVA
DAEECAGRCGPLMDCRAFHYN
VSSHGCQLLPWTQHSPHTRLRR
SGRCDLFQKKDYVRTCIMNNG
VGYRGTMATTVGGLPCQAWS
HKFPNDHKYTPTLRNGLEENFC
RNPDGDPGGPWCYTTDPAVRF
QSCGIKSCREAACVWCNGEEY
RGAVDRTESGRECQRWDLQHP
HQHPFEPG KFLDQGLDDNYCR
NPDGSERPWCYTTDPQIEREFC
DLPRCGSEAQPRQEATTVSCFR
GKGEGYRGTANTTTAGVPCQR
WDAQIPHQHRFTPEKYACKDL
RENFCRNPDGSEAPWCFTLRPG
MRAAFCYQIRRCTDDVRPQDC
YHG AG EQYRGTVSKTRKG VQC
QRWSAETPHKPQFTFTSEPHAQ
LEENFCRNPDGDSHGPWCYTM
DPRTPFDYCALRRCADDQPPSIL
DPPVDQVQFEKCGKRVDRLDQR
CSKLRVAGGHPGNSPWTVSLR
NWQGQHFVCGGSLVKEQWILTV
ARVQCFSSVCHMPLVTGY*GMVG
ATFFQNPQHGEPSLVQRVQ*AK
MLCGPSGSQLVVLLKLERSVTL
N/HRVWALIILPVPEWYWPPGTV
KCEIAGWGETVKGTGVNDTVVLN
VALLNVISNVQECNIKHRGRVRE
SEMVCTEGVLLAPVGACESDYG
GPVLACFTVHNCWVVLEGIIIPNVR
19620 49988 19735 66 234 WCSAWHSWLPHSPSGPAASYP PAACRSAPQWPDP*R*CPRTHR RRSPWRAPWHCP
19621 49989 19736 500 GRGGGGVRRCARPAPGRPPAA
ARRRGDQRGGRLGRVHVPGKTF
CCGICGRGFGRRETLKRHERIH
TGEKPHQCPVCGKRFRESFHLS
KHHWHTRERPYKCELCGKVF
GYPQSLTRHRQVHRLQLPCAL
AGAAGLPSTQGTPGACGPGAS
GTSAGPTDGLSYACSD
Figure imgf002483_0001
Figure imgf002484_0001
Figure imgf002485_0001
Figure imgf002486_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Λmino acid sequence ( X=Unknown, NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /^possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
19634 50002 19749 1725 MELKTKARELREECRSLRSRCD
QLEERPNLRLIGVPESDGENGT
KLENTLQDI1QENFPNLAKQVN
VQIQEIQRTPQRYSLRRATPRHII
VRFTKVEMKEKMLRAAREKGR
VTLKGKPIRLTADLSAETLQAR
REWGPIFNILKEKNFQPRISYPA
KLSFISEGEIKSFTDKQMLRDFV
TTRPALQELLKEALNMERNNRS
WFFEKINKIDRPLARVIKKKRE
KNQIDAIKNDKEDITTNPTEIQTI
IREYYKHLYPNKLENLEEMDTF
LDTYTFPRLNQEEVESLNTSITG
SEIVAIIS/NSIPTKKSPGPDGSTA
EFYQVMLEVLARAIRQEKEIKGI
QLGKEEVKLSLFADDMIVYLEN
PIVSAQNLLKLISNFSKVSGYKI
NVQKSQAFLYTNNRQTESQIMS
ELPFTIASKRIKCLGIQLTRDVK
DLFKENYKPLLKEIKEDTNKW
KNIPCSWVGRINIVKMAILPKVI
YRFΝAIPIKLPMTFFTDLEKTTL
ΝFIWΝQKRARITKSILSQKΝKA
GGITLPDFKLYYTATVTKTAW
YWYQΝ RWYWYQΝRDIDQWΝ
RTEPSEITPHVYΝYLIF
SEQ ID SEQ ID NO- Met SEQ ID NO: Nucleotide Nucleotide location of last Ammo acid sequence ( X=Unknown, NO- of peptide hod in USSN location of first codon for last ammo acid *=Stop codon, /=possιble nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possιble nucleotide insertion) sequence
19635 50003 19750 1719 MHNTDGNRFLSHWGYRQALSI
SKPASASLHPSSKTKPLGTQSKT
WAKRNREHGKKERSSSPAME
QSWMENDFDELREEGFRRSNY
SELREDIQTKGKEVENFEKNLE
ECITRITNTEKCLKELMELKTK
ARELREECRSLRSRCDQLEERR
KQERSKIDTLTSQLKELEKQEQ
THSKAGRRQEITKIRAELKEIET
QKTLQKINESRSWFFERINKIDR
PLARLIKKKREKNQIDTIKNDK
GDITTDPTEIQTTIREYYKHLYA
NKLENLEEMDTFLDTYTLPRLN
QEEVESLNRPITGSEIVAIINSLP
TKKSPGPDGFTAEFYQRYKEEL
HINRAKDKNHMIISIDAEKAFD
KIQQPFMLKTLNKLVLEVLARA
IRQEKEIKGIQLGKEEVKLSLFA
DDMIVYLENPIVSAQNLLKLLS
NFSKVSGYKINVQKSQAFLYTN
NRQTESQIMSELPFTIASKRIKY
LG IQLTRDVKHLFKEN YKPLLK
EIKEDTNKWKNIPCSWVVGRINI
VKMAILPKVIYRFNAVPIKLPM
TFFTELEKTTLKFIWNQKRACIA
KSILSQKNKAGGITLP
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Amino acid sequence ( X=Unkπovvn, NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possiblc nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
19636 50004 19751 2133 WRRI YQANGK* KNK/QKKAG V
VILVSDKTDFKPTKIKRDKEGH
YIMVKGSIQQEELTVLNIYAPN
TGAPRFMKQVLRDLQRDLDPH
TTIMGDFNTPLSTLDRSARQKV
NKDIQELNSALHQADLINIYRIL
HPKSTEYTFISAPHRTYSKIDHI
VGRKALLRK YKRTEI ITDCLSD
HSAIKLELRIKKLTQNSSTTWK
LNNLLLNDYWIHNKTKAEIKM
CFETSENKDTTYQNLWDTCKA
VCREKFIALNAHKRKQERSKID
TLTSQLKE/LEKQEQTHSKASRR
KSRRNG * I PGHIHPPKTKPGRI*
VPE*TNNRV*N*GNN**LTNQK
KFRTRRIHSQILPEHSAGSSGQG
NQAGERNKGYSIRKRGSQIVPV
CR*HDCIFRKPHHLSPKSP*AVK
QLQQSLRIQNQRAKITSSPIHQ*
QTNREPNHE*TFIHNCFKENKIP
RNPTYKGCEGPIQGELQTTAQQ
NKRGHKQMEEHSMLMDRKNQ
YHENG HS AQGNL* IQCHPHQAT
NDFLHRIGKNYFKVHMEPKKSP
HCQVNPKPKEQSWRHHAT*LQ
TI LQG YSNQNSM VLVPKQTYRP
MEKNRGLRNNTTHLRPSSL*QT
*QKQEMGKGFPI**MVLGKLAS
HM*KAETGSLPYTLYKN*FKM
D*RLKC*T*NHKNLRRKPRQYH
SGHRHEQGLYV*NTKSNGNKS
QN*QMGSN*TKELLHSKRNYH
19637 50005 19752 1314
Figure imgf002490_0001
Figure imgf002491_0001
Figure imgf002492_0001
Figure imgf002493_0001
Figure imgf002494_0001
Figure imgf002495_0001
Figure imgf002496_0001
Figure imgf002497_0001
Figure imgf002498_0001
Figure imgf002499_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Amino acid sequence ( X=Unkπown, NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
19696 50064 19811 50 2237 RSLKAFICSILSMDKQSRDPPVK
MQGSITTPGSIALAQAQAQAQV
PAKAPLAGQVSTMVTTSTTTTV
AKTVTVTRPTGVSFKKDVPPSI
NTTNIDTLLVATDQTERIVEPPE
NIQEKIAFIFNNLSQSNMTQKEE
LVNNTELVQSYRQQIGNWNQ
ANLQLFWNMYNSRRDLDINRP
GTVPNAKTLRCPVMLWGDNA
PAEDGVVECNSKLDPTTTTFLK
MADSGGLPQVTQPGKLTEAFK
YFLQGMGYMPSASMTRLARSR
TASLTSASSVDGSRPQACTHSE
SSEGLGQLNLKFEIEVLCKNLA
LDINELKPGNLLKDKDRLKNLD
EQLSAPKKDVKQPEELPPITTTT
TSTTPATNSTCTATVPPQPQYS
YHDINVYSLAGLVPHITLNPTA
HPQLKQCVRQAIERAVQELVHP
WDRSIKIAMTTCEQIVRKDFA
LDSEESRMR1AAHHMMRNLTA
GMAM1TC/RE/PLLMSISTNLKN
SFASALRTASPLQREMMDQAA
AQ AQDNCFLL/CKVGGVDPK
QLAVYEEFARNVPGFLPTNDLS
QPTGFLAQPMKQAWATDDVA
QIYDKCITELEQHLHAIPPTLAM
NPQAQALRSLLEVWLSRNSRD
AIAALGLLQKSMENGLNYMAV
AFAMQLVKILLVDERSVAHVT
EADLFHTIETLMRINAHSRGNA
PEGMDITTRQKVNKKREGFND
19697 50065 19812 3777 KNPEFNKMVLNETYRNIKVLLT
SDKAAANFSDRSLLKNLGHWL
GMITLAKNKPILHTDLDVKSLL
LEAYVKGQQELLYWPFVAKV
LESSIRSWFRPPNPWTMAIMN
VLAELHQEHDLKLNLKFEIEVL
CKNLALDINELKPGNLLKDKDR
LKNLDEQLSAPKKDVKQPEELP
PITTTTTSTTPATNTTCTATVPP
QPQYSYHDIΝVYSLAGLAPHIT
LΝPTIPLFQAHPQLKQCVRQAIE
RAVQELVHPWDR
Figure imgf002501_0001
Figure imgf002502_0001
Figure imgf002503_0001
Figure imgf002504_0001
Figure imgf002505_0001
Figure imgf002506_0001
Figure imgf002507_0001
Figure imgf002508_0001
Figure imgf002509_0001
Figure imgf002510_0001
Figure imgf002511_0001
Figure imgf002512_0001
Figure imgf002513_0001
Figure imgf002514_0001
Figure imgf002515_0001
Figure imgf002516_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Λmino acid sequence ( X=Unkno\vn, NO: of peptide hod n USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
19837 50205 19953 644 1795 RPSKSQTMLNSLSTALIISKIYFL
PHT*HIYFLLPGSFCYTHSLLSLP
QLPLFL/CLDLNAASHIIPDTTPD
PHDCISLIHLTFTFPQISFFPVSHP
DHTWFIDGSSTRPNVTTHQQRQ
AML*YLPHLSLRLPLCLPPLPLS
KPN*LP*LKPINSPYNSPILPVQK
PDKS YRLVQDLRLINQI VLPI HP
WPNPYTLLSSIPSSTTHYSMLD
LKDAFFTIPLHPSSQPLFAFTWT
DPDTHQSQQLTWAVLPQGFRD
SPHYFSQALSHDLLSFHPSASHL
IQYIDDLLLCSPSFESSQQDTLL
LLQCLFSKGYRVSPSKAQISSPS
VTYLGIILHKNTHALPADRWPS
CLRAAAAAALILVEALKITNYA
QLTLYSSHNFQNLFSSSHLTHIL
SAPRLLLLYSLFVESPTITIVPGL
DLNAASHIIPDTTPDPHDCISLIH
LTFTFPQISFFPVSHPDHTWFID
GSSTRPNHHSPAKAGYAIVSST
SIIEATALPPSTTSQQAELVTLT
QALTLAKGLRINIYTDSKYAFHI
LQHHAVIWAERGFLTTQGSSIIN
ASLIKTLLKTTLLPKEAGVIHCK
GHQKASDPIALGNAYADKVAR
QAASSPTSVPHSQFFSFASVTPT
YSPTETSTYQSLPTQGKWFLDQ
GKYLLPASQAHSILSSFHNLFH
VGYKPLDRLLEPLISFPSWKSIL
KEITSQCSICYSTTPQGLFRPPPF
PTHQARGFAPAQDWQIDFTHM
19838 50206 19954 52 641 RPSKSQTMLNSLSTALIISKIYFL
PHT*HIYFLLPGSFCYTHSLLSLP
QLPLFL/CLDLNAASHIIPDTTPD
PHDCISLIHLTFTFPQISFFPVSHP
DHTWFIDGSSTRPNVTTHQQRQ
AML*YLPHLSLRLPLCLPPLPLS
KPN*LP*LKPSLLQKDYASISILI
LNMPFIFCSTMQSYGLKEVSSL
HKGPPSLMPL
19839 50207 19955 380 TTPDPHDCISLIHLTFTPFPHISFF RVPHPYHIWFIDGSSTRPNVATH QQRQAML*YLPHLSLRLLLCPP PLPLSKPN*LP*LKPSLLQKDYA SIFILTLNMPFISCTTMQEVSSLH KGPLSLIPL
Figure imgf002518_0001
Figure imgf002519_0001
Figure imgf002520_0001
Figure imgf002522_0001
Figure imgf002523_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Amino acid sequence ( X=Unknown, NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
19883 50251 19999 3210 MVKGSIQQEELTILNIYAPNTG
ALRFIKQVLRDLQRDLDSHTIIM
GDFHTPLSTLDRSTRQKVNKDI
QELNSALHQEDLIDIYRTLHPKS
TEYTFFSAPHHTYSKIDHIVGSK
ALLSKCKRTEIITNCLSDHSAIK
LELRIKNLTQNRSTTWKLNNLL
LNDYWVHNEMKAEIKMFFETN
ENKDTTYQNLWDTFKAVCRGK
FIALNAHKRKQERSKIDTLTSQL
KELEKQEQTHSKASRRQEITKIR
AELKEIETQ
19884 50252 20000 2052 MGDFNTPLSTLDRSMRQKVNK
DTQELNSALHQADLIDIYRTLH
PKSTEYTFFSAPHHTYSKIDHIV
GSKALLSKCKRTEIITNYLSDHS
AIKLELRIKN LTQNRSTTWKLN
NLLLNDYWVHNEMKAEIKMFF
ETNENKDKTYQNLWDAFNAV
CRGKFVALNAHKRKQEGSKID
TLTSQLKELEKQEQTHSKASRR
QEITKVRAELKEIETQKTLQKIN
ESRSVARLIKKKREKNQI/DAIKN
DKRDITIDPTEIQTTIREYYKHL
Y ANKLEN LEEMDKFLDTYTLP
RLNQEEVESLNRPITGSEIVAIIN
SLPTKKSPGPEGFTAEFYQRYK
EELVPFLLKLFQSTEKEGILPNS
FYEASIILIPKPGRDTTKKENFRP
ISLMNIDAKILNKILAKRIQQHI
QKLIHHDQVGFIPGMQGWFNIC
KSINVIQHINRTKDKNHMIISID
AEKVFDKIQQRFMLKTLNKLV
LEVLARAVRQEKKIKGIQLGKE
ELKLSLFADDMIVYLENPIVSA
QNLLKLISNFSKVSEYKINVQKS
QAFLYTNNRQTESQIMSELPFTI
ASKRIKYLGIQLTRDVKDLFKE
ΝYKPLLKEIKEDVΝKWKΝIPCS
WVGRIΝIVKMAILPKVIYRFΝAI
PIKLPMTSFTELEKTTFKFI WΝ H
KGARIAKSILSQKNKAGGITLPD
FKLYYKATVTKTAWY SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Λmino acid sequence ( X=Unknown, NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
19885 50253 20001 2314 MDKFLDTYTLPRLNQEEVESLN
RSITGSEIVAILNSLPTKKSPGPD
GFTAEFYQRYKEELVPFLLKLF
QSIEKEGILPNSFYEASIILIPKPG
RDTSKKENFRPIPLMNIDAKILN
KILAKRIQQHIKKLIHHDQVGFI
PGMQGWFNIRKSINVIQHTNRA
KDKNHMIISIDAEKAFDKIQQLF
MLKTLNKLGIDGMYLKIRRAIY
DKPTANIILNGQKLEAFPLKTGT
RQGCPLSPLLFNIVLEVLARAIR
QEKEIKGIQLGKEEVKLSLFAD
DMIVYLENPIVSAQNLLKLISNF
SKVSGYKINVQKSQAFLYTNNR
QTESQIMSELPFTLASKRIKYLG
IQLTRDVKDLFKENYKPLLNEI
KEDTNKWKNIPCSWVGRINIVK
MAILPKVIYRLNAIPIKLPMPFF
TELEKTTLKFIWNQKRARVAKS
ILSQKNKAGGITLPDFKLYYKAI
VTKTAWYWYQNRDIDQWNRT
EPSEITPHINNYLIFDKPDKNKQ
WGKDSLFNKWCWENGLAICRK
LKLDPFLTPYTKINSRWIKDLN
VRPKTRKTLEENLGITIQDIGMG
KDFMSKTPKAMATKDKIDKW
DLIKLKSFCTAKETTIRVNRQPT
KWEKIFATYSSDKGLISRIYNEL
KQI YKKKTNNPI KK WAKDMΝR
HFSKEDIYAAKKHMKKCSSSLA
IREMQIKTTMRYHLTPVRMAII
KKSGΝNVRLFTVAKTWIQPKCA
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Amino acid sequence ( X=Unknown, NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
19886 50254 20002 2685 MGDFNTPLSTLDRSTRQKVNK
DTQELNSAPHQADLIDIYRTLH
PKSTEYTFFSAPHHTYSKTDHIL
GSKALLSECKRTEIITNYLSDDS
AIKLELRIKNLTQNRSTTWKLN
NLLLDDYWVHNEMKAEIKMFF
ETNENKDTTYQNLWDAFKAVC
RGKFVALNAHKRKQGRSKIDT
LTSQLKELEKQEQTHSKASRRQ
EITKIRAELKEIETQKTVQKINES
RSWFFERINKIDRQLARLIKKKR
EKΝLIDAIKΝDKGDITTDPTEIQ
TTI RE YYKHL YAΝKLEΝLEEM
DKFLDTYTLPRLΝQEEVESLΝR
PITGSEIVAIIΝSLTTKKSPGPDG
FTAEFYQRAIRQEKEIKGIQLGK
EEVKLSLFADDMIVYLENPIVS
AQKLISNFSKVSGYKINVQKSQ
AFLYTNNRQTESQIMSELPFTIA
SKRIKYLGIQLTRDVKDLFKEN
YKPLLKEIKEDTNKWKNIPCSW
VGRINIVKMAILPKVIYRFNAIPI
KLPMTFFTELKKTTLNFIWNQK
VRAHIAKS/VLSQKNKAGGITLP
DFKLYYKATVTKTAWYWYQN
RDTDQWNRTEPSEIMPRIYNYL
IFDKPEKNKQWGKDSLFNKWC
WKNWLAICRKLKLDPFLTPYT
KINSRWIKDLNIRPKTIKTLEEN
LGITIQDIGMGKDFMSKTPKAM
ATKAKIDKWDLIKLKSFCTAKE
TTNRVNRQPTKWEKIFATYSSD
Figure imgf002527_0001
Figure imgf002528_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Λmino acid sequence ( X=Unknown, NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
19906 50274 20022 5316 MVKGSIHQEELTILNIYAPNTG
APRFIKQVLSDLQRDLDFHTLI
MGDFNTPLSTMDRSTRQKVNK
DIQELNSALHQADLIDIYRTLHP
KSTEYTFYSAPHHTYSKIDHILG
SKALPSKCKRTDI ITN YLSDHS A
IKLELRIKNLTQNHSTTWKLNN
LLLNDYWVHNEMKAEIKMFFE
TNENKDTTYENLWDAFKAVCR
GKFIALNAHKRKQERSKIDTLT
SQLKELEKQEQTHSKASRRQEI
TKIRAEQINKIDR
19907 50275 20023 2712 WRKI YQ ANGKQKKAG VA I L VS
DKTDFKPTKIKRDKEGHYIMVK
GSIQQEELTILNIYAPNTGAPTFI
QQVLSDLQRDLDSHTLIIGDFN
TPLSTSDRSTRQKVNKDTQELN
SALHQADLIDIYRTLHPKSTEYT
FFSAPHHTYSKIDHILG SKALLS
KCKRTEIITNYLSDHSAIKLELM
IKNLTQNHSTIWKLNNLLLNDY
WVHKEMKAEIKMFFETOENKD
TTYQNLWDTFKAVCRGKFIAL
NAHKRKQERSKIDTLTSQLKEL
EKQEQTLSKASRRQEITKIRAEL
KEIETQKTLQKINESRSRFFERIN
KIDR*VTLARVIKKKGEKΝQIDTI
KNDKGDITTDPTEIQTTIRESYK
HLYANKLENLEEMDKFLDTYT
LPRLNQEEVESLNRPITGSEIVAI
VNSLPTKKSPGPDGFTAEFYQR
YKEELVPFLLKLLQSIEKEGILP
HSFYEASIILIPKPGRDTTKKVEN
FRPISLMNIDAKILNKILANQIQ
QHIKKLIHHDQVGFIPGMQGWF
NTCKSINVIQHINRTKDKNHMII
SIDAEKAFDKIQQPFMLKTLNK
LVLEVLARAIRQEKEIKGIQLGK
EEVKLSLFADDMIVDLENPIVS
AQNLLKLISNFSKVSGYKINVQ
KSQAFLYNNNRQTESQIMSELA
FTIASKRIKYLGIQLTRDVKDLF
NENYKPLLNEIQEDTNKWKNIP
CSWVGRINIVKMAILPKVIYRF SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Amino acid sequence ( X=Unknovvn,
NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
19908 50276 A 20024 4461 MVASRSSRVGVFYGGQNSVDG
VSRRSDGSLNELCLYRPPTGAY
LAYDTLDVLPSGQFTVNTPPID
VNGKSLLALLWEHTSLLTSMG
GGRWAMQPLGQLGQLEHLGD
RRDQVLDRSGRSRRLDAAQHL
RHRPDRRPQGMVGVLTNQKEP
RTRWIHSRILPEVQGGTGTVPSE
TIPIDRKEGILPNLFDEASIILIPK
RGRDTTKKENFRPISLMNIDAKI
LNKILANRNQQHIKKLIHHDQV
GFIPGMQGWFNICK
19909 50277 B 20025 1183 4247
19910 50278 20026 2671 MVKGSIQQEELTILNIYAPNTG
APRFIKQVLSDLQRDLDSHTLI
MGDFNTPLSTLDRSTRQKVNK
DTQELNSALHQADLIDINRTLH
PKSTEYTFFSAPHHTYSKIDHIV
GSKALLSKCKRTEIITNYLSDHS
AIKL ELRIKNLTQSRSTP WKLN
NLLLNDYWVHNEMKAEIKMFF
ETNKNKDTTYQNLWDAFKAV
CRGKFIALNAYKRKQERSKIDT
LTSQLKELEKQEQTHSKASRRQ
EITKIRAELKEIETQKTLQKINES
RSWFFERINKIDRPLARLIKKKR
EKNQIDTIKNDKGDITTDPTEIQ
TTIREYYKHLYANKLENLEEM
DTFLDTYTLPRLNQEEVESLNR
PITGSEIVAIINSLPTKKSPGPDG
FTAEFY/PESYL*QTHSQYHTEW
AKTGSIPFENWHKTGMPSLTAP
1QHSVGSSGQGNQAGEGNKGY
SIRKRGSQIVPVCR*HDCLSRKP
HRLSPKSP*ADKQLQQSLRIQN
QCTKITSILIHQKQTNREPTHE*
TPIHNCFKENKIPRNPTYKGCEG
PLQGELQTTAQRNKRGHKQME
EHSMLMGRKNQYRENGHTAQ
GNLQIQCHPLQATODFLHRIGK
NYFKVHMEPKKSPHRQVNPKP
KEQSWRHHTT*LQTILQGYSNQ
NSVMVLVPKQRYRSMEQNRAL
RNNAAYLHYSDVLV*QT*EKQA
WGKGIPVIYKMVALVGKLASVPM*
Figure imgf002531_0001
Figure imgf002532_0001
Figure imgf002533_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Amino acid sequence ( X=Unkno\vn, NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
19928 50296 20044 2269 MSPDIPQVSPEVGLYPVANHSSI
WPTCHSLSEEDTEAQRNHELD
TVKALREDAQVLKIPNQSPKDT
LVTALQLAQKGTKAAAIRLGA
KVGQKQSRQAKSSPALWPGL
LSFPRARFFEWQSPGLPNTAAM
ENGTGPYGEERPREVQETTVTE
GAAKIAFPSANEVFYNPVQEFN
RDLTCAVITEFARIQLGAKGIQI
KVPGEKDTQKVWDLSEQEEE
KVELKESENLASGDQPRTAAV
GEICEEGLHVLEGLAASGLRSIR
FALEVPGLRSWANDASTRAV
DLIRRNVQLNDVAHLVQPSQA
DARMLMYQHQRVSERFDVIDL
DPYGSPATFLDAAVQAVSEGG
LLCVTCTDMAVLAGNSGETCY
SKYGAMALKSRACHEMALRIV
LHSLDLRANCYQRFWPLLSIS
ADFYVRVFVRVFTGQAKVKAS
ASKQALVFQCVGCGAFHLQRL
G KASG VPSGRAKFS AACGPPVT
PECEHCGQRHQLGGPMWAEPI
HDLDFVGRVLEAVSANPGRFH
TSERIRGVLSVITEELPDVPLYY
TLDQLSSTIHCNTPSLLQLRSAL
LHADFRVSLSHACKNAVKTDA
PASALWDIMRCWEKECPVKRE
RLSETSPAFRIVLSVEPRLQANFT
VIREDVANPSSRQRGLKRFQAVNP
EAVNWGPRPRSRPVGGKAADEA
MEERVRRLLQNKRKEPPEDVVA
19929 50297 20045 1578 2356 PLRRWACNHKLRQNAHQLSSTI
HCNTPSLLQLRVSALLHADFRVV
SLSHACKVNAVKTDAPASALWV
DIMRCWEKECPVKRERLSETSP
AFRILSVEPRLQANFTVIREDANP
SSRQRGLKRFQANPEANWGPR
PRARP/GVSGGKVGVGLSQGKK
G*RADGVPPTPSPSHRGKAADE
AMEERRRLLQNKRKEPPVEDVA
QRAARLKTFPCKRFKEGTCQRG
DQCCYSHSPPTPRVSADAAPDC
PETSNQTPPGPGAAAGPGID
Figure imgf002535_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Amino acid sequence ( X=Unknown, NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
19937 50305 20053 2322 MAEGKGEAGTFFTGQQDRLLF
VFFIITIWGLFRCSVFSSSSSSSS
LSSSSSSSSSSSLSSSFLHLFFIFIF
VFFVFIFFVFIFVFVFAFFFIFFVF
VVFFVFFFFVLLVEILMRPTISIR
GQKLKISDEMSKDCLSILYNTC
VCTEGVTKRLAEKNDFVIFLFT
LMTSKKTFLQTATLIEDILGVK
KEMIRLDEVPNLSSLVSNFDQQ
QLANFCRILAVTISEMDTGNDD
KHTLLAKNAQQKKSLSLGPSA
AEINQAALLSIPGFVERLCKLAT
RKVSESTGTASFLQELEEWYT
WLDNALVLDALMRVANEESEH
NQASIVFPPPGASEENGVLAHTS
ARTVQLPQSMKIMHEIMYKLEV
LYVLCVLLMGRQRNQVHRMIA
EFKLIPVGVLNNLFDKLIWRKHS
ASALVLHGHNHYVCDCSPDITLV
KIQFLRLLQSFSDHHENKYLLL
NNQELNELSAISLKANIPEVEAV
LNTDRSLVCDGKRGLLTRLLQ
VMKKEPAESSFRFWQARAVEE
FPSRGTTSYADQMFLLKRVGLL
EHILYCIVDSVECKSRDVLQSYF
DLVLGELVMKFNVDAFKRFNKYI
NTDAKFQVFLKVQIVNSSLVDSN
MLVRCVTLSLDRFENQVDMKV
AEVLSECRLLAYISQVVPTQMSF
LFRLINIIHVQTLTQENVSCLNTV
SLVILMLVARRKERLPLVYLRLLV
QRVMEHSKKYPGLLVLNNFHNL
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Λmino acid sequence ( X=Unknown,
NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
19938 50306 20054 237 2033 TSPTPLPLPATMPKTISVRVTTM
DAELVEFAIQPNTTGKQLFDQV
VKTIWLEGKFGFFGLQNQDTVK
GFSTVWLKLNRKVD*PRNVRKE
VSPLLFKVFRAKFYPVEDVSEELIQ
DITQRLFVFLQVKEGILNDDIYCP
VPETAVLLASYAVQSKYGDFNK
EVHKSGYLAGDKLLPQRVLEQ
HKLNRDVQWEERIQVWVHEEHR
GML/RGEDAVLEISERLLQDLE
MYGVNYFSIVKNKKGSELWLG
VDALGLNIYEQNDRLTPKIGFP
WVSEIRNISFNDKKFVVIKPIDKK
APDFVFYAPRLRINKRILALCM
GNHELYMRRRKPDTIEVQQMK
AQAREEKHQKQMERAMLENE
RKKREMAEKEKEKIEREKEEL
MERLKQIEEQTKKAQQELVEEQ
TPRAPGTLSQERKRAQSEAEKL
AKERQEAEEAKEALLQASRDQ
KKTQEQLALEMA ELTARISQLE
MARQKKESEAVEWQQKAQMV
QEDLEKTRAELKTAMSTPHVA
EPAENEQDEQDENGAEASVADL
RA* SVMAKDRSEEERTTEVAEKN
ERVQKHLKALTSELANARDES
KKTANDMIHAENMRLGRDKY
KTLRQIRQGNTKQRIDEFESM
Figure imgf002538_0001
Figure imgf002539_0001
Figure imgf002540_0001
Figure imgf002541_0001
Figure imgf002542_0001
Figure imgf002543_0001
Figure imgf002544_0001
Figure imgf002545_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Amino acid sequence ( X=Unknown, NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
19980 50348 20096 1982 QRDNLEAEAVAPCTAAARAVA
PDLLRLCSVRPSRSAPEDRQQP
CLFSHSEAGTAFIQTQQLHAAV
MADTF/LWEHMCRLVDIDSPPIT
ARNTGVIICVTIGPGLPDQVETLK
EDD*SLGMKCGSVLNFSSWEL
HGVTMRETIQGMWRTANGKA
FAFLTPFLLPGPLAVGFRTLKGP
EIPNWGSSKGNGTVAEVGA*RK
GATLKFTLVDNAYMVEKCVDENI
LVWLDYKNICKVVVEVGVSKIYV
DDGLISLQGKQKVGADFVLVTEV
ENGGSLGIVMKGVNLPVGAAVD
FACLCRRKDIQDLKFGVVEVQDV
DMVFVASFIPQSILMFHEVRKVL
GVEKGKDHQGLSSKIRESMRGF
RRFDVEILVEASDGI/LWVARG*F
*ALRIPAEKVVFVLAQKMIDLDRF
NRAVGKPVVICATQMLESMIKKP
RPTRAVEGSDVVANVAVLDWKPT
CIMLSWEKQPKGDYPLVEAVVR
MQHLV1AREARWLPSYHVLQLFV
EELRPPGAQLTSDPTEATAVGVA
VEASFKCCVSGAIIVVLTKSGKVC
SPRLARYRPRAPIIAVTRNSPGK
LVRAHLVYRGIFPGVVCQGPDSR
RAWA*DIGVLRVKLCHGMFGK
ARGFFQERGDWHCA*PEWRP
WLPAPPNNMRWPCCRDGPRG
PSSSRPCSHPFFPKPIPFRPSNGL
VELHSGAVNVATG
Figure imgf002547_0001
Figure imgf002548_0001
Figure imgf002549_0001
Figure imgf002550_0001
Figure imgf002551_0001
Figure imgf002552_0001
Figure imgf002553_0001
Figure imgf002554_0001
Figure imgf002555_0001
Figure imgf002556_0001
Figure imgf002557_0001
Figure imgf002558_0001
Figure imgf002559_0001
Figure imgf002560_0001
Figure imgf002561_0001
Figure imgf002562_0001
Figure imgf002563_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Amino acid sequence ( X=Unkno\vn, NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
20109 50477 20225 1970 MWKGFTILDSFKNISDSWEDVR
ISTLTGVWKKLIVTLMDDLEEF
KTSVEEITLTQRKLFFVMESPV
GPISPPQPPSVSAWNKPLTSFGS
APSSEGAKNGQESGLEIGTDTIQ
FGAPASNGNENEWPVLSEKSA
DKIPEPKEQRQKQPRAGPIKAQ
KLPDLSPVDNKVEHKPGPIGKER
SLKNRKVKDAQQVEPEGQEKP
SPATVRSTDPVTTKETKAVSEM
STEIGTMISVSSAEYGTNAKESV
TDYTTPSSSLPNTVATNNTKME
DTLVNNMESARKAWENSPNVR
EKGSPVTSTAPPIATGVSSSASG
PSTANYNSFSSASMPQIPVASVT
PTASLSGAGTYTTSSLSTKSTTT
SDPPNICKVKPQQLQTSSLPSAS
HFSQLSCMPSLIAQQQQNPQVY
VSQSAA/ATLTTACLS*WSTAQI
PAFYMDTSHLFNTQHARLAPPS
LAQQQGFQPGLSVRSQPAFMQS
SLSQPSWLSGTAIHNFPTVQH
QEL AKAQSGL AFQQTSNTQPI PI
LYEHQLGQASGLGGSQLIDTHL
LQVTVPLPASQLSLPNFGSTGQ
PLIALPQTLQPPLQHTTPQAQA
QSLSRPAQVSQPFRGLIPAGTQ
HSMIATTGKMSEMELKAFGSGI
DIKPGTPPIAGRSTTPTSSPVSGL
LLQVRTASPAK
20110 50478 20226 260 8485 VSTEMSEKSGQSTKAKDGKKY
ATLSVLFNTSKGKSLETQKTTVA
ARHGLQSLGKVGISRVRMPPPC*
PPKFLKPE*QRANDPNVNIVPK
DGTGWASKQEQHEEEKVTPEVP
PAVQPKPGVAAPPEVAPAPKSW
ASNKQGGQGDGIQVNSQFQQE
FPSLQAAVDQEKKVEKETNDDN
YGP*PSLRPPNVACWRDGGKA
AGSPSSSDQDEKLPGQDESTAG
TSEQNDILKVVEKRIACGPPQA
KLNGQQAALASQYR
20111 50479 A 20227 330 485 RQGDRVSLCCLGRS*TPGLK*F SHLDLLNCWDYRCETVHLAEI AQVSEKQI
20112 50480 20228 1 189 1413
Figure imgf002565_0001
Figure imgf002566_0001
Figure imgf002567_0001
Figure imgf002568_0001
Figure imgf002569_0001
Figure imgf002570_0001
Figure imgf002571_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Λmino acid sequence ( X=Unkno>vn, NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
20148 50516 20264 2519 MAPSSRSGAGNTSTPGGCARAR
APPGNAGVHSGTQTGAVRDTA
FISPSSPGLFLSSEPPQPGQRAGF
LQPSCCGAYYLYPWLRERQSSI
LWSGGAAAPQRKSGPLGRSDQ
PPPSPHWGGFQKRCRHRIAQL
FAVGALVCCLVLLHLSAMISAS
RAAAARLVGAAASRGPTVAAR
HQDSWVNGLSHEAFRLVSRRDY
VASVEAIKGAWGIVDLGTTNSCV
AVMEGNQAKVVLENAEGAKTT
PSWAFTADGERLGGMPAKRQ
AVTNPNNTFYATKRLIGRRYDD
PEVQKDIKΝVPFKIVRASΝGDA
WVEAHGKLYSPSQIGAFVLMK
MKETAEΝ YLG HTAKΝ A VITVP
AYFΝDSQRQATKDAGQISGLΝ
VLRVIΝEPTAAALAYGLDKSED
KVIAVYDLGGGTFDISILEIQKG
VFEVKSTNGDTFLGGEDFDQAL
LRHIVKEFKRETGVDLTKDΝM
ALQRVREAAEKAKCELSSSVQT
DIΝLPYLTMDSSGPKHLΝMKLT
RAQFEGIVTDLIRRTIAPCQKA
MQDAEVSKSDIGEVILVGGMT
RMPKVQQTVQDLFGRAPSKAV
ΝPDEAVAIGAAIQGGVLAGDV
TDVLLLDVTPLSLGIETVHGAVF
SKLIΝRKTNIPTKKSQVFSTAAD
GQTQVEIKVCQGEREMAGDΝK
LLGQFTLIGIPPAPRGVPQIEVTF
DIDAΝGIVHVSAKDKGTGREQ
20149 50517 A 20265 168 LERERRKRSRKELLΝFYAWQH
RESKMEHLR/QLRKKFEEDKQR
IELLRAQRKFRPY
Figure imgf002573_0001
Figure imgf002574_0001
Figure imgf002575_0001
Figure imgf002576_0001
Figure imgf002577_0001
Figure imgf002578_0001
Figure imgf002579_0001
Figure imgf002580_0001
Figure imgf002581_0001
Figure imgf002582_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Amino acid sequence ( X=Unkno n, NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
20207 50575 20323 270 2945 NTSSFKLSPTTGPPGRYLTTTGA
RIWNRLARYLVSEGSGLGAIEV
MGTQDPGNMGTGVPASEQISC
AKEDPQVYCPEETGGTKDVQV
TDCKSPEDSRPPKETDCCNPED
SGQLMVSYEGKAMGYQVPPFG
WRICLAHEFTEKRKPFQANNVS
LSYMIKHIGMGLRYLQWRDRK
THVEKKTPFIDMNKSVPPIQSYF
VVPLGGLRGGTSTRGWRGQFCR
WQLNPGMYQHRTV1ADQFTVC
LRVREGQTVYQQVLSLVERPSVL
RSWNWGLCGYFAFYHALYPRA
WTVYQLPGQNVTLTCRQITPIL
PHDYQDSSLPVGVFVWDVENE
GDE ALDVSIM FSMRN/GTG W W
RRCPRGFVGMSPSVWSVGVQT
VRGLLLHSSNPSKPLHDGLWLA
RSSRAATTVNPH/CTAFDPGVRT
GAAVVWQDLLQDGQLDSPTGQ
RTPTQKGVGIAGAVCVSSKLRP
RGQCRLEFSLAWDMPRIMFGA
KGQVHYRRYTRVFGAGVGDAP
PALSHYALCRYAEWEERISAW
QSPVVLDDRSLPAWYKSALFNE
LYFLADGGTVWLEVLEDSLPEE
VRAETSVTSAPTLRDYRRFG*LE
GQEYVRMYNTYDVHFYASFALI
MLWPKLELSLQYDMALATLRE
DLTRRRYLMSGVMGTCEKEGT
FIPHDIGGPRWMEPWASAFNAY
VLIHDTADWVKDLΝLKFVACRFI
20208 50576 20324 123 359
20209 50577 20325 259 KPVGEKDTWSGMRTTGQLRPA HGVRLKAΝKDSLYKPILRQKK HFNSLHIPKALQKALPFKΝKPK TQAKAGKVPKDRRR*TVSΝITV
20210 50578 20326 297 585
20211 50579 20327 182 374 PDFRI A AT* V VLDLDKSIKI VKK LKLTGFPYKIFKNTSFIKACISIY SHIYKSPYCLRKGMKYL
Figure imgf002584_0001
Figure imgf002585_0001
Figure imgf002586_0001
Figure imgf002587_0001
Figure imgf002588_0001
Figure imgf002589_0001
Figure imgf002590_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Amino acid sequence ( X=Unknown, NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
20249 50617 A 20365 160 2810 GLRVPLSPSSPRSRLRSGPLRLP
GADSGSGPKAVCSPPFIVAPTG
RGYCGDHESSFGAMEEPGATP
QPYLGLLLEELRRWAALPEG
MRPDSN L YGFP WEL VIC A A V V
GFFAVLFFLWRSFRSVRSRLYV
GREKKLALMLSGLIEEKSKLLE
KFSLVQKEYEGYEVESSLKDAS
FEKEATEAQSLEATCEKLNRSN
SELEDEILCLEKELKEEKSKHSE
QDELMADISKRIQSLEDESKSL
KSQVAEAKMTFQIFPMNEERLK
IAIKDALNENSQLQESQKQLLQ
EAEVWKEQVSELNKQKVTFED
SKVHAEQVLNDKESHIKTLTER
LLKMKDWAAMLGEDITDDDN
LELEMNSESENGAYLDNPPKG
ALKKLIHAAKLNASLKTLEGER
NQIYIQLSEVDKTKEELTEHIKN
LQTEQASLQSENTHFENENQKL
QQKLKVMTELYQENEMKLHR
KLTVEENYRLEKEEKLSKVDEK
ISHATEELETYRKRAKDLEEEL
ERTIHSYQGQIISHEKKAHDNW
LAARNAERNLNDLRKENAHNR
QKLTETELKFELVLEEDPYALDV
PNTAFGREHSPYGPSPLGWPSS
ETRAFLSPPTLLEGPLTLSPLLP
GGGGRGSRGPGNPLDHQITNER
GESSCDRLTDPHRALSDTGFLS
PPWDQDRRMMFPPPGQSYPDS
ALPPQRQDRFCSNSGRLSGPAE
20250 50618 20366 131 461
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Λmino acid sequence ( X=Unkno n,
NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
20251 50619 20367 1761 MARGYGATVSLVLLGLGLALA
VIVLAWLSRHQAPCGPQAFAH
AAVAADSKVCSDIGRVAILQQQ
GSPVDAS1AALVCTSVVNPQSM
GLGGGVIFTIYNVTTGKVEVIN
ARETVPASHAPSLLDQCAQALP
LGTGAQWIGVPGELRGYAEAH
RRHGRLPWAQLFQPTIALLRGG
HVEAPVLSRFLHNSILRPSLQAS
TLRQLFFNGTEPLRPQDPLPWP
ALATTLETVATEGVEVFYTGRL
GQMLVEDIAKEGSQLTLQDLA
KFQPE VVDA LEVPLGD YTL YSP
PPPAGGAILSFILNVLRGFNFST
ESMARPEGRVNVYHHLVETLK
FARGQRWRLGDPRSHPKLQNA
SRDLLGETLAQLIRQQIDGRGD
HQLSHYSLAEAWGHGTGTSHV
SVLGEDGSAVAATSTIVNTPFGA
MVYVSPRTGIILNNELLDLCERC
PRGSGTTPSTVSGDRVGGAPGR
CWPPVPGEPSPSSMEPSILΓNKA
QGSNLVIGCAGGELIISAVAQAI
MSKLWLGFDLRAAIAAPILHVN
ΝKGCVEYEPΝFSQEVQRGLQD
RGQNQTQRPFFLNWQAVSQE
GACVYAVSDLRKSGEAAGY
Figure imgf002593_0001
Figure imgf002594_0001
Figure imgf002595_0001
Figure imgf002596_0001
Figure imgf002597_0001
Figure imgf002598_0001
Figure imgf002599_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Λmino acid sequence ( X=Unknown, NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
20285 50653 20402 190 2189 AFCPAGTRSAPCPRPRTMTMTL
HTKASGMALLHQIQGNELEPLN
RPQLKIPLERPLGEVYLDSSKPA
VYNYPEGAAYEFNAAAAANA
QVYGQTGLPYGPGSEAAAFGS
NGLGGFPPLNSVSPSPLMLLHPP
PQLSPFLQPHGQQVPYYLENEP
SGYTVREAGPPAFYRPNSDNRR
QGGRERLASTNDKGSMAMESA
KETRYCAVCNDYASGYHYGV
WSCEGCKAFFKRSIQELPTLC*I
GWYWYQNRDIDQWNRTEPSEI
I PHI YN HLIFDKRDKNKK WGKD
SLFNK*CWENWLAICRKLKLD
HFLTPYTKINSRWIKDLNVRTK
TIKTLEENLGNTIQDIGMGKDF
MSKTPKA MATKAKI DKWDLIK
LKSFCSLLGYLKTENTAKETTIR
LNRQPTEWEKIFAIYSSDKGLIS
RIYKELKQIYKKK/GNPINKWV
KDMNRHFSKEDIYAANRHMKK
CSSSLAIREMQIKTTMRYHLTP
VRMVIIRKSGNDRCWRGCGEIG
TLLHCWWDCKLVQPLWKTVW
RFLRDLQLEIPFDPAIPLLGIYPK
DYKSCCYKDTCTRMFIAALFTI
AKT WNQPKYPTTIG *IKKM WH I
YTMEYYAAIKKDEFI/SLVGTW
MKLETIILSKLTQEQKTKHHMF
SFITGSRTMRTHGCREGNITHW
GLLRDSVNMVTQAH
20286 50654 20403 907 RVPAMATSSFALPRILGAGARA
PSRWLGFLGKATPRPARPSRRT
LGSATALMIRESEDSTDFNDKIL
ΝEPLKHSDFFΝVKELFSVRSLF
DARVHLGHKAGCRHRFMEPYI
FGSRLVDHDIIDLVEQTATHLQLA
LΝFTAHMAYRKGIILFISRΝRQF
SYLIEΝMARDCGEVYAHTRYFM
GGMLTNARLLYGPTVRLPDLVII
FLHTLΝNIFEPHVAVRDAAKM
ΝIPTVGIVDTNCΝPCLITYPVPG
ΝDDSPLAVHLYCRLFQTAITRA
KEKRQQVEALYRLQGQKEPGD
QGPAHPPGADMSHSL
20287 50655 B 20404 70 1477
Figure imgf002601_0001
Figure imgf002602_0001
Figure imgf002603_0001
Figure imgf002604_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Λmino acid sequence ( X=Unknown,
NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possiblc nucleotide insertion) sequence
20315 50683 20432 1791 MPGARTSSSGASENHRARGQG
GGPQGVGRMAEGKAGGAAGL
FAKQVQKKFSRAQEKVLQKLG
KAVETKDERFEQSANNFYQQQ
AEGHKLYKDLKNFLSAVKVMH
ESSKRVSETLQEIYSSEWDGHE
ELKAIVWNNDLLWEDYEEKLV
ADQAVRTMEIYVAQFSEIKEVRI
AKRGRKLVDYDSARHHLEAVQ
NAKKKDEAKTAKAEEEFNKAQ
TVFEDLNQELLEELPILYNSRIG
CYVTIFQNISNLRDVFYREMSK
LNHNLYEVMSKLEKQHSNKVF
WKGLSSSSRRSLVISPPVRTAT
VSSPLTSPTSPSTLSLKSESESVS
ATEDLAPDAAQGEDNSEIKELL
EEEEIEKEGSEASSSEEDEPLPA
CNGPAQAQPSPTTERAKVSQEE
VLPSSTTPSPGGALSPSGQLSSS
ATE V VLRTRT ASEG SEVRPKKT
ASIQRTSAPPRRPPPPRATASPRP
CSGNIPSSPTASGGGSAVTSPRAS
LGTGTASPRTSLEIV* PNPEVPPEK
PVRTPEAKENENIHNQNPEELC
TSPTLMTSQVASEPGEAKKME
DKEKDNKLISADSSEGQDQLQV
SMVPENNNLTAPEPQEEVSTSE
Figure imgf002606_0001
Figure imgf002607_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Amino acid sequence ( X=Unkno\vn, NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
20324 50692 20441 582 2595 PPPPVRRRDPGPAVRPAGAMA
KKSAENGIYSVSGDEKKGPLIA
PGPDGAPAKGDGPVGLGTPGG
RLA VPPRVETWTRQM DFI MSC V
GFAVGLGVNVWRFPYLCYKNG
GGVFLIPYVLIALVGGIPIFFLEIS
LGQFMKAGSINVWNICPLFKGL
GYASMVIVFYCNTYYIMVLAW
GFYYLVKSFVTSTLPWATCGNY/
TWKHSPTCVEIFVRHEDCVANAS
LANLTCDQLANRVRSPVVIEFWE
NKVLKLSGGLEVVPGALNWEVV
TLGLLACWVLVYFCVWKGVK
STGKVIVYFTA/THYPYVVLWL
LVRGVLLPGALDGIIYYLKPDW
SKLGSPQVWIDAGTQIFFSYAIG
LGALTVLGSYNRFNNNCYKDA
IILALINSGTSFFAGFWFSILGF
MAAEQGVHISKVAESGPGLAFI
AYPRAVTLMPVAPIVWAALFFF
MLLLLGLDSQFVGVEGFITGLL
DLLPASYYFRFQREISVALCCA
LCFVIDLFMGVTDGGMYVFQLF
DYYSASGTTLLWQAVFWECW
VAWVYGADRFMDDIACMIGY
RPCPWMKRVCWSFFTPLVCVMGI
FILNWYYEPLVYKQN/TYVYP
WWGEVAMGVWPFALSVSMLFVP
LVHLLGLPSLRAKGHPWLKRW
QHVLTVQPIWGLHPL/EELPRFRD
ADVQGTLTHP*TPVSESSKWVV
20325 50693 A 20442 363 MGKRGLILLIEACIAIAQPLQRW
FWPPRAGMPVTYD/HSNSAAG
SPPV*NARLCSPGEKVLCRSQT
GNFPHTPQEQAVPSAWNALPV
QP* SH/GPSPML VPCRCVCPMET
RVQGSASPGGQG
20326 50694 20443 747 1389 LGAVQLCPPPCRPCHRAALPGG
LFPPPRPPHSPQAGPLHPQWTQ*
PGSVGSPTQLSPEASRPQWPPT
AVGSPRAGVHPGHGQA*SADISS
AKQWTQVGPAGQGGSKFGKT
KVPWQSGPLGWDPGARAGQG
VG*ETRGPSLQLAWSDGEGV/P
GWRGSYRGHRRGP*PPAPWPW
SGGHPGSPMEWPCPGTELQMR
NRAGSRDLHSLGRAPPAATAPS
Figure imgf002609_0001
Figure imgf002610_0001
Figure imgf002611_0001
Figure imgf002612_0001
Figure imgf002613_0001
Figure imgf002614_0001
Figure imgf002615_0001
Figure imgf002616_0001
Figure imgf002617_0001
Figure imgf002618_0001
Figure imgf002619_0001
Figure imgf002620_0001
Figure imgf002621_0001
Figure imgf002622_0001
Figure imgf002623_0001
Figure imgf002624_0001
Figure imgf002625_0001
Figure imgf002627_0001
Figure imgf002628_0001
Figure imgf002629_0001
Figure imgf002630_0001
Figure imgf002631_0001
Figure imgf002632_0001
Figure imgf002633_0001
Figure imgf002634_0001
Figure imgf002635_0001
Figure imgf002636_0001
Figure imgf002637_0001
Figure imgf002638_0001
Figure imgf002639_0001
Figure imgf002640_0001
Figure imgf002641_0001
Figure imgf002642_0001
Figure imgf002643_0001
Figure imgf002645_0001
Figure imgf002646_0001
Figure imgf002647_0001
Figure imgf002648_0001
Figure imgf002649_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Amino acid sequence ( X=Unknown, NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
20869 51237 20988 2577 MPPNRHRMRGETTLWNPGCDH
AGIATQVWEKKLWREQGLSR
HQLGREAFLQEVWKWKEEKG
DRIYHQLKVKLGSSLDWDRACF
TMDPKLSVAAVTEAFVGLHEKG
IIYRSTAFVΝWPCTLΝSAISDIE
VDKKELTGRTLLSVPGYKEKV
EFGVLVSFVAYKVVQGSD*/DDE
EVWAΝKLGSKTVMLGDVAVA
VPPPKDTRVYQHLVKGKNVIHPVF
LSRSLSPLSSDEFVDMDFVGTGA
VKDHPPRHDQNDYEVGQRHGL
EAISIMDSRGAVPIKCASAFPGPA
AGLRPGKRCWWR*RNRGLFRG
IEDNPMWPLCNRSKDWEPLL
RPQWYVRCGEMAQAASAAVT
RGDLRILPEAHQRTWHAWMD
NIREWCISRQLWWGHRIPAYFV
TVSDPAVPPGESEDLSVFYPGT
LLETGHDILFFWVARMVMLGL
KLTGRLPFREVYLPCPSCENAH
GRKVMSKSLGNVIDPLDVHL/Y
GISLQGLHNQLLNSNLDPSEVE
KAKEGQKADFPAGVPECGTDA
LRFGLCAYMSQGS*K*TWNVN
RIMG * RQFCNKL WKCTKF ALR
GLGKGFVPSVDTSNSAGHESLA
DR* IRNLLTE AVRLSNQGFQA Y
DFPAVTTAQYSFWLYELCDVY
LECLKPVLNGVDQV/AQAECAR
QTLYTLPWTLGLRLLSPFMPFV
TEELFQRLPRRMPQAPPSLCVT
20870 51238 A 20989 4083 SQRGEHVGAGCRGGDLRELGS
PGEPPTVSARSTPPPGPLGGSQP
RPAPGCEVDSPAQSDHLPPVFP
LRSDLLITMSTLYVSPHPDAFPS
LRALIAARYGEAGEGPGWGGA
HPRICLQPPPTSRTSFPPPRLPAL
EQGPGGLWVWGATAVAQLLW
PAGLGGPGGSRAAVLVQQWVS
YADTELIPAACGATLPALGLRS
SAQDPQAVLGALGRALSPLEE
WLRLHTYLAGEAPTLADLAAV
TALLLPFRYVLDPPAR
Figure imgf002651_0001
Figure imgf002652_0001
Figure imgf002653_0001
Figure imgf002654_0001
Figure imgf002655_0001
Figure imgf002656_0001
Figure imgf002657_0001
Figure imgf002658_0001
Figure imgf002659_0001
Figure imgf002660_0001
Figure imgf002661_0001
Figure imgf002662_0001
Figure imgf002663_0001
Figure imgf002664_0001
Figure imgf002665_0001
Figure imgf002666_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Amino acid sequence ( X=Unknown, NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
21025 51393 21 151 1536 MGGGEQGTLRTAENQGLQVCP
GEAGSSLGFSGNTPTNGLPARG
DQTSYRDFWSGPQFPDTSLRTA
APGAREQCGTHGGEGSTLEGSR
PDVPQDLFSPTLGFYGSTYPELE
QGMLGGH*G/PAENQGLQVCP
GEAGSSLGFSGNTPTNGLPARG
DQTSYRDFWSGPQFPDTSLRTA
APGAREQCGTHGGEGSTLEGSR
PDVPQDLFSPTLGFYGSTYPELE
QGMLGGHVEASREYQMLSHV
GΝPDYPHALATEQATSPSTESG
SAGFKGVALAMLSKITMCVRIG
ILVFQQDQGSMCKAFSTTGMG
LVTYLLTCRAWSRLQCGASYR
AHDVΝVLPGCSGPAALLLGDW
FSLHSSDAVLPAQGQGMLSQG
EADACITLPHLFPSREWFLLGL
EAGGLIPGAEGPPEDMGLVRGP
PAVAAGPPPRWLSPEFCLLQYS
EEEGCKEPLWLKVEAEVSTLYR
FMPTGFTRGPFΝIEHREWGΝTT
ATWTRMISΝHΝKHSQRGRKMT
ARLPCGGAPHFSDRTDDKWRR
SSLLTRSGCQTERLLTSQTEPLP
AGGSPHFSΝGETGQRRSSPPQR
GCDRPQALLTSKTEQQRKSASH
ISYDERPGKDAPHFLDGMVAG
KRRSSLSRLGSQA
Figure imgf002668_0001
Figure imgf002669_0001
Figure imgf002670_0001
Figure imgf002671_0001
Figure imgf002672_0001
Figure imgf002673_0001
Figure imgf002674_0001
Figure imgf002675_0001
Figure imgf002676_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Λmino acid sequence ( X=Unkno\vn, NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
21070 51438 21197 2466 MLSRLFRMHGLFVASHPWEVI
VGTVTLTICMMSMNMFTGIAG
LFTIFSSFVFSTWIHFLDKELTG
LNEALPFFLLLIDLSRASTLAKF
ALSSNSQDEVRENIARGMAILG
PTFTLDALVECLVIGVGTMSGV
RQLEIMCCFGCMSVLANYFVF
MTFFPACVSLVLELSRESREGR
PIWQLSHFARVLEEEENKPNPV
TQRVKMIMSLGLVLVHAHSRW
IADPSPQNSTADTSKVSLGLDE
NVSKRIEPSVSLWQFYLSKMIS
MDIEQVITLSLALLLAVKYIFFE
QTETESTLSLKNPITSPWTQKK
VPDNCCRREPMLVRNNQKCDS
VEEETGINRERKVEVIKPLVAET
DTPNRATFVVGNSSLLDTSSVL
VTQEPEIELPREPRPNEECLQIL
GNAEKGAKFLSDAEIIQLVNAK
HIPAYKLETLMETHERGVSIRR
QLLSKKLSEPSSVLQYLPYRGYV
NYSL W* WG ACCEN VIGYMPIP
VGVAGPLCLDEKEFQVPMATT
EGCLVASTNRGCRAIGLGGGAS
SRVLADGMTRGPWRLPRACD
SAEVKAWLETSEGFAVIKEAFD
STSRFARLQKLHTSIAGRNLYIR
FQSRSGDAMGMNMISKGTEKA
LSKLHEYFP*NCRF*AVVSGNYC
TDKKPAAINWIEGRGKSWCET
VI PAKV VREVLKTTTEAMIEVN
INKNLVGSAMAGSIGGYNAHA
21071 51439 21 198 236 3022 GEGSFRSATALTGARLSVGAGV
RWPLVRSGGSKDSVATMLSRL
FVRMHGLFVASHPWEVILGTVT
LTICMMSMNMFTGNNKICGWN
YECPKFEEDVLSSDIIILTITRCIA
ILYIYFQFQNLRQLGSKYILGIA
GLFTIFSSFVFSTWIHFLDKELT
GLNEALPFFLLLIDLSVRASTLA
KFALSSNSQDEVRENIARGMAI
LGPTFTLDALVECLVIGVGTMS
GVRQLEIMCCFGCMSVLANYF
VFMTFFPAC
21072 51440 A 21199 450 CKSRGSNLRVHFKNTRETAQAI
KGMHIRTATKYLKDVTLQKQC
VPFRRYNGGVGRCAQAKQWG
WTQGRWPKKSAEFVLVIEHIQV
NKAPKMRRRTYRAHGRINPYM
SSPCHIEMILTEKEQIVPKPEEEV
AQKKKISQKKLKKQKLMARE
Figure imgf002678_0001
Figure imgf002679_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Λmino acid sequence ( X=Unknown,
NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /^possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
21094 51462 A 21221 2157 MGFPLRPSPAHLSSGRQQFPAK
EERLLLSLTQESQSEEIDCNDKD
LFKAVDAALKKYNSQNQSNNQ
FVLYRITEATKTVGSDTFYSFK
YEIKEGDCPVQSGKTWQDCEY
KDAAKAATGECTATVGKRSST
KFSVATQTCQITPGGWFILWHC
PGGKLTISDYPNSSRVLHCTQC
VQGLNRRTRRHKTCPPLQAAS
QIEERIGFEGRPGLESFRAPCLL
ASPFPCLEKGAEGPWTAQYDC
LGCVHPISTQSPDLEPILRHGIQ
YFNNNTQHSSLFMLNEVKRAQ
RQAAP/GREFSSMDTFEGYEGA
EDMEKLAGPFPVG*QSRNLFLQ
IRFPGEERKHPGRKRRTR*GSN
GDRQHQEGACWGVHAG*LLR
GRRLVSRRGRGARGPPPRRAAL
PAVAPAPRLQSARIPERGDGDD
GVRRWGHREAIGS*GQS/LS/CN
GIIILIKEERLPCPFHCISLMKRPP
GFSPFRSSRIGEIKEETTVSPPHT
SMAPAQDEERDSGKEQGHTRR
HDWGHEKQRKHNLGHGHKHE
RDQGHGHQRGHGLGHGHEQQ
HGLGHGHKFKLDDDLEHQGGH
VLDHGHKHKHGHGHGKHKNK
GKKNGKHNGWKTEHLASSSED
STTPSAQTQEKTEGPTPIPSLAK
PGVTVTFSDFQDSDLIATMMPPI
SPAPIQSDDDWIPDIQIDPNGLSF
NPISDFPDTTSPKCPGRPWKSVS
21095 51463 21222 207 424 HHINCVLLYLQSHLRSVCEYQGV RPPKAGAEPASEVREVSLTN W A ESSTPRHIAPTTSASNLEMEGQE ERWDRI
Figure imgf002681_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Λmino acid sequence ( X=Unknown,
NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
21100 51468 21227 846 3269 GFMEVLRDP1KKNSSESKPAQS
GFSRGNSPLSCPESVEASPAVNE
KSVYSTHNYGTTQRHGCRGLP
YADHNYGAPPPPTPPASPPVQTI
IPRSDLNGLPSPVEERCGDSPNS
EGETVPTWCPCGLSQDGFLLNC
DKCRGMSRGKVIRLHRRKQDN
ISGGDSSATESWDEELSPSTVLY
TATQHTPTSITLTVRRTKPKKR
KKSPEKGRAAPKTKKIKAFREG
SRKSLRMKNSPSEAQNLDENTT
EGWENRIRLWTDQYEEAFTNQ
YSADVQNALEQHLHSSKEFVG
KPTILDTINKTELACNNTVIGSQ
MQLQLGRVTRVQKHRKILRAA
RDLALDTLIIEYRGKVMLRQQF
EVNGHFFKKVPYPVFVLFLLQNL
NGVEMCVDARTFGNDARFIRR
SCTPNAEVRHMIADGMIHLCIY
AVSAITKDAEVT1AFDYEYSNC
NYKVDCACHKGNRNCPIQKRN
S*CYRTATPTTSSKPTHHWSRD
*T*KSVRRKELEMEQQNEASEEN
NDQQSQEVPEKVTVSSDHEEV
DNPEEKPEEEKEEVIDDQENLA
HSRRTREDRKVEAIMHAFENLE
KRKKRRDQPLEQSNSDVEITTT
TSETPVGEETKTEAPESEVSNSV
SNVTIPSTPQSVGVNTRRSSQA
GDIAAEKLVPKPPPAKPSRPRPK
SRISRYRTSSAQRLKRQKQANA
QQAELSQAALEEGGSNSLVTPT
21101 51469 21228 197 CRPPSPRPVRAARPARPQASFSR
EPPPRRMAP*SKRRRQSARGPV
RPSRVWARIKAKPYSCCCQL
21102 51470 21229 488 GSYDINRQDTFQKDRQRVGPSG
ASQKTEGFLLRLLYSGLIIAHCS
LEILGRNDPPASASKVAETTDS
GSLSLANT*PPTPR/VTSQTANA
TSQAAPALPLGSVPGRRAADPR
PLLAWAPAPPGPGTSRNTHPAP
DGLDMLVPIGDGVLGGNCFRN
QCKSVSTS
21103 51471 21230 124 293
Figure imgf002683_0001
Figure imgf002684_0001
Figure imgf002685_0001
Figure imgf002686_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Amino acid sequence ( X=Unkno\vn, NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
21126 51494 21253 2775 QALARRAVQAEKMAPPSTREP
RVLSATSATKSDGEMVLPGFPD
ADSFVKFALGSWAVTKASGG
LPQFGDEYDFYVRSFPGVFQAFC
ETQGDR/LCFQCMSRVMQVPM
GCVRSNVI KDRSKVVTELEDKFVD
LLVVDANVDVILERVGILLDEAS
GVNKNQQPVLPAGLQVPKTW
VSSWNRKAAEYGKKAKSETFRL
LHAKNIIRPQLKFREKIDNSNTP
FLPKIFHSNPMVQVKPLPQALSK
ERRERPQDRPE
21 127 51495 21254 2668 QALARRAVQAEKMAPPSTREP
RVLSATSATKSDGEMVLPGFPD
ADSFVKFALGSWAVTKASGG
LPQFGDEYDFYRSFPGFQAFCE
TQGDRLLQCMSRVMQYHGCRS
NIKDRSKVTELEDKFDLLVDAN
DVILERVGILLDEASGVNKNQQ
PVLPAGLQVPKTWSSWNRKA
AEYGKKAKSETFRLLHAKNIIR
PQLKFREKIDNSNTPFLPKIFIKP
NAQKPLPQALSKERRERPQDRP
EDLDVPPALADFIHQQRTQQVE
QDMFA H PYQ YELNHFTPADA V
LQKPQPQL YRPI EETPCHFISSL
DELVELNEKLLNCQEFAVDLEH
HSYRSFLGLTCLMQISTRTEDFII
DTLELRSDMYILNESLTDPAIVK
VFHGADSDIEWLQKDFGLYW
NMFDTHQAARLLNLGRHSLDH
LLKLYCNVDSNKQYQLADWRI
RPLPEEMLSYARDDTHYLLYIY
DKMRLEMWERGNGQPVQLQV
VWQRSRDICLKKFIKPIFTDESY
LELYRKQKKHLNTQQLTAFQL
LFAWRDKTARREDESYGYVLP
NHMMLKIAEELPKEPQGIIACC
NPVPPLVRQQINEMHLLIQQAR
EMPLLKSEVAAGVKKSGPLPSA
ERLENVLFGPHDCSHAPPDGYP
11 PTSGS VPVQKQASLFPDEKED
NLLGTTCLIATAVITLFNEPSAE
DSKKGPLTVAQKKAQNIMESFE
21128 51496 21255 98 286 HARPRHGPRLRCEVGLRCAPG YAGASARPTCECGDTRYCPQDP NQT* SRPVPII AHRYCLTG
21129 51497 21256 1020 1280
21130 51498 21257 153 3804
Figure imgf002688_0001
Figure imgf002689_0001
Figure imgf002690_0001
Figure imgf002691_0001
Figure imgf002692_0001
Figure imgf002693_0001
Figure imgf002694_0001
Figure imgf002695_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Amino acid sequence ( X=Unkno\vπ, NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possiblc nucleotide insertion) sequence
21 174 51542 21301 1 16 1559 PRTARPSCAHARTVRAVATRRR
RVHFRDLSAAPSGLSVLRPRPT
EPVRVDCPESANPPVRARPDSP
DPRMFELLAASDARINGPFCAP
RVLLAPLLGAFVTVSKIVSKIAL
SPK* SRRSHLLRRNGSQ WVVKV
HVELKEHNGVHITGIDVWAPKSD
RICH CLGADRNCLMVWSVQKVD
GVWKPNPG*FLRNLIGGGNFL*
KWVPPRRTKFGCGGSGARLISV
CYFESENDWWVSKHIKKPIRST
VLSLDWHPNNVLLAAGSCDFK
CRVFSAYIKEVDEKPASTPWGS
KMPFGQVLMSEFGGVSGTGVGW
VHGVSFSVASGVDRLGWGSATD
STVVSVA*CLQKWQVSTLKTVE
FLAAP*WSFVSREPASGLPAH
DCVCPMLFNYG*PRAALTFRLP
KLDIPKVQSIQRNMSAHGNAFG
NMVDKRATTEDVRNTALVETLHV
QNSITQVLYLLRWDKQDVCRKF
CTVTGIVDGEPWTIWDFQDPRSL
PSQGLRIM
Figure imgf002697_0001
Figure imgf002698_0001
Figure imgf002699_0001
Figure imgf002700_0001
Figure imgf002701_0001
Figure imgf002702_0001
Figure imgf002703_0001
Figure imgf002704_0001
Figure imgf002705_0001
Figure imgf002706_0001
Figure imgf002707_0001
Figure imgf002708_0001
Figure imgf002709_0001
Figure imgf002710_0001
Figure imgf002711_0001
Figure imgf002712_0001
Figure imgf002713_0001
Figure imgf002714_0001
Figure imgf002715_0001
Figure imgf002716_0001
Figure imgf002717_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Ammo acid sequence ( X=Uπknown,
NO: of peptide hod in USSN location of first codon for last ammo acid *=Stop codon, /=possιble nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possιble nucleotide insertion) sequence
21290 51658 21418 1584 TQKNTAVRAHRLWGREKISRK
EGAGPTPQESRRIGARSGALMT
RKGRGMQEVEELSETGKWKQ
HRVGEALRVWLVWAGSLG/D
RRGLVLGRRGYYGAVPAVEMP
ARALLPRRMGHRTLASTPALW
ASIPCPRSELRLDLVLPSGQSFR
WREQSPAHWSGVLADQVWTL
TQTEEQLHCTVYRGDKSQASRP
TPDELEAVRKYFQLDVTLAQL
YHHWGSVDSHFQEVAQKFQG
VRLLRQDPIECLFSFICSSNNNIA
RITGMVERLCQAFGPRLIQLDD
VTYHGFPSLQALAGPEVEAHLR
KLGLGYRARYVSASARAILEEQ
GGLAWLQQLRESSYEEAHKAL
CILPGVGTKVADCICLMALDKP
Q A VPVDVHM WH I AQRD YS WH
PTTSQAKGPSPQTOKELGNFFR
SLWGPYAGWAQAGLLGNAFD
GHQLLRPLIFCQDHLREGPPIGR
GDSQGEELEPQLPSSLSSIPYGF
CDHCWTKDVDDPPLVTHPSPG
SRDGHMTQAWPVKWSPLATV
IGHVMQASLLAL
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Amino acid sequence ( X=Unknown,
NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
21291 51659 21419 837 2785 WTRMLFAVSCNDGECQNSNVI
LFCDMCNLAVHQECYGVPYIPE
GQWLCRRCLQSPSRAVDCALC
PNKGGAFKQTDDGRWAHWC
ALWIPEVCFANTVFLEPIDSIEHI
PPARWKLTCYICKQRGSGACIQ
CHKANCYTAFHVTCAQQAGLY
MKMEPVRETGANGTSFSVRKT
AYCDIHTPPGSARRLPALSHSE
GEEDEDEEEDEGKGWSSEKVK
KAKAKSRIKMKKARKILAEKR
AAAPWSVPCIPPHRLSKITNRL
TIQRKSQFMQRLHSYWTLKRQ
SRNGVPLLRRLQTHLQSQRNCD
Q VGRDSEDKN WALKEQLKS W
QRLRHDLERARLLVELIRKREK
LKRETIKVQQIAMEMQLTPFLIL
LRKTLEQLQEKDTGNIFSEPVPL
SEVPDYLDHIKKPMDFFTMKQ
NLEAYRYLNFDDFEEDFNLIVS
NCLKYNAKDTIFYRAAVRLRE
QGGAWRQARRQAEKMGIDFE
TGMHIPHSLAGDEATHHTEDA
AEEERLVLLENQKHLPVEEQLK
LLLERLDEVNASKQSVGRSRRA
KMIKKEMTALRRKLAHQRETG
RDGPERHGPSSRGSLTPHPAAC
DKDGQTDSAAEESSSQETSKGL
GPNMSSTPAHEVGVGEPQFCSP
KRTRRQ/HGPPKRPGRPPKNRES
QMTPSHGGSPVGPP/RAPHHEFP
ASAQAG*EPQFCSPKRTRRQLD
21292 51660 A 21420 2625 MENKITGYTTVDISQWHRKEHF
EAFQSVAQCTYNQTVQLDITAF
LKTVKKNKHKFYPAFIHILARL
MNAHPEFRMAMKDGELVIWD
SVHPCYTVFHEQTETFSSLWSE
YHDDFRQFLHIYSQDVACYGE
NLAYFPKGFIENMFFVSANPWV
SFTSFDLNVANMDNFFAPVFTM
GKYYTQGDKVLMPLAIQRDSE
DKNWALKEQLKSWQRLRHDL
ERARLLVELIRKREKLKRETIKV
QQIAMEMQLTPFLILLRK
Figure imgf002720_0001
Figure imgf002721_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Amino acid sequence ( X=Unknown,
NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
21302 51670 21430 1660 AGVLASHPNSTDVHIINLSLTFH
GQELLSDTKLELNSGRRYGLIG
LNGIGKSMLLSAIGKREVPIPEH
IDIYHLTREMPPSDKTPLHCVM
EVDTERAMLEKEAERLAHEDA
ECEKLMELYERLEELDADKAE
MRASRILHGLGFTPAMQRKKL
KDFSGGWRMRVALARALFIRPF
MLLLDEPTNHLDLDACVWLEE
ELKTFKRILVLVSHSQDFLNGV
CTNIIHMHNKKLKYYTGNYDQ
YVKTRLELEENQMKRFHWEQD
QIAHMKNYIARFGHGSAKLAR
QAQSKEKTLQKMMASGLTERV
VSDKTLSFYFPPCGKIPPPVIMV
QNVSFKYTKDGPCIYNNLEFGI
DLDTRVALVGPNGAGKSTLLK
LLTGELLPTDGMIRKHSHVKIG
RYHQH LQEQLDLDLSPLEYMM
KCYPEIKEKEEMRKIVIGRYGLT
GVKQQVSPIRNLSDGQKCRVCL
AWLAWQNPHMLFLDEPTNHL
DIETIDALADAINEFEGGMMLV
SHDFRLIQQVAQEIWVCEKQTI
TKWPGVDILVAYKEHLKSKLVDV
Figure imgf002723_0001
Figure imgf002724_0001
Figure imgf002725_0001
Figure imgf002726_0001
Figure imgf002727_0001
Figure imgf002728_0001
Figure imgf002729_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Amino acid sequence ( X=Unknown, NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
21351 51719 21479 2421 QAGAMASGSVAECLQQETTCP
VCLQYFAEPMMLDCGHNICCA
CLARCWGTAETNVSCPQCRETF
PQRHMRPΝRHLAΝVTQLVKQL
RTERPSGPGGEMGVCEKHREPL
KLYCEEDQMPICWCDRSREHR
GHSVLPLEEAVEGFKEQIQΝQL
DHLKRVTRLKFTDFLAQGEQA
RAELLSLTHMEREKIVWEFEQL
YHSLKEHEYRLLARLEELDLAI
YΝSIΝGAITQFSCΝISHLSSLIAQ
LEEKHQQPTRELLQDIGDTLSK
AERIVRIPEPWITPPDLQEKIHIFA
PKCLFLTESLKQFTEKMQSDME
KIQELREAQLYSGGSIVGGHEP
GEPRGIKAGYGTCΝCFPEEEKC
FCEPEDIQDPLCDELCRTVIAAA
VLFSFVVSVLLSAFCIHCYHKF
AHKPPISSAEMTFRRPAQAFPVS
YSSSGARRPSLDSMEΝQVSVDA
FKILEDPKWEFPRKΝLVLGKTL
GEGEFGKWKATAFHLKGRAG
YTTVAVKMLKEΝASPSELRDL
LSEFΝVLKQVΝHPHVIKLYGAC
SQDGPLLLIVEYAKYGSLRGFL
RESRKVGPGYLGSGGSRΝSSSL
DHPDERALTMGDLISFAWQISQ
GMQYLAEMKLVHRDLAARΝIL
VAEGRKMKI SDFGLSRDVYEE
DPYVKRSQGRXPVKWMAIESLF
DHIYTTQSDVWSFGVLLWEIVT
LGGΝPYPGIPPERLFΝLLKTGH
Figure imgf002731_0001
Figure imgf002732_0001
Figure imgf002733_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Λmino acid sequence ( X=Unknown, NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
21358 51726 21486 2589 MSPVTLWVRTSSRGEDRCGMC
TPGRDTGFAP1SGMCSKYRSCTI
NEDTGLGLAFTMPMSLDTKHIP
NFGKVHGYIMRHHKNEDTIPW
RRNTKKEEARGTQPLEADDQN
PGWEDGKPGSLENEGRRSHRG
NPAPDMPQPPKEDLFI LPDEYK
SCLRHKRSLLRSHRNEELNVET
LVWDKKMMQNHGHENITTY
VLTILNMERILKKTSFTIIHPSA
MERCPVLNLQIRDSLWNHCFSL
ATTVHRFHPVGRHALKKMYKL
PSAYINVSITKYTQRFALTYISC
VFHRNAAFVAVPGEGRCCAGV
DVPAPMSLRSPGPHWPQEHGA
LWISCTVRGMASCLGQLCRVS
MRVRANIGKVQKPRLRISNGK
GLHTQSELSGSAYLEKGWRGA
APGLDSRNEKGTEGARDSGTPG
RQSQTLQRGPPGPVPGSSKIAPQ
TLTRQGSMTASRKLVLLLPEFC
ERTGEQSGQLYPPSPLLVEWV
MPAGTSSTVDPLLPELCRRWER
QALSLCRLYRQRTSVSIKALCD
HTETRSVALSTCQGLECNSSPA
RKQNWMENEFEELTEVGFRKS
VITKFSKLKEHILTHRKEAKNLE
KRLDEWLTRITSVEKSLNDLME
LKTTVRKVLHEAYTSFNSRFDQ
VEERILVIEHQINEIKQEDKVRE
KRVKRNEQSLQEIWDCVKRPN
LCLIGVPESDGENGTKLENTVLQ
21359 51727 A 21487 260 1292 RLCAFNKRMTFQFNFTIEDHLE
NELTPIRDGALTLDSSKELSVSE
SQKGEERDRKCSAEQFDLPQD
HLWEHKSMENAAPSQDTDSPL
SAASSSRNLEPHGKQPSLRAAK
EHAMPKDLKKMLENKVIETLP
GFQHVKLSVVKTILLKENFPGE
NIVSKSFSSHSDLITGVYEGGLK
IWECTFDLLAYFTKAKVKFAG
KKVLDLGCGSGLLGITAFKGGS
KEIHFQDYNSMVIDEVTVLPNV
VANSTLEDEENDVNEPDVKRC
RKPKVTQLYKCRFFSGEWSEFC
KLVLSSEKLFVKYDLIVLTSETIY
NPDYYSNLHQTFLRLLSKNGRV
LLASKAHVQRNSLSRL
Figure imgf002735_0001
Figure imgf002736_0001
Figure imgf002737_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Λmino acid sequence ( X=Unknown, NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possiblc nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
21390 51758 21518 1321 VKSRAALLGPGA/LSPLEVSRRR
PWTERRLGSVRGDRLRERSRPA
VNRTPRMSRVPSGAPRGTPSRR
WWMMKCWDHQPLG*IPAPHV
GPLAG/IRKREWSDESEEEPEKE
LAPEPEETWVVEMLCGLKMKL
KQQRVSSILPEHHKDFNSQLAP
GVNPSPPHRSFCWKRKMEWW
DQSEESLEEEPRKVLAPEPEEIW
VAEMLCGLKMKLKRRRVSLVL
PEHHEAFNRLLEDPVIKRFLAW
DKDLRVSDKYLLAMVIAYFSR
AGFPS WQ YQRIHFFL A L YLAND
MEEDDEDSKQNIFHFLYRKNRS
RIPLLRKRWFQLGHSMNPRARK
NRSRIPLLRKRRFQLYRSTNPRA
RKΝRSRIPLLCKRRFQLYRSMΝ
SRARKΝRSQIVLFQKRRFHFFCS
MSCRAWVSPEELEEΝTGPRGD
VDFQQELYSΝAΝGRHQAGGEE
Figure imgf002739_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Λmino acid sequence ( X=Unknown, NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
21396 51764 A 21524 727 2953 HVSGYDDVPLGTLSLDIWPPAL
FSWSFAAFFFTPPIEAVLKSFSC
LLVWYIGMAQPKQERVARARH
QRSETARHQRSETA/KTPTLGN
RQTPTLGNRQTPRLGIHARPRR
RATTSLLTLLRAFGKKTQS/STE
PAKAIKPIDRKSVHQICSGQWL
SLSTAVKKIVENSLDAGATNID
LKLKDYGMDLIEVSGΝGCGVE
EEΝFKGLTLKHHTSKIQEFADL
TRVETFGFRGEALSSLCALSDV
TISTCHVSAKVGTRLVFDHDGK
IIQKTPYPHPRGTTVSVKQLFST
LPVRHKEFQRΝIKKAQR/PQAR/
CYRCS*PΝVSPGPGMAAASTLS
A*PRSQSSYFISCKA/EDNAICPI
AEQKAEGKFSSRYRNHDSAEN
RRDGSMTRPRTDWTLKRACTE
KRRPQHASSFEWRGHSLCWDR
EQRDAETLKMLLDSGLSVQKK
TKDRTETRFGEMGQILGKIMMS
HQPQPQEEQSPQRSTSGYPLQE
WDDEVSGPSAPGVDPSPPRRS
LGWKRKRECLDESDDEPEKEL
APEPEETWVAETLCGLKMKAK
RRRVSLVLPEYYEAFNRLLEDP
VIKRLLAWDKDLRVSDKIPSEP
TILGASPKTLPPASRICIRPSNTP
PPRNFHMSTVTPMLSYLANDM
EEDDEAPKQNIFYFLYEETRSHI
PLLRELWFQLCRYMNPRARKN
CSQIALFRKYRFHFFCSMRCRA
Figure imgf002741_0001
Figure imgf002742_0001
Figure imgf002743_0001
Figure imgf002744_0001
Figure imgf002745_0001
Figure imgf002746_0001
Figure imgf002747_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Amino acid sequence ( X=Unknown, NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possiblc nucleotide insertion) sequence
21445 51813 A 21573 59 2384 PRPGSRSGLCRRAGERGAVRAG
GMSRRTRCEDLDELHYQDTDS
DVPEQRDSKCKVKWTHEEDEQ
LRALVRQFGQQDWKFLASHFP
NRTDQQCQYRWLRVLNPDLVK
GPWTKEEDQKVIELVKKYGTK
QWTLIAKHLKGRLGKQCRERW
HNHLNPEVNKSCWTEEEDRIIC
EAHKVLGNRWAEIAKMLPGRT
DNAGKNHWNFTIQK/RKGDTG
GFLREYKD* KPP V YLPLVELEDK
DGLQSAQPTEGQRSLLTNWPSV
PPTIKEEENVSEEELAAATTSKE
QEPIGTDLDAVRVTPVEPLEEFPVK
REDQEGSPPETD/SLPYKWWE
AVNLLIPAWSVSLYEALDLIGSV
DPDAWCDLSRFDLPEEPSIEDSI
NNSVLVQLQASVHQQQILPPRQP
SAVLVPSVTEYRLDGHTISDLSR
SSRGELIPISPSTEVGGVSGIGTPP
SVLKRQRKRRVALSPVTENSTS
LSFLDSCNSLTPKSTPVKTLPFS
PSQFLNFWNKVQDTLELESPLA
DYPPQVCSQKVWTTPLHRVDK
TPLHQKHAVAFVTPDQKYSMD
NTPHTVPTPFKNALEKYGPLKPL
PQTPHLVEEDLKEVLRSEAGIELI
IEDDIRPEKQKRKPGLRRSPIKK
VRKSLALDIVDEDVKLMMSTL
PKSLSCRTTAPSNVSSTVLTLSGIK
EDNSLVLNQGLLAGPSPRKAAC
VPEGPESHFHGQLAPYVPVPGK
21446 51814 21574 190 889 SLPRRLESPGRLAMAARGVIAP
VGES/LCRYAEVPCSPSAKRPDA
DVDQQRLVRSLIAVGQGVAAL
AFAGVRYALVRIWKPLEVQVITET
AKGRFQTPRFSSY* * GG/FLNI I *
LRREAVGLILGVSPVSAGKAKIRT
VAHRVRVMILESPQIKGGSPYVA
AKINEAKDVLLETTTKRLMLKD
HTEGKKNEGTSKKKKSPAKYS
KTWSFLIFYMGFDHSLIFRPLSC
ITNKMLIVLLFIIF
Figure imgf002749_0001
Figure imgf002750_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Amino acid sequence ( X=Unkno n, NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
21458 51826 21586 929 MDPSIYFLPSKREGHLQAYIRDS
VPEDPLILIFFSASSDQRPWLKA
PHLPALRGFRGGGAGGGAQSR
TTEDYVAWAKGNSPTSPSPRSR
GGCSAKLSGPRRPPRRPGRLFN
MAASANSVSRAWRTESPEGLE
SLHSTSLSAISVSRPEGGAELSD
TADTMGFGDLKSPAGLQWPTI
TWRTRATSRGMCHHKQMWQY
LKPCPAHRLPTCVMPYV/GYNH
IKSYEKEKASLPGVKKALGKYG
PADVEDTTGSGATDSKDDDDID
LFGSDDEEESEEAKRLRE/DTSC
TI*IKESQKTCTCCQVFHLTRCE
21459 51827 21587 2405 MALRWRSVRAVLCSLHVACE
KVGMQIPRFCYHERLSVAGNC
RMCLVEIEKAPKDQSMMFGND
RSRFLEGKRAVEDKNIGPLVKT
IMTRCIQCTRCIRFASEIAGVDD
LGTTGRGNDMQVGTYIEKMFM
SELSGNIIDICPVGALTSKPYAFT
ARPWETRKTESIDVMDAVGSNI
WSTRTGEVMRILPRMHEDINE
EWISDKTRFAYDGLKRQRLTEP
MVRNEKGLLTYTSWEDALSRV
AGMPPALALQSAEIAASARPPP
RLGSEEPLCLAAHRLGCEEPLC
LAASLESEERLCPAAIPPGNLRY
VFISSAKTDKYSKLAPVEWVTV
EKVPENVEVTLELGNRQSRDR
VSRIPGNAIVIWFTGFKWKKTV
TTLWFLNLAIADFIFLLFLPLYIS
YVAMNFHWPFGIWLCKANSFT
AQLNMFASVFFLT/GDQPGPLY
PLDPSCLISSASNPQELSDCHYI
HLAFGFSNWRSCPVLPGHCGV
Q*SYSLL*QFSEA*S*PHFDQAP
CSDLGEIYHWLSLPFANNEYLL
LVSHLQGEEAKHPDLQ*AFLDN
SGCGCGLCGLLDSLSPV* HLG A
HHSPQ*LFPPCDAGWNPPLHWF
GIPQ*LLEPHPLCPN**EVPSSLP
VLSC*DTQVHTVGSQLFWVQVL
KEAKKPMVVLGSSALQRNDGA
AILAAVSSIAQKIRMTSGVTGD
WKVMNILHRIASQVAALDLGY SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Amino acid sequence ( X=Unkno n, NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
21460 51828 21588 2065 MLRIPVRKALVGLSKSPKGCVR
TTATAASNLVIRVFVDGQSLMVV
EPGTTVLQACEVRIGMQIPRFCY
HERLSVAGNCRMCLVEIEKAPK
DQSMMFGNDRSRFLEGKRAVE
DKNIGPLVKTIMTRCIQCTRCIR
FASEIAGVDDLGTTGRGNDMQ
VGTYIEKMFMSELSGNIIDICPV
G ALTSKPY A FTARP WETRKTES
IDVMDAVGSNIWSTRTGEVM
RILPRMHEDINEEWISDKTRFA
YDGLKRQRLTEPMVRNEKGLL
TYTSWEDALSRVAGMLQSFQG
KDVAAIAGGLVDAEALVALKD
LLNRVDSDTLCTEEVFPVTAGA
GTDLRSNYLLNTTIAGWEEAD
WLLVGTNPRFVEAPLF*C*EFR
KSWVLHNDVLKVALIRAVPVDLV
TYTYDHLGVDSPKILQVDIASGK/
SHPFSQVLKEAKKPMVWLGSS
ALQRNDGAAILAAVSSIAQKIVR
MTSGVTGDWKVMNILHRIASQ
VAVLVDLGYKPGVEAIRKNPPK
VLFLLGADGGCITRQDLPKDCF
IIYVQGHHGDVGAPIADVILPGA
AYTEKSATYVNTEGRAQQTKV
AVTPPGLAREDWKIIRALSEIAG
MTLPYDTLDQVRNRLEEVSPNL
VRYDDIEGANYFQQANELSKL
VNQQLLADPLVPPQLTIKDFYM
TGEVTEINEALAENPGLVNKSR
YEDGWLIKMTLSNPS
Figure imgf002753_0001
Figure imgf002754_0001
Figure imgf002755_0001
Figure imgf002756_0001
Figure imgf002757_0001
Figure imgf002758_0001
Figure imgf002759_0001
Figure imgf002760_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Λmino acid sequence ( X=Unkπown, NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
21514 51882 21642 2230 MMRTSGGTRDNESSSDDERER
ENRLSDEKSNEYANLVGGERY
EPDEENPMLGFRGAGRYVSDSF
LRLCSSLEIVKARETVALIQGLK
HVDLLADDTALRALIPQRADRFP
MNAVRSTLKEEEKLVNSSDWG
YVDAQAFARWRPEYGYFAKQA
GFTVKTSASLAALWQWNQIG
GKERYFFGNILWQTRALMDRAI
GHKLAKGRPEREYLQTGDAVD
SWKVIWEPEKQLTLLFGMKAP
GLGRLCFSLEDKGDYRTIDVRA
FWHPHGNRAEREKPRFQPYSRR
KSFIGQNMLLPMSIMPPVFRRVI
CPLGVARLGKFRDPVYLVCVFT
RQIFRYYTSGDEFLQINTTGGED
IDDIDDIKLFVYEESYGISKESH
WREAINAKAMGAMTLNWQEK
RWQRFFNSEEPGNIEPVYMLEK
VENQNHAKWEVHNFTMGYQR
QVTEDTYEYLLLNWGVPVPD
NIHPMTKDEAGVWSWRTPILK
GNLYEYFFNVDGVRSIDTGTA
MTNPQRQVNSSMILVPGSYLDT
RSVAHGDLIAITYHSNALQSER
QMYVWTPPGYTGMGEPLPVLY
FYHGFGDTGRSAIDQGRIPQIM
DNLLAEGKIKPMLWIPDTETD
AKGIIPEDFVPQERRKVFYPLNA
KAADRELMNDIIPLISKRFNVR
KDADGRALAGLSQGGYQALVS
GMNHLESFGWLATFSGVTTTT
Figure imgf002762_0001
Figure imgf002763_0001
Figure imgf002764_0001
Figure imgf002765_0001
Figure imgf002766_0001
Figure imgf002767_0001
Figure imgf002768_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Amino acid sequence ( X=Unknown, NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
21559 51927 21687 2339 MSGGQTEKQAQLIRDFQPDMI
MVTPSYCLNLIEELERQLGGDA
SGCSLRVGVFGAEPWTQAMRK
EIERRLGITALDIYGLSEVMGPG
VAMECLETTDGPTIWEDHFYPE
IVNPHDGTPLADGEHGELLFTT
LTKEALPVIRYRTRDLTRLLPGT
ARTMRRMDRISGRSDDMLVLR
RRHLCRPSACQLCRPPSLPATD
RVLELFAQSGAQWLVILGDVL
NHGPRNALPEGYAPAKVAERL
NEVAHKVIAVRGNCDSEVDQM
LLHFPITAPWQQVLLEKQRLFL
THGHLFGPENLPALNQNDVLV
YGHTHLPVAEQRGEIFHFNPGS
VSIPKGGNPASYGMLDNDVLS
VIALNDQSIIAQVAINPAPEKEG
FLIMEQRRLASTEWVDIVNEEN
EVIAQASREQMRAQCLRHRAT
YIWHDGMGKILVQRRTETKDF
LPGICATSKVRKLLCKSAALLP
VSELLLVLLTPVFVGSAVIGKL
GAFAIHIDGITVKRRLVRNTVL
QFCGRSRQPLEYVSAGDTGPVL
RALKRMMAMRHYMRSQTVKG
VTDTRAIDEVALSVAQVEEMY
RYLAIANYEDRFVIPTSHREMA
GDAFAERNGCGFTFGDGCHGS
DSKFNLFNSSRIDAINITEILKVI
GLLMEYPDELLWECKEDALALI
RRDAPMLTDFTHNLLNAPLLD
KQAEWCEVFDRGRTTSLLLFEH
21560 51928 21688 391 631
21561 51929 21689 2779 3348 LMQLHNGLPVAGLRQFCQQLV
IQRLYFFTTATAHQCTRHMCHR
HRSLLTAHRQMALCFQYVDDL
HHAFADVLINNACYPLLMGEQ
GVFIYNGTQQLLHWVIIDSRQD
HGFGGL*KSRIVCERRRL*SQH
GQFVFRHHVPDHRIDIAFNNAL
*TLVIREISSVPASVGVGIDFPGG
AATWAGPVVIKPGQS
Figure imgf002770_0001
Figure imgf002771_0001
Figure imgf002772_0001
Figure imgf002773_0001
Figure imgf002774_0001
Figure imgf002775_0001
Figure imgf002776_0001
Figure imgf002777_0001
Figure imgf002778_0001
Figure imgf002779_0001
Figure imgf002780_0001
Figure imgf002781_0001
Figure imgf002782_0001
Figure imgf002783_0001
Figure imgf002784_0001
Figure imgf002785_0001
Figure imgf002786_0001
Figure imgf002787_0001
Figure imgf002788_0001
Figure imgf002789_0001
Figure imgf002790_0001
Figure imgf002791_0001
Figure imgf002792_0001
Figure imgf002793_0001
Figure imgf002794_0001
Figure imgf002795_0001
Figure imgf002796_0001
Figure imgf002797_0001
Figure imgf002798_0001
Figure imgf002799_0001
Figure imgf002800_0001
Figure imgf002801_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Λmino acid sequence ( X=Unkno n,
NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
21782 52150 21910 52 1719 IGQWDVDTCFTRIHLVFVNFD
NDTGSIHVLNDTTTFSYRSYTG
VNGNSTFHTSTNQRLISAQSRN
GLTLHVRTHQCTVGVIVFQERD
QGRTDGYHLLGGYVHWNLV
AAEQAGFAFATASYQVFYEVA
FFIQVGVRLGDNWAFFDSRQV
VNFVSYNTVGHFTIRSLKEAVF
VSLCVHGQRVDQTDVRTFRGF
DWTDATWSRVYVSNFEACTF
TGQTAWAECGDTTFVRNLRQR
WLVHKLRQLAGTEELFHCCG
SQDPQQPWGWNLRVNPLAQTT
GHAHE AQRKQ WH KGNEEADD
PEPERAFTPGFVQFETERFRPPV
GHPCEAAEDHAADDHVMEVR
NQEQAVMQNEVRARYGQQHA
SHPAHREGDDKTDGPQHRRME
HDAALIHGEQPVEDLHPGRDG
DNHCGDTKEGVYVRARTHRKE
WQPNDERQHGDTDGCPYQRG
ITKQTFTREGCRNFGEHTENRQ
DQDVNFRVAPGPNQVDVHHH
VATHIVGKEMGAQIAIQGQHAT
YRQNGKDL*SNRGVASGNPQC
RHRAHRRATDCQTVIKQPVFFG
Figure imgf002803_0001
Figure imgf002804_0001
Figure imgf002805_0001
Figure imgf002806_0001
Figure imgf002807_0001
Figure imgf002808_0001
Figure imgf002809_0001
Figure imgf002810_0001
Figure imgf002811_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Amino acid sequence ( X=Unknown,
NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
21865 21993 145 1929 VSGVIEIADGSRRRKAAALTES
DYRVLVGELDDEQMAALSRLG
NDYRPTSAYERGQRYASRLQN
EFAGNISALADAECDNLKTCHT
SHGSVMAETAVINHKKRKNSP
RIVQSNDLTEAAYSLSRDQKRM
LYLFVDQIRKSDGTLQEHDGIC
EIHVAKYAEIFGLTSAEASKDIR
QALKSFAGKEWFYRPEEDAG
DEKGYESFPWFIK/RPSRGLYSV
HINPYLIPFFIGVLQNRFTQFRLS
ETKEITNPYAM RLYESLCQYRK
PDGSGIVSLKIDWIIERYQLPQS
YQRMPDFRRRFLQGFCRFRNH
HQTGFSPAGANQRGPLAATLSG
PGGEGQSAVARLTGEKKNHPG
AQYANRLSPRVGRFINAAGTTG
FPTWKAGSERNAINDDVTYAIK
PTCWPGLDIIPSCLALHRIETEL
MGKFDEGKLPTDPHLMLRLAIE
TVAHDYDVIVIDSAPNLGIGTIN
WCAADVLIVPTPAELFDYTSA
LQFFDMLRDLLKNVDLKGNSN
GSQSPWMEEQIRDAWGSMVLK
NWRETDEVGKGQIRMRTVFE
QAIDQRSSTGAWRNALSIWEPE
CNEISIGVSLDQDGGSNSVLRK
Figure imgf002813_0001
Figure imgf002814_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Amino acid sequence ( X=Unkno>vn, NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possiblc nucleotide insertion) sequence
21880 52248 22008 1894 EGEDRGLPRTMGAALGTGTRL
APWPGRACGALPRWTPTAPAQ
GCHSKPGPARPVPLKKRGYDV
TRNPHLNKGMAFTLEERLQLGI
HGLIPPCFLSQDVQLLRIMRYY
ERQQSDLDKYIILMTLQDRNEK
LFYRVLTSDVEKFMPIVYTPTV
GLACQHYGLTFRRPRGLFITIHD
KGHLATMLNSWPEDNIKAVW
TDGERILGLGDLGCYGMGIPVG
KLALYTACGGVNPQQCLPVLL
DVGTNNEELLRDPLVYIGLKHQ
RVHGKAYDDLLDEFMQAVTD
KFGINCVLIVQFEDFANANAFRLL
NKYRNKYCMFNDDIQGTASVA
VAGILAALRITNNKLSNHVFVF
QGAGEAAMGVIAHLLVMALEVK
EGVPKAVEATRKIWVMVDFVKGL
IVKGRSHLNHEKEMFAQDVHPE
SNSLDEWRLVKPTAIIGVAAIA
EAVFTEQI LRNM ASFVRRAPI IF A
LSNPPRKAECTAVEKCYRVTEG
PRGFFASGKSFLRSVDL/ESMGK
TFIPGGRGNNA*RVPRGWQLG
VHSPGGDPGH I PVDEIFLPDSRA
KLPQEVSEQHLVSQGRLYPVPLS
TVIRDVSLRIAI KVLDY A YKHNL
WSYYPEPKDKEAFCKIPGSYTP
DYDSFYT/VDSYIWAQGKAMN
21881 52249 22009 322 567
21882 52250 22010 338 603
21883 52251 22011 127 417
21884 52252 22012 253 424
21885 52253 22013 124 1235 YLCLLPKGQGDRALSRTSQWP
QMSQSQACGGSEQIPGIDIQLN
RKYHTTRKLSTTKRFPTACWR
KKVGAFTKIIEAMGFTGPLKYS
KWEDLRLRALRIVY*LCGRKLD
FEGFFSKVCSVMPDTFNSWFLINP
YFHVWMCLVRMKQEGRSGKY
MCRIIVHFMWEDVQQRGRVM
GVNPYILKKNMILMTNHFYAVA
ILGYDEGILSDDHGLAAALWK
NLLVNVRKCEDPRHLELLVEYVR
KQIQYLDSVMNGEDLLLTGEVS
WRPLVEKNPQSILKPHSPTYND
RGTFDGLGPPHGPPAGFRGTSR
EEKCLVGPRTPARKWPELDLLN
KHLSKYLAPICVEFPLSVPQESD
LLGYRAWENPLALPGCPSSLRG
Figure imgf002816_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Amino acid sequence ( X=Unkno\vn, NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /^possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
21890 52258 A 22018 1806 MRGNTAGGPGDEKPDSTHPPW
EGTLQEAQEMKKWRFMWTS
GGPDPQHWRGSGLYGQLVAQ
MLHIDVAQVYMDSWWPRCST
LAWLRFIWTAGGPDAPHWLRS
GLGGQLVAWMLHVGVAQVYV
DSWWPSCSASMWLSFMEHKQ
CLTLGYEVALFLQHFLSRAFVL
VWVPRKEKAQVLRQGLSLTFR
ENPGVCVMSSYRPWLITAAAD
MQPSKPGGTGSYCSYWSPTDN
TPRLAAACTWNLGSSHGSAPG
VRTFIEAGFSPPFLAPKALALTL
SDSDSRWGPNPVYAQPQP/PEP
AWGPNPV*GQPQPRNRPGVPEP
CVCSTPTAEPPWGPNPVSTPTA
EPAWGPNPVYA/PTPTAEPPWG
PNPV*GQPQP/REPAWGPNPVY
GQPQPRNRPGSVPPQGNLYCSA
ALGATDPSQSWHQCSQNLLPA
DKGAKREAPW/PRPPSGEVDHQ
VWHLHGWPLPTTAPPLLQGVW
RVEQTPASHHRQPRPRQSSPST
KAPPVITVNQGPASHHRQPRPR
QSSPSTKAPPVITVNQGPASHH
RQPRPRQSSPST/TGPASHHRQP
RPRQSSPSTKAPPVITVNQGPAS
HHRQPRPRQSSPSTKAPPVITVN
QGPASHHRQPRPRQSSPSTKAP
PVITVNQGPASHHRQPRPRQSSP
STKAPPVITVNQGPASHHRQPR
PRQSSTLTKAPPVITVNQSPASH
Figure imgf002818_0001
Figure imgf002819_0001
Figure imgf002820_0001
Figure imgf002821_0001
Figure imgf002822_0001
Figure imgf002823_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Amino acid sequence ( X=Unknown, NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
21920 52288 22048 357 1579 QRRPRPFPSQGISMTECFLPPTSS
PSEHRRVEHGSGLTRTPSSEEIS
PTKFPGLYRTGEPSPPHDILHEP
S*SNGLMDEKDHGKKKGNFKK
KEKRTEGYAAFQEDSSGDEAE
KLMPRFEEAYGRTTETEKVQEF
QRLLKELPEGNYLLISLFWHM
DHVITKELEAKMNIQNIYIWSP
TVQISNLVLYVFFTHVQELFGN
WLKQVTKTRRWSNVATMPTL
PETQAGFKEEIRIQELLLNCLHQ
DLQGGIKDLSEEERLWEEKEIL
TKQEELLGMEQFLRRQIASEKE
EIECLRAETAEIQSHQQQGLSET
EEYSSESESEDEEELQIILEDLQR
QNEELEIKNNHLNQAIHEESEA
VIEPRMQPWLLQLQPDRAKQQ
AQEDEEPEWRGGAMQTPKNGI
21921 52289 22049 182 2212 VTEKTSSLPSQGISMTECFLPPT
SSPSEHRRVVEHG/SAGLTPGPPA
SEEIQPLPKFPGLYVRTGEPPHPP
P*HPSHEPSLM*CPDDEKDHGK
KKGKFKKKEKRTEGYAAFQED
SSGDEAESPSKMKRVSKGIHVFK
KPSFSKKVREKDFKIKEKPKEEK
HKΕEKHKEEKHKEKKSKDLTVA
ADVVKQWKEKXKXKKPIQEPE
VPQIDVPNLKPIFGIPLADAVER
TMMYDGIRLPAVFRECIDYVEK
YGMKCEGIYRVSGIKSKVDELK
AAYDREESTNLEDYEPNTVASL
LKQYLRDLPENLLTKELMPRFE
EACGRTTETEKVQEFQRLLKEL
PECNYLLISWLIVHMDHVIAKE
LETKMNIQNISIVLSPTVQISNR
VLYVFFTHVQELFGNWLKQV
MKPLRWSNMATMPTLPETQAG
IKEEIRRQEFLLNCLHRDLQGGI
KDLSKEERLWEVQRILTALKRK
LREAKRQECETKIAQEIASLSKE
DVSKEEMNENEEVIN1LLAQEN
EILTEQEELLAMEQFLRRQIASE
KEEIERLRAEIAEIQSRQQHGRS
ETEEYSSESESESEDEEELQIILE
DLQRQNEELEIKNNHLNQAIHE
EREAIIELRVQLRLLQMQRAKA
EQQAQEDEEPEWRGGAVQPPR
DGVLEPKAAKEQPKAGKEPAK
21922 52290 B 22050 34 322
Figure imgf002825_0001
Figure imgf002826_0001
Figure imgf002827_0001
Figure imgf002828_0001
Figure imgf002829_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Amino acid sequence ( X=Unknown, NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possiblc nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
21954 52322 22082 233 2096 RREDEGNKTSWIHPSPWWLIF
RFFSLFFFKQPLFTMNKLYIGNL
SENAAPSVDLVKSIFKDAKIPVSG
PFLVKTGYAFVDCPVDESWAVLR
AIKAFSGKIELHGKPIEVEHSVP
KRQRIRKLQIRNIPPHLQWEVL
DSLLVQYGVVESCEQVNTDSET
AWNVTYSSKDQARQALDK/L
LDKLNGFQLENFTLKVAYIPDE
MATQQNPFLQPRGRRGLGQRG
SVSRQGSPGSVSKQKPCDSPLRL
LVPTQFVGAIIGKEGATIRNITK
QTQSKIDVHRKENAGAAEKSIT
I PSTPEGTSA ACKSILEIMHKEA
QDIKFTEEIPLKILAFTNNFVGRLI
GKEGRNLKKIEQDTDTKITISPL
QELTLYNPERTITVKGNVETCA
KAEEEIMKKIRESYENDIASMN
LQAHLIPGLNLNALGLFPPTSG
MPPPTSGPPSAMTPPSPQSEQSE
SETVHLFIPALSVRALISKQGQH
IKQLSRFAGASSKIAPVEAPDAK
VRMVMIAGSPEARFKAQGRIY
GKIKEENFVSPKEEVKLEAHIR
VPSFAAGRVIGKGGKTWNELQ
NLSSAEVWPRDQTPDENDQV
WKITGHFYACQVVAQRKI*EFL
TQVRQHQQQKALQSGPPQSRR
Figure imgf002831_0001
Figure imgf002832_0001
Figure imgf002833_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Λmiπo acid sequence ( X=Unkno n,
NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possib!e nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
21968 52336 22096 20 2482 RVCSSSASTASQAVMADAWEE
IRRLAADFQRAQFAEATQRLSE
RNCIEIVNRL1AQKQLEWHTL
DGKEYITPAQISKEMRDELHVR
GGRVNIVDLQQVINVDLIHIEN
RIGDIIKSEKHVQLVLGQLVIDEN
YLVDRVGQEVNDKLQESGQVTI
SELCKTYDLPGNFLTQALTQRL
AIIIIGHIDLDNRG V I FTEAFVAR
HRARIRGLFSAITRPTAVNSLIS
KYGFQEQLLYSVLDELVNSGRL
RGTWGGRQDKAVFVPDIYSR
TQSTWVDSFFRQNGYLEFDALS
RLGIPDAVSYIKKRYKTTQLLFL
KAACVGQGLVDQVEASVEEAI
SSGTWVDIAPLLPTSLSVEDAAI
LLQQVMRAFSKQASTWFSDT
VWSEKFVINDCTELFRELMHQ
KAEKEMKNNPVHLITEEDLKQI
STLESVSTSKKDKKDERRRKAT
EGSGSMRGGGGGNAREYKIKK
VKKKG RKDDDSDDESQSSHTG
KKKPEISFMFQDEIEDFLRKHIQ
DAPEEFISELAEYLIKPLNKTYL
EWRSVFMSSTTSASGTGRKRT
IKDLQEVEVSNLYNNIRLFGKRG
WKFFVADDTQGWLLPKHLLEV
QCVLISLNLIVFNFVLGFGILMDG
Q*DDPGRPFTSVELRKEIVLSKLS
EETKVALTKLHNSLNEKSIEDFI
FCVLDSAAEACDIMVKRGDKKR
ERQILFQHRQALAEQLKVTEDP
21969 52337 22097 711 VLSLFDGIATGYLVLKELGIKV
GKYVASEVCEESIAVGTVKHEG
NIKYVNDVRNITKKNIEEWGPF
DLVIGGSPCNDLSNVNPARKGL
YEGTGRLFFEFYHLLNYSRPKE
GDDRPFFWMFENWAMKVGD
KRDISRFLECNPVMIDAIKVSAA
HRARYFWGNLPGMNRPVIASK
NDKLELQDCLEYNRIAVRIFGFP
VHYTDVSNMGRGARQKLLGRS
WSVPVIRHLFAPLKDYFAC SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Amino acid sequence ( X=Unkno n, NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
21970 52338 22098 406 2885 GRGILVNGACSDQSSDSPPILEA
IRTPEIRGGWAVGTGWSGADI
VSGHCRQWSLLSLFQGSGTLSL
AFITWAFRETRTRSESPAVSSHT
SSQSTCGQHST*HT*HARGRQG
RNHVDESPVEFPATRVGSPDAQ
TPARSL*LGDMPHSLTIDLTDDT
EDTHGTPQSSSTPYARLAQDSQ
QGGMESPQVEADSGDGDSSEY
QVDGREGLLPRV/I*GP*TRGKTS
YAMWSWKATSKRQAMSGMR
WVQWFGDGKFSEVSPGKGKGF
CRPEAVPACLPLNKLVSYRKA
MYHALEVTWDEGMRAKP*EQ
G*SKRPTELEDQLKPMLEWAH
GGFKPTGIEGLKPNNTQPGGNE
SPWQHPLPLLVGPLLGRVRLKT
NCYNNGKDRGDEDQSRGDCW
VPGIMGQMGGGRCRSGIDVIPL
TPWLGL*ACLVCQTCRVSPPTT
ALDLPPLASSΝSPLLPCEGRELL
LCSΝTSCCR*ALGPVGWMWVG
PQGSYVPAV*EPWSCYMCLPQ
RCHGVLRRRKDWNVRLQAFFT
SDTGLEYVSHRLPPLPPQIPGAQ
GVGKLWFDGIATGEFGEHLETL
LSCHNRVARESEESIAVGTVKH
EGNIKYVNDVRNITKKNVRAV
CTLRASSLPVFFPSPPHPNPARK
GLYGEHPSLWQSLESLCHLTTG
PGAGNK/RGLGWRFQMNPALS
SFLSTVGVQRDISRFLEVRESGD
21971 52339 22099 729 896
Figure imgf002836_0001
Figure imgf002837_0001
Figure imgf002838_0001
Figure imgf002839_0001
Figure imgf002840_0001
Figure imgf002841_0001
Figure imgf002842_0001
Figure imgf002843_0001
Figure imgf002844_0001
Figure imgf002845_0001
Figure imgf002846_0001
Figure imgf002847_0001
Figure imgf002848_0001
Figure imgf002849_0001
Figure imgf002850_0001
Figure imgf002851_0001
Figure imgf002852_0001
Figure imgf002853_0001
Figure imgf002854_0001
Figure imgf002855_0001
Figure imgf002856_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Amino acid sequence ( X=Uπkno>vn, NO: of peptide hod n USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
22078 52446 22206 1894 QSLADLLQSQPSITLTSQIWPSP
A*GFWSWGNLSVTTPSKVPATP
TRTIAATPIQTLPQSQTAPKRID
TPSLEEPSDLEELEQFAKTFKQR
RIILGFTVQGDAGLAMVKLYGN
DFSPTTIFRFEALNLSFKNMCKL
KPLLEKWLNDAENLSSDSSLSS
PSALNSPGIEGLSRRRKKRTSIE
TNIRVALEKSFLENQKPTSEEIT
MIADQLNMEKEVIRVWFCNRR
QKEKRINPPSSGGTSSSPIKAIFP
SPTSLVATTPSLVTSSAATTLTV
SPVLPLTSAAVTNLSVTASTSEA
SSASETSTTQTTSTPLSSPLGTSQ
VMVTASGLQTAAAAALQGAA
QLPANASLAAMAAAAGLNPSL
MAPSQFAAGGALLSLNPGTLSG
ALSPALMSNSTLATIQALASGG
SLPITSLDATGNLVFANAGGAP
NIVTAPLFLNPQNLSLLTSNPPG
GFTIEISNNNSTMVMTGMRIQIG
TQAIERAPSYIEIFGRTMQLNLS
RSRWFDFPFTREEALQADKKLN
LFTVRKFQNIRHTLFKCENKNT
FQQCLLLEQDSSCGTGFCSMTT
YPVMLLSSEQPVQSLSVRQCVI
LLDAAAVISGGMGAVCLLLTH
LCPLCYAGLLSEITDPESFVKDL
LYSVP
Figure imgf002858_0001
Figure imgf002859_0001
Figure imgf002860_0001
Figure imgf002861_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Amino acid sequence ( X=Unkno n, NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
22092 52460 22220 1908 MKY IQDFQREKQEFERNLARFR
EDHPDLIQNAKKSDIPEKPKTPQ
QLWYTHEKKVYLKVRPDATTK
EVKDSLGKQWSQLSDKKRLK
WIHKALEQRKEYEEIMRDYIQK
HPELNISEEGITKSTLTKAERQL
KDKFDGRPTKPPPNSYSLYCAE
LMANMKDVPSTERMVLCSQQ
WKLLSQKEKDAYHKKCDQKK
KDYEVELLRFLESLPEEEQQRV
LGEEKMLNINKKQATSPASKKP
AQEGGKGGSEKPKRPVSAMFIF
SEEKRRQLQEERPELSESELTRL
LARMWNDLSEKKKAKYKARE
AALKAQSERKPGGEREERGKLP
ESPKRAEEIWQQSVIGDYLARF
KNDRVKALKAM EMTWNNME
KKEKLMWIKKAAEDQKRYERE
LSEMRAPPAATNSSKKMKFQG
EPKKPPMNGYQKFSQELLSNGE
LNHLPLKERMVEIGSRWQRISQ
SQKEHYKKLAEEQQKQYKVHL
DLWVKSLSPQDRAAYKEYISN
KRKSMTKLRGPNPKSSRTTLQS
KSESEEDDEEDEDDEDEDEEEE
DDENGDSSEDGGDSSESSSEDE
SEDGD/EGGADRAQWEGRGVL
PGGPPGDSTSFLTPQNEEDDED
EDDDEDDDEDEDNESEGSSSSS
SSSGDSSDSDSN
22093 52461 22221 129 2365
22094 52462 22222 2730 PSLRFAISGFVWRLAAPASAAA
ELRLDCSARRAPRCPRPALPSTR
SAEGAVLCAFRREEQQAAATA
AAAAAAATAAAAAAPAPPPRP
EEEPLPPREGAAAVPGRAGEGA
ATQSQGGSRWSGKPGGRAAPA
RPRRWLDSWRMNGEADCPTDL
EMAAPKGQDRWSQEDMLTLLE
CMKNNLPSNDSSKFKTTESHM
DWEKVAFKDFSGDMCKLKWV
EISNEVRKFRTLTELILDAQV*HF
KNPYKGVKKLKKHPDFPK
22095 52463 22223 446 RPSPPTSSM/PGLWGYSRPCSSG
VPASALQA*SPAFSPASQRPPTT
SWWEVP*EEGRAVPVWEGRIRA
VGPAPAS*PSDLPAWSCRSWR£
TRGVSPAEDAMLLRAVEPLEGC
QRPLLDSEPP/SPARTSAPAASA
SPPAPGGPPCLARPLPR
Figure imgf002863_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Amino acid sequence ( X=Unkπown, NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possiblc nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
22105 52473 22233 2027 EESDCLTEYEEDAGPDCSRDEG
GSPEGASPSTASEMDVSLHFVL
WGCLHVYQRMIDKAEDVCLFV
AQPGELVGQLAVLTGEPLIFTL
RAQRDCTFLRISKSDFYEYDSP
KLECGWHSEDPNPWTSPSTGL
NPLFLELCRHMVFQRLGQGDY
VFRPGQPDASIYWQDGLLELC
LPGPDGKECVVKEWPGDSVN
SLLSILDVITGHQHPQRTVSARA
ARDSTVLRLPVEAFSAVFTKSA
RTITGCVWRYVRASASYCPYLP
PLCDPKDGHLLVDGCYVNNVP
GSLWRYVRASMTVRRATCPRCA
TPRTGTYSWMAATSTICQASGR
PHHPHTQAPPAPHTRTRRTSPA
AWVPKRSSPLTWGARMRRTSA
PTGTACPAGGCCGSG*IPGLTR*
RFQTWLKSSPAWPTCPVCGS*R
LSSPAPTASTCARPSTASRPWTL
GSSTRSILRTTRRMSTAVTTTAT
PAPIITPST*GTNGTGCAPQIRPA
EPSPGARTSWNPSLSATFAPEEF
VASP*LDVPICRMGQCCSLPGG
ATLP*GPCGPQCMPGNWPPER
TAARYRQTTAYQTPACPTPPRG
QRPT/CVPTVSSLQAGLQGGPRS
DFDMAYERGRISVSLQEEASGG
SLAAPAREPREQPAGACEYSYC
EDESATGGCPFGPYQGRQTSSIF
EAAKQELAKLMRIE
Figure imgf002865_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Amino acid sequence ( X=Unknown, NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
221 i : 52481 22241 2232 MQDTEKDDNNNDEYDNYDEL
VAKSLLNLGKIAEDAAYRARTE
SEMNSNTSNSLEDDSDKNENLG
RKSELSLDLDSDWRETVDSLK
LLAQGHGWLSENMNDRNYA
DSMSQQDSRNMNYVMLGKPM
NNGLMEKMVEESDEEVCLSSL
ECLRNQCFDLARKLSETNPQER
NPQQNMNIRQHVRPEEDFPGRT
PDRΝYSDMLΝLMRLEEQLSPRS
RVFASCAKEDGCHERDDDTTS
VΝSDRSEEVFDMTKGΝLTLLE
KAIALETERAKAMREKMAMEA
GRRDΝMRSYEDQSPRQLPGED
RKPKSSDSHVKKPYYGKDPSRT
EKKESKCPTPGCDGTGHVTGL
YPHHRSLSGCPHKDRVPPEILA
MHESVLKCPTPGCTGRGHVΝS
NRNSHRRDIQVFNLQKQEMIDR
LESLSGCPIAAAEKLAKAQEKH
QSCDVSKSSQASDRVLRPMCFV
KQLEIPQYGYRNNVPTTTPRSN
LAKELEKYSKTSFEYNSYDNHT
YGKRAIAPKVQTRDISPKGYDD
ANGYCKDPSPSSSSTSSYAPSSS
SNLSCGGGSSASSTCSKSSFDYT
HDMEAAHMAATAILNLSTRCR
EMPQNLSTKPQDLCATRNPDM
EVDENGTLDLSMNKQRPRDSC
CPILTPLEPMSPQQQAVMNNRC
FQLGEGDCWDLPVDYTKMKPR
VRIDEDESKDITPEDLDPFQEALE
22114 52482 22242 727 4372 LWACVLPGVTLPPFLGEKTSCK
MEVDTEEKRHRTRSKGVRVPV
EPAIQELFSCPTPGCDGSGHVSG
KYARHRSVYGCPLAKKRKTQD
KQPQEPAPKRKPFAVKADSSSV
DECDDSDGTEDMDEKEEDEGE
EYSEDNDEPGDEDEEDEEGDRE
EEEEIEEEDEDDDEDGEDVEDE
EEEEEEEEEEEEEEENEDHQMN
CHNTRIMQDTDVLDDNNNDEY
DNYDELVAKSLLNLGKIAEDA
AYRARTESEMNSNTSNS
Figure imgf002867_0001
Figure imgf002868_0001
Figure imgf002869_0001
Figure imgf002870_0001
Figure imgf002871_0001
Figure imgf002872_0001
Figure imgf002873_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Λmino acid sequence ( X=Unknown, NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
22150 52518 22278 1871 MEDNLISINKIKILLAVSDGEIDE
TFSLKQLMFNSVPVQNEDGSFN
FEGVKAEFRPGTQTQEYIKGME
DSSSEVTVNREVTTDNPYTISVT
NKTLSAIRIKMFMPRVAYELKVT
VIKMAFELSTRCSKRLM/DGSFE
TVLTDVIEGKTMSGYDRSRRV
NLPNFNNQVIFRWRKTPDSND
SNWDAIQVRSYAEVIDAKFRY
PLTGLLFVEFDSKMFPNQLPTISV
IRKRWKIVNVPSNYDPESRTYN
GNWDGTFKKAWTNNPAWVLY
DLMINQRYGLDQKELGISVDK
WALYEAAQYCDQMWLTAKA
ARNLATFVT/CIIQSQTDAYKW
VRDICSIFRGMSFWNGESISVIID
RPREPAYIFTNDNWNGDFSYTV
SQ ARRACTRRAT* CLMMSKTCI
NRTLSQYSI/V/EGHSTVRVNNVT
SITAIGVAHVEARPTDCGRWILK
TNLRSTTVNFATGLEGMIPTIGD
WAISDNFWSSNLTMNLSGRLQ
EVSGSQIFLPFRVDARAGDFNIV
NKPDAKPVKRTISSVSADGKTI
EINIGFGFPVKPNTVFAIDRTDL
ALQQYWTKIDKGDDDEEFTVS
KSRRWSTILISTMKLITGLTSTT
DRQASLSQIR/CPKPENVKVCPQS
RESSRG
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Λmino acid sequence ( X=Unknown, NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
22151 52519 22279 5307 5305 CRSSQNTQETSPDRALLVNLRP
SQGPLLALWVPAEGVGGKGHC
LPPRSFFRMDNQLVPAVSDGEI
DETFSLKQLMFNSVPVQNEDGS
FN FEG VKAEFRPGTQTQEYIKG
MEDSSSEVTVNREVTTDNPYTI
SVTNKTLSAIRIKMFMPRVAYEL
KVTVIKMAFELSTRCSKRLM/D
GSFETVLTDVIEGKTMSGYDRS
RRVNLPNFNNQVIFRWRKTPD
SNDSNWDAIQVRSYAEVIDAK
FRYPLTGLLFVEFDSKMFPNQL
PTI SI RKRWKI VN VPSNYDPESR
TYNGNWDGTFKKAWTNNPAW
VLYDLMINQRYGLDQKELGISV
DKWALYEAAQYCDQMWLTA
KAARNLATFVT/CIIQSQTDAYK
WRDICSIFRGMSFWNGESISVII
DRPREPAYIFTNDNWNGDFSY
TVSQARRACTRRAT*CLMMSKT
CINRTLSQYSI/V/EGHSTVRVNN
VTSITAIGVAHVEA/KANRRGRW
ILKTNLRSTTVNFATGLEGMIPT
IGDWAISDNFWSSNLTMNLSG
RLLEVSGSQIFLPFRVDARAGD
FIIVNKPDGKPVKRTISSVSADG
KTIEINIGFGFPVKPNTVFAIDRT
DLALQQYWTKIDKGDDDEEF
TVSKSRRWSTILISTMKLITGLTS
TTDRQASLSQIR/CPKPENVKVCP
QSRESSRG
22152 52520 22280 412 PAAVA*VKTEPETPGPSCLSQE
GQTAVVKTEESSELGNYVIKIDH
LETIQQLLTAWKKIPLITAKVW
QRAMTMRKVLQEILEKNPRFH
HLTPLKTKHIAHWCRCHGYTPP
DPESLRNDGDSIEDVLTQIDSEP
22153 52521 22281 513 IFACWGKQAWPACCNSLRGQA
VKGNQLLPVSLVKRKTTLAPNT
QTASPRALADSLMQLARQVSR
LE/QRAKREPENEEEVDILSLSE
PVKINIKKEQEEKQEEVKFYLPP
TPGSEFIGDVTQKRLEEVDSTFG
DDLVQGFSGGISRRYGEKPGVQ
DLEVEPDSVTEMLQFYD
Figure imgf002876_0001
Figure imgf002877_0001
Figure imgf002878_0001
Figure imgf002879_0001
Figure imgf002880_0001
Figure imgf002881_0001
Figure imgf002882_0001
Figure imgf002883_0001
Figure imgf002884_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Λmino acid sequence ( X=Unkno\vn, NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
22205 52573 22333 8875 EDGSLLQEVGIICLARGPGCGPE
TPLYSPATGLALQLQGPAAMG
ALWSWWILWAGATLLWVGKSG
EDWISGPTTVFSQPGLTQEASV
DLKNTGREEFLTAFLQNYQLA
YSKAYPRLLISSLSESPASVSILS
QADNTSKKVTVRPGESVMVNI
SAKAEMIGSKIFQHAWIHSDY
AISVQALNAKPDTAELTLLRPIQ
ALGTEYFVLTPPGTSARNVKEF
AWAGAAGASVSVTLKGSVTF
NGKFYPAGDVLRVT
22206 52574 22334 2214 ISATDGPLAPCHGLVPPAQYFQ
GCLLDACQVQGHPGGLCPAVA
TYVAACQAAGAQLGEWRRPDF
CPLQCPAHSHYELCGDSCPVSC
PSLSAPEGCESACREGCVCDAG
FVLSGDTCVPVGQCGCLHDGR
YYPLGEVFYPGPECERRCECGP
GGHVTCQEGAACGPHEECRLE
DGVQACHATGCGRCLANGGIH
YITLDGRVYDLHGSCSYVLAQ
VCHPKPGDEDFSIVLEKNAAGD
LQRLLVTVAGQWSLAQGQQV
TVDGEAVALPVAVGRVRVTAE
GRNMVLQTTKGLRLLFDGDAH
LLMSIPSPFRGRLCGLCGNFNG
NWSDDFVLPNGSAASSVETFG
AAWRAPGSSKGCGEGCGPQGC
PVCLAEETAPYESNEACGQLRN
PQGPFATCQAVLSPSEYFRQCV
YDLCAQKGDKAFLCRSLAAYT
AACQAAGVAVKPWRTDSFCPL
HCPAHSHYSICTRTCQGSCAAL
SGLTGCTTRCFEGCECDDRFLL
SQGVCIPVQDCGCTHNGRYLPV
NSSLLTSDCSERCSCSSSSGLTC
QVAAGCPVPGVRVCEVKAEARVN
CWVATRGLVCVLSVGVANLVTTF
DGARGATTSPGVYELSSRCPGL
QNTI P WYRV VAEVQICHGKTE
AVGQVHIFFQDGMVTLTPNKG
VWVNGLRVDLPAEKLASVSVS
RTPDGSLLVRQKAGVQVWLGA
Figure imgf002886_0001
Figure imgf002887_0001
Figure imgf002888_0001
Figure imgf002889_0001
Figure imgf002890_0001
Figure imgf002891_0001
Figure imgf002892_0001
Figure imgf002893_0001
Figure imgf002894_0001
Figure imgf002895_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Λmino acid sequence ( X=Unkno n, NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
22269 52637 22397 2382 QTEMTIEGLQPTVEYVVSVYAQ
NPSGESQPLVQTAVTNIDRPKG
LAFTDVDVDSIKIAWESPQGQV
SRYRVTYSSPEDGIHELFPAPDG
EEDTAELQGLRPGSEYTVSWA
LHDDMESQPLIGTQSTAIPAPTD
LKFTQVTPTSLSAQWTPPNVQL
TGYRVRVTPKEKTGPMKEFNLA
PDSSSVWSGLMVATKYEVSV
YALKDTLTSRPAQGWTTLEN
VSPPRRARVTDATETTITISWRT
KTETITGFQVDAVPANGQTPIQ
RTIKPDVRSYTITGLQPGTDYKI
YLYTLNDNARSSPWIDASTAI
DAPSNLRFLATTPNSLLVSWQP
PRARITGYIIKYEKPGSPPREW
PRPRPGVTEATITGLEPGTEYTI
YVIALKNNQKSEPLIGRKKTDE
LPQLVTLPHPNLHGPEILDVPST
VQKTPFVTHPGYDTGNGIQLPG
TSGQQPSVGQQMIFEEHGFRRT
TPPTTATPIRHRPRPYPPNVGEEI
QIGHIPREDVDYHLYPHGPGLN
PNASTGQEALSQTTISWAPFQD
TSEYIISCHPVGTDEEPLQFRVP
GTSVTSATLTGLTRGATYNIIVE
ALKDQQRHKVREEWTVGNSV
NEGLNQPTDDSCFDPYTVSHYA
VGDEWERMSESGFKLLCQCLG
FGSGHFRCDSSRWCHDNGVNY
KIGEKWDRQGENGQMMSCTCL
GNGKGEFKCDPHEATCYDDGK
22270 52638 22398 48 7500 GATMACCAAQGEERVTPCASG
KRGPAATTSLVLWIPSVPPPVPF
PTLWPPPSWRRQPPGGIRRDFS
RRLRREANLVATCLPVRASLPH
RLNMLKGPGPGLLLLAVLCLG
TAVPSTGASKSKRQAQQMVQP
QSPVAVSQSKPGCYDNGKHYQ
INQQWERTYLGNVLVCTCYGG
SRGFNCESKPEAEETCFDKYTG
NTYRVGDTYERPKDSMIWDCT
CIGAGRGRISCTIANRCHEGGQS
YKIGDTWRRPH ETGG YM
22271 52639 22399 403 NSTNKRRGFCFITFKEVEMEKKY
HΝVGLSKCEIKVAMSKEQYQQ
QQQWGSRGGFAGRARGRGGG
PSQΝWΝQGYSΝYWΝQGYGΝY
GYΝSQGYGGYGGYDYTGYNΝ
YYGYGDYSΝQQSGYGKVSRRG
GHQΝSYKPY
Figure imgf002897_0001
Figure imgf002898_0001
Figure imgf002899_0001
Figure imgf002900_0001
Figure imgf002901_0001
Figure imgf002902_0001
Figure imgf002903_0001
Figure imgf002904_0001
Figure imgf002905_0001
Figure imgf002906_0001
Figure imgf002907_0001
Figure imgf002908_0001
Figure imgf002909_0001
Figure imgf002910_0001
Figure imgf002911_0001
Figure imgf002912_0001
Figure imgf002913_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Λmino acid sequence ( X=Uπknown, NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
22372 52740 22500 90 2166 SQHFGRLRRVDYLRTSKMQTSS
SRSVHLSEWQKNYFAITSGICT
GPKADAYRAQILRIQYAWANS
EISQVCATKLFKKYAEKYSAIID
SDNVESGLNNYAENILTLAGSQ
QTDSDKWQSGLSINNVFKMSS
VQKMMQAGKKFKDSLLEPALA
SWIHKEATVFDLPKFSVCGSS
QESDSLPNSAHDRDRTQDFPES
NRLKLLQNAQPPMVTOTARTC
PTFSAPVGESATAKFHVTPLFG
NVKKENHSSAKENIGLNVFLSN
QSCFPAACENPQRKSFYGSGTI
DALSNPILNKACSKTEDNGPKE
DSSLPTFKTA EQLWVDQQKK
YHQPQRASGSSYGGVKKSLGA
SRSRGILGKFVPPIPKQDGGEQN
GGMQCKPYGAGPTEPAHPVDE
RLKNLEPKMIELIMNEIMDHGP
PVNWEDIAGVEFAKATIKEIW
WPMLRPDIFTGLRGPPKGILLFG
PPGTGKTLIGKCIASQSGATFFSI
SASSLTSKWVGEGEKMVRALF
AVARCQQPAVIFIDEIDSLLSQR
GDGEHESSRRIKTEFLVQLDGA
TTSSEDRILVVGATNRPQEIDEA
ARRRLVKRLYIPLPEASARKQI
VINLMSKEQCCLSEEEIEQIVQQ
SDAFSGADMTQLCREASLGPIS
AV/SQTADIATITPDQVRPIAYID
FENAFITVRPIVSPKDL*LYENW
22373 52741 22501 523 FFFFFPAFKMSKRGVRGWVPSG
CRKFPDFPLGLPVVGTL*SNWCL
TNTVGAQKTLYIVISREGGIKGTG
LTRLPRLPGVGVDMVMAHSSRK
ANPELQKKGYIPAVWIRQRKVI
TVGKRWRCFFIFEDVNAGVIGDV
NKGVEMKGFAITGVPRAKECAD
LWPRDCIQMLGSIAWFSQYIL
Figure imgf002915_0001
Figure imgf002916_0001
Figure imgf002917_0001
Figure imgf002918_0001
Figure imgf002919_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Λmino acid sequence ( X=Unkno n, NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
22397 52765 22525 474 2527 SASPLGPYLLPSGAPERGSPGKE
HPEERWTAPPSSSQSAEVLGE
LVLDGTAPSAHHDIPALSPLLPE
EARPKHALPPKKKLPSLKQVNS
ARKQLRPKATSAATVQRAGSQ
PASQGLDLLSSSTEKPGPPGDPD
PIVASEEASEVPLWLDRKESAV
PTTPAPLQISPFTSQPYVAHTLP
QRPEPGEPGPDMAQEAPQEDTS
PMALMDKGENELTGSASEESQ
ETTTSTIITTTVITTEQAPGMQPP
TPMGEGPPIVKHTWILDKCNVA
TLQST*ALGPNEGYIDSSDYPLL
PLNNFLECTYNVTVYTGYGVE
LQVTPGEYPDGELLSIRGVDGP
TLTVLANQTLL V EGQVIRSPTN
TISVYFRTFQDDGLGTFQLHYQ
GRVRPRLFPRRPDSGDVTVMDL
HSGGVAHFHCHLGYELQGAK
MLTCINASKPHWSSQEPICSGM
LQPQPHNATIGRVLSPSYPENT
NGSQFCIWTIEAPEGQKLHLHF
ERLLLHDKDR*SSEGPQTNKSA
LLYDSLQTESVPFEGLLSEGNTI
RIEFTSDQARAASTFNIRFEGEG
PWELYEPYIQNGNFTTSDPTYNI
GTIVEFTCDPGHSLEQGPAIIECI
NVRDPYWNDTEPLCRGERIHN
ASAVAGVVLSPNWPEPYVEGE
DCIWKIHVGEEKRIFLDIQ/FVS
V*NGSVFSQEKVRPTPEWW
22398 52766 22526 131 3125 LVSFLLFQDALPEGDASPLGPY
LLPSGAPERGSPGKEHPEERW
TAPPSSSQSAEVLGELVLDGTA
PSAHHDIPALSPLLPEEARPKHA
LPPKKKLPSLKQVNSARKQLRP
KATSAATVQRAGSQPASQGLD
LLSSSTEKPGPPGDPDPIVASEE
ASEVPLWLDRKESAVPTTPAPL
QISPFTSQPYVAHTLPQRPEPGE
PGPDMAQEAPQEDTSPMALMD
KGENELTGSASEESQETTTSTIIT
TTVITTEQAP
22399 52767 22527 161 352
22400 52768 22528 368 LGSTHASVRINILSVDVLYFIFY
RRDDNKDIDSEKEAAMEAEIKA
ARERAIVPLEARMKQFKDMLL
ERGVRKQSLGEIFLFCISNINMS
C*KVKSKVSAFSTWEKELHKIV
FDPRYLLLNLI
Figure imgf002921_0001
Figure imgf002922_0001
Figure imgf002923_0001
Figure imgf002924_0001
Figure imgf002925_0001
Figure imgf002926_0001
Figure imgf002927_0001
Figure imgf002928_0001
Figure imgf002929_0001
Figure imgf002930_0001
Figure imgf002931_0001
Figure imgf002932_0001
Figure imgf002933_0001
Figure imgf002934_0001
Figure imgf002935_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Λmino acid sequence ( X=Unkno n, NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
22502 52870 22630 276 1905 LFLYSSHGQYGCPSCSATDGW
RNGKSTAGHQRYLCSHCRKTW
QLQFTYTASQPAGQ*RTAIATVI
CCNNCQRRHTNSRCHTDNCRR
ADSASSHATSGSSSRGCAGSGR
CGASSTRRCLGPVCCFRGSKRS
WRRQHEGHKDCRKLVFFTNFSFI
HRMDNTAVPPVQLLTGWCVTA
KAPPDISAISALTAVKHGNCSSL
TPLLNPPG SDVI VC AEMDEQ W
GYVGAKSRQRWLFYAYDSLRK
TWAHVFGERTMATLGRLMSL
LSPFDWIWMTDGWPLYESRL
KGKLHVISKRYTQRIERHNLNL
RQHLARLGRKSLSFSKSVELHD
KQQGLDAALIDAPLAVDAQTP
LSELLSHVGQAPCAVPWDED
QQYVGIISKGMLLRALDREGGV
RVPVIYPKRFRTTDVCMPLSLG
TCIREIEAEFPDPIFRRSWYICE
FINTNGAMGIIVNKPLENLKIEGI
LEKLKITPEPRDESIRLDKPVML
GGPLAEDRGFILHTPASNFASSI
RISDNTVMTTSRDVLETLGTDK
QPSDVLVALGYASWEKGQLEQ
EILDNAWLTAPADLNILFKTPIA
DRWREAAKLIGVDILTMPAQD
GTPDWNIIERLLKEWQPDEIIVG
LPLNMDGTEQPLTARARKFAN
RIHGRFGVEVKLHDERLSTVEA
RSGLFEQGGYRALNKGKVDSA
SAVIILESYFEQGY
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Amino acid sequence ( X=Unknown, NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possib!e nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
22503 52871 22631 399 2337 PEVQGLQQAGAAARRPGVGLA
APRWCEGRLRGSGLQSRLCPG
ASV*ATDGVVRNGKSTAGHQR
YLCSHCRKTWQLQFTYTASQP
AGQ*REKEEKEEVEEEEEEEKK
KRRREGEKEEKEEKEKEKKKK
F KKKKXKKKKKKKKd<-KKKK
KKKKKKKQLSLVTVMSQQCQL
WRNEARTTGACLPSPTVSAAAP
PIVSTTVTAFAAATLAKGDLRS
RASSKQVPLLWLVWVWPHPG
WFVKAGSVALGSSPGCVLELL
CKLLTGWCVTAKAPPDISAISA
LTAVKHGNCSSLTPLLNPPGSD
VIVCAEMDEQWGYVGAKSRQ
RWLFYAYDSLRKTWAHVFGE
RTM ATLG RLMSLLSPFDWI W
MTDGWPLYESRLKGKLHTQER
KARSKQQKDADTAPPIQVGTQ
TPQTVHAARDTRQASPRASKIS
QHRQADGDKRQTRH A E AQAT
RTHTGRRKTIKETATQQAYADE
QTRRTARTHRRTARHTAAHTH
RTRVAARQTTEKDNEQSDTQPP
TQKTPSATATAHTPRRTETTTNI
VQARHTSARERNHTRQHQATR
RETTTNAHTNTPHYHAEHTQSR
SQQQTTSTERPTTDSSTTINNNN
HTPRTVRDERRMTQHKHPSRA
STTTNRTTTTGAYQHTQRDTRD
TRRDTHEDTTTQQPHTTTPTAN
EAAYTAHTTKTDDHRHTADDT
22504 52872 22632 103 313 ATAGCPSCSATEGWRNGKST AGHQRYLCSPCRKTWQLQFTY TASQP/GARIMGVGLNTVLRHL KNSGRSR
22505 52873 A 22633 620 832
Figure imgf002938_0001
Figure imgf002939_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Λmino acid sequence ( X=Unknown, NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possiblc nucleotide insertion) sequence
22512 52880 22640 747 MIIIDPRYTDTGAGREDEWIPIR
PGTDAALVNGLAYVMITENLV
DQAFLDKYCVGYDEKTLPASA
PKNGHYKAYILATD/GWRTAK
STAGHQRYLCSHCRKTWQLQF
TYTASQPAGQ* R YTDTGAGRE
DEWIPIRPGTDAALVNGLAYV
MITENLVDQAFLDKYCVGYDE
KTLPASAPKNGHYKAYILATDG
WCVPPKAPPDISAISALTAVKH
GNCSSLTPLLNPPGSDVIVCAE
MDEQWGYVGAKSRQRWLFYA
YDSLRKTWAHVFGERTMATL
GRLMSLLSPFDWIWMTDGWP
LYESRLKGKLHVISKRYTQRIE
RHNLNLRQHLARLGRKSLSFSK
SVELHDKVIGHYLNIKHYQ
22513 52881 22641 1722 2463 NPRLAGSEGSCPSCSATDGWR
NGKSTAGHQRYLCSHCRKTWQ
LQFTYTASQPAGQ*RKLTPGCL
FVALKGERFDAHDFADQAKAG
GAGALLVSRPLDIDLPQLIVKD
TRLAFGELAAWVRQQVPARW
ALTGSSGKTSVKEMTAAILSQC
GNTLYTAGNLNNDIGVPMTLL
RLTPEYDYAVIELGANHQGEIA
WTVTHPPRFVTKQWLCFLPDG
LLLFAEARFVIWQMQPAVMAV
DVAAFAHLFNQQQYQRFTGRP
PDSSKLPWKITCNGECVTFHW
WLLGVAVRISRLLPVRVMTAL
EGNYNGTEGISALPFNGIILAHS
NESEWVTFRNNKNNEAFLDRV
YIVKVPYCLRISEEIKIYEKLLN
HIFECWQHFLNNPESLQCYIWA
ANADFREARQVIFKTFFTVGVE
EELGIRQTRTODFLVTGDNLLRI
FRFDVGNEDKVRQQFAWRIH
REVFLVTFHGVNQRFSRHREEF
LFEFCVSVTANLTRCFFGIGFNL
RLTFFVIGDDFPTLTQNFRILIGV
IDGEFRLAHKAVAANHAIGLNA
QNGCRNDFVAHAAGSRALAAD
GRSLEHAPTVETWAGPLCESG
DVFTQQEGGNVETRALPEVKA
AVPPVQLLTGWCVTAKALPDIS
AISALTAVKHGNCSSLTPLLNPP
GSDVIVCAEMDEQWGYVGAKS
RQRWLFYAYDRLRKTWAHRL
Figure imgf002941_0001
Figure imgf002942_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Amino acid sequence ( X=Unknown, NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
22517 52885 22645 1479 1793 TRY RSCPSCS ATDG V V RNGKST
AGHQRYLCSHCRKTWQLQFTY
TASQPAGQ*RRNACQYRTPAV
VASRQKRWHGDLGMIGSRDV
MLFISDSVGAKELDLIIPRLEDK
SIALLAMTGKPTSPLGLAAKAV
LDISVEREACPMHLAPTSSTVN
TLMMGDALAMAVMQARGFNE
EDFARSHPAGALGARLLNKVH
HLMRRDDAIPQVALTASVMDA
MLELSRTGLGLVAVCDAQQQV
QGVFTDGDLRRWLVGGGALTT
PVNEAMTVGGTTLQSQSRAIDA
KEILMKRKITAAPWDENGKLT
GAIKLQDFYQGRGLFNPSIPRVS
TADAGLANRARRNWVGSSWK
QVGQIILQGGPQNAEKSGQRYT
VSHDRHRDAERQRVKSGLPRY
AECSTYPSCPVALDLAFASHGI
HMHAPGEALRMLVTAMDKAA
DARTKLARLLATKGITHEIQIPD
ISTKEKAQQPIGLNMEQIKPEKQ
DFIKPVIPQAGKHNGVNMSVLR
SLLTAGVLASGLLWSLNGITAT
PAAQASDDRYEVTQQRNPDAA
CLDCHKPDTEAVPPVQLLTGW
CVTAKAPPDISAISALTAVKHG
NCSSLTPLLNPPGSDVIVCAELD
EQWGYVGAKSRQRWLFYAYA
QSPRTTRCCARIRLHALWRRW
Figure imgf002944_0001
Figure imgf002945_0001
Figure imgf002946_0001
Figure imgf002947_0001
Figure imgf002948_0001
Figure imgf002949_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Amino acid sequence ( X=Unkno\vπ, NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possib!e nucleotide insertion) sequence
22551 52919 A 22679 2210 MEQKGEFKNDTLI YEFFH I VAK
PHPLPIKSSTYTTFQSARVTRFF
LDAEKECRCQEGQVEAALIAM
GGTCQGTQRLSTAGESESATH
MREPAVQCRRDSQCVSVLAGR
RRFFQGAASPLHPASVALTQLH
VRWASTATIGQVSGFPGKIRVS
PTTNLGDSDYIFTASHWGLGHY
GPAVHVHSQGGIDERGSNDSG
RLYVTNPCGNVPSWTPNIDRLA
EDGVKLTQHISAASLCTPSRAA
FLTGRYPVRSGMVSSIGYRVLQ
WTGASGGLPTNETTFAKILKEK
GYATGLIGKWHLGLNCESASD
HCHHPLHHGFDHFYGMPFSLM
GDCARWELSEKRVNLEQKLNF
LFQVLALVALTLVAGKLTHLIP
VSWMPVIWSALSAVLLLASSYF
VGALIVHADCFLMRNHTITEQP
MCFQRTTPLILQEVASFLKRNK
HGPFLLFVSFLHVHIPLITMENF
LGKSLHGLYGDNVEEMDWMV
GRILDTLDVEGLSNSTLIYFTSD
HGGSLENQLGNTQ YGG WNGI Y
KGGKGMGGWEGGIRVPGIFRW
PGVLPAGRVIGEPTSLMDVFPT
WRLAGGE VPQDRVI DGQDLL
PLLLGTAQHSDHEFLMHVYCER
VLHAARWHQRDRGTMWKVHF
VTPVFQPEGAGACYGRKVCPCF
GEKWHHDPPLLFDLSRDPSET
HILTPASEPVFYQVMERVQQAV
Figure imgf002951_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Amino acid sequence ( X=Unknown, NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
22557 52925 22685 53 1966 VGQPRGLRTLEKLTASVAGGA
ANFFGDIERDLGPRDTSRGIRG
GNLGDRGCSSAEMAYASWQR
WSPVE WA RWM WTA VTSSG DS
SLLVLQGDSGKRSSDSEEAFET
PESTTPVKAPPAPPPPPPEV I PEP
EVSTQPPPEEPGWGSETVPVPD
GPRSDSVEGSPFRTQAHSFSGVF
DEDQPIASTGTDILDFDNIELVD
TFQTLEPRASDAKNQEGKVNT
RRKSTDSVPISKSTLSRSLSLQA
SDFDGASSSGNPEAVALAPDAY
STESAKRWPPASGGGRVQNSP
PVGRKTLPLTTAPEAGEVTPSD
SGGQEDSPAKGLSVRLEFDYSE
DKSSWDNQQENPPPTKKIGKKP
VAKMPLRRPKMKKTPEKLDNT
PASPPRSPAEPNDIPIAKGTYTF
DIDKWDDPNFNPFSSTSKMQES
PKLPQQSYNFDPDTCDESVDPF
KTSSKTPSSPSKSPASFEIPASA
MEANAWDGDGLNKPAKKKKT
PLKTMVEDVMSVCSLFDTFRV
KKSPKRSPLSDPPSQDPTPAATP
ETPPVISAWHATDEEKLAVTN
QKWTCMTVDLEADKQDYPQPS
DLSTFVNETKFSSPTEELDYRNF
YEIEYM EKIGSSLPQDDDAPKK
QALYLMFDTSQESPVKSSTVR
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Amino acid sequence ( X=Unknown,
NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
22558 52926 22686 662 2927 HLRRKRNLPRREGLSPALLEET
PLEPAVGPKAACPLDSESAEGV
VPPASGGKRCLFTTAPEAGEVT
PSDSGGQEDSPAKGLSVRLEFD
YSEDKSSWDNQQENPPPTKKIG
KKPVAKMPLRRPKMKKTPEKL
DNTPASPPRSPAEPNDIPIAKGT
YTFDIDKWDDPNFNPFSSTSKM
QESPKLPQQSYNFDPDTCDESV
DPFKTSSKTPSSPSKSPASFEIPA
SAMEANGVDGDGLNKPAKKK
KTPLKTDTFRVKKSPKRSPLSD
PPSQDPTPAATPETPPVISAWH
ATDEEKLAVTNQKWTCMTVD
LEADKQDYPQPSDLSTFVNETK
FSSPTEELDYRNSYEIEYMEKIG
SSLPQDDDAPKKQALYLMFDT
SQESPVKSSPVRMSESPTPCSGSV
SFEETGSPLRNTAAVKNQVHPVP
TRDWAP*PRSSHLAGCQRKSSQ
E/VELGGPWGLGHPFQEGDLKF
TASPRGSFASADALLSRLAHPV
FSL/CGALDYLVEPNFRQKRNPP
LFAQKLQREAVVHPTDVSIVSKT
ALYSRIRTTEVEKPAGLLFQQP
DLVDSALQIARAEIIVTKEREVSE
WKDKYEESRVREVMEMRKIVLA
EYEKTIAQMIEDEQREKSVSVHQ
TVQQLVLEKEQAVLADLNSVEK
VSLADLFRRYEKMKEVLESFRK
NEEVLKRCAQEYLSRVKKEEQ
RYQAVLKVHAVEEKLDRANAEVI
22559 52927 22687 374 1268 AETNMVWYDWMRPSHAQLHS
DYMQPLTEAKAKSKNKVRGV
QQLIQRLRLIKSPAEIERMQIAG
KLTSQAFIETMFTSKAPVEEAFL
YAKFEFECRARGADILAYPPW
AGGNRSNTLHYVKNNQLIKDG
EMVLLDGGCESSCYVSDITRTW
PVNGRFTAPQAELYEAVLEIQR
DCLALCFPGTSLENIYSMMLTLI
GQKLKDLGIMKNTQGK*CL/RR
LLE/QYCPHHVGHYLGMDVHD
TPDMPRSLPLQPGMVITIEPG I YI
PEDDKDAPEKFRGLGVRIEDDV
WTQDSPLILSADCP
Figure imgf002954_0001
Figure imgf002955_0001
Figure imgf002956_0001
Figure imgf002957_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Amino acid sequence ( X=Uπkno\vn, NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /^possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
22590 52958 A 22718 2240 MRSDPDTSVNEIETKLSALLGS
ETTGEILFDLLCANGPEWNRFV
TLEMKYGRIMLDTAKIIDEQDV
PTHILSKLTFTLRNHPEGWMK
NFEVLQPLQNSLSGLPLWVSER
ILQQINQLTHYEPVIGIMGKTGA
GKSSLCNALFAGEVSPVSDVAA
CTRDPLRFRLQIGEHFMTIVDLP
GVGESGVRDTEYAALYREQLP
RLDLILWLIKADDRALATDEHF
YRQVIGEAYRHKMLFVISQSDK
AEPTSGGNILSTEQKQNISRKIC
LLHELFQPVHPVCAVSVRLHPV
VALLQQFRTDDERTHRYHYDS
QHRLVDYTRTQYEEPLVESRYL
YDPLGRRVAKRVWRRERDLTG
WMSLSRKPQVTWYGWDGDRL
TTIQN DRSRIQTIYQPGSFTPLIR
VETATGELAKTQRRSLADALQ
QSGGEDGGSVVFPPVLVQMLD
RLESEILADRVSEESRRWLASC
GLTVEQMQNQMDPVTEGTTAW
Y/AEYDEWGNLLNEENPHQLQ
QLIRLPGQQYDEESGLYYNRHR
YYDPLQGRYITQDPIGLKGGW
NFYQYPLNPISDIDPLGLSMWE
VARQMNAIASKLPEGFHIGINFS
ASHIISPTFVDECLNFRDSFTRR
DLNLVLEVTERDLLNVDESLVQ
RLNILHENGFVIALDDFGTGYS
GLSYLHDLHIDYIKIDHSFVGRV
NADPESTRILDCVLDLARKLSIS
Figure imgf002959_0001
Figure imgf002960_0001
Figure imgf002961_0001
Figure imgf002962_0001
Figure imgf002963_0001
Figure imgf002964_0001
Figure imgf002965_0001
Figure imgf002966_0001
Figure imgf002967_0001
Figure imgf002968_0001
Figure imgf002969_0001
Figure imgf002970_0001
Figure imgf002971_0001
Figure imgf002972_0001
Figure imgf002973_0001
Figure imgf002974_0001
Figure imgf002975_0001
Figure imgf002976_0001
Figure imgf002977_0001
Figure imgf002978_0001
Figure imgf002979_0001
Figure imgf002980_0001
Figure imgf002981_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Λmino acid sequence ( X=Unknown, NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=ρossible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
22706 53074 22834 1889 ACRRRAGTWTPPRPRGSGSRA
MAGNCGARGALSAHTLLFDLP
PALLGELCAVLDSCDGALGWR
GLAERLSSSWLDVRHIEKYVDQ
GKSGTRELLWSWAQKNKTIGD
LLQVLQEMGHRRAIHLITNYGA
VLSPSEKSYQEGGFPNILFKETA
NVTVDNVLIPEHNEKGVLLKSS
ISFQNIIEVGTRNFHKDFLIGERG
E/IFGGYQEGKIQTLTYAVKLFK
QEKKMQCKKHWKRFLSELEVL
LLFHHPNILELAAYFTETVEKFC
LIYPYMRNGTLFDRLQCVGDT
APLPWHIRIGILIGISKTIHYLHN
VQPCSVICGSISSANILLVDDQFH
PKLTDFSMAHFRSPVLEHQSCCI
NMPSSVSRKHLWYVMPEEYIRQ
GKLSIKTDVYSFGIVIMEVLTGC
RWLDDPKHIQLRDLLRELMEK
RGLDSCLSFLDKKVPPCPRN FS
AKLFCLAGRCAATRAKLRPSM
DEVV*RALESTQASLYFVAGDPP
TSLKALRGPVSPLFVLEKVTVSIPV
VEDDEVKPNNNLLPS**KALRMK
RWPQKVTPFVECSQPVEVWFLSLE
QKAREARENEEACNMPSSSVCE
ESWFPKYVIVPSVQDLRPL*GK*K
I LSSQAVSGHSCVRTKPVESSCSS
KFSWDEYEQYKKE
Figure imgf002983_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Λmino acid sequence ( X=Unkπo n, NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
22717 53085 22845 2355 MVGSRRISGVIPLPFTRRAGIFS
DPRANMQPLWAPVPPATFVPTS
LPPVPTARPPTSASCHCQQVPC
AGLEVALPGNRTICWLSLWPS
WQLDEDAQSPALESSSIRQFPT
APRSGYPGTLCPSFLFGPRPRTC
EYHSKPVLCAPSSRASPSSATTL
STRTLWGPGAGSHPFGVHNTR
LSPDLCPGKIVLRALKESGAGM
PEQHKDPRVQENPDDQRTVPE
VTGDARSAFWPLRDNGGPSPFV
PRPGPLQTDLHAQSSEIRYNHTS
QTSWTSSSTKRNAISSSYSSTGG
LPGLKQRRGPASSRCQLTLSYS
KTVSEDRPQAVSLGHTRCEKG
ADTAPGQTIAPTASRPHINTSLH
VEDKAISDCRPSRPSHTLSSLAT
GASGGPPVSKAPTMDAQQDRP
KSQDCLGLVAPLASAAEVPAT
APVSGKKHRPPGPLFSSSDPLPA
NSSHSRDSAQVTSMIPAPFTAA
SRDAGMRRTRSAPAAAAAAPP
PSTLNPTSGSLLNVEWMEAPHIS
WPQPQLQHVPRGQ/NVRYNQR
SQTSRTRS/CPQTKCQLQLPQLY
GRPPGSKAEEGVPASSHCQLALS
SSNTVSEDGPQAVSSGHRCENK
AGTAPGQTLAPRGGSPRSQASR
PHINTALHVEDKAISDCRPSRPS
HTLSSLATGASGGPPVSKAPTM
DAQQDRPKSQDSLGLLAPLASA
AEVPSTAPVSGKKHRPPGPLFSS
22718 53086 22846 1200 M PEQDKDPRVQENPDDQRRVP
EVTGDARSAFRPLRDNGGLS/PP
FVPGPGPLQTDLHTQRVLGYRV
TVEDLDREKEAAFQRINSALQV
EDKAISDCRLSRPSHTLSSLATG
TSGLPAVSKAPSMDAQQERHK
SQDCLGLLAPLAPAAEVPSTAP
MSGKKHRPPG PLFSSSDPLPATS
SHSQDSAQVTSLIPAPFPAASM
DAGMRRTSPGTSAPAAAAAPPP
STLNPTLGSLLEWIEALHISGPQ
PQLQQVPRGQNQRSQTSRTSSC
PKRK/CHLQLLQLYGRPPGTKA
EEGVPASSHCQLTLSSSNTVSED
GPQAVSSGHTHCEKTADTAPG
QTLAPRGGFPRSQASRPCRHKF
PLLPRRRGEPLMLPPPLELGYR
VTAEDLDREKEVAFQRIKSALQ
VEDKAI SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Amino acid sequence ( X=Unknovvn, NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \-possible nucleotide insertion) sequence
22719 53087 A 22847 1762 YHSKPVLCAPSSRASPSSATTLS
TRTLWGPGAGSHPFGVHNTRL
SPDLCPGKIVLRALKESGAGMP
EQHKDPRVQENPDDQRTVPE V
TGDARSAFWPLRDNGGPSPFVP
RPGPLQTDLHAQSSEIRYNHTS
QTS WTSSSTKRN A I SSS YSSTGG
LPGLKQRRGPASSRCQLTLSYS
KTVSEDRPQAVSLGHTRCEKG
ADTAPGQTIAPTASRPHINTSLH
VEDKAISDCRPSRPSHTLSSLAT
GASGGPPVSKAPTMDAQQDRP
KSQDCLGLVAPLASAAEVPAT
APVSGKKHRPPGPLFSSSDPLPA
NSSHSRDSAQVTSM I PAPFTA A
SRDAGMRRTRSAPAAAAAAPP
PSTLNPTSGSLLNVEWMEAPHIS
WPQPQLQHVPRGQ/NVRYNQR
SQTSRTRS/CPQTKCQLQLPQLY
GRPPGSKAEEGVPASSHCQLALS
SSNTVSEDGPQAVSSGHRCENK
AGTAPGQTLAPRGGSPRSQASR
PHINTALHVEDKAISDCRPSRPS
HTLSSLATGASGGPPVSKAPTM
DAQQDRPKSQDSLGLLAPLASA
AEVPSTAPVSGKKHRPPGPLFSS
SDPLPATSYHSRDTAQ
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Λmino acid sequence ( X=Unkno\vn,
NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
22720 53088 A 22848 292 2052 GSVAWSPFSPSPSSLCLWDLPV
APTLWGPGAGSHPFGVHNTRL
SPDLCPGKIVLRALKESGAGMP
EQDKDLRVQENPDDQRRVPEV
TGDARSA/FLAPAGQWSPLSLC
AQSRASADIPPCPEVRNQI*PDIP
DHLDELMHQPKCHLQLLQLCG
RLAGAKVEEGASGAKEQGRDA
* AGQGTPESQQN PDDQRTVPEV
TGDARSTVRPLRDSGGLSPFVP
RPGPLQTDLHAQSSEIRYNQTS
QTSWTSSSTKRNAISSSYSSTGG
LLGLKQRRGPASSRCQLTLSYS
KTVSEDRPQAVSSGHTRCEKA
ADTAPGQTLAPRGGSPRSQGSR
PRRRKIALLPHR*GELQVEDKAI
LDCRPSWPSHTLSSLATGASGE
PPVSKAPTMDAQQDRPKSQDC
LGLVAPLASAAEIPSTAPVSGK
KHRPPGPLFSSSDPLPATSSHSR
DSAQVTSLIPAAFTAASMDVG
MRRTRPGTSAPAAAAAAPPPST
LNPTSGSLLNVEWMEALHISGPQ
PQLQQVPRGQ/NVRYNQRSQTS
RTRS/CPQTKCQLELPQLYGRPP
GTKGCRHEKNVLCSKLFEGFGF
IFVGFFFFAYMGILQLLII
Figure imgf002987_0001
Figure imgf002988_0001
Figure imgf002989_0001
Figure imgf002990_0001
Figure imgf002991_0001
Figure imgf002992_0001
Figure imgf002993_0001
Figure imgf002994_0001
Figure imgf002995_0001
Figure imgf002996_0001
Figure imgf002997_0001
Figure imgf002998_0001
Figure imgf002999_0001
Figure imgf003000_0001
Figure imgf003001_0001
Figure imgf003002_0001
Figure imgf003003_0001
Figure imgf003004_0001
Figure imgf003005_0001
Figure imgf003006_0001
Figure imgf003007_0001
Figure imgf003008_0001
Figure imgf003009_0001
Figure imgf003010_0001
Figure imgf003011_0001
Figure imgf003012_0001
Figure imgf003013_0001
Figure imgf003014_0001
Figure imgf003015_0001
Figure imgf003016_0001
Figure imgf003017_0001
Figure imgf003018_0001
Figure imgf003019_0001
Figure imgf003020_0001
Figure imgf003021_0001
Figure imgf003022_0001
Figure imgf003023_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Amino acid sequence ( X=Unkno\vπ, NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
22972 53340 A 23105 122 2437 RLELKLPLWTMRFTEMELLGH
LSIFKITGKQKQLTRTAPGPSGQ
LRSNQTTGNSLEPSGQPRSSQTT
GNSLEPSGQPRSSQTTGNSLEPS
GQPRSSQTTGNSLEPSGQPRSSQ
TTGNSLEPSGQPRSSQTTGNSLE
PSGQPRSSQTTGDSLEPSGQLRS
NQTTGDSVQPNNREQPGTVRS
APLQPNNREQPGTVRSAPLQPN
NRGQRPTKQQGTTWNRQVSSA
PTKQQGTARVPSGQPRSNQTTG
NSLEPSSQLRSNQTTGDSVQPN
NREQPGTVRSAPLQPNNRGQRP
TKQQGTARVPSGQPCSΝQTTGΝ
SLEPSSQLRSΝQTTGDSVQPΝN
REQPGTIRSAPLQPΝNRGQRWD
RQVSSAPTKQGTALGPSGVSSAP
TKQQGTALEPSSQLRSΝQTTGΝ
SLEPSGQPRSΝQTTGΝSREPSSQ
LCSΝQTIGΝSQDRQVSSAP/IQT
TGΝSQDRQVSPAPTKQQGTAG
ΝRQVSST/PIQTGΝSVQPΝNREQ
PGTVSGAM/VWVATPVKGPAR
VETSAHKVSAAGLPKKKT/QPG
VTGGAPRLTLΝTVLPKHSRTPS
RHEAGSTQALYHΝGQAQYALS
FPHEAAPMAAFYWTGIEVTTA
LTLEEAPCTVEITTALTLEEPPC
TVFWΝKTERΝTRTLVQDKGSΝ
TKGFVQTATCCFRADCKASRPS
PHPPSPHREWTSDLRHKQVSKF
PGAHRLPGHPELPIQEAWDRGE
22973 53341 23106 605 813 LFVFIΝFREΝPALLRWVAYARVTI
ΝVYPVΝFRPTPKΝSLMGALCGF
GPLIFIYYIIKTERVSIQTRCLVF
22974 53342 23108 273 936 AYSΝKVPSWΝQKAVPSPTKPAG
TLIFDFPAVKTVRΝKFVFFINYP
VSGYTEDLKKFLKPYTLEEQKΝ
LTVCPDGALFEQKGPVYVACQ
SPISLLQACRGMΝDLDFGYSQR
ΝPCILVKMΝRIIGLKSEGVPRID
CVSKΝEDIPΝVAVYPHΝGIIDL
KYFPYYGEKLHVGYLQPLVAV
QVSFAPΝDTGKEVTVECKIDGS
AΝLKSQDDRDKFLGRVMFKIT
22975 53343 23109 104 478 ETEVGGLVEASMSHΝCTTALQ
PGQHSQKEΝLQΝKPIPGREKY*
VSIEFELIYFP/YYCQVGYLQPL
VAVQVSFAPΝΝTGKEVTVECKI
VDGSAΝLKSQVDDRDKFLGVRV
MFQΝHSTVHSMSRVSSTE
Figure imgf003025_0001
Figure imgf003026_0001
Figure imgf003027_0001
Figure imgf003028_0001
Figure imgf003029_0001
Figure imgf003030_0001
Figure imgf003031_0001
Figure imgf003032_0001
Figure imgf003033_0001
Figure imgf003034_0001
Figure imgf003035_0001
Figure imgf003036_0001
Figure imgf003037_0001
Figure imgf003038_0001
Figure imgf003039_0001
Figure imgf003040_0001
Figure imgf003041_0001
Figure imgf003042_0001
Figure imgf003043_0001
Figure imgf003044_0001
Figure imgf003045_0001
Figure imgf003046_0001
Figure imgf003047_0001
Figure imgf003048_0001
Figure imgf003049_0001
Figure imgf003050_0001
Figure imgf003051_0001
Figure imgf003052_0001
Figure imgf003053_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Amino acid sequence ( X=Unkno n, NO: of peptide hod n USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
23184 53552 23318 3366 WRKIYQANGKQKKAGVAILVS
DKTDFKPTKIKRDKEGHYIMVK
GSIQQEELTILNIYAPNTGAPTFI
QQVLSDLQRDLDSHTLIIGDFN
TPLSTSDRSTRQKVNKDTQELN
S ALHQA DLIDI YRTLHPKSTE YT
FFSAPHHTYSKIDHILGSKALLS
KCKRTEIITNYLSDHSAIKLELM
IKNLTQNHSTIWKLNNLLLNDY
WVHKEMKAEIKMFFETNENKD
TTVYQNL WDTFKA VCRGKFIAL
NAHKRKVQER
23185 53553 23319 2611 MGDFNTPLSTLDRSTRQKANK
DTQELNSALHQVDLIDIYRTLH
PKSTEYTFFSAPHHTYSTTDHIL
GSKALLSKCRRTEIITNYLSDHS
AIKLELRIKNLTQNRSTTWKLN
NLLLNDYGVHNEMKAEIKMFF
ETNENKDTTYQNL WDTFKA VC
RGKFIALNAHKRKQERSKIDTL
TSQLKELEKQEQTHSKASRRQE
ITKΪRAELKEIETQKTLQKVINES
RSWFFERINKIDRPLARLIKKKR
EKNQIDAIKNDKGDITTDPTEIQ
TTIREYYKHLY ANKLENLEEM
DKFLHTYTLPRLNQEEVESLNR
PITGSEIVAIINSLPTKKSPGPDG
FTAEFDQRYKEELVPFLLKLFQ
SIEKEGILPNSFYEASIILIRKPGR
DTTKKENFRPISLMNTDAKILN
KILAKRIQQHIKKLIHHDQVGFI
PGMQGWFNIHKSINVIQHINRA
KDKNHMIISIDAEKAFDKIQQPF
MLKTLSKLGIDGTYFKIIRAIYD
KPTANIILNGQKLEAFPLKTGTR
QGCPLSPLLFN I VLE VLA RAIRQ
EKEIKLISNFSKVSGYKINVQKS
QAFLYTNNRQTESQIMSQLPFTI
ASKRIKYLGIQLTRDVKDLFKE
NYKPLLKEIKEDTNKWKNIPCS
WVGRJNIMKMAILPKVIYRFNA
IPIKLPMTFFTELEKTTFKFIWN
QKRARITKSILSQKNKAGGITLP
DFKLYYKATVTKTAWYWYQN
Figure imgf003055_0001
Figure imgf003056_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Λmino acid sequence ( X=Unknown, NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
23196 53564 23330 2372 TTNCLSDHSAIKLELRIKKLTEN
HTTTWKLNNLLLNDYWVDNE
MKAEIKMFFEINEKKVDTTSQNL
WDTFKA VCRGKLIALNAHKRK
QERSKIDTLTSQLKKVLEKQEQR
NSKASRKQEITKIRAELKE1ETQ
KKTLQKINESRSWFFEKINKIDG
PLARLIKKKREKNQVDAIKNDK
GDI ITDPTEMQTTIREY YKYLY
ANKLENLEEMDKFLNTYTLQR
LNQEGIESLNRPITGSEIEAIΓNSL
PTKKSPGPDGFTAKFYQRYKEE
LVYPEPRSESECLSNIREFLRGC
GASLRLESIREDRNGRSQKTVH
TEGDMNMN I KKI VKQ AT VLTF
TTALLAGGATQAFAKENNQKA
YKETYGVSHITRHDMLQIPKQQ
QNEKYQVPQFDQSTIKNIESAK
GLDVWDSWPLQNADGTVAEY
NGYHWFALAGSPKDADDTSI
YMFYQKVGDNSIDSWKNAGR
VFKDSDKFDANDPILKDQTQE
WSGSATFTSDGKIRLFYTDYSG
KHYGKQSLTTAQVNVSKSDDT
LKINGVEDHKTIFDGDGKTYQN
VQQFIDEGN YTSGDNHTLRDPH
YVEDKGHKYLVFEANTGTENG
YQGEESLFNKAYYGGGTNFFR
KESQKLQQSAKKRDAELANGA
LGIIELNNDYTLKKVMKPLITSN
TVTDEIERANVFKMNGKWYLF
TDSRGSKMTIDGINSNDIYMLG
23197 53565 23331 2987 APRFIKQVLSDLQRDLDSHTIIM
GDFNTPLSTLDRSTRQKVHKDT
QELNSALHQADLIDIYRTLHPK
STEYTCFSAPHHTYSKIDHIVGS
KALLSKCKRSEIITNCLSDHSAI
KLELRIKKLTQΝRSTTWKLΝNL
LLΝDYWVHΝEMKAEIRMFFET
NENKDTTYQNLWDTFKAVCRG
KFIALNAHKRKQERSKIDTLTS
QLKELEKQEQTHSKASRRQEIT
KIRAELKEIETQKTLVQKINEVSR
SWFFVERTNK
23198 53566 A 23332 339 701 PDKEGNEIWVDMYTVKPSGWT
VRTFDKPRKRFIAFFIAGILFRAI
KNHFLPRETLQCLPYILTGFRRG
QSEYFSIFSNMDLADTVMFL*G
A I RFSDL VN EQIQHTFLIPGETIC
RLSEVV
23199 53567 A 23333 190 502 SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Amino acid sequence ( X=Unknown, NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
23200 53568 23334 4472 8032 KKSGNNRLMRVHGKKRFLWQ
KQGVCEVEIQAQRESARDGTD
REDAESPHAEVTVPRGTRTEILT
LWASRDRPASACGVTVPKGST
THPGGMDRVCGVDTAVLHWM
KQVQGISNLEVETGPRRRDMD
EAGSRHSQQTNIGTENQTPHVL
THKWQLNRNSACDKLHKPHSV
PGGVAAGDVGGKTISNPGVCAI
EYQVDGSPHIWEDVNKGVYIEE
DRAPLMERSSSPAMEQSWMEN
DFDELREEGFRRSNYSELRED
23201 53569 23335 2374 MVSISDLVICPPRJdPKVLGLQGP
PGLDSISDPSAGAGFLDWGEIG
MPGPGRAGHQALCKCDCQCLE
KTTTKAPGKMPKSTRSGPVRV
RLADGPNRCAGRLECGMPDAG
EQCVMTTGTSGRHCGLLGTGL
WKGYTDLTIIPPGPGTPPQERTC
QGDYHSGGTWTHSPLETTRRP
GSSSPAIRRLPAQMLLLPARPPH
PRSSSPEAMDPPPPKAPPFPKAE
GPSSTPSSAAGPRPPRLGRHLLI
DAN/G V YPYTYTVQLEEEPRG P
PQREAPPGEPGPRKGYSCPECA
RVFASPLRLQSHRVSHSDLKPF
TCGACGKAFKRSSHLSRHRATH
RARAGPPHTCPLCPRRFQDAAE
LAQHSWGTPRGPLLAAACNCE
VARGRLESPGPERLLHGYGGRE
EEGG WGRA AGGLDRVEG FISS
KAHHYLLIDTQGVPYTVLVTRS
HRGSQGPVGLQARKVLQLPRV
LKGLRVHVHLQRHSITHSEVPQ
DFAGSLDSFQTPGESLRLVFRA
LDTTQSSRISKAEPCLKEEPLSL
GDLPYMHTTLCFCRKRRASPGP
GTLQRGALAWPDWASPRALPV
PSLSSTTRSPAAPLFAVPLSGRT
TQAMA FDG IIFQGQSQRSAGLT
TTSRFLACQRPLRLCAWWASRS
PRCTLRRPVGLRPGVHPRPRLV
YRDLKPENVMASGQPRDRPQP
WFAWPPRPTRFCGGCWTLTPK
Figure imgf003059_0001
Figure imgf003060_0001
Figure imgf003061_0001
Figure imgf003062_0001
Figure imgf003063_0001
Figure imgf003064_0001
Figure imgf003065_0001
Figure imgf003066_0001
Figure imgf003067_0001
Figure imgf003068_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Amino acid sequence ( X=Unknown,
NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possib!e nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
23255 53623 A 23389 1986 MGIGEGPTTSEGCRESVEYSHQ
KSELPLSGDSQVPGPIGCNCDP
QGSVSSQCDAAGQCQCKAQVE
GLTCSHCRPHHFHLSASNPDGC
LPCFCMGITQQCASSAYTRHLIS
THFAP/GDFQ/GFA/LVNPQRNS
RLTGEFTVEPVPEGAQLSFGNF
AQLGHESFYWQLPETYQGDKG
EAYFARMRRAHQNRTSLSEEQ
LRAAVTAGRIPEPPEGRDWAQR
ASQFSLSYEGFSLLPGSLYYWQ
LPRAFLGDKVAAYGGKLRYTL
SYTAGPQGSPLSDPDVQITGNNI
MLVASQPALQGPERRSYEIMFR
EEFWRRPDGQPATREHLLMAL
ADLDELLIRATFSSVPLAASISA
VSLEVAQPGPSNRPRALEVEEC
RCPPGYIGLSCQDCAPGYTRTG
SGLYLGHCELCECNGHSDLCHP
ETGACSQCQHNAAGEFCELCA
PGYYGDATAGTPEDCQPCACP
LTNPENMFSRTCESLGAGGYRC
TACEPGYTGQYCEQCGPGYVG
NPSVQGGQCLPESSELHFPSVQ
PSDAGVYICTCRNLHQSNTSRA
ELLVTEAPSKPITVTVEEQRSQS
VRPGADVTFICTAKSKSPAYTL
VWTRLHNGKLPTRAMDFNGIL
TIRNVQLSDAGTYVCTGSNMF
AMDQGTATLHVQGSAEAGERS
MYSISVLLKYTS
23256 53624 23390 13319 PASGRLRAARSGRRGASERESG
AGRAMGWRAPGALLLALLLH
GRLLAVTHGLRAYDGLSLPEDI
ETVTASQMRWTHSYLSDDEYM
LADSISGDDLGSGDLGSGDFQM
VYFRALVNFTRSIEYSPQLEDA
GSREFREVSEAWDTLESEYLKI
PGDQWSWFIKELDGWVFVE
LDVGSEGNADGAQIQEMLLRVI
SSGSVASYVTSPQGFQFRRLGT
VPQFPRACTEAEFACHSYNECV
ALEYRCDRRPDCRDMS
Figure imgf003070_0001
Figure imgf003071_0001
Figure imgf003072_0001
Figure imgf003073_0001
Figure imgf003074_0001
Figure imgf003075_0001
Figure imgf003076_0001
Figure imgf003077_0001
Figure imgf003078_0001
Figure imgf003079_0001
Figure imgf003080_0001
Figure imgf003081_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Amino acid sequence ( X=Unkno n, NO: of peptide hod in USSN location of first codon for last amino acid -Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
23354 53722 23489 990 EKMNTADQARVGPADDGPAPS
GEEEGEGGGEAGGKEPAADAA
PGPSAAFRLMVTRREPAVKLQ
YAVSGLEPLAWSEDHRVSVST
ARSIAVLELICDVHNPGQDLVIH
RTSVPAPLNSCLLKVGSKTEVA
ECKEKFAASKDPTVSQTFMLDR
VFNPEGKALPPMRGFKYTSWSP
MGCDANGRCLLAALTMDNRL
TIQANLNRLQWVQLVDLTEIYG
ERLYETSYRLSKNEAPEGNLGD
FAEFQRRHSMQTP VRME WSG I
CTTQQVKHNNESRDVGSVLLA
VLFENGNIAVWQFQLPFVGKES
ISSCNTIESGITSPSVLFWWEYE
HNNRKMSGLIVGSAFGPIKILPV
NLKAVKGYFTLRQPVILWKEM
DQLPVHSIKCVPLYHPYQKCSC
SLWAARGSYVFWRLLLISKAG
LNLHNSHVTGLHSLPIVSMTAD
KQNGTVYTCSSDGKVRQVIPIF
TDVALKFEHQLIKLSDVFGSVR
THGIAVKPCGAYLAIITTEGMIN
GLHPVNKNYQVQFVTLKTFEE
AAAQLLESSVQNLFKQVDLIDL
VRWKILKDKHIPQFLQEALEKK
IESSGVTYFWRFKLFLLRILYQS
MQKTPSEALWKPTHEDSKILLV
DSPGVMGNADDEQQEEGTSSKQ
WKQGLQERSKEGDVEEPTDVD
SLPTTGVDAGVGREPMGRRNSW
GNSKGKIRSCWEMRLTREHMK
23355 53723 23490 1218 1446 RCTDSQWRSPTPLHLAAQACSL ETTVCLLCSKADYTLSEKRGW MPIHFAAFYDNVCIIIALCRKDP SLLEAEATAE
23356 53724 23491 632 NMAKTYDYLFKLLLIGDSGVG
KTCVLVFRFSEVDAFNSTFIFTIRI
DFKIRTIELDGKRIKLQIWDTAG
QERFRTITTAYYRGAMGIMLVY
DITNEKSFDNIRNWIRNTEEHAS
ADVEKMILGNKCDVNDKRQVS
KGS/GENKLALDYGIKFMETSA
KANINVENAFFTLARDI KAKMD
KKLDGNSPVQGSNQGVKITPDQ
QKRSSFFRCVLL
Figure imgf003083_0001
Figure imgf003084_0001
Figure imgf003085_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Amino acid sequence ( X=Unkno n, NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
23375 53743 23510 2094 2619 ASPASPAPCSGIPRT*SPYPSPSR TQVAPRARGPAIHAHPGP/RGCT ALPRQGPPTPGSARAPPRIPAHR APTSPTLNFCLASSRQSMPPAPR PPPVPATNPVTPGCGRPPPTLGC RNPQTLRPPPGLPSPALLRRPLP PPARVGMPPHQPPLPTTPACWR PVDWARAEGAGRAVC
23376 53744 2351 1 5050 3317 PARTSPSASGPP**DLISALWTPS QPRAPTRQSLNSLSSSAPSSSTIG PSPSPPAPGPAPPLPWLPPSPSSL PRLSQVSRRPHSH LHPQ
23377 53745 23512 862 2954 AEHTLLVCFLNTSLTTTSTHVES
FGGQLRGSQIGHPRRPNKMPYA
WSIPQDPPLKPERDPGGPLGVS
LSLPVNQYGESEEPMGDDDYD
EYSKELNQYRRSKDSRGRGLSR
GRGRGSRGRGKGMGRGRGRG
GSRGGMNKGGMNDDEDFYDE
DMGDGGGGSYRSRDHDKPHQ
QSDKKGKVICKYFVEGRCTWL
YHTTGNCINGDDCMFSHDPLTE
ETRELLDKVEMTGSSVFDYIHP
GDHSEVLEQLGLRTPTPGPPTPP
SVSSSSSSSSSLADTPEIEASLTK
VPPSSLVQERSFFVRMKSTLTK
RGLHVKASGYKVIHVTGRLRA
HALGLVALGHTLPPAPLAELPL
HGHMIVFRLSLGLTILACESRVS
DHMDLGPSELVGRSCYQFVHG
QDATRIRQSHVDCETHLHPPSL
PTTPQTRASHSLVPGSPSSGLPT
VPPAAPTAPGTPFSIGRAARRLS
DASGLLDKGQVMTGYYRWLQ
RAGGFV WLQS VATV AG SGKSP
GEHHVLWVSHVLSQAEGGQTP
LDAFQLPASVACEEASSPGPEPT
EPEPPTEGKQAAPAENEAPQTQ
GKRIKVEPGPRETKGSEDSGDE
DPSSHPATPRPEFTSVIRAGVLK
QDPVRPWGLAPPGDPPPTLLHA
GFLPPWRGLCTPGTIRYGPAEL
GLVYPHLQRLGPGVPALPEAFYP
PLGLPYPGPAGTRLPRKGD
Figure imgf003087_0001
Figure imgf003088_0001
Figure imgf003089_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Λmino acid sequence ( X=Unknowπ, NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
23398 53766 A 23533 2094 RPLDVGGPAAGTPGVLSRPCPS
TAALAPKPFCAAPRPQPDAPAC
AGATGGSCADFDSGVDFVRRR
SSGLWGPQPPLFPVKNYTEMFQ
DPVAFKDVAVNFTQEEWALLD
ISQKNLYREVMLDTFWNLTSIG
KKWKDQNIEYEYQNPRRNFRS
VTEEKVNEIKEDSHCGETFTPV
PDDRLΝFQKKKASPEVKSCDSF
VCGEVGLGΝSSFΝMΝIRGDIGH
KAYEYQEYGPKPCKCQQPKKA
FRYRPSFRTQERDHTGEKPYAL
LKΝDGKKPLFYHSSIQRHMW
HSGDGPYKCKFCGKAFHWLSL
YLIHERTHTGEKPYECKQCGKS
FSYSATHRIHERTHIGEKPYECQ
ECGKAFHSPRSCHRHERSHMGE
KAYQCKECGKAFMCPRYVRRH
ERTHSRKKLYECKQCGKALSSL
TSFQTHIRVMHSGERPYECKTCG
KGFYSAKSFQRHEKTHSGEKPY
KCKQCGKAFTRSGSFRYHERTH
TGEKPYECKQCGKAHERTHTG
EKPYECKLYGKALSRLISFRRH
MRMHTGERPHKCKICGKAFVSS
PSSFQRHERSHTGEKPYKCKQC
GKAFTCFHFVFQYHERTHTGEK
PDGCKQCGKAFRSAKYIRIHGR
THTGEKPYECKQCGKAFHWVS
SFHRHERTHAGEKPYECKHCG
KAFTCSIYIRIHERIHTGEKPYQ
CKECGKAFIRSSYCRKHERTHTI
Figure imgf003091_0001
Figure imgf003092_0001
Figure imgf003093_0001
Figure imgf003094_0001
Figure imgf003095_0001
Figure imgf003096_0001
Figure imgf003097_0001
Figure imgf003098_0001
Figure imgf003099_0001
Figure imgf003100_0001
Figure imgf003101_0001
Figure imgf003102_0001
Figure imgf003103_0001
Figure imgf003104_0001
Figure imgf003105_0001
Figure imgf003106_0001
Figure imgf003107_0001
Figure imgf003108_0001
Figure imgf003109_0001
Figure imgf003110_0001
Figure imgf003111_0001
Figure imgf003112_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Amino acid sequence ( X=Unknown, NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possiblc nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
23544 53912 23679 1216 LGNKRAGLRRADGEAPSALPY
CYFLQKDAEAPWLSKPAYDSA
ECRHHAAEALRVAWCLEAASL
SHRPGPRSGLSVSSLRPTRKMA
TTFLAHEKIWFDKFKYDDAERR
FYEQMNGPVAGASRQENGAVT
VILRDIARARENIHKSLAGSSGPV
GASVSGTSGDHGELWRIASLEV
ENQSLRGWQELQQAISKLEAR
LNVLGKELRLAHRGTAPQTQH
VSPMRVQVEPPVAKKPATPAEDD
EDDDIDVLVGRDNEVEEDKEAA
QLVREERVLRVHYAEKKAQESLP
LGGQVPPSLLDVKVPWDDESDM
AQLEACVRSIQ/LGTGRSWGAS
QMVAVGYGVIRKLQIQCWEDD
KVGTDLLEEEITKFEEARARSV
DIASFSTRSEALKCVYVAHVRE
GPCTRSKDLRPAKKKKKK
23545 53913 A 23680 2066 MAAALRAPTQVSAASSRPPLPS
PPETEGRVKGFLQPGPHCPAQC
LCSGDDGSNEDLEQKREVIFPR
PHEAPVAEHRQQTVGLRELVAI
PKRKEETTAIVQVFVAFEDVAI
YFSQEEWELLDEMQRLLYRDV
MLENFAVMASLGCWCGAVDE
GTPSAESVSVEELSQGRTPKAD
TSTDKSHPCEICTPVLRDILQMI
ELHASPCGQKLYLGGASRDFW
MSSNLHQLQKLDNGEKLFKVD
GDQASFMMNCRFHVSGKPFTF
GEVGRDFSATSGLLQHQVTPTI
ERPHSRIRHLRVPTGRKPLKYT
ESRKSFREKSVFIQHQRADSGE
RPYKCSECGKSFSQSSGFLRHR
KAHGRTRTHECSECGVKSFSRK
THLTQHQRVHTGERPYDCSEC
GKSFRQVSVLIQHQRVHTGERP
YECSECGKSFSHSTNLYRHRSA
HTSTRPYECSECGKSFSHSTNLF
RHWRVHTGVRPYECSECGKAF
SCNIYLIHHQRFHTGERPYVCSE
CGKSFGQKSVLIQHQRVHTGER
PYECSECGKVFSQSSGLFRHRR
AHTKTKPYECSECEKSFSCKTD
LIRJTQTVHTGERPYECSVCGKS
FIRKTHLIRHQTVHTNERPYEC
DECGKSYSQSSALLQHRRVHT
GERPYECRECGKSFTRKNHLIQ
HKTVHTGERPYECSECGKSFSQ
Figure imgf003114_0001
Figure imgf003115_0001
Figure imgf003116_0001
Figure imgf003117_0001
Figure imgf003118_0001
Figure imgf003119_0001
Figure imgf003120_0001
Figure imgf003121_0001
Figure imgf003122_0001
Figure imgf003123_0001
Figure imgf003124_0001
Figure imgf003125_0001
Figure imgf003126_0001
Figure imgf003127_0001
Figure imgf003128_0001
Figure imgf003129_0001
Figure imgf003130_0001
Figure imgf003131_0001
Figure imgf003132_0001
Figure imgf003133_0001
Figure imgf003134_0001
Figure imgf003135_0001
Figure imgf003136_0001
Figure imgf003137_0001
Figure imgf003138_0001
Figure imgf003139_0001
Figure imgf003140_0001
Figure imgf003141_0001
Figure imgf003142_0001
Figure imgf003143_0001
Figure imgf003144_0001
Figure imgf003145_0001
Figure imgf003146_0001
Figure imgf003147_0001
Figure imgf003148_0001
Figure imgf003149_0001
Figure imgf003150_0001
Figure imgf003151_0001
Figure imgf003152_0001
Figure imgf003153_0001
Figure imgf003154_0001
Figure imgf003155_0001
Figure imgf003156_0001
Figure imgf003157_0001
Figure imgf003158_0001
Figure imgf003159_0001
Figure imgf003160_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Λmino acid sequence ( X=Unkno n, NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
23867 54235 24005 2502 MEARVERAVQKRQVLFLCVFL
GMSWAGAEPLRYFVAEETERG
TFLTNLAKDLGLGVGELRARG
TRIVSDQNMQILLLSSLTGDLLL
NEKLDREELCGPREPCVLPFQL
LLEKPFQIFRAELWVRDINDHA
SVFLDREISLKILESTTPGAAFLL
ESAQDSDVGTNSLSNYTISPNA
YFHINVHDSGEGNIYPELVLNQ
VLDREEIPEFSLTLTALDGGSPP
RSGTALVRILVLDVNDNAPDFV
RSLYKVQVPENSPVGSMWSV
SARDLDTGSNGEIAYAFSYATE
RILKTFQΓNPTSGSLHLKAQLDY
EAIQTYTLTIQAKDGGGLSGKC
TVWDVTDINDNRPELLLSSLT
SPIAENSPETVVAVFRIRDRDSG
NNGKTVCSIQDDLPFILKPSVEN
FYTLVTEKPLDRERNTEYNITIT
VTDLGTPRLKTEHNITVLVSDV
NDNAPAFTQTSYTLFVRENNSP
ALPIGSVSATDRDSGTNAQVIY
SLLPSQDPHLPLASLVSINADNG
HLFALSPWTTMPLRAFEFRNNS
PALHIGSVSTAETNIQVTYSLLP
PRNPHLPLASLVSINTDNGHLF
ALRSLDYEALQEFEFRVGASDR
GSTALSSEALVRVLVLDANDSS
LFVLFPLQNGSAPCTELMPRAVA
AEPGYLVTKNAWLS*QLLKAT
EPGLFGVWAHN/GTDRLLSERD
TAKHRLMVLVKDNGEPPRSAT
Figure imgf003162_0001
Figure imgf003163_0001
Figure imgf003164_0001
Figure imgf003165_0001
Figure imgf003166_0001
Figure imgf003167_0001
Figure imgf003168_0001
Figure imgf003169_0001
Figure imgf003170_0001
Figure imgf003171_0001
Figure imgf003172_0001
Figure imgf003173_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Λmino acid sequence ( X=Unkno\vn, NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
23983 54351 24121 2204 IDMIFTPGPPSTPKHKKSQKGSA
FTFPSQQSPRNEPYVARPSTSEI
EDQSMMGKFVKVERQVQDMG
KKLDFLVDMHMQHMERLQVQ
VTEYYPTKGTSSPAEAEKKEDN
RYSDLKTIICNYSETGPPEPPYSF
HQVTIDKVSPYGFFAHDPVNLP
RGGPSSGKVQATPPSSATTYVE
RPTVLPILTLLDSRVSCHSQADL
QGPYSDRISPRQRRSITRDSDTP
LSLMSVNHEELERSPSGFSISQD
RDDYVFGPNGGSSWMREKRYL
AEGETDTDTDPFTPSGSMPLSST
GDGISDSTVPLFLSSEILQKQVG
QSITSMLGFLSRGPSMKLCMGL
ACVLSLWNTVSGIKGEAKKEK
GMTFLPTTDSKKFFSLLSVTSYS
SFAFHKFSVAVYNISNLKTVDP
AKFPTRYCYCLNNRTNDLSDFT
ALLVDIIGNSTSYLTEIFKSTSIL
SVNQSNESDCIFICVMTGKSGR
NLSDFWEIEEKYPIINYTFTSGL
SGVLALLLTQSLFGGLFTRTRM
KFGAVTRIGVDLPWEINP/LSSCS
LLHEKDPPTTSGPQTDQPKKHL
TNFKSAARPTFLGQGQVPLNPF
SFTLSGKSCFPRRQEPPNRLFPH
PNLLSLCPNPLFLCPNPFSTFLE
GKNHHPHPFSVSQVFSLGLPPS
LWEHSPMAHRLRVQPRQQPPDI
HKQVISLQSGHSLLVKSQSWSQ
DPIKFQLPHHHKNTASVCINPSV
23984 54352 24122 2102 2529 ETTMSFIFSLLYPFRMHLSFLIVS
PALRSISRRALAAFPRGRSLDLQ
PAMPEPPTPS/CGLLCVPSVPPMS
AAPCSKVPQSHLFAFMSCNTAK
VCSFTAEASETTSPPGGTNNSR
RATLRAVTLTRRSATLTRRSAA
SHRSSTV
23985 54353 24123 299 567 TAMLLTQSLFGGLFTQTRMKF GAVTQIVADLPWEINPLSSCSLL HEKDPPTSSGPQTDQPKEHLTN FKSGETKEMHFIRGPKTPAPVM
23986 54354 24124 684 822 KVKRQSYSQNTFYYPICSRH*IK LQKLNSGPQTPQQDLINLAIKV
Figure imgf003175_0001
Figure imgf003176_0001
Figure imgf003177_0001
Figure imgf003178_0001
Figure imgf003179_0001
Figure imgf003180_0001
Figure imgf003181_0001
Figure imgf003182_0001
Figure imgf003183_0001
Figure imgf003184_0001
Figure imgf003185_0001
Figure imgf003186_0001
Figure imgf003187_0001
Figure imgf003188_0001
Figure imgf003189_0001
Figure imgf003190_0001
Figure imgf003191_0001
Figure imgf003192_0001
Figure imgf003193_0001
Figure imgf003194_0001
Figure imgf003195_0001
Figure imgf003196_0001
Figure imgf003197_0001
Figure imgf003198_0001
Figure imgf003199_0001
Figure imgf003200_0001
Figure imgf003201_0001
Figure imgf003202_0001
Figure imgf003203_0001
Figure imgf003204_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Amino acid sequence ( X=Unkno n, NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
24243 54611 24381 2208 MLLVSASVNAMSISGQAGKEY
TNIGVGFGTETTGLALSGNWTH
NDDDGDVAGVGLGLNLPLGPL
MATVGGKGVYTNPNYGDEGY
AAAVGGGLQWKIGNSFRLFGE
Y Y YSPDSLSSG IQS YEEANAG A
RYTIC VQVF V V VMLI RNQLFTP
VNTAIDGFATN YAI MPA YA VG
EGVATHGLQFELMRTTGGNTQ
HVFHHPLPLFAMWRYAAEMFP
YQQMCQFVRYHFFNKSFTVFQ
QQNGIEANFIAFQPGRSGGGSA
LGRNVFGFRYQPQCQQDKQYV
QSRIHPEGVGVTQRVEHGQERC
ANDHVQSSGTELMQEGVMSAS
LKNQQGFSLPEVMLAMVLMV
MIVTALSGFQRTLMNSLASRNQ
YQQLWRHGWQQTQLRAISPPA
N WQSGAA MLRV YHSN RLD VL
EALMEFIVERERLDDPFEPEMIL
VQSTGMAQWLQMTLSQKFGIA
ANIDFPLPASFIWDMFVRVLPEI
PKESAFNKQSMSWKLMTLLPQ
LLEREDFTLLRHYLTDDSDKRK
LFQLSSKAADLFDQYLVYRPD
WL AQ WETGHL VEG LGEAQA W
QAPLWKALVEYTHQLGQPRW
HRANLYQRFIETLESATTCPPG
YLRASLYAAIQHALASVREHTT
LWIAHRLSTIVDADTILVLHRG
QAVEQGTHQQLLAAQGRYWQ
MYQLQLAGEELAASKGHAELY
24244 54612 24382 729 DCSPLIDHCDADTILVLHRGQA
VEQGTHQQLLAAQGRYWQMY
QLQLAGEELAASVQPLPNSDRR
GSMKLVTVIIKPFKLEDVREAL
SSIGIQGLTVTEVKGFGRQKGH
AELYRGAEYSVNFLPKAAYTG
KIGDG/KIFV AELQRVIRIRTGEA
DEAALYLWHTATERDGPFRNF
TAGAFRASLPHPARHGRRPILA
APFMPVNPSRLPPRRRHLASGR
SLRFNCCTAGDETCSARSGERR
24245 54613 24383 683 1 161 RTGHCH/SAFSFSTVLTSCSLSRF
SFLAHRLQYLLRVAEPLPNSDR
RGSMKLVTVIIKPFKLEDVREA
LSSIGIQGLTVTEVKGFGRQKG
HAELYRGAEYSVNFLPKVKIDV
AIADDQLDEVIDIVSKAAYTGKI
GDGKIFVAELQRVIRIRTGEADE
Figure imgf003206_0001
Figure imgf003207_0001
Figure imgf003208_0001
Figure imgf003209_0001
Figure imgf003210_0001
Figure imgf003211_0001
Figure imgf003212_0001
Figure imgf003213_0001
Figure imgf003214_0001
Figure imgf003215_0001
Figure imgf003216_0001
Figure imgf003217_0001
Figure imgf003218_0001
Figure imgf003219_0001
Figure imgf003220_0001
Figure imgf003221_0001
Figure imgf003222_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Λmino acid sequence ( X=Unknown, NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
24385 54753 A 24523 891 3240 RYRKSPGSWR/CMSCILTSWITT
MFPI*RLGQWC*NPCVTMAL/L
API DVHLMVKPVDRI VPDFAAA
GASIITFHPEASEHVDRTLQLIK
ENGCKAGLVFNPATPLSYLDY
VMDKLDVILLMSVNPGFGGQS
FIPQTLDKLREVRRRIDESGFDI
RLEVDGGVKVNNIGEIAAAGA
DMFVAGSAIFDQPDYKKVIDE
MRSELAKFEDIRGVAFDLDGTL
VDSAPGLAAAVDMALYALELP
VAGEERVITWIGNGADVLMER
ALTWARQERATQRKTMGKPPV
DDDIPAEEQVRILRKLFDRYYG
EVAEEGTFLFPHVADTLGALQA
KGLPLGL VTN KPTPFVAPLLEA
LDIAKYFSWIGGDDVQNKKPH
PDPLLLVAERMGIAPQQMLFVG
DSRNDIQAAKAAGLDAQARRK
NAGYYYYLRLAKRQMRAGVA
EALRDDLLAAKLNVKLVNAGD
YKFKQI ASEKLLl V VTSTQG EGE
PPEEAVALHKFLFSKKAPKLEN
TAFAVFSLGDSSYEFFCQSGKD
FDSKLAELGGERLLDRVDADV
EYQAAASEWRARWDALKSRA
PVAAPSQSVATGAVNEIHTSPY
SKDAPLVAWRLPNVEGRARAG
GASSFLADRVEEEGEVRVFIEH
NDNFRLPANPETPVIMIGPGTGI
APFRAFMQQRAADEAPGKNWL
FFGNPHFTEDFLYQVEWQRYV
24386 54754 24524 950 2124 MSLISSVIAATWLHVKSYRLSC
GFYACQSLVLVS/IFPTLSCLFA
AEQLLIWSASAFITKVLLVPLIM
TYAARNIPQNIPEKALFGPAMM
ALLAALIVLLCAFWQPVKLPM
ATGLKPALAVALGHFLLGLLCI
VSQRNILRQIFGYCLMENGSHL
VLALLAWRAPELVEIGIATDAIF
AVIVMVLLARKIWRTHGTLDL
TPLLFSLLCFACRKRRLSATRTV
TVLHSLG ITLLLIL AL WVVQTA
ADAGEIFAAGLWLHIDGLGGLF
LAILGVIGFLTGIYSIGYMRHEV
AHGELSPVTLCDYYGFFHLFLF
TMLLWTSNNLIVMWAAIEAT
TLSSAFLVGIYGQRSSLEAAWK
YIIICTVRVAFGLFGTVLVYANA
ASVMPQAEMAIFWSE
Figure imgf003224_0001
Figure imgf003225_0001
Figure imgf003226_0001
Figure imgf003227_0001
Figure imgf003228_0001
Figure imgf003229_0001
Figure imgf003230_0001
Figure imgf003231_0001
Figure imgf003232_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Λmino acid sequence ( X=Unknown,
NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
24463 54831 24601 190 MTDFRESDTMSLFDKKHLVSP
ADALPGRNTPMPVATLHAVNG
HSMTNVPDGMEIAIFAMGCFW
GVERLFWQLPGVYSTAAGYTG
GYTPNPTYREVCSGDTGHAEA
VRIVYDPSVISYEQLLRKYRCV
AVALLGYFKCKPILLNPSYKSM
QDDLAFIARFHFKDLKFRRFSL
KADQKSRIYERVLSIAGVNSWK
DKQHQPRLVEHLLVCAESWVA
PRALFDAAIEYLAHQKIAIPAYL
VAFLVLGPLVLSLIPSQMLPSLT
WSSITYSIWIVEAGYQYHSRLR
RFGRVEVNEQIEYLIHNPIRTRA
RAVNFVDNNNRLQAPRILRQDS
WWITRPLPGLTWAIIGSPGILPQ
WKSARVMKQAVAYLEPFIEA
SKEQGKTNGKMVIATVKGDVH
DIGKNIVGWLQCNNYEIVDLG
VMVPAEKILR*KLGGTPRTL*R
SNRVPGPPEDRHSCLFGCV/YW
FLVP*CCLSSLRRCCRR*RGQAS
RIQ*LLSSKLAISTIL/GLRRFGR
VEVNEQIEYLIHNPIR/IARQGG
QLC**QQSASGPTDITAGFMVD
HQTFTGADVGDNRVARNPATG
GQIGARHETGGGLPRTVY*SQQ
RAGQNQRQDGDRHREGRRPRH
Figure imgf003234_0001
Figure imgf003235_0001
Figure imgf003236_0001
Figure imgf003237_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Amino acid sequence ( X=Unkno n, NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
24496 54864 A 24634 1926 MATAKANQEIQEWLGKYGTAR
VKLNVDKDFSLKDSSLEMLYPI
YDTPTNMLFTQGAIHRTDDRTQ
SNIGFGWRHFSGNDWMAAVIT
FIDHDLSRSHTRIGVGAEYWRD
YLKLSANGYIRASGWKKSPDIE
DYQERPANGWDIRAEGYLPAW
PQLGASLMYEQYYGDEVGLFG
KDKRQKDPHAISAEHSPEFRSE
ALKLVNASVLLPQPVNSACMN
HNSTTGAVNSKISRRLLNVNW
RCLPRLHVSNASWQNGMKSWL
SSKRPRHTSEAPEMKYVFIEKH
QAEFSIKAMCRVLRVARSGWY
TWCQRRTRINIAWRRADGSRT
SFVYTSYLAKVNEEWKNNVGT
GSTVKWPIGLGGKGNDGIAAF
VQRLPGAIGYVEYAYAKQNNL
AYTKLISA*NEVCLY*KTSG*V
QHQSNVPR/RSGWPAAAGIRGV
SGGQG*TLL*YAAQMAPELPSS
TPATWRK*TKSGKTTLV/HGST
VKWPIGLGGKGNDG1AAFVQR
LPGAIGYVEYAYAKQNNLAYT
KLISADGKPVSPTEENFANAAK
GADWSKTFAQDLTOQKGEDA
WPITSTTFILIHKDQKKPEQGTE
VLKFFDWAYKTGAKQANDLD
YASLPDSWEQVRAAWKTNIK
DSSVNFDMANVFFRVTPGNFL
VNREASHQRRVFAVRAGVDFI
YRASSDGLRRCRDRRTRFKRFN
24497 54865 24635 1387 1639 ESKPLNLGRGSGFLLTRALSAQ PLWQSPDQHLTPQAGLRSRL*A QRQA*QKTASRW*PKPLWAL VKATSEDISVVL* ESACQC
24498 54866 24636 1570 2516 IEVAMSIWKNNIHWVGQRDW
EVRDFHGTEYKTLRGSSYNSYF
IREEKNVLIDTVDHKFSREFVQ
NLRNEIDLADIDYIVINHAEEDH
AGALTELMAQIPDTPIYCTANAI
DSINGHHHHPEWNFNWKTGD
TLDIGNGKQLIFVETPMLHWPD
SMMTYLTGDAVLFNYDAFGQH
YCDEHLFNDEVDQTELFEQCQ
RYYANILTPFSRLVTPKITEILGF
NLPVDMIATSHGWWRDNPTQ
IVELYLKWAADYQEDRITIFYD
TMVNNTPMMADAIAQGIAVETG
PRLAGENFNVVARNDKHEDPDL
LSFRSERGA
Figure imgf003239_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Amino acid sequence ( X=Uπknown,
NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
24509 54877 24647 194 2481 GMFRMNLMEPLTQCINAGHEM
TKAIAIAQFNDDSPEARKITRR
WRIGEAADLVGVSSQAIRDAEK
AGRLPHPDMEIRGRVEQRVGY
TIEQINHMRDVFGTRLRRAEDV
FPPVIGVAAHKGGVYKTSVSVH
LAQDLALKGLRVLLVEGNDPQ
GTASMYHGWVPDLHIHAEDTL
LPFYLGEKDDVTYAIKPTCWPG
LDHPSCLALHRIETELMGKFDE
GKLPTDPHLMLRLAIETVAHDY
DVIVIVDSAPNLGIGTINWCAA
DVLIVPTPAELFDYTSALQFFD
MLRDLLKNVDLKGFEPDVRILL
TKYSNSNGSQSPWMEEQIRDA
WGSMVVIKNVVRETNEVGKAA
PMVDSLIARVGVMARGNAITLP
VCGRDVKFTLEVLRGDSVEKTS
RVWSGNERDQELLTEDALDDLI
PSFLLTGQQTPAFGRRVSGVIEI
ADGSRRRKAAALTESDYRVLV
GELDDEQMAALSRLGNDYRPT
SAYERGQRYASRLQNEFAGNIS
ALADAENISRKIITRCINTAKLP
KSWALFSHPGELSARSGDALQ
KAFTDKEELLKQQASNLHEQK
KAGVIFEAEEVITLLTSVLKTSS
ASRTSLSSRHQFAPGATVLYKG
DKMVLNLDSPQSFSPGAGDHA
NSPQSSSPSTGVHINSPQLSSLS
AGVHTISPQSSNPSVGVTALSHP
DLVWVFTLTAHSRPVLVWVFT
24510 54878 24648 1361 1694
2451 1 54879 24649 3852 MKLMETLNHCINAGHEMTKAV
AIAQFNDDSPEARKITRRWRIG
EAADLVGVSSQAIRDAEKAGR
LPHPDMEIRGRVEQRVGYTIEQI
NHMRDVFGTRLRRAEDVFPPVI
GVAAHKGGVYKTSVSVHRAQ
DLALKGLRVLLVEGNAPQGTA
SVYHGWVPDLHIHAEAALLPF
YLGEKDDVTYAIKPTCWPGLDI
IPSCLALPRIEPELMGK/FDEGRS
FMPGPP/HHHPLGRSSQ*QVSNS
LDRAAKGN*KPLCHC
Figure imgf003241_0001
Figure imgf003242_0001
Figure imgf003243_0001
Figure imgf003244_0001
Figure imgf003245_0001
Figure imgf003246_0001
Figure imgf003247_0001
Figure imgf003248_0001
Figure imgf003249_0001
Figure imgf003250_0001
Figure imgf003251_0001
Figure imgf003252_0001
Figure imgf003253_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Amino acid sequence ( X=Unknown, NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
24595 54963 24733 815 MRVGRKAVGAIWWSSKQELFQ
QLFLRTELEDIGPGIVKRELGRE
GSKDTGSSDDWEQLPDWKTKR
VGNERDQELLIEDALDDLIPSFL
LTGQQTPAFGRRVSGVIEIADG
SRRRKAAALTESDYRVLVGEL
DDEQMAALSRLGNDYRPTSAY
ERGQRYASRLQNEFAGNISALA
DAENISRKIITRCINTAKLPKSV
VALFSHPGELSARSGDALQKAF
TDKEELLKQQASNLHEQKKAG
VIFEAEEVITLLTSVLKTVQSKN
LSHFTWICHAETAVINHKKRKN
SPRIVQSNDLTEAAYSLSRDQK
RMLYLFVDQIRKSDGTLQEHD
GICEIHVAKYAEIFGLTSAEASK
DIRQALKSFAGKEWFYRPEED
AGDEKGYESFPWFIKHWVTISP
QTSAYERGQRYASRCKNEFAG
NISALADAENISRKIITRCINTAK
LPKSWALFSHPGELSARSGDA
LQKAFTDKEELLKQQASN LHE
QKKAGVIFEAEEVITLLTSVLKT
SSASRTSLSSRHQFAPGATVLY
KGDKMVLNLDRSRVPTECKRK
LRPF* SGQRTDTAY VFDPGRDY
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Λmino acid sequence ( X=Unknown, NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
24596 54964 24734 1249 DF A DFGTTIKQDFRLLGQTS V D
RLLQLSQGQAVKGNQLLPVSL
VKRKTTLAPNTQTASPRALADS
LMQLARQVSRLESGQLAISGRI
LFLPMLNSINKLAAPSQPCTEPF
FRRDPTLHPKSEVRLPPKVTTIT
VISNPILITPQEKSVAQCTYNQT
VQLDITAFLKTVKKNKHKFYPA
FIHILARLMNAHPEFRMAMKD
GELVIWDSVHPCYTVFHEQTET
FSSLWSEYHDDFRQFLHIYSQD
VACYGENLAYFPKGFIENMFFV
SANPWVSFTSFDLNVANMDNF
FAPVFTMGKYYTQGDKVLMPL
AIQI RKSDGTLQEHDG ICEIH VA
KYAEIFGLTSAEASKDIRQALKS
FAGKEWFYRPEEDAGDEKGY
ESFPWFIKRAHSPSRGLYSVHIN
PYLIPFFIGLQNRFTQFRLSETKE
ITNPYAMRLYESLCQYRKPDGS
GIVSLPIDWIIERYQLPQSYQRM
PDFRRRFLQVCVNEINSRTPMR
LSYIEKKKGRQTTHIIRISDGTL
QEHDGICEIHVAKYAEIFGLTSA
EASKDIRQALKSFAGKEWFYR
PEEDAGDERGYESFPWFIKRAH
SPSRGLYSVHINPYLIPFFIGLQN
RFTQFRLSETKEITNPYAMRLY
ESLCQYHLCGTLRNSRFCEI H V
AKYAEIFGLTSAEASKDIRQAL
KSFAGKEWFYRPE/VGCRR*K
RL*IFSLVYQTCAQSIQRALQCT
Figure imgf003256_0001
Figure imgf003257_0001
Figure imgf003258_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Λmino acid sequence ( X=Unknown,
NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /^possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
24602 54970 24740 398 MVDSLIARVGVMARGNAITLP
VCG RD VKFTLQ VLRGDS VEKT
SRVPPGNERDQEMLTEDALDD
LIPSFLLTGQQTPAFGRRVSGVI
HTADGSRRRKAAALTEKFCIVD
NPPGEIRKRACLFIPTHALNLNR
FEDICYAHQLPDGGLLSARVGV
MARGNPILSPVCGRNVKFTLKV
LRGDSVENPSRVWSGNERAQE
LLTEHALDDLTLSFLLTGQQTP
AFGRRVSGVIEIADGSRRRKAA
ALTESDYRVLVGELDDEQMAA
LSRLGNDYRPTSPYERGQRYAS
RLQNEFAGNISALADAENISRKI
ITRCINTAKLPKSWALFSHPGE
LSARSGNERDQELLTEDALDDL
IPSFLLTGQQTPAFGRRVSGVIH
TADG SRRRKAA/DTYRKIL YR* *
PPGGD*KKSVPFYSNTCAQSQP
24603 54971 24741 1450 3699 RN PLQI PKQQQNEKYQVPQFDQ
STIKNIESAKGLDVWDSWPLQN
ADGTVAEYNGYHWFALAGSP
KDADDTSIYMFYQKVGDNSIDS
WKNAGRVFKDSDKFDANDPIL
KDQTQEWSGSATFTSDGKIRLF
YTDYSGKHYGKQSLTTAQIFQ
MITVFITSLCNTTSSHICLHISWV
TTMHGAMPHPGHIPTAPASLPA
KVKGSGKVCWLAFPLSELTPPS
ALQVNVSKSDDTLKINGVEDH
KTIFDGDGKTYQNVQHFIDEGN
YTCGDNHTLRDPHYVEDKGHK
YLVFEANTGTENGYQGEESLFN
KAYYGGGTNFFRKESQKLQQS
AKKRDAELAΝGALGIIELΝNDY
TLKKVMKPLITSΝTVTDEIERA
ΝVFKMΝGKWYLFTDSRGSKM
TIDVPQAKSTΝWITSYMTΝRG
FFEPEGSKGSITAAVMAKGLSV
ASGΝSTPERRRATAΝEΝVQPRA
IRDAEKAGRLPHPDMEIRGRVE
QRVGYTIEQIΝHMRDVFGTRLR
RAEDVFPPVIGVAAHKGGVYK
TSVSVHLAQDLALKGLRVLLV
EGΝDPQGTASMYHGWVPDLHI
HAEDTLLPFYLGEKDDVTYAIK
PTCWPGLDIIPSCLALHRIETEL
MRKFDEGΝLPTDPHLMLRLAIE
TVAHDYDVI VIDS APΝLG IGTIN
WCAADVLIVPTPAEQLDISVSS
RQFFDVILRDLLKΝVDLKGFEPD SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Amino acid sequence ( X=Uπknown, NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /^possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possiblι- nucleotide insertion) sequence
24604 54972 24742 3899 MVDSLIAPVGVMARGNAITLPV
RATNVKFTLEELRGDSVEKTSR
VWSEKKDRGGQEAEVSGEGSS
QTKSLQKRQHSNGSDFTARGF
HTGQALVRSKGTYWNPTGLGL
SHGTEDPNLIGEELDAGEWDK
NTRKELAVLNVRRDFADFGTTI
KQDFRLLGQTSVDRLLQLSQG
QAVKGNQLLPVSLVKRKTTLA
PNTQTASPRALADSLMQLARQ
VSRLESGQFWVFEGGLRVLVR
ERVTTPKGTVASMYQGWVVPD
24605 54973 24743 268 2617 RMFRMKLMETLVSQCINAGHE
MTKAIAIAQFNDDSPEARKITR
RWRIGEAADLVGVSSQAIRDAE
KAGRLPHPDMEISGRVVKHGVG
YTIEQIYLVTRDVFGTRLPRAEDV
VFPPVIGVAAHKGGVYKTSVSV
HLAQDLALKGLRVLLVEGNDP
QGTASMYHGWVPDLHIHAEDT
LLPFYLGEKDDVTYAIKPTCWP
GVLDIIPSCLALHRIETELMGKFD
EGKLPTDPHLMLRLAIETVAHD
YDVIVIDSAPNLGIGTINWCAA
DVLIVPTPAELFDYTSALQFFD
MLRDLLKTVDLKGFEPDVRILL
TKYSNSNGSQSPWMEEQIRDA
WGSMVLKNVVRETDEVGKGQI
RMRTVFEQAIDQRSSTDTSLSTP
AAPMVDSLIARVGVMARGNAI
TLPVCGRDVKFTLEVLRGDSVE
KTSRVWSGNERDQELLTEDAL
DDLIPSFLLTGQQTPAFGRRVSG
VIEIADGSRRRKAAALTESDYR
VLVGELDDEQMAALSRLGNDY
RPTSAYERGQRYASRLQNEFAG
NISALADAENISRKIITRCINTAK
LPKSWALFSHPGELSARSGDA
LQKAFTDKEELLKQQASNLHE
QKKAGV IFEAEE VITLLTS VLKT
SSASRTSLSSRHQFAPGATVLY
KGDKMALNLDRSRVPAYIIRSY
IRCGRKGFASAGVGGCRGWLN
YAASEQIVLRVHHMRCEIPHRC
Figure imgf003261_0001
Figure imgf003262_0001
Figure imgf003263_0001
Figure imgf003264_0001
Figure imgf003265_0001
Figure imgf003266_0001
Figure imgf003267_0001
Figure imgf003268_0001
Figure imgf003269_0001
Figure imgf003270_0001
Figure imgf003271_0001
Figure imgf003272_0001
Figure imgf003273_0001
Figure imgf003274_0001
Figure imgf003275_0001
Figure imgf003276_0001
Figure imgf003277_0001
Figure imgf003278_0001
Figure imgf003279_0001
Figure imgf003280_0001
Figure imgf003281_0001
Figure imgf003282_0001
Figure imgf003283_0001
Figure imgf003284_0001
Figure imgf003285_0001
Figure imgf003286_0001
Figure imgf003287_0001
Figure imgf003288_0001
Figure imgf003289_0001
Figure imgf003290_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Λmino acid sequence ( X=Unknown, NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
24793 55161 24931 2249 MLPDIARFLPRLANDQIEEFTW
YVNHFTYSNAIKIFCYQMFLAS
TQARASGAPLQRLFRAIGGNA
WERPFRMVFCSAAFSEESSPFR
ETNYEPQLFLGFATDYRFAGW
TLRDVEMGYNHDSNGRSDPTS
RSWNRLYTRLMAENGNWLVE
VKPWYWGNTDDNPDITKYM
GYYQLKIGYHLGDAVLSAKGQ
YNWNTGYGGAELGLSYPITKH
VRLYTQVYSGYGESLIDYNFNQ
TRVGVGAIGVNVAQAEVLNLE
SGAKQVLQETFGYQQFRPGQE
EIIDTVLSGRDCLWMPTGGGK
SLCYQIPALLLNGLTVWSPLNS
LKEKSANPFEFTFAAFAHAFHW
VIQAIGMINTPA DGSTHKQART
SWLPYSSLPVLSVSTQDEKAKK
KMRFMQRSKDSLAKWLSAILP
WI VGL VGLFAVTVIRD YG RAS
EADRQALLEKGNVLIRALESGS
RVGMGMRMHHVQQQALLEEM
AGQPGVLWFAVTDAQGIIILHS
DPDKVGRALYSPDEMQKLKPE
ENSRWRLLGKTETTPALEVYRL
FQPMSAPWRHGMHNMPRCNG
KAVPQVDAQQAIFIAVDASDLV
ATQSGEKRNTLIILYALATVLL
ASVLSFFWYRRYLRSRQLLQDE
MKRKEKLVALGHLAA/GVAHE
IRNPLSSIKGLAKYFAERAPAGG
EAHQLAQHVQIRRFHAHHPAS
Figure imgf003292_0001
Figure imgf003293_0001
Figure imgf003294_0001
Figure imgf003295_0001
Figure imgf003296_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Amino acid sequence ( X=Unknown, NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
24824 55192 24962 128 2574 QWWMAMVAKWIDIWW
GVEMIRVKVVAIWEVWVEV
VWMVAWWW****WW*/DMG
EARWWWEYGVWEVEPEIILNA
PQQYLLAGIGDTLAKWYEAW
LAPQPETLPLTVRLGINNAQAIR
DVLLNSSEQALSDQQNQQLTQS
FCDWDAIIAGGGMVGGLGDR
FTRVAAAHAVHNGLTVLPQTE
KFLHGTKVAYGILVQSALLGQ
DDVLAQLTGAYQRFHLPTTLA
ELEVDINNQAEIDKVIAHTLRPV
ESIHYLPVTLTPDTLRAAFKKYS
QSCFVIRLGGDFIHQLAEΝHIVI
FVΝΝHΝGTRCQTFQWAISΝCH
TVSLQKFSVRKFLSDLFAIGGR
RGVRPPTEAPGHQKSDΝQAHV
SVSFDGLDLVFFWΝQWAGW
FCFDLFSRKRGRFSVWRΝKGFG
RYCQRLAΝFRDRQAFQLLΝQIH
FTRCKQVDΝGGTEFKATHFLAF
QQCHRTWFPGRPAIKTRVERL
DRYRΝRMEKSΝGQAΝLRVHG
GTWEKLVLQLDSAGMVΝRVSS
VGAPLREPIPTEGKGAKEDTWG
NLSSNDPLLEAFGGQFGIAISMH
QTLATRGGKRNARPTTRPRFHR
GTNPRGIAPHIGTANARYSTGH
CPVALPFGLRGPPLARENVATA
TKGH ESASRETEW A VAKLTSR
VLLICDATHALPIQVLRN DIRVI
AAGNGCTDDLQLEHLPANTDIR
24825 55193 24963 117 335 PGLAPATRRSTPGRFNADAGAD * RRAAGTGAGRAKRRGVGIHR CLADPFLADCRNAGLRGAALC GTAAASGK
Figure imgf003298_0001
Figure imgf003299_0001
Figure imgf003300_0001
Figure imgf003301_0001
Figure imgf003302_0001
Figure imgf003303_0001
Figure imgf003304_0001
Figure imgf003305_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Amino acid sequence ( X=Unknown, NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
24887 55255 25025 2209 MIGEIRDGETAEIAIKAAQTGHL
VLSTLHTNSTCETLVRLQQMG
VARWMLSSALTLVIAQRLVRK
LCPHCRRQQGEPIHIPDNVWPSP
LPHWQAPGCVHCYHGFYGRTA
LFEVLPITPVIRQLISANTDVESL
ETHARQAGMRTLFENGCLAVE
QGLTTFEELIRVLGMPHGDGAA
MDREENELGVSCIAVPVFDIHG
RVPYAVSISLSTSRLKQLYAGLP
VWCDGSRFFAQKARAIKCKQW
GYVGAKSRQRWLFYAYDRLRK
TWAHVFGERTMATLGRLMSL
LSPFDWIWMTDGWPLYESRL
KGKLHVISKRYTQRIERYNLNL
RQHLARLGRKSLSFSKSVELHD
KAAGGVPVLVRELLKAGLLHE
DVNTVAGFGLSRYTLEPWLNN
GELDWREGAEKSLDSNVIASFE
QPFSHHGGTKVLSGNLGRAVM
KTSAVPVENQVIEAPAVVFESQ
HDVMPAFEAGLLDRDCWWR
HQGPKANGMPELHKLMPPLGV
LLDRCFKIALVTDGRLSGASGK
VPSAIHVTPEAYDGGLLAKVRD
GDIIRVNGQTGELTLLVDEAEL
AAREPHISELSAFTFGTGRGYSA
AFGVKCPFRKRALSDPCYAMR
EVADFLINKGVDGLFYLGTGGE
FSQMNTAQRMALAEEAVTIVD
GRVPVLIGVGSPSTDEAVKLAQ
HAQA YGADGI V A IN PY Y WKV A
24888 55256 25026 3395 MSELPFTIASKRIKYLGIQLTRD
VKDLFKENYKPLLKEIKEDTNK
WKYIPCSWLGRINIVKMAILAK
VIYRFΝAIPIKLPMTFFKELEKT
TLKFIWΝQKGAASRSΝPEPKΝK
LEDHITΝSTIYRLSTKSCSVSGL
KQTLLAESEALTSYSHRVFSAW
DFGLCGDVHVRLRQRIILYELK
GQVPALLLDDGTLLTEGVAIM
QYLADSVPDRQLLAPVΝSISRY
KTIEWLΝYIATELHKELKFSFH
WSTTSHKGLGM
Figure imgf003307_0001
Figure imgf003308_0001
Figure imgf003309_0001
Figure imgf003310_0001
Figure imgf003311_0001
Figure imgf003312_0001
Figure imgf003313_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Λmino acid sequence ( X=Unkno n,
NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
24926 55294 25064 1368 2451 PAHAAPGGGC/VRSLACRFLWN
MRVTGKSYSTLRRVLSAIWNW
RMMRCALTRALVAFRVR/SSVP
LAAVLAIYARENGAGTMFEPE
AAYDEDTSIMNDEEASADNET
VMSVIDGDKPDHDDDTHPDDE
PPQPPRGGRPGLRGLTLRWFSV
AAQRSDILDAVTLSLKVAALAT
LIALVLGTLAAAALWRRDFFG
KNAISLLLLLPIALPGIVTGLALL
TAFKTINLEPGFFTIWGHATFC
WWFNNVIARFRRTSWSLVEA
SMDLGAΝGWQTFRYWLPΝLS
SALLAGGMLAFALSFDEIIVTTF
TAGHERTLPLWLLΝQLGRPRD
VPVTNWALLVMLVTTLPILGA
WWLTAKATYLEKLKV
24927 55295 A 25065 2072 MSEQLTDQVLVERVQKGDQKA
FNLLWRYQHKVASLVSRYVP
SGDVPDWQEAFIKAYRELTPR
RPYLLRAFYEWLLDΝQLTPHL
WDVTLPGVQVPMEYARDGQI
VLΝIAPRAVGΝLELAΝDEVRFΝ
ARFGGIPRQVSVPLAAVLAIYA
REΝGAGTMFEPEAAYDEDTSI
MΝDEEASADΝETVMSVIDGDK
PDHDDDTHPDDEPPQPPRGGRP
ALRWΝVIVAIIFTTKPTAMGIA
VRFTTAKGETPAIPDVTTTTAVI
GETARASWAKCIGIIITTAVPIV
VKPFAΝISPATTSVPRFFGLPAS
SSHTLIGAIIGIGLTNALLTGSSV
MDALΝLREVTKIFSSLIVSPIVG
LVIAGGLIFLLRRYWSGTKKRD
RIHRIPEDRKKKKGKRKPPFWT
RIALIVSAAGVAFSHGAΝDGQK
GIGLVMLVLVG1APAGFVVΝM
ΝASGYEITRTRDAVTNFAARES
EASVIRHSΝRFIETIΝGGRRCSR
GITLTLHKLRVSAQPLLVAVLΝR
LEQKKPVGLRYDPQAQSLVCLP
TQTRTGWΝLΝGFEVGFRPCVR
LMIYGRSLEAQATASLAAATG
YDSH1FDLFPASASAQIDTDTAV
ILLCHDLΝRELPVLQAAREAKP
F YLG ALG S YRTHTLRLQKLHEL
G WSVRQETTQIRAPVGIFPKARD
AHTLALSVLAEVASVRLHQEE
Figure imgf003315_0001
Figure imgf003316_0001
Figure imgf003317_0001
Figure imgf003318_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Λmiπo acid sequence ( X=Unkno\vn,
NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
24954 55322 25092 338 2388 CVQEPLITVSWRLFAIPAGMKF
SLFLRCLFCFHLAARYLVLLCQ
LH SQGIKVLDFTGVQSGPSCTQ
MLA WFGAD VIKIERPG VG DVT
RHQLRDIPDIDALYFTMLNSNK
RSIELNTKTAEGKEVMEKLIRE
ADILVENFHPGAIDHMGFTWEH
IQEINPRLIFGSIKGFDECSPYVN
VKAYENVAQAAGGAASTTGF
WDGPPLVSAAALGDSNTGMHL
LIGLLA ALLHRELNMTV A El VE
REEWAGFRARETAALEAVTAP
STVIATGGGIILTEFNRHFMQNN
GIWYLCAPVSVLEDFRQRTME
VLRLAFLSSGILEFFTSLSIALVA
VYFGFSYLGELDFGHYDTGVTL
AAGFLALILAPEFFQPLRDLGTF
YHAKAQAVGAADSLKTFMETP
LAHPQRGEAELASTDPVTIEAE
ELFITSPEGKTLAGPLΝFTLPAG
QRAVLVGRSGSGSRYCPYIΝHR
ΝVRLGPIVTDVPWEΝSGDKSLP
EATAQVISSGSGATKVEDGLGD
LΝKPVSΝQΝLVTGIDTPVYΝAP
SAGSAPFGVLADΝMRYPILHKL
KDRLΝQTWYQ1RIGDRLAYISA
LDAQPDΝGLSVLTYHHILRDEE
ΝTRFRHTSTTTSVRAFNΝQMA
WLRDRGYATLSMVQLEGYVK
ΝKIΝLPARAWITFDDGLKSVS
RYAYPVLKQYGMKATAFIVTT
24955 55323 25093 1810 2399 STFRΝSDDEPYIVRPVAVAWRD
YVPQPARNGLSΝFTGΝLEEPAV
MVΝYFLQGDPYQGMVHFTRFF
HQHPSGLS*WVAGΝIFQILCAΝ
QPVRKRTRDNVQLLHSRGQRG
VFRCSRRVLTGLVDRFGRTQTF
HREAAGEFSGEITGVTDGAWR
HFRLVLTTQAQRAEEARQQAIS
GGTEPSAFPDTLPVDTEYVMNF
24956 55324 25094 577 MPVSTAVATVIKTKQVEFQLQ
VGVPLYFRLRANPIKTILDNQK
RLDSKGNIKRCRVPLIKEAEQIA
WLQRKLGNAARVEDVHPISER
PQYFSGDGKSGKIQTVCFEVIG
CREEPDAEECHGDREQRRVFIR
EDGFF VTKDL YPNRRAVG 11 STT
AVISPSQPPIIAPRVAQSSEPRSA
AHALARAIRQSTTV
Figure imgf003320_0001
Figure imgf003321_0001
Figure imgf003322_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Amino acid sequence ( X=Unknown, NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
24974 55342 251 12 1087 3289 RLRSCGAQKNTFCNAAAGCRW
RTTLSKMGGLLIAADLQDPVQ
DMEMWQFHPTGIAC/AGVLVT
E/GCRGEGGYLLNKHGERFMER
YAPNAKDLAGRDWARSIMIEI
REGRGCDGPWGPHAKLKLDHL
GKEVLESRLPGILELSRTFAHVD
PVKEPIPVIPTCHYMMGGIPTKV
TGQALTVNEKGEDVWPGLFA
VGEIACVSVHGANRLGGNSLL
DLWFGSARQVLHLQESIAEQG
ALRDASESDVEASLDRLNRWN
NNRNGEDPVAIRKALQECMQH
NFSVFREGDAMAKGLEQLKVI
RERLKNARLDATSSEFNTQRVE
CLELHNLMETAYATAVYATFR
TERRGAHSRVFDFPDRDDENWL
CHSLYLPESESMTRRSDYTLEA
DEGRDMMLLDALIQLKEKDPS
LSFRRSCREGVCGSDGLNMNG
KNGLACITPISALNQPGKKIVIR
PLPGLPVIRDLWDMGQFYAQ
YEKIKPYLLNNGQNPPAREHLQ
MPEQREKLDGLYECILCACCST
SCPSFWWNPDKFIGPGSLLAGY
RSGLIAGIPKCSARYLSGEKRSV
AWQQIAAEIGISIDAQFNESLKG
ISRDESLRRILQHGGKEGDFNSQ
ERAQLAYRKΝLLYVHSLRELT
VΝAVLPGIRSLLADLRAQQISV
GLASVSLΝAPTILAALELREFFT
FCADASQLKΝSKPGPGIFLAGC
24975 55343 251 13 47 630 VGWETSFAQIRSRFLRIRFCRAY
SSTLFVSAAΝPTT*G/QMVF*CG
NGAQHKNILLLYRSIHGGFHFQ
RGHHINTFNKRWRR* *NRT* YQ
RDVRATTGSGGGDRKTHLAGA
VIRDITHRIERFTRRAGGNHNV
QMFQWDGRKCATAISTVLDM
FGDHFGRQKIASCQRWLAKTIR
TVQQPICKNIVRKPGCHSLA
24976 55344 251 14 731 829
Figure imgf003324_0001
Figure imgf003325_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Λmino acid sequence ( X=Unknown,
NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
24993 55361 25131 473 1765 KAWLMTGIVRHWKGRAVPPA
EQLLEHLGNGKSFDSVAQAWD
AAMADAKAEDTVLWGVEAFRI
NAAVAKDDTQIHTHMCYCEFT
DIMDSIAALDADVITIETSRSDM
ELLESFEEFDYPNEIGPGVYDIH
SPNVPSVEWIEALLKKAAKRIP
AERLWVNPDCGLKTRGWPETR
AALANMVQAAQNLRRGARAG
CAADFAASARLAVGEALTNIA
ATQIGDIKRIKLSANWMAAAG
HPGEDAGLYEAVKAVGEELCP
ALGLTIPVGKDSMSMKTRWQE
GNEEREMTSPLSLVISAFARVE
DVRHTITPQLSTEDNALLLIDLG
KGNNALGATALAQVYRQLGD
KPADVRDVAQLKGFYDAIQAL
VAQRKLLAYHDRSDGGLLVTL
AEMAVLSSLGQGGYFGHMII/D
SGQYPGKSKTGQSTPDVAAQA
RQSWIP
24994 55362 25132 602 2563 WTTLTLVLTAVGSVLLLLFLV
MKARMHAFLALMWSMGAGL
FSGMPLDKIAATMEKGMGGTL
GFLAVWALGAMFGEILHETG
AVDQIAVKMLKSFGHSRAHYA
IG L AGL VC ALPLFFEVAI VLLIS
VAFSMARHTGTNLVKLVIPLFA
GVAAAAAFLVPGPAPMLLASQ
MNADFGWMILIGLCAAIPGMII
AGPLWGNFISRYVELHIPDDISE
PHLGEGKM PSFGFSLSLILLPL V
LVGLKTIAARFVPEGSTAYEWF
EFIGHPFTAILVACLVAIYGLAM
RQGMPKDKVMEICGHALVQPA
GIILLVIGAGGVFKQVLVDSGV
GPALGEALTGMGLPIAITCFVL
AAAVRIIQGSATVACLTAVGLV
MPVIEQLNYSKRRGFTGRVEEN
FGVCKGPSKYHSQHDAINRQG
REASDAHGTLENLPLPLVPQPN
AATALAALRASGLEVSENAIRD
G I ASAILPGRFQI VSESPRVIFDV
AHNPHAAEYLTGRMKALPKNG
RVLAVIGMLHDKDIAGTLAWL
KSWDDWYCAPLEGPRGATAE
QLLEHLGNGKSFDSVAQAWDA
AMADAKAEDTVLVCGSFHTVA
HVMEVIDARRSGGKLCQMHVN
VQDKIDEEINQVHICLCLVLVIV
SGCKYVGYSLKCSEHYYSLVR
Figure imgf003327_0001
Figure imgf003328_0001
Figure imgf003329_0001
Figure imgf003331_0001
Figure imgf003332_0001
Figure imgf003333_0001
Figure imgf003334_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Amino acid sequence ( X=Unknown, NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
25039 55407 25177 2635 MNKSGKYLVWTVLSIMGAFAL
GYIALNRGEQINALWIWASVC
IYLIAYRFYGLYIAKNVLAVDP
TRMTPAVRHNDGLDYVPTDKK
VLFGHHFAAIAGAGPLVGPVLA
AQMGYLPGMIWLLAGWLAG
AVQDFMVLFVSTRRDGRSLGE
LVKEEMGPTAGVIALVACFMI
MVIILAVLAMIVVKALTHSPWG
TYTVAFTIPLALFMGIYLRYLRP
GRIGEVSVIGLVFLIFAIISGGWV
AESPTWAPYFDFTGVQLTWML
RLINRSIMMDFSLTEEQELLLAS
IRELITTNFPEEYFRTCDQNGTY
PREFMRALADNGISMLGVPEEF
GGIPADYVTQMLALMEVSKCG
APAFLITOGQCIHSMRRFGSAE
QLRKTAESTLETGDPAYALALT
EPGAGSDNNSATTTYTRKNGK
VYINGQKTFITGAKEYPYMLVL
ARDPQPKDPKKAFTLWWVDSS
KPGIKINPLHKIGWHMLSTCEV
YLDNV*TSVWRTCACPALRRA
TQRYRRQSRAGNLRTSGEAIRR
DMEWTSLRSEVDWEKGDTIKQC
ISAATSCVYGVTGEGILEGGDA
QILREPLPGTKLGLLNWPVFDA
LESRSTRERDGKPGGEICGETA
NILIRGPENRPLRRDPPGKP
Figure imgf003336_0001
Figure imgf003337_0001
Figure imgf003338_0001
Figure imgf003339_0001
Figure imgf003340_0001
Figure imgf003341_0001
Figure imgf003342_0001
Figure imgf003343_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Amino acid sequence ( X=Unkno n,
NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
25089 55457 25227 1886 HRLNITAENDCRRLHCSLRDLS
SLLQAVGRLAEYFIGDVFAARF
NDALTWERLVKVTLYGSQIKL
YNIETAVPSVLKPDLIDVHAQS
LAGGCRLSAHWLAQYCSEIHR
QNTQQFVTLISTTMDAITPLIST
KVQDKLLLSACHLLVSLATTVR
PVFLISIPAVQKVFNRITDASAL
RLVDKAQVLVCRALSNILLLPW
PNLPENEQQWPVRSINHASLISA
LSRDYRNLKVPSAGAPQRKMPL
DDTKLIIHQTLQRLRKILWENIS
GVESTKSSTDFAYQFGCRESVVQ
VSLGLFPAFVIHQSDVTDEMLSF
FLTLFRGLRVQMGVPFTEQIIQT
FLNMFTREQLAESILHEGSTGC
RWEKFLKILQVWQEPGQVFK
PFLPSIIALCMEQVYPIIAERPSP
DVKAELFELLFRTLHHNWRYFF
KSTVLASVQRGIAVEEQMENEP
QFSAIMQAFGQSFLQPD1HLFK
QNLFVYLVETLNTKQKLVYHKKVI
FRVTAMVLFQFVNVLLQVVLVVH
KSHDLLVQEVEIGHRHSYNMASV
DFVDGFFAAFLPEFLVTSCVDGVV
DANQKSVLGRNFVKMDRLCPCL
PGPAPHSPRMCTGWSTTCGLLT
DSGNDSLAPWGLWKLLGPCLL
PGGTRDFLLVAT
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Amino acid sequence ( X=Unknown,
NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
25090 55458 A 25228 689 3053 VDCRMS/FSYIEKFTDFLRLFVS
VHLRRIESYSQFPWEFLTLLFK
YTFHQPTHEGYFSCLDIWTLFL
DYLTSKIKSRLGDKEAVLNRYE
DALVLLLTEVLNRIQFRYNQAQ
VLEELVDDETLDDDQQTEWQRY
LRHSLEWAKVMELLPTHAFST
LFP VLQDN LVE V YLGLQQFI VTS
GSGHRLNITAENDCRRLHCSLR
DLSSLLQAVGRLAEYFIGDVFA
ARVFN DALTV VERL VKVTLYGS
QIKLYNIETAVPSVLKPDLIDVH
AQSLAALQAYSHWLAQYCSEV
HRQNTQQFVTLISTTMDAITPLI
STKVQDKLLLSACHLLVSLATT
VRPVFLISIPAVQKVFNRITDAS
ALRLVDKAQVLVCRALSNILLL
PWPNLPENEQQWPVRSINHASL
ISALSRDYRNLKPSAVAPQRKM
PLDDTKLIIHQTLSVLEDIVENIS
GESTKSRQICYQSLQESVQVSL
ALFPAFIHQSDVTDEMLSFFLTL
FRGLRVQMGVPFTEQIIQTFLN
MFTREQLAESILHEGSTGCRW
EKFLKILQWVQEPGQVFKPFL
PSIIALCMEQVYPIIAERPSPDVK
AELFELLFRTLHHNWRYFFKST
VLASVQRGIAEEQMENEPQFSA
IMQAFGQSFLQPDIHLFKQNLF
YLETLNTKQKLYHKKIFRTAML
FQFVNVLLQVLVHKSHDLLQE
EIGIAIYNMASVDFDGFFAAFLP
25091 55459 25229 853 MISVANEIAEAGFDPQGRTSED
LLDLAESRVFKIAESRANKDEG
PKNIADVLDATVARIEQLFQQP
HDGVTGVNTGYDDLNKKTAG
LQPSDLIIVAARPSMGKTTFAM
TLVENAGRVMLQDKPVVLIFSLE
MPSEQIMMRSLASLSRVDQTKI
RTGQLDDEDWARISGTMGILLE
KRNIYIDDSSGLTPTEVRSRARR
IVAREHGGIGLIMIDYLPLMRVP
ALFDNRTLEIAEISRSLKALAKE
LNVPWALSHLNRSLEQRAEER
PGNSDLGESGSIEQDADLIL
Figure imgf003346_0001
Figure imgf003347_0001
Figure imgf003348_0001
Figure imgf003349_0001
Figure imgf003350_0001
Figure imgf003351_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Amino acid sequence ( X=Unkno n, NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
25138 55506 A 25276 652 2689 KFSVGYAVQNFNATVSTKQFQ
VFSEQNTSSYFAEPQLDVNYYQ
NDWHFVTTRDDMPEATRVHL
EPTINLPLSNNWGSINTEAKLLA
THYQQTNLDWYNSRNTTKLDE
SVNRVMPQFKVDGRMVFERD
MEMLAPGYTQTLEPRAQYLYV
PYRDQSDIYNYDSSLLQSDYSG
LFRDRTYGGLDRIASANQVTTG
VTSRIYDDAAVERFNISVGQIY
YFTESRTGDDNITWENDDKTGS
L V WAGDTY WRI SERWGLRGGI
QYDTRLDNVATSNSSIEYRRDE
DRLVQLNYRYASPEYIQATLPN
VNHHITMEHLHMKTRTQQIEEL
QKEWTQPRWEGITVRPYSAEDV
VKLRGSVNPECTLAQLGAAKM
WRLLHGESKKGYINSLGALTG
GQALQQAKAGIEAVYLSGWQV
AADANLAASMYPDQSLYPANS
VPAWERINNTFRRADQIQWSA
GIEPGDPRYVDYFLPIVADAEA
GFGGVLNAFELMKAMIEAGAA
AVHFEDQLASVKKMRHMGRT
ASDQPWLAYAPYADLVWCEPP
PPDLELARRFAQAIHAKYPGKL
LAYNCSPSFNWQKNLDDKTIAS
FQQQLSDMGYKFQFITVLAGIHS
MWFNMFDLANAYAQGEGMK
HYVEKVQQPEFAAAKDGYTFV
SHQQEVGTGYFDKVTTIIQGGT
SSVTALTGSTEESQF
Figure imgf003353_0001
Figure imgf003354_0001
Figure imgf003355_0001
Figure imgf003356_0001
Figure imgf003357_0001
Figure imgf003358_0001
Figure imgf003359_0001
Figure imgf003360_0001
Figure imgf003361_0001
Figure imgf003362_0001
Figure imgf003363_0001
Figure imgf003364_0001
Figure imgf003365_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Λmino acid sequence ( X=Unkno n,
NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
25226 55594 25364 2701 TFFLPAGEILPPRLEPQITLPVGS
RRIISIAVTDLPEPESPHLRSGLG
WASLCCFMVICKFFPAAEGLIL
AAVCDSRGSRYWCNGNRRYG
YASDLLVRCGATVMFSEVTEV
RDAIHLLTPRAVNEEVGKRLLE
EMEWYDNYLNMGKTDRSANP
SPGNKKGGLGKRGSSKALWLP
LLQIQARKQVGLRVYTVACVL
RTGTWYDRNGPDWRVPGQLN
MGNPAPETGRYRLAHPATPHD
PRTTPDHPSHIKPHAPTNQQPDT
LTPTHHHHTDTRDPSPPPPRTRT
PRQSTHTTPPHPTTTTRHTDIHP
PTHTNPPPPTRVLTRQLH HTNRI
IPHRTHTTTCRRNHHNQMNTTT
PPPPHTPQITHRHHNTTLREQTQ
KHRQHIHGTQHHTTRKPPTPTT
RQYNTSIVGTIGGGVMVERKVIE
ESLQALQERKPRLFHGRMARN
GADAVGSDCGGAMSVFISVHG
MRPRL VLIG AG HVN RAIAQS A A
LLGFDIAVADIYRESLNPELFPP
STTLLHAESFGAAVEALDIRPD
NFVLIATNNQDREALDKLIEQPI
AWLGLLASRRKVQLFLRQLRE
KGVAEEHIARLHAPVGYNIGAE
TPQEIAIRVLAEILQVKNNAPGG
LMMKPSHPSGHQMVVIRGAGD
IASGVALRLYHAGFNVIMLEVE
KPTVIRCTVAFAQAVFDGEMT
VEGVTARLATSSAEAMKLTER
Figure imgf003367_0001
Figure imgf003368_0001
Figure imgf003369_0001
Figure imgf003370_0001
Figure imgf003371_0001
Figure imgf003372_0001
Figure imgf003373_0001
SEQ ID SEQ ID NO: Met SEQ ID NO- Nucleotide Nucleotide location of last Λmino acid sequence ( X=Unkno>vn, NO: of peptide hod in USSN location of first codon for last ammo acid *=Stop codon, /=possιble nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possιble nucleotide insertion) sequence
25261 55629 25399 174 2238 LRSGDLPWEINPLSSCSLLHEKD
PPTTSGPQTVTSPRNISPISNPRQR
RHILSMDPKLRRRSWTWVGSLP
LVFNHCRDASLIIHPGFRGVRPR
RDVCLGPSPLATITDAELYVTL
TVEGKSIPFLIDTEATHSTLPSFK
GPVALASIHVQNVQ1KRDKEGH
YIMVKGSIQQEELTILNMYAPN
TGAPRFIKQVLNDLQRDLDSHT
IIMGDFNTPLSTLDRSTRQKVN
KDIQDLNSALHQADLIDIYRTL
HTKSTECTFFSAPHRTYSKIDHI
VGSKALLSKCKRTEIIINCLSDH
SAIKLELRIKKLTQNHSTTWKL
NNLLLNDYWVHKEMKAEIKM
FFEΓNENKNTTYQNLWDTFKAV
CRGKFIALNAYKRKQERSKIDT
LTSQLEELEKQEQTRSKASRRE
EITKIRAELKEIETQKTLQKINES
RSWFFEKINKIDILLARLIKKKR
EKNQIDA I KNDKGDI ATDPTEIQ
TTIREYYKHLCPNKLENLEEMD
KFLDTYTLPRLNQEEDESLNRPI
TGSEIVAIINSLPTKKSPGPDGFT
AKFYQRYKEELVPFLQKYKEEL
VPFLLKLFQSIEKEGILPNSFSEA
SHLIPKPGRDTTKKENFRPISLM
NIDAKILNKILANQIQQHIKKLI
HHDQVGFIPGMQGWFNIRKSIN
VIQHINRTKDKNHMIISIDAEKA
FDKIQQPSC
25262 55630 25400 267 EKDPLTTSGPQTVTSPRNISPISN LRLFAFTWTDPDTHQAQQITW AVLPQGFTDSPRYFSQAQISSLS VTYLSIILIKTHVLSLPIMSD
25263 55631 25401 167 408 LGSRTFPWEINPPSSCSLLREKD PPTTSGPQTVTRPRNISPISNLVS GLFLLSSPTSLTIPQPLSSFNLDD TLQSLPSLNF
25264 55632 A 25402 142 441 LRLGDLPWEINPLSSCSLLCEKD
PPTTSGPVPRNISPISNPVSGLFLL
SSPTCLTIPQPLSPFNLGATLQSL
PSLNFNSFHFLVETKETRFIRGP
KTPAP
25265 55633 25403 79 381 LGSGDLPWEINPLSSCSLLREKE PPTTSGPQTVTRPRNISPISNPELA TLAGNLATGPRNARSPGFLLSH VLSVWDPTENQTVQLTWQPLP QPLELWPKAL
Figure imgf003375_0001
Figure imgf003376_0001
Figure imgf003377_0001
Figure imgf003378_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Amino acid sequence ( X=Unkno n, NO: of peptide hod n USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
25290 55658 A 25428 557 MARSLGVLVALPFPLPVGFNAE
EAHNIVKEFSPGKSHPRESLSQQ
RETLVQAGAFHWFGWRGKEK
LKPKFNTFGRLPNEMLEQWLSP
TNQCVDGV/LKGGEDVYNHNNI
NQWTASIVEQSLVTHLVKLGKA
YKYIVTCAVVQKSAYGFHTASS
CFWDTTSDGTCTVRL/WENRT
MNVCIVNVFCHCLFVL
25291 55659 25429 178 2270 VRRLFCVQTGGSPVGRGSLHHL
WLVFSHVLSLLCSPCLLAPGM
WGFHDRDLVLRKALYTMMRT
GAEREALKRRWRWQQTQQNK
ESGLVYTEEEWEREWTELLKL
ASSEPRTHFSKNGGTGGGHLPP
EGKPLHGCSSLLRSGKGGHVK
AEPSRLQQSQAKPPLTSQIRALH
FPIPSQINEKPGAGHWSAKLHFP
YWQLGEAGEEEGSSPPDIMRCL
LTCLSVPSTAVIPLTDSEHKLLP
LHFAVDPGKDWΕWGKDDNDN
ARLAHLILSLEAKLNLLHSYMN
VTWIRIPSETRAPLAQPESPTAS
AGEDVQSLADSLDSDRDSVCS
NSNSNNGKNGKDKEKEKQRKE
KDKTRADSVANKLGSFSKTLGI
KLKKNMGGLGGLVHGKMGRA
NSANGKNGDSAERGKEKKAKS
RKGSKEESGASASTSPSEKTTPS
PTDKAAGASPAEKGGGPRGDA
WKYSTDVKLSLNILRAAMQGE
RKFIFAGLLLTSHRHQFHEEMIG
YYLTSAQERFSAEQEQRRRDA
ATAAAAAAVPPPPPRP/ERPPRRP
ETEGVPVPERASPGPPTQLVLK
LKERPSPGPAAGRAARAAAGG
TA/CPGGRRAACERQRTSAWPQ
PPGASAPERHPRAGVGRAGRG
VRAGRGGAAAVRHVPAAEPLA
WAELQPGARRRPAHRQHGRV
AGARGARGPTGRGGDRGGGRA
Figure imgf003380_0001
Figure imgf003381_0001
Figure imgf003382_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Amino acid sequence ( X=Unknown,
NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
25304 55672 A 25443 2355 MDHPGFREFCWKPHEVLKDLP
LGSYSYCSPPSSLMTYFFPTERG
GKIYYDGMGQDLADIRGSLEL
AMTHPEFYWYVDEGLSADNLK
SSLLRSEILFGAPLPNYYSVDDR
WEEQRAKFQSFWTYVAMLAK
QSTSKVQVLYGGTDLFDYEVR
RTFNNDMLLAFISSSCIAALVYI
LTSCSVFLSFFGIASIGLSCLVAL
FLYHWFGIQYLGILNGVAAFV
IVGIGVDDVFVFINTYRQATHL
EDPQLRMIHTVQTAGKATFFTS
LTTAAAYAANVFSQIPAVHDFG
LFMSLIVSCCWLAVLVTMPAA
LGLWSLYLAPLESSCQTSCHQN
CSRKTSLHFPGDVFAAPEQVGG
SPAQGPIPYLDDDIPLLEVEEEP
VSLELGDVSLVSVSPEGLQPAS
NTGSRGHLIVQLQELLHHWVL
WSAVKSRWVIVGLFVSILILSLV
FASRLRVSASRAPLLFRPDTNIQ
VLLDLKYNLSAEGISCITCSGLF
QEKPHSLQNNIRTSLEKKRRGS
GVPWASRPEATLQDFPGTVYIS
KVKSQGHPAVYRLSLNASLPAP
WQAVSPGDGEVPSFQVYRAPF
GNFTKKLTACMSTVGLLQAAS
PSRKWMLTTLACDAKRGWKF
DFSFYVATKEQQHTRKLYFAQS
HKPPFHGRVCMAPPGCLLSSSP
DGPTKGFFFVPSEKVPKARLSA
TFGFNPCVNTGCGKPAVRPLVD
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Amino acid sequence ( X=Unknown, NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
25305 55673 25444 1737 MLYAAECLRNVMKLQVIPSAL
NEALQAQSTWSDLGLASHALS
STWCLHGESAAWSQPAWECG
QPRDQLQTHSEAWLCALVATF
SQELEQFAVTGAQISDRQGHAV
CTAPPRGRPSTLMTNLVCASSA
DECGGGKTETQTAGGRLQGAE
TRSRGQLTRLITRTQAAEPRWF
LQLLKRSANRSRALTRPFSGLSS
DGARRLRLSFVSAVSSADTKPT
VPVSSDSSGRDGCGETRAEAGP
GGPHVAQPQRVAPPAAAARDF
RKPRGLNRQNVSP/PGPGEQGP
G/PRPQKPAGRAPVPEPQGEPGL
GLFPRVGRPPRSFPGSPADPAAS
VSHTQGPSPDTGLPGTCQNRLP
NPAAVDEQTCTRGSLNCRLSRS
PRVSERRG PEGTPGRRPAAPPQ
CAGTRGLSPRADLGAAIRTPRA
PYRPHGQGRPSRHNCDSVSSQG
WPPHSCWWKTTSNGMPWKAS
FCSRTAQPANQNTGNHERGLRS
VAMFGDRASREATPALSLCESQ
GCFEDEKEVFSPEPHVETSDEPL
PICSGPGDGWGDPIARPGRGNN
LSPGTSAWEVPRWIRVTGEDW
LSTGPAECVFINDLTT
Figure imgf003385_0001
Figure imgf003386_0001
Figure imgf003387_0001
Figure imgf003388_0001
Figure imgf003389_0001
Figure imgf003390_0001
Figure imgf003391_0001
Figure imgf003392_0001
Figure imgf003393_0001
Figure imgf003394_0001
Figure imgf003395_0001
Figure imgf003396_0001
Figure imgf003397_0001
Figure imgf003398_0001
Figure imgf003399_0001
Figure imgf003400_0001
Figure imgf003401_0001
Figure imgf003402_0001
Figure imgf003403_0001
Figure imgf003404_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Λmino acid sequence ( X=Unkno>vn, NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
25396 55764 25537 2204 IDMIFTPGPPSTPKHKKSQKGSA
FTFPSQQSPRNEPYVARPSTSEI
EDQSMMGKFVKVERQVQDMG
KKLDFLVDMHMQHMERLQVQ
VTEYYPTKGTSSPAEAEKKEDN
RYSDLKTIICNYSETGPPEPPYSF
HQVTIDKVSPYGFFAHDPVNLP
RGGPSSGKVQATPPSSATTYVE
RPTVLPILTLLDSRVSCHSQADL
QGPYSDRISPRQRRSITRDSDTP
LSLMSVNHEELERSPSGFSISQD
RDDYVFGPNGGSSWMREKRYL
AEGETDTDTDPFTPSGSMPLSST
GDGISDSTVPLFLSSEILQKQVG
QSITSMLGFLSRGPSMKLCMGL
AC VLSL WNTVSG I KGEAKKEK
GMTFLPTTDSKKFFSLLSVTSYS
SFAFHKFSVAVYNISNLKTVDP
AKFPTRYCYCLNNRTNDLSDFT
ALLVDIIGNSTSYLTEIFKSTSIL
SVNQSNESDCIFICVMTGKSGR
NLSDFWEIEEKYPIINYTFTSGL
SGVLALLLTQSLFGGLFTRTRM
KFGAVTRIGVDLPWEINP/LSSCS
LLHEKDPPTTSGPQTDQPKKHL
TNFKSAARPTFLGQGQVPLNPF
SFTLSGKSCFPRRQEPPNRLFPH
PNLLSLCPNPLFLCPNPFSTFLE
GKNHHPHPFSVSQVFSLGLPPS
LWEHSPMAHRLRVQPRQQPPDI
HKQVISLQSGHSLLVKSQSWSQ
DPIKFQLPHHHKNTASVCINPSV
25397 55765 25538 110 542 TALLLTQSLFGGLFTRTRMKFG AMTRIGVDLPWEINPLSSCSLLC EKHPPTTSGPQTDQPKKHLTNF KSETKETCFIHEPKTLAPVTDW EGSLPLVFNHCRDGLSDHSATF QGCQTMQGCLPWSFTLSGKSR FSGEGASCSLPG
25398 55766 25539 781 MPLQPLATPFTSPGSPSVAQVT
TNTGEPGFAQQEYAETPQTTAR
PPHPHPALRALQASVHQTPRLG
DCLGSSESATALHGSGEPNKLA
PPGRPTERGGLFKCSVPLRTVR
GCRAGLRGTETVPSSLQRQPAG
DVGWINNKSTTALDRTPTDTQ
RTRRPRPPKQKQRGRALKERPR
DSDSAREGPLKAVAGLFGGLFT
RTCMKFGAVTRIGVDLPWEΓNPL
SSCSLLH EKDPPTTSGPQTDQPK
KHLTNFKSDLLSLATED
Figure imgf003406_0001
Figure imgf003407_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Amino acid sequence ( X=Unknown, NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
25413 55781 25554 84 2804 PRGSWRCRRDLKCRPLSHSDA
DWDDLWDQFDERRYLNAKKW
RVGDDPYKPVCFSTSGRVSGSP
AIRAIPDTRHLRKKATSNSLSRQ
QLASPLPRSSYCGVTPGPIAIGP
LWTSGSLRTPNHIDTVFAPGLP
QPWFHLNYLRAMPSLSSAPKCT
LLVYCTDLPPTSIIITFHNEARST
LLRTIRRPPAVSTSQMAMTAAA
LTPAPLRTYSVHLSCRAHQLVP
MPQRKKYDETWVREHTKHSKl
SLFPSCRLHFSCLSRRRMAVGS
GKFQVMGDLGEPFLRVRWLRI
WTLGPDFHGSALNSVLNRVTPT
HLIREIILVDDFSNDPDDCKQLI
KLPKVKCLRNNERQGLVRSRIR
GADIAQGTTLTFLDSHCEVNRD
WLQPLLHRVKEDYTRWCPVI
DIINLDTFTYIESASELRGGFDW
SLHFQWEQLSPEQKARRLDPTE
PIRTPIIAGGLFVIDKAWFDYLG
KYDMDMDIWGGENFEMMTVQ
VLQPRRSEALICTWKVHHLTRE
STALLIALVQAPFTGPQAKKLQ
QSPCPAMLAPEYAPEASVLVCT
VSLLTYTTLQSEISFRVWMCGG
SLEIVPCSRVGHVFRKKHPYVF
PDGNANTYIKNTKRTAEVWMD
EYKQYYYAARPFALERPFGNV
ESRLDLRKNLRCQSFKWYLENI
YPELRYPSYLLELWLAESRGPQ
LAWSGEETDNDQLDEGFYVYQ
Figure imgf003409_0001
Figure imgf003410_0001
Figure imgf003411_0001
Figure imgf003412_0001
Figure imgf003413_0001
Figure imgf003414_0001
Figure imgf003415_0001
Figure imgf003416_0001
Figure imgf003417_0001
Figure imgf003418_0001
Figure imgf003419_0001
Figure imgf003420_0001
Figure imgf003421_0001
Figure imgf003422_0001
Figure imgf003423_0001
Figure imgf003424_0001
Figure imgf003425_0001
Figure imgf003426_0001
Figure imgf003427_0001
Figure imgf003428_0001
Figure imgf003429_0001
Figure imgf003430_0001
Figure imgf003431_0001
Figure imgf003432_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Λmiπo acid sequence ( X=Unknown, NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
25575 55943 25716 924 MFAIISYSSLAAVLLTATLTAAG
IISFPVALCLVIGANLGSGLLAM
LNNSAANAAARRVALGSLLFK
LVGSLIILPFVHLLAETMGKLSL
PKAELVI Y FH VF YNLVRCL VML
PF VDPM A RFCKTIIRDEPELDTQ
LRPKHLDVSALDTPTLALANAA
RETCALATPWTDDGRKYAYSA
ASGGRRSATKVMVWTDGESH
DGSMLKAVIDQCNHDNILRFGI
AVLGYLNRNALDTKNLIKEIKA
IASIPTERYFFNVSDEAALLEKA
GTLGEQIFSIEDMDLGDEVYTV
GRPHPMIDPTLRNQLIADLGAK
PQVRVLLLDVVIGFGATADPAA
SLVSAWQKACAARLDNQPLYA
IATVTGTERDPQCRSQQIATLED
AGIAWSSLPEATLLAAALIHPL
SPAAQQHTPSLLENVAVINIGLR
SFALELQSASKPVVHYQWSPV
AGQGKWLANPELLEADADAEY
AAVIDIDLADIKEPILCAPNDPD
DARPLSAVQGEKIDEVFIGSCM
TWGHFRAAGKLLDAHKGQLPT
RLWVAPPTRMDAAQLTEEGYY
S VFGKSG ARVSS IPCA VPCV WA
RVADGATVVSTSTRNFPNRLGT
GANVFLASAELAAVAALIGKLP
TPEEYQTYVAQVDKTAVDTYR
YLNFNQLSQYTEKADGLLKPRF
RPWQRKILDTLATYHEQHRDEP
GPGRERLRRMALPMEDEALVL
25576 55944 25717 417 MG VLNN WLSEEESLWIQSRIHL
RALRYYSNWRQYFAGYTFGRQ
YWQSPEDDHLPLLREFLARYER
PMKNCWKILDIEETTDVDIIRRA
YLALLPSFHPETDPQDFKQLRQ
AYEEALRIAQSPAKSVWQPEEY
EV AEHEI LLAFRALLASDSERFL
PSAWQRFIQQLNYCSMEEIDEL
RWSLCTIAMNTAHLSFECWLL
AERLRWLQEENTGEIDEEELGS
FLYAIAKGNVFNFQTILHLPVA
RRPDICTQHNDQSRCEGYQILS
DKRGHQHGGGITALYQCGNAD
PRAKGERLFFDAPA*CASSARV
CVLSLAGASKKSRSPFARGSAL
PHWYRAVMPPPCW SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Λmino acid sequence ( X=Unkno n, NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
25577 55945 25718 1680 2058 YRAYAPAFLPLSPSCPPLWRHC
GSGRGENRRRDSSAHGPYAPRR
QPDMPAHRQS*SFV/PASCSAC
MDEPHKRLTVTAGTECGTSASS
AALRAILKPCSSVCSSFFLLLLL
PPPPPSSSSSSFFFLF
25578 55946 25719 1 1 15 3549 NPDRLLQLNLILETSK*QQMGD
TRKKRS/NLTVAPIISPLKRYEQ
HTELSRHINVVNLGDRNRRQQL
ILDQGKSSQRNIYLSHVAQKIPG
RATYALSHSVDMITIQLGTLLR
CAPPHRRRYAGRFFCRSAGNFL
NVQGFDAVLVAVKHDLLLCLT
IAPSLIVCTRLRSPWCGALRRCS
TDGRSDLASSGVMFSIDTESAL
TRWCLSLPHGALGKIIAEGLLS
VIASKASAIVTTVAWPVTRRSS
LVNWHSGGSGLWRCNRTDERD
QQASQGTTRMGRTQRNYSPTR
QAIHKTGSPTMATTSNTEQPPH
EKPNTLTQHPRNPDRTIHPKAT
HNELTQDTPQRDAKQNGEKTQ
RKKNCLSELNDAEDQAERFQD
RLMLKLQVTTKPCSMTRLRDC
AGIWSAANRWSGIGIDRMIMLF
TNSHTIATLFSSRRCAHRNKFH
LMNEAVRRLLRSSFRKSVLRRD
ASFHDNQRVHSLQLLKERKKSL
IHGRICAAKASLFQTRRYRWQK
APLFSAQFRGRNRAVTHNRWV
TTTTAIERTRARGLRPNSGRGF
HSSPAWQLRHRKCRRCARRYH
TVYWIKRRRSVARASTWSGRI
TSRRGENRRRDSSAIAHMLHAA
KQTCLRIAVMIIRAASCSACMD
EPPSDLTVTAGTDAGLAPAGYA
LNERRDLLHALTKSTALRTAHR
GPNRTGWLNRRIVRFVMTALF
25579 55947 25720 606 1562
25580 55948 25721 688 762
25581 55949 25722 492 737 SLPSGATLPPPALLLASAPDTA* WRCRER*YHLPAYAALSPAVA HRASLYSTLPSSVPWRRQCAGF RARRSPEQAWAYPAR
Figure imgf003435_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Amino acid sequence ( X=Unknown, NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
25588 55956 25729 2376 MAEVEEKKQRIFRKFTYRGVD
LGQLLDMSFEQLRQLYGARQL
RRLNHGLWRNAEAQAAFAAE
APMLGQEGGADHREAGSGEDA
PAGYDHPSQDGGQQGGRLQRQ
DLQPGGDQAGDDRPRPGRVLH
HLQAQRGKKPM/ISTVTFGQ*P
DPPLRAGQRL*RC/RLLIYGRGK
CWALSGNP/RSGKTTLLKSISAR
LTPQQGEIHYENRSLYAMSEAD
RRRLLRTEWGWHQHPLDGLR
RQVSAGGNIGERLMATGARHY
GDIRATAQKWLEEVEIPANRID
DLPTTFLRATHTVKMFSIPGYSS
PGQLLDPETNINIGTSYLQYVY
QQFGNNRIFSSAAYNAGPGRVA
TAIVLTLYIIYMVGFFATSGELT
YEVWIGFFASAFTKVFTLLALF
SILIHAWIGMWQVLTDYVKPLA
LRLMLQLVIWALWYVIYGFV
WWGMRAALQISQSGQTCALL
SKVFPTRSHTVSAQGGITVALG
NTHEDNWEWHMYDTVKGSDY
IDAGRKRQEEGDWNSGVGDPS
DAGDRKHQLKAAQLRPEAIKQ
RRIANRLKRRTQFPRHLKGFVN
TLKPIHGGCSVGTKCPQMASLP
EGLHHPVKLVFLGKPGCLLTDR
PLKGPNTAKPVAVDLNASATAI
TSYSLYGGSMILLFLASTLYHAI
PHQRAKMWLKKFDHCAIYLLI
AGTYTPFLLVGLDSPLARG LMI
25589 55957 25730 219 652 SASWREAICLSRCRLPRSATAR
GAGRGGCGLADRRASRQGKNL
EAESAQEQNGHQHRIHESQHPC
QGGAPVSHHQAAVRLRESQIQ
GAAEKR* PTGD VI H PGQPVSGG
/LK*YPSIPTCLLTTRTASLLKPP
GVRTSVCWQQLPR
25590 55958 25731 1902 2100 RKFFSSSSGRS*PVRCSKL*SSID
A*PLERIKRSRSYQVGSVGLCL
RKSFHSTSAISAMPIGAPG
25591 55959 25732 3223 3669
Figure imgf003437_0001
Figure imgf003438_0001
Figure imgf003439_0001
Figure imgf003440_0001
Figure imgf003441_0001
Figure imgf003442_0001
Figure imgf003443_0001
Figure imgf003444_0001
Figure imgf003445_0001
Figure imgf003446_0001
Figure imgf003447_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Amino acid sequence ( X=Unkno\vn,
NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
25658 56026 25799 2392 MEMVAASKMRKSQDRMAASR
PYAETMRKVIGHLAHGNLEYK
HPYLEDRDVKRVGYLWSTDR
GLCGGLNINLFKKLLAEMKTW
TDKGVQCDLAMIGSKGVSFFNS
VGGNVVAQVTGMGDNPSLSEL
IGPVKVMLQAYDEGRLDKLYI
VSNKFINTMSQVPTISQLLPLPA
SDDDDLKHKSWDYLYEPDPKA
LLEQCSCMLRSPAWGITSTDTT
RARTDLACPIWLLLDPYPKLMG
NAPKALASVDKRGVPVNTILVS
ALVTALCVLINYLAPESAFGLL
MALWSALVINWAMISLAHMK
FRRAKQEQGVVTRFPALLYPLG
NWICLLFMAAVLVIMLMTPGM
AISVYLIPPLQTNTFNSTTIAMA
DYQMESLSAEINFAAAKLARA
CADEWTARTPEKPRYVAGVLG
PTNRTASISPDVNDPAFRNITFD
GLVAAYRESTKALVEGGADLIL
IETVFDTLNAKAAVFAVKTEFE
ALGVELPIMISGTITDASGRTLS
GQTTEAFYNSLRHAEALTFGLN
CALGPDELRQYVQELSRIAECY
VTAHPNAGLPNAFGEYDLDAD
TMAKQIREWAQAGFLNIVRGC
CGTTPQHIAEMSPGRKRLTRAE
SSSVPSTLPGKAWLEHALPLIAE
QL/QGRRSAVFIPFVAGVTQTWD
DYTAKTAAVLAPLGVSVTGIHS
WDPVAAIENAEIVIVGGGNTF
25659 56027 25800 412 1115 LAGEVMDLLLLSNSTLPGKAW
LEHALPLIAEQLQGRRSAVFIPF
AGVTQTWDDYTAKTVAVLAPL
GISVTGIHSVVDPVAAIENAEIVI
VGGGNTFQLLKQCRERG LLAPI
TDWKRGALYIGWSAGANLAC
PTIRTTNDMPIVDPQGFDALNLF
PLQINPHFTNALPEGHKGDPRL
FRVIRELLWAPELTIIGLPEGNW
ITVSKGHATLGGPNTTYVFKAG
EEAVPLEAGHRF
25660 56028 25801 360 MDHAIAGIAVQFRGGNLLRFK
AFQQRVKYRMCQAERA INRLF
NIAVKRLTGNRLDDKSKQHIIDI
AVDIPKWSDHLHIIRRKHHDPT
QDSGTHLPAVADDGFRTSVRN
RSA*HIRYLTRC*KALNRNKLPP
RN*TAIPAMA
Figure imgf003449_0001
Figure imgf003450_0001
Figure imgf003451_0001
Figure imgf003452_0001
Figure imgf003453_0001
Figure imgf003454_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Λmino acid sequence ( X=Unknown,
NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /^possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
25704 56072 25845 489 2797 TRRRRRTMIALIQRVTRASVTV
EGEVTGEIGAGLLVLLGVEKDD
DEQKANRLCERVLGYRIFSDAE
GKMNLNVHQAAGSVLWSQFT
LAADTERGMRPSFSKGMSQLP
GLSRETRESIAMYHLRVPQTEE
ELERYYQFRWEMLRKPLHQPK
GSERDAWDAMAHHQMWDEQ
GNLVAVGRLYFNADNEASIRFM
AVHPDVQDKGLGTLMAMTLES
VAPYFSGSETGMMTLNRYRLR
HMAKQGNRSAKRVEKLLRKPD
RLISLVLIGNNLVNILASALGTI
VGMRLYGDAGVAIATGVLTFV
VLVFAEVLPKTIAALYPEKVAY
PSSFLLAPLQILMMPLVWLLNA
ITRMLMRMMGIKTDIVDMLLS
VLDLEKM/TVDDIMLPRSEIIGI
DINDDWKSIL/R/QLSHSPHGAIV
LYRDSLDDAISMLRVREAWRL
MSEKKEFTKETMLRAADEIYFV
PEGTPLSTQLVKFQRNKKKVGL
WNEYGDIQGLVTVEDILEEIV
GDFTTSMSPTLAEEVTPQNDGS
VIIDGTAN VREINKA FN WHLPE
DDARTVNGVILEALEEIPVAGT
RVRIGEYDIDILDKLSFCFRYSN
HRRTNRTTIQRITFLHNAQYVT
WRNVIRFHHCDGLMHIRVQWD
VSFGDHFNAKLTHNIQHRLQRQ
LNAFNHRRFIIRVSFISHFQRTIQ
AINHRQQFVDEFLQREFVGFFNI
25705 56073 25846 87 199 QPASVQTAGAGCRL*LHQRQW RECLVLDSKRRGDGVL
Figure imgf003456_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Λmino acid sequence ( X=Unknown,
NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possiblc nucleotide insertion) sequence
25710 56078 25851 2730 2893 WPVGERLAPGDYPV TGQLCT AAGD/EWRR*KVRRHQPLWRG FPNRRYMAWRSA
2571 1 56079 25852 750 929 SFHPAGRRYLYRHSSRSGNITN
GCCRLWSAFCRLRTATTVPPAR
P*QLRPDRSRAQCPV
25712 56080 25853 2427 CRQHHRWHSGSTAVYRRVLITS
KCWNVFIRTKTWTVSILTTSW
CASARRVCVPAPRAVSSRCLSV
TTLIPSASTPW*LAHRISLAAR*
AF WFNCRIPAGIDNVKVLERI H
PDKDVDGFHPYNVGRLCQRAP
RLRPCTPRGIVTLLERYNIDTFG
LNAWIGASNIVGRPMSMELLL
AGCTTTVTHRFTKNLRHHVEN
ADLLIVAVGKPGFIPGDWIKEG
AIVIDVGINRLENGKWGDWF
EDAAKRASYITPVPGGVGPMT
VATLIETRYRQGKMFDGSSIGG
WKGINESDMVLMPDASTAVID
PFFADSTLIIRCDILEPGTLQGYD
RDPRSIAKRAEDYLRSTGIADT
VLFGPEPEFFLFDDIRFGSSISGS
HVAIDDIEGAWNSSTQYEGGN
KGHRPAVKGGYFPVPPVDSAQ
DIRSEMCLVMEQMGLWEAHH
HEVATAGQNEVATRFNTMTKK
ADEIQIYKYVVHNVAHRFGKT
ATFMPKPMFGDNGSGSPPSQPQ
SQPTPPEILLQALLSAGCFYAVY
YVTYKTRLRKRVADVPRRPLFT
MNRVETNLTWVILMGIALVSV
GIFFMHNGFLLFRLNSYSQIFSS
EVSGVALKRFFYFFIPAMLWY
FLRQDSKAWLFFLVSTVAFGLL
TYMIVGGTRANIIIAFAIFLFIGII
RGWISLWMLAAAGVLGIVGMF
WLALKRYGMNVSGDEAFYTFL
25713 56081 25854 502 885
25714 56082 25855 233 738 SNSCKRTKIISDVTSVFHAIFPAI
KITAPNSPRLRVNDSATPLTSAG
ESSGSTTLQKLCQRVAPSVSAA
SSYSGPNLFQHRLHGTHPKRNA
GKGHSHRNPQPCIGDFDIKLRQ
PFTVKTLYVKEKGQR*GGDRV
LPAQRVNPSAHQAACGLGSDS
GLGSTPSADRKRR
Figure imgf003458_0001
Figure imgf003459_0001
Figure imgf003460_0001
Figure imgf003461_0001
Figure imgf003462_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Λmino acid sequence ( X=Unknown, NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
25739 56107 25880 2613 QRKPTSSRKGLSARMMSGLTG
TITTVSTGWCTTRGHNMKSRW
SKVAIFTTR/SGRRVAKRVWRR
ERDLTGWMSLSRKPQVTWYG
WDGDRLTTIQNDRSRIQTIYQP
GSFTPLIRVETATAVRRRRRWQ
WGIPAGTVQMLDRLESEILADR
VSEESRRWLASCGLTVEQMKK
TSIHASIVRHKLLTAEQALKLLG
EMFVYHMPFNRALGMELERYE
KEFAQLAFKNQPMMVGNWAQ
SILHGGVIASALDVAAGLVCVG
STLTRHETISEDELRQRLSRMGT
IDLRVDYLRPGRGERFTATSSLL
RAGNK VA VARVELHNEEQL Y I
ASATATYMSKSMFLAVATILIS
KSSLSRLRRPSTIVCGIPISSISAN
ITPGRSLRSSSSTSKTLCTQFAV
QFLSKLLYAEGFVHVHRQNRH
LERRDSVRPYDASFVWLLNRR
SYHTRYPDTVATHRQDLVTAIF
TLYGGFQCFGVLGTQLEDVTNF
DTTFDQQPTLAIRAWIASHHVT
DICNFRGSDIAIPVDAEWFTID
VSTSSEITHRRNVDLFPDQLTVT
RSQTFTAYERLSTALTVAQVCG
VQRLCNYYSARLTPLPGPDSTR
ESNHRLAQITQYARQLASSPSII
DNRSRQHLNDVGLTAWDCVIIS
QIIGFIGFQARTIATFQAYLGHP
VRWLPGLEIQNYADASLFADES
LRWRSSYEVEKLPEEHTKSSTA
25740 56108 25881 71 3478 CCLSSHPSTTATALRSLSPVRK
WPKISALIPWEWAMCSLLSHGL
MLSGRSLVAWCWTVLVQNAS
TSGRSLSGRCLPCCKASSISLVD
SALSFLLFTLRFLVGLAEAPSFP
GNSRIVAAWFPARKGTGIINEVIS
PVLANTALEGLGGSGDALCTQC
EAEGFRHITYYLDRPDVLARFT
TKIIADKIKYPFLLSNGNRVAQG
ELENGRHWVQWQDPFPKPCYL
FALVAGDFDVLRDTFTTRSGRE
VALELYVDRGN
Figure imgf003464_0001
Figure imgf003465_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Λmino acid sequence ( X=Unknown,
NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
25752 56120 25893 888 892 TVDLDACTKGVLTFTTALLAG
GANQAFAKENTKKAYKETYGV
SHITRHDMLQIFKQQQNEKYQV
PQFDQSTIKNIESAKGLDVWDS
WPLQNADGTVAEYNGYHWF
ALAGSPKDADDTSIYMFYQKV
GDNSIDSWKNAGRVFKDSDKF
DANDPILKDQTQEWSGSATFTS
DGKIRLFYTDYSGKHYGKQSLT
TAQVNVSKSDDTLKINGVEDH
KTIFDGDGKTYQNVQQFIDEGN
YTSGDNHTLRDPHYV/EDKGH
KYLVFEANTGTENGYQGEESLF
NKAYYGGGTNFFRKESQKLQQ
SAKKRDAELANGALGIIELNND
YTLKKVMKPLITSNTVTDEIER
ANVFKMNGKWYLFTDSRGSK
MTIDDLWKFELENDDLEYLVE
ENSKQKTIQDVIWLFLTAYTHI
RGATQAFAKENNQKAYKETYG
VSHITRHDMLQIPKQQQNEKYQ
VPQFDQSTIKNIESAKGLDVWD
SWPLQNADGTVAEYNGYHWF
ALAGSPKDADDTSIYMFYQKV
GDNSIDSWKNAGRVFKDSDKF
DANDPILKDQTQEWSGSATFTS
DGKIRLFYTDYSGKHYGKQSLT
TAQVNVSKSDDTLKINGVEDH
KTIFDGDGKTYQNVQQFIDEGN
YTPGDNHTLRDPHYVKTKAMR
Figure imgf003467_0001
Figure imgf003468_0001
Figure imgf003469_0001
Figure imgf003470_0001
Figure imgf003471_0001
Figure imgf003472_0001
Figure imgf003473_0001
Figure imgf003474_0001
Figure imgf003475_0001
Figure imgf003476_0001
Figure imgf003477_0001
Figure imgf003478_0001
Figure imgf003479_0001
Figure imgf003480_0001
Figure imgf003481_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Amino acid sequence ( X=Unknown, NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
25870 56238 26011 1854 CGGTLARACDGCLNGEFAA LI
TGPVHKGVΓNDAGI/PFTGHTEF
FEERSQAKK/VVMMLATEELRV
ALATT/HLPLRDIADAITPALL/H
QPKYLDNADAVLAMY/HDQGL
PVLKYQGFGRGV/NITLGLPFIR
TSVGHG/TALELAGRGQ/ADVG
SFIRPYSAIK*L*NAGASVRWLS
ERRICRAITGPVHKGVINDAGIP
LPVIPSFSKSVRRRKRCDDAGD
RRTSRGAGNDALPLRDIADAIT
PALLHNRNILITPTPCWRCTRSG
SSRAKIPGLRARCDITLGLPFIRT
SVGHGQPLNWRDVANRCWQF
YTPLFCHQMIVKPMHNRIPRAT
YRGAVKSAIGLGCCCLKASATR
CAYRWRRSGRRDQSRFRYFEIA
GIRSRGINFIACPTCSRRNLMLS
VRLTRWSNAWRYHHSDGRFD
YRLRGECPGEALVSTLGVTGG
NKKAASMKMGAQRPSGQQDM
IDQLEARIRAKASQRTKRVELT
FSRLKNNNVEKQDTALFALYQ
RRLRCVRKVRRYVMAAVSAGC
GGGVCAVSAYMPLNRLPNAQE
ANNEYTSFENCIKCTVCTTACR
VTLRNNDPKLPYLVKAGLKMK
RVTRSPRRYCAAASATHRFNDE
RAGKVQGMPDINKLPADRETC
STLTCARSRQNQTNPYPANRAA
NAYQWLATKALEKVSMKSP
WQHKVSSPAAVRHLPSITQRRT
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Amino acid sequence ( X=Unknown,
NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
25871 56239 26012 1504 MG I V ASASVQAAEIYNKDGNK
LDVYGKVKAMHYMSDNASKD
GDQSYIRFGFKGETQINDQLTG
YGRWEAEFAAWTDMFPEFGG
DSSAQTDNFMTKRASGLATYR
KTTSSAYRWPELNLQYHRCTS
AARAMLLSGRPVMQINERTAV
RRQMTVYLRVERIHEIDPLVSL
SHSMSVIPIDVERPIAAAPCTPG
LWRTKTPLRVPSTYPLRLPGRA
RWCHRTFRLHLCKDWVFMFS
GLLIILVPLIVGYLIPLRQQAAL
KVINQLLSWMVYLILFFMGISL
AFLDNPRPYLADSVPDRQLLAP
VNSISRYKTIEWLNYIATELHK
GFTPLFRPDTPEEYKPTVRAHSE
KKLQYVNEALKDEHWICGQRF
TIADAYLFTVLRWAYAVKLNL
EGLEHIAAFMQRMAERPEVQD
ALSAEGLKGSRHLSAGARRLIL
GIIVTFSLILALICVTQPFNPLAQ
FIFLMLLWGVALIVRRMPGRFS
ALMLIVLSLTVSCRYIWWRYTS
TLNWDDPVSLVCGLILLFAETY
A WI VLVLG YFQV V WPLNRQPV
PLPKDMSLWPSVDIFVPTYNED
LNVTPDELKQVLDVAAALKAL
RAENISTKVFNSGLGISVFRDNS
TRTRFSYASALNLLGLAQQDLD
EGKSQIAHGETVRETANMISFC
ADAIGIRDDMYLGAGNAYMRE
VGAALDDGYKQASGFSEPAMR
25872 56240 26013 2903 3024
25873 56241 26014 123 335
25874 56242 26015 2634 2787
25875 56243 26016 1477 2485 NPEQLMVKTQRWITPGEPAGI
GPDLWQLAQREWPVELWCA
DATLLTNRAAMLGLPLTLRPYS
PNSPAQPQTAGTLTLLPVALRA
PVTAGQLAVENGHYWETLAR
ACDGCLNGEFAALITGPGVHKG
VINDAGIPFTGHTEFFEERSQAK
KWMMLATEELRVGLATTHLP
VRDI ADAITPALLHEG I AILHHD
LRTKFGIGEPRILWGGLNPHAG
EGVGPMGTEEIDTIIPVLNELRA
QGMKLNGPLPADTLFQPKYLD
NADAVRPMYHDQGLPVLKYQ
GFGRGVNITLGLPFIRTSVDRGT
ALELAGRGKVADVGSFITALNL
AIKMIVNTQ
Figure imgf003484_0001
Figure imgf003485_0001
Figure imgf003486_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Λmino acid sequence ( X=Unknown, NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
25896 56264 A 26037 1162 3478 LVRFGLLPLAYSARCFSARSED
RPKDECETCCIKYPNGRNVLSQ
EIHQVFVLNGIQSMSGYVYNLG
NELASMQGLVDWRLSPQGSS
AREAFSHSLDLARLAEKRGYH
RY WLAEHHNMTG I ASA ATS VLI
GYLAANTTTLHLGSGGVMLPN
HSPLVIAEQFGTLNTLYPGRIDL
GLGRAPGSDQRTMMALRRHM
SGDIDNFPRDVAELVDWFDAR
DPILEPTHNGRRLCSTVRHPRM
GYTPGVEPTTGPLGQGLANAV
GLAIAERTLAAQFNQPDHEIVD
HFTYVFMGDGCLMEEGWFTD
DTAKRFEAYHWHVIHEIDGHD
PQAVKEAILE AQS VKDKPSLI IC
RTVIGFGSPNKAGKEEAHGAPL
GEEEVALARQKLGWHHPPFEIP
KEIYHAWDAREKGEKAQQSW
NEKFAAYKKAHPQLAEEFTVRR
MSGGLPKDWEKTTQKYINELQ
ANPAKIATRKASQNTLNAYGP
MLPELLGGSADLAPSNLTIWKG
SVSLKEDPAGNYIHYGVREFG
MTAIANGIAHHGGFVPYTATFL
MFVEYARNAARMAALMKARQ
IMVYTHDSIGLGEDGPTHQAVE
QLASLRLTPNFSTWRPCDQVEA
AVGWKLAVERHNGPTALILSR
QNLAQVERTPDQVKEIARGGY
VLKDSGGKPDIILIATGSEIEITL
QAAEKLAGEGRNVRWSVLASS
25897 56265 26038 1032 MΝKSPKTRDIQEEELEVLLEEE
A AGGDEDREKEIL I ERIQSIKEE
KQKGLGLAAGWTADVSSRRPG
ASQLPPPPTHPIPQYRGWGPPD
QHLLSTLWTNVGSYKSGSFLQL
GGWCMNLDMDLΝSEESRWLF
RRHWDTGGIKPHERKRAVAHL
TLSEREEIRAGLSAKMSIRAIAT
ALΝRSPSTISREVQRΝRGRRYY
KAVDANNRANRMAKRPKPCLL
DQNLPLRKLVLEKLEMKWSPE
QISGWLRRTKPRQKTLRISPETI
YKTLYFRSREALNHLNIQVHLA
RSH WL WPG ERYVNRNGERGPV
TLWTG/TPISDGSRTSVKRALGQ
WGEHLSRRNAFFPLCATW
Figure imgf003488_0001
Figure imgf003489_0001
Figure imgf003490_0001
Figure imgf003491_0001
Figure imgf003492_0001
Figure imgf003493_0001
Figure imgf003494_0001
Figure imgf003495_0001
Figure imgf003496_0001
Figure imgf003497_0001
Figure imgf003498_0001
Figure imgf003499_0001
Figure imgf003500_0001
Figure imgf003501_0001
Figure imgf003502_0001
Figure imgf003503_0001
Figure imgf003504_0001
Figure imgf003505_0001
Figure imgf003506_0001
Figure imgf003507_0001
Figure imgf003508_0001
Figure imgf003509_0001
Figure imgf003510_0001
Figure imgf003511_0001
Figure imgf003512_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Λmino acid sequence ( X=Unknown, NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
26086 56454 26233 152 2099 GNRLLWSFIAFSKNSTHFKGIF
AMRIILLGAPGAGKGTQAQFIM
EKYGIPQISTGDMLRAAVKSGS
ELGKQAKDIMDAGKLVVTDELV
IALVKERIAQEDCRNGFLLDGF
PRTIPQADAMKEAGINGHYVLE
FDVPDELIVDRIVGRRVHAPSG
RVYHVKFNPPKVEGKDDVTGE
ELTTRKDDQEETVRKRLVEYH
QMTAPLIGYYSKEAEAGNTKY
AKVDGTKPQIYRGSLYVSDQIG
HTIVNPDGWCDCGRYGCLET
VASLSALKKQARVWLKSQPVS
TQLDPEKLTTAQLIAAWQSGEP
WITSWVDRSANAIGLSLYNFLN
ILNINQIWLYGRSCAFGENWLN
TIIRQTGFNPFDRDEGPSVKATQ
IGFGQLSRAQQGVDLGNMIITS
GQIPVNPKTGEVPADVAAQAR
QSLDNVKAIVEAAGLKVGDIV
KTTVFVKDLNDFAPVNATYEA
FFTEHNATFPARSCVEVARLPK
DVKIEIEAIAMLYTGYHYNDAP
SAVRYPRGNASGVELTPLEKLP
IGKGIVKRRGEKLAILNFGTLM
PEAAKVAESLNATLVDMRFVK
PLDEALILEMAASHEALVTVEE
NAIMGGAGSGVNEVLMAHRKP
VPVLNIGLPDFFIPQGTQEEMRA
ELGLDAAGMEAKIKAWLA
26087 56455 26234 935 1020
26088 56456 26235 145 373
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Amino acid sequence ( X=Unkno n, NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
26089 56457 26236 530 1865 RGNKRLDLMFKQSHASMYIKQ
PEQLDYQLNLKEVITAVFISMV
VLNISPTLY/PDKPSPKDKYVAY
AINIPDYELAADVYNINVTSPSG
QQETFKILINLEHLRQTLERKSL
TAVQKSQCEIITPKKPGEAILHA
FNATYQQIRENMSEFARCHYG
Y1QIPPVTTFRADGPETPEEEKG
YWFHAYQPEDLCTIHNPMGDL
QDFIALVKDAKKFGVIDIIPDYTF
NFMGIGGSGKNDLDYPSADIRA
KISKDIEDPVTKERKQIHPEDIH
LTAKDFEASKDNISKDEWENLH
ALKEKRLNGMPKTTPKSDQVI
MLQNQYVREMRKYGVRGLRY
DAAKHSKHEQIERSITPPLKNY
NERVHNTNLFNPKYHKKAVM
ΝYMEYLVTCQLDEQQMSSLLY
ERDDLSAIDFSLLMKTIKAFSFG
GDLQTLASKPGSTISSIPSERRILI
26090 56458 26237 14 2031 LIVLS*QHG/DLRQΝRSGAEHFG
LQRLMLHASQLSLTHPFTGEPL
TIHAGLDDTWMQALSQFGWRG
LLPENERVEFSAPSGQDGDPTL
GLIKEVACELSGMMRRSQPWE
EAEESIPDRGESLLEVSGTPNAG
WGAEGLAEREALLLCCCCGPM
GPDPLGLGSPGSVRGRSRLIHH
AISGEALWEVTSEGLDMAAAR
QFAIEKGAPALRAMTFIERAAM
LKAVAKHLLSEKERFYALSAQT
GATRADSWVDIEGGIGTLFTYA
SLGSRELPDDTLWPEDELIPLSK
EGGFAARHLLTSKSGVAVHINA
FNFPCWGMLEKLAPTWSGRNA
SHHQTSYRDGPLTQAMEFEDK
AQQRDVETARHFTFFRII AN ED
KKTGFPRFQTIVITASHKTIDRT
RYTYGAWRCYRPLLTVEEYRA
TGSLITQPKFPPHKSSEVHRVPR
NQGRFVAVNSTLPTEPATVAPV
RNGNASRDTAKTQTAERPSTTR
PARQQAVIEPKKPQATVKTEPK
PVAQTPKRTEPAAPVASTKAPA
ATSTPAPKETATTAPVQTASPA
QTTATPAAGAKTAGNVGSLKS
APSSHYTLQLSSSSNYDNLNGW
AKKENLKNYWYETTRNGQP
WYVLVSGVYASKEEAKKAVST
LPADVQAKNPWAKPLRQRHRD
LRGDERRDDRRSEYFNRQVIS
Figure imgf003515_0001
Figure imgf003516_0001
Figure imgf003517_0001
Figure imgf003518_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Λmiπo acid sequence ( X=Unknown, NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
261 1 1 56479 26258 1616 5678 PLAKTGRQREQLAGLAK/AHPS
LTLHQDPVYVTRADAPVAGKV
ALLSGGGTEHEPMHCGYIGQG
MLSGACPGEIFTSPTPDKIFECA
MQVDGGEG VLLI I KN YTGDILN
FETATELLHDSGVKVTTWIDD
DVAVKDSLYTAGRRGVANTVL lEKLVGAAAERGDSLDACAELG
RKLNNQG HSIGI ALGACTVRS A
GKPSFTLADNEMEFGVGIHAIL
KGDIPEAEAILASIKPAGWSRA
DWLPNQFQALRKSFIPERPVP
VMVTRLFELPVQISLGVYSLER
PANPQPIAYLVLQPRRKANNQN
ASTSAIIRATIGEPISCSVPSRGC
EGGISVAETVPAMIIAIGITISEG
LDARYRHVNSEHDPSDRQAKF
EWRYFGNLGKTVNDWRKNNK
ADIKEDGNRRCRHTNQPHCRIE
AVEYDDGRLCPLHGAHRGQSS
DGELVKPTVKTQSLAVHYSIGK
LRHRPNHGAVYYQRLRSGMGL
MNAFDSQTEDSSPAIGRNLRSR
L ARKKLSEM VEEELEQM I RRRE
FGEGEQLPSERELMAFFNVGRL
RAGSVKTQRSGANKQRRTRCV
SRPSADTIIGELSGMAKDFLSHP
GGIAHFEQLRLFFESSLVRYAA
EHATDSRDPRYRPKFYSISRSVP
VSDNITLQQLLLSLNDQPARCK
PSARCRLHGSHTSRYECNHRHG
261 12 56480 26259 185 775 IPEIFSGETHLKVSTLSKPTFSISS
LIRCTTRFSSQLSLGIATMSHRK
RVISALKSLSRLSANSLVFANQ
VERSSLTNLGCTRCSG*SLFWQ
RPALR*PFGLCPTVALTYLIVSP
EW*KAVSVKCWRNRRLTLMTS
PICTNSGTLTTAPVDRVAQVTF
QSPECLQHMRLTIALRYFRVAL
SRRLLVYASVCFTFSG
Figure imgf003520_0001
Figure imgf003521_0001
Figure imgf003522_0001
Figure imgf003523_0001
Figure imgf003524_0001
Figure imgf003525_0001
Figure imgf003526_0001
Figure imgf003527_0001
Figure imgf003528_0001
Figure imgf003529_0001
Figure imgf003530_0001
Figure imgf003531_0001
Figure imgf003532_0001
Figure imgf003533_0001
Figure imgf003534_0001
Figure imgf003535_0001
Figure imgf003536_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Amino acid sequence ( X=Unknovvn, NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
26204 56572 A 26351 2212 MKRSRCRDRPQPPPPDRREDGV
QRAAELSQSLPPRRRAPPGRQR
LEERTGPAGPEGKEQPPALASQ
SAEIAASARLPPRLGRLLGFQK
ACRCWSLNPHILMALLRSLVPP
DKKHPQVWRGRPPLHLAPNVG
LFSRVKVRSSWIEDKSMRDSR
RGLSQRRRRRKKKKRGSSSKK
KKRRKKRKKKKRKKRKRRKN
RKKKKKRKNKRKKKRKKEEK
KEEEEERRKKEEEDEEGRGRGR
RKRKRKKRKKRRSRKKKETAA
AAAAGERLGKWWPGECPVEC
VAYFLRRRLQQRLHPARQLLL
QGMAGYLSESDFVMVEEGFST
RDLLKELTLGASQATTDEVAAF
FVADLGAIVRKHFCFLKCLPRV
RPFYAVKCNSSPGVLKVLAQL
GLGFSCANKAEMELVQHIGIPA
SKIICANPCKQIVSQIKYAAKHGI
QLLSFDNEMELAKWKSHPSA
NFHIGSGCPDPQAYAQSIANAR
LVFEMGTELGHKMHVLDLGGG
FPGTEGAKVRFEEIASVINSALV
DLYFPEGCGVDIFAELGVRYYV
TSAFTVAVSIIAKKEVLLDQPGR
EEENGSTSKTIVYHLDEGVYGIF
NSVLFDNICPTPILQKSSTKSLR
TTIGEAFERLHRLLRERQKAML
EELEADTARTLTDIEQKVQRYS
QQLRKVQEGAQILQERLAETDR
HTFLAGVASLSERLKGKIHETN
26205 56573 26352 860 MKKEDAFLKAFYIVHGKWNCV
KGSLTRTPCCCTNCLAGGIWC
MRVQI FQLENKVLPGFP WACR
VDLYGQIGFDAAAQCLSLSVSK
AEPLVKKAPPELREQLARKTLS
SDNLKYLFDNKDIMKVKQSKK
KKEHSFRKNNGCIKIIAFSDHAE
DFRKLGCEVLGISVDSQFTHLA
WINNPQREGDLGPLNIPLLAEV
ASGLSEDYAVLKTDAGIAYRG
RFISDGKGVL/SQITVNDLPVGR
SVDEALRLVQAFQYAEEHAEV
CPAGWKPGTDTIKLNVGDSKE
Figure imgf003538_0001
Figure imgf003539_0001
Figure imgf003541_0001
Figure imgf003542_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Amino acid sequence ( X=Unkno n, NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
26236 56604 A 26383 2237 NRRKFSVSIGFYTVMDAQYKII
TKTAHLITKESPQEEGKEMFAT
MSKLKEQLTKVRK*FNALLYES
QQLLIPLEELEKQMTSFYDSLG
KINEIITVLEREAQSSALFKQKH
QVRTILFKWCNSSQKMTLCLVL
GSMISMLSTAKITIAGYKSEVEV
SVSHHCI*NCKKTLTLIEKGSQS
VQKFVTLSNVLKHFDQTRLQR
QIADIHVAFQVICSPC*DWKKH
VETNSRLMKKFEESRAELEKVL
RIAQEGLEEKGDPEELLRRHTV
SPPFRDQRVLNAFLKACDELTD
ILPEQEQQGLQEAVRKLHKQW
KVSQDRTAYHLLHLKIDVEKN
RFLASVEECRTELDRETKLMPQ
EGSEKIIKEHRVRLPHHSPHHLC
EKRLQLIEELCVKLPVRDPVRD
TPGTCHVTLKELRAAIDSTYRK
LMEDPDKWKDYTSR*PQNGLW
ISTNETQLKGIKGEAIDTANHGE
VKRAVEVSSGSLSKRGETLSWL
KSRLKVLTEVSSENEAQKQGDE
LAKLSSSFKALVTLLSEVQHLL
THFGDCVQYKEIVKNSLEELIS
GSKEVQEQAEKILDTENLFEAQ
QLLLHHQVKMPLSSAKKRDVQ
QQIAQAQQGEGGLPDRGHEEL
RKLESTLDGLERSRERQERRIQ
VGAGS*ERFETNKETWRYLFQ
TGSSHERFLSFSSLESLSSELEQT
KVYYGSLESIAVQAENLVKEAS
26237 56605 26384 390 GTSLFDEEGAKIVKDLMSKAEK
NGVKITLPVDFVTADDFDENAK
TGQATVASGIPAGWMGLDCG/P
ESSKKYAEAVTRAKQIVWNGP
VGVFEWEAFARGTKALMDEV
VKATSRGCITIIGKRSYTKLIPI
Figure imgf003544_0001
Figure imgf003545_0001
Figure imgf003546_0001
Figure imgf003547_0001
Figure imgf003548_0001
Figure imgf003549_0001
Figure imgf003550_0001
Figure imgf003551_0001
Figure imgf003552_0001
Figure imgf003553_0001
Figure imgf003554_0001
Figure imgf003555_0001
Figure imgf003556_0001
Figure imgf003557_0001
Figure imgf003558_0001
Figure imgf003559_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Λmino acid sequence ( X=Unkno\vn, NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /^possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
26365 56733 26512 161 2150 YLDAEKMGQKASQQLALKDSK
EVPWCEWSEAIVHAAQKLKE
YLGFEYPPSKLCPAANTLNEIFL
IHFITFCQEKGVDEWLTTTKMT
KHQAFLFGADWIWTFWGSNKQ
IKLQLAVQTLQMSSPPPVESKP
CDLSNPESRVEESSWKKSRFDK
LEEFCNLIGEDCLGLFIIFGMPG
KPKDIRGWLDSVKSQMVRSH
LPGGKAVAQFVLETEDCVFIKE
LLRNCLSKKDGLREGGASPGSL
RLAAPGPPLTLNAACPLRLAVL
AAMAAAALPAWLSLQSRARTL
RAFSTAVYSATPVPTPSLRVDD
LHLTEIVGMLDSVLTPEDSSGK
YRFISGEVLCRITGCFTGVRVEA
KDLFGGCCSNPNEVMVTWIKVI
VEKEVWLYLRYILKALPPRTEK
MAVDQDWPSVYPVAAPFKPSA
VPLPVRMGYPVKKGVPMAKEG
NLELLKIPNFLHLTPVAIKKHCE
ALKDFCTEWPAALDSDEKCEK
HFPIEIDSTDYVSSGPSVRNPRA
RVWLRVKLSSLNLDDHAKKK
LIKLVGERYCKTTDVLTIKTDR
CPLRRQNYDYAVYLLTVLYHE
SWEYWKEWGKK*D*SRHGKSI
YGENSSSERKYPGKRFSR*KLL
RKNMGN*LKKSSLGTKEIEEYK
KSWSLKNEEENENSISQYKES
26366 56734 26513 i 186 PASTMSIRVTQKSYKVSTSGPW
FFSSCSYLSGPSAHISSLSFSRAG
SSSFQGGLGRGYGGASGMEVIT
AVMVNQSLLSPFIDKVQFLEQQ
NKMLENKWSLLQQQKMAQSN
LDNMFESYINNLRWQLETLGR
KKLKLEAELGNMQGLVEDFKN
KYEDEINKYTEMENEFVLIKKD
VDEAYMNKVELEYRLEGPTDEI
NFLRKLYEQEIRELQSQILDMS
WLSMDNSHSLDMDSIIAEVKV
QYEEIANRSWAEAERMYQN*Y
AKLSQLEAALQRAKQDMALQL
HEYQELMNFKLAQDIVITTYRK
LLESEGSWLESGMQSMSIHMK
TTSGYAGSLSSAYGGLTSPSLS
YSLGSSFGSGAGSSSFSHTSSTR
AAWKKIEAQNGKLVSKSSDV
26367 56735 B 26514 361 1215
Figure imgf003561_0001
Figure imgf003562_0001
Figure imgf003563_0001
Figure imgf003564_0001
Figure imgf003565_0001
Figure imgf003566_0001
Figure imgf003567_0001
Figure imgf003568_0001
Figure imgf003569_0001
Figure imgf003570_0001
Figure imgf003571_0001
Figure imgf003572_0001
Figure imgf003573_0001
Figure imgf003574_0001
Figure imgf003575_0001
Figure imgf003576_0001
Figure imgf003577_0001
Figure imgf003578_0001
Figure imgf003579_0001
Figure imgf003580_0001
Figure imgf003581_0001
Figure imgf003582_0001
Figure imgf003583_0001
Figure imgf003584_0001
Figure imgf003585_0001
Figure imgf003586_0001
Figure imgf003587_0001
Figure imgf003588_0001
Figure imgf003589_0001
Figure imgf003590_0001
Figure imgf003591_0001
Figure imgf003592_0001
Figure imgf003593_0001
Figure imgf003594_0001
Figure imgf003595_0001
Figure imgf003596_0001
Figure imgf003597_0001
Figure imgf003598_0001
Figure imgf003599_0001
Figure imgf003600_0001
Figure imgf003601_0001
Figure imgf003602_0001
Figure imgf003603_0001
Figure imgf003604_0001
Figure imgf003605_0001
Figure imgf003606_0001
Figure imgf003607_0001
Figure imgf003608_0001
Figure imgf003609_0001
Figure imgf003610_0001
Figure imgf003611_0001
Figure imgf003612_0001
Figure imgf003613_0001
Figure imgf003614_0001
Figure imgf003615_0001
Figure imgf003616_0001
Figure imgf003617_0001
Figure imgf003618_0001
Figure imgf003619_0001
Figure imgf003620_0001
Figure imgf003621_0001
Figure imgf003622_0001
Figure imgf003623_0001
Figure imgf003624_0001
Figure imgf003625_0001
Figure imgf003626_0001
Figure imgf003627_0001
Figure imgf003628_0001
Figure imgf003629_0001
Figure imgf003630_0001
Figure imgf003631_0001
Figure imgf003632_0001
Figure imgf003633_0001
Figure imgf003634_0001
Figure imgf003635_0001
Figure imgf003636_0001
Figure imgf003637_0001
Figure imgf003638_0001
Figure imgf003639_0001
Figure imgf003640_0001
Figure imgf003641_0001
Figure imgf003642_0001
Figure imgf003643_0001
Figure imgf003644_0001
Figure imgf003645_0001
Figure imgf003646_0001
Figure imgf003647_0001
Figure imgf003648_0001
Figure imgf003649_0001
Figure imgf003650_0001
Figure imgf003651_0001
Figure imgf003652_0001
Figure imgf003653_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Amino acid sequence ( X=Unkno n,
NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
27535 57903 A 27701 2231 MGKKQSRKTGNSKKQSASPPP
KERSSTPATEQSWSENDFDELR
EEGFRRSNYSKLQEEIQTKGKE
VKHFEKNLDECITRITNREKCL
KELMELKAKARELREECRSLRS
QCDQVEERTVLARLIKKKREKN
QIDAIKIDKGDITTNPTETQTTIR
EYYKHLYTNKLENLEEMDKFL
DTYTLPRLNQDDVESLNRPITG
SEIEAINHSLPTKKSPGPDRFTA
KFYQSLPSSCDYGHAPAHPDVF
RVSSFWWVRGLAGSGVKLQTF
AVSVTALKAARLELFIPPDSGA
QLASPSGSLTRTGGGAACQSHA
VCPHSSAFGWSMGLGAMEQG
AALIGEARASREPMEEVGGSG
MAACRSQALPRGQLRPSEKSRA
APEHSSSPAMEQSWMENDFDE
LREEGFRRSNYSELREDIQTKG
KEVENFEKNLEECITRITNTEKC
LKELMELKTKARELREECRSLR
SRCDQLEERVSAMEDEMNEMK
QEGKFREKRIKRNEQSLQEIWD
YVKRPNLHLIGVPESDGENGTK
LENTLQDIIQENFPNLARQANIQ
IQEIQRTPQTYSSRRATPRHIIVR
FTKVEMKEKMLRAAREKGDSV
ERSFSNKAELLFQLHGHQRESL
SKKEAQVWRDKGLYFRKGLLD
QAQSWSRQVASILGAQIQVEGS
PEQRLENVELAGGHLNTLKAIP
MVAVAGGISFKCGAVLLLGAA
27536 57904 27702 596 1593 KPRLENYVKNAEASGANAINW
KKGY/LVMEDEMNEMKQEGKF
REKRLKRNEQSLQEIWDYVKR
PNLRLIGVPESDGENGTKLENT
LQDIIQENFPNLARQATVHIREI
QRTPQRFSLRRSTPRHIVRFSKV
ETKEKMLRAAREKEIQTTIREY
YKHLYTTKLENLEEMEKFLDT
YTLPRLNQKEVESMKRPITSSEI
EAVINILPIKRSPGPDGLTATFY
QRYKEELERSSSPATEQSWMEN
DFDELREEGFRRSNYSELREDIQ
TKVKEVENFEKNLEECITRITNT
EKCLKELMELKTKARELREECR
SLRSRCDQLEERVSAMEDEMN
Figure imgf003655_0001
Figure imgf003656_0001
Figure imgf003657_0001
Figure imgf003658_0001
Figure imgf003659_0001
Figure imgf003660_0001
Figure imgf003661_0001
Figure imgf003662_0001
Figure imgf003663_0001
Figure imgf003664_0001
Figure imgf003665_0001
Figure imgf003666_0001
Figure imgf003667_0001
Figure imgf003668_0001
Figure imgf003669_0001
Figure imgf003670_0001
Figure imgf003671_0001
Figure imgf003672_0001
Figure imgf003673_0001
Figure imgf003674_0001
Figure imgf003675_0001
Figure imgf003676_0001
Figure imgf003677_0001
Figure imgf003678_0001
Figure imgf003679_0001
Figure imgf003680_0001
Figure imgf003681_0001
Figure imgf003682_0001
Figure imgf003683_0001
Figure imgf003684_0001
Figure imgf003685_0001
Figure imgf003686_0001
Figure imgf003687_0001
Figure imgf003688_0001
Figure imgf003689_0001
Figure imgf003690_0001
Figure imgf003691_0001
Figure imgf003692_0001
Figure imgf003693_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Amino acid sequence ( X=Unkno n, NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
28135 58503 28305 189 1890 MKMASSLAFLLLNFHVSLLLV
QLLTPCSAQFSVLGPSGPILAM
VGEDADLPCHLFPTMSAETME
LKWVSSSLRQWNVYADGKEV
EDRQSAPYRGRTSILRDGITAG
KAALRIHNVTASDSGKYLCYFQ
DGDFYEKALVELKVAALGSΝL
HVEVKGYEDGGIHLECRSTGW
YPQPQIQWSΝAKGEΝIPAVEAP
WADGVGLYEVAASVIMRGGS
GEGVSCIIRΝSLLGLEKTASISIA
DPFFRSAQPWIAALAGTLPILLL
LLAGASYFLWRQQKEITALSSEI
ESEQEMKEMGYAATEREISLRE
RKKIQYLTPDVILYPDMAΝAIL
LVSEDQRSVQRAEEPHDLPDΝP
ERFEWRYCVLGCESFMSERHY
WEVEVGDRKEWHIGVCSKΝVE
RKKVWVKMTPEΝGYWTMGLT
DGΝKYRALTEPRTOLKLPEPPR
KVGVILDYETGHISFYΝATDGS
HIYTFLHASSSEPLYPVFRILTLE
PTALTVCPIPK/GREFPRFPTLVP
DHSLEIPLTPGLANESGEPQAEV
TSLLLPAQPGAKGLTLHNSQSE
PYSYRHTLKHFTDIHSIIP
Figure imgf003695_0001
Figure imgf003696_0001
Figure imgf003697_0001
Figure imgf003698_0001
Figure imgf003699_0001
Figure imgf003700_0001
Figure imgf003701_0001
Figure imgf003702_0001
Figure imgf003703_0001
Figure imgf003704_0001
Figure imgf003705_0001
Figure imgf003706_0001
Figure imgf003707_0001
Figure imgf003708_0001
Figure imgf003709_0001
Figure imgf003710_0001
Figure imgf003711_0001
Figure imgf003712_0001
Figure imgf003713_0001
Figure imgf003714_0001
Figure imgf003715_0001
Figure imgf003716_0001
Figure imgf003717_0001
Figure imgf003718_0001
Figure imgf003719_0001
Figure imgf003720_0001
Figure imgf003721_0001
Figure imgf003722_0001
Figure imgf003723_0001
Figure imgf003724_0001
Figure imgf003725_0001
Figure imgf003726_0001
Figure imgf003727_0001
Figure imgf003728_0001
Figure imgf003729_0001
Figure imgf003730_0001
Figure imgf003731_0001
Figure imgf003732_0001
Figure imgf003733_0001
Figure imgf003734_0001
Figure imgf003735_0001
Figure imgf003736_0001
Figure imgf003737_0001
Figure imgf003738_0001
Figure imgf003739_0001
Figure imgf003740_0001
Figure imgf003741_0001
Figure imgf003742_0001
Figure imgf003743_0001
Figure imgf003744_0001
Figure imgf003745_0001
Figure imgf003746_0001
Figure imgf003747_0001
Figure imgf003748_0001
Figure imgf003749_0001
Figure imgf003750_0001
Figure imgf003751_0001
Figure imgf003752_0001
Figure imgf003753_0001
Figure imgf003754_0001
Figure imgf003755_0001
Figure imgf003756_0001
Figure imgf003757_0001
Figure imgf003758_0001
Figure imgf003759_0001
Figure imgf003760_0001
Figure imgf003761_0001
Figure imgf003762_0001
Figure imgf003763_0001
Figure imgf003764_0001
Figure imgf003765_0001
Figure imgf003766_0001
Figure imgf003767_0001
Figure imgf003768_0001
Figure imgf003769_0001
Figure imgf003770_0001
Figure imgf003771_0001
Figure imgf003772_0001
Figure imgf003773_0001
Figure imgf003774_0001
Figure imgf003775_0001
Figure imgf003776_0001
Figure imgf003777_0001
Figure imgf003778_0001
Figure imgf003779_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Amino acid sequence ( X=Unknovvn, NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
29139 59507 29314 2104 STLAQQWQAGDS1WSRPAIRVF
ATYAKWDEKWGYDYTGNADN
NANFGKAVPADFNGGSFGRGD
SDEWTFGAQMEIWCSYLLALR
QCQADIHSAGCICHGWLVND
QCLPPVKCWGRCGAKSPFCAIT
FRVRWTIRKSTKCCANRAYKR
NITVGRIRRLRRIRQPLPDATLC
VLSGLQTEHNRRWPNSVCHFSP
DSTFYNDLSGRMKNVRLMFNG
IHRDNGFSKRPTDLNYTRKPLV
LAFQTAWFITVRVEIVGFRGINR
LSLMLEQNNVLIGENAWGKSS
LLDALTLLLSPESDLYHFERDD
FWFPPGDINGREHHLHI1LTFRE
SLPGRHRVRRYRPLEACWTPCT
DG YHRI FYRLEGES AEDGS VMT
LRSFLDKDGHPIECRG PIKAR
HLVVRLMPVVLVRLRECPVLMRR
IRVNGTVPNVPNVEVTARQLDF
LARGGQARYRLMRRRASNEQR
SWRYLDIINRMIERPETRYTREI
GFTSTMDLIYGLPKQTPESFAF
TLKRVAELNPDRLSVFNYAHLP
TIFAAQRKIKDADLPSPQQKLDI
LQETIAFLTQSGYQFIGMDHFA
RPDDELAVAQREGVLHRNFQG
YTTQGDTDLLGMGVSAISMIGD
CYAQNQKELKQYYQQVDEQG
NALWRGIALTRDDCIRRDVIKS
LICNFRLDYAPIKKQGDLHFAD
YFAEDLKLLAPLAKDGLGDVD
Figure imgf003781_0001
Figure imgf003782_0001
Figure imgf003783_0001
Figure imgf003784_0001
Figure imgf003785_0001
Figure imgf003786_0001
Figure imgf003787_0001
Figure imgf003788_0001
Figure imgf003789_0001
Figure imgf003790_0001
Figure imgf003791_0001
Figure imgf003792_0001
Figure imgf003793_0001
Figure imgf003794_0001
Figure imgf003795_0001
Figure imgf003796_0001
Figure imgf003797_0001
Figure imgf003798_0001
Figure imgf003799_0001
Figure imgf003800_0001
Figure imgf003801_0001
Figure imgf003802_0001
Figure imgf003803_0001
Figure imgf003804_0001
Figure imgf003805_0001
Figure imgf003806_0001
Figure imgf003807_0001
Figure imgf003808_0001
Figure imgf003809_0001
Figure imgf003810_0001
Figure imgf003811_0001
Figure imgf003812_0001
Figure imgf003813_0001
Figure imgf003814_0001
Figure imgf003815_0001
Figure imgf003816_0001
Figure imgf003817_0001
Figure imgf003818_0001
Figure imgf003819_0001
Figure imgf003820_0001
Figure imgf003821_0001
Figure imgf003822_0001
Figure imgf003823_0001
Figure imgf003824_0001
Figure imgf003825_0001
Figure imgf003826_0001
Figure imgf003827_0001
Figure imgf003828_0001
Figure imgf003829_0001
Figure imgf003830_0001
Figure imgf003831_0001
Figure imgf003832_0001
Figure imgf003833_0001
Figure imgf003834_0001
Figure imgf003835_0001
Figure imgf003836_0001
Figure imgf003837_0001
Figure imgf003838_0001
Figure imgf003839_0001
Figure imgf003840_0001
Figure imgf003841_0001
Figure imgf003842_0001
Figure imgf003843_0001
Figure imgf003844_0001
Figure imgf003845_0001
Figure imgf003846_0001
Figure imgf003847_0001
Figure imgf003848_0001
Figure imgf003849_0001
Figure imgf003850_0001
Figure imgf003851_0001
Figure imgf003852_0001
Figure imgf003853_0001
Figure imgf003854_0001
Figure imgf003855_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Λmino acid sequence ( X=Unkno n, NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
30126 60494 30309 153 2031 RRNYQRQQKKRGSEGAGRPDR
RSRISAAYSPRSEFNNAAVTEH
Y*NRF*T/LVPRSR/HANADTVT
REAVNQVIALLDSGALRVAEKI
DGLPTNQMQASRRAKEEEVHL
TGQPSRVQSPPRRCEEKQYMVL
MIVSGRSGSGKSVALRALEDM
GFYCVDΝLPWLLPDLARTLA
DREISAAVSIDVRΝMPESPEIFE
QAMSNLPDAFSPQLLFLDADRN
TLIRRYSDTRRLHPLSSKNLSLE
SAIDKESDLLEPLHGFPDYDTV
GFSCKRRIDYVCRIKHSRRIRQV
VLLNFAKSGAFSTTRGTDDKTR
RSLLVTLVRIFCVRVIFAYDIRD
GIHVRIIQINSKSGKVGSKHNSG
YSAAKFGGVGLTQSLALDLAE
YGITVHSLM LGNLLKSPMFQSL
LPQYATKLG I KPDQ VEQ Y YIDK
VPFKRGCDYQDVLNMLLFYAS
PKAWLAPDSRSMSPAVSGRFK
PRVWAIALDDQQRIVDTLFMK
GLTVFARPQKIPAITGRHSGATL
QKQGKCSVEELAQYFDTTGTT
MRKDLVILEHAGTVIRTSGGW
PDSPPHTRRDPRRFSMAFPWFD
NIRSAEFHHVTTLLAEIPRQQND
IHRPAHAAAAPKVETRSGDETN
RRWΝRPAQHLFA
30127 60495 30310 720 872 EKVPVSIGPGGMQELPMQSPDR RSAGKPGPASRAGR*TGGGAFS TKRDYR
30128 60496 50311 1967 2452 SRRCSASIΝQRPRPGHEKMVSV
RIAPASΝVPTCRPITVTTGSIALR
SA WTI ITRMRVSPLARAVRM* S
SPSTSSIEERVIRTMTASGIVPST
MAGKIIWATASIKLPSSPQMAV
SISIKPVΝGLESSRΝTISLTRPET
GVRFQCTETSMISIMPHQKIGIE
30129 60497 30312 1912 3960
30130 60498 30313 250 LIRKVST*SVLMKAFLRIA*KDC ERLGLKCFWSGSEKGCPLVΝT ΝAFGGHCEHHQWVSSVSRVRV SQSAGGCPLFΝILLΝTV
30131 60499 30314 300 SVFSH*AKKIWKGVSQRALGQ
ΝSRGSGSGCQASWTIRFPVGFH
TDGTRLRRΝPGMQAFLGPVAR
FLALEARSLDCAFSSSLLFKRKL
SGRWGRASRGTKL
Figure imgf003857_0001
Figure imgf003858_0001
Figure imgf003859_0001
Figure imgf003860_0001
TC980/l0Sfl/13d Z.90S/.0/Ϊ0 OΛV
Figure imgf003861_0001
Figure imgf003862_0001
Figure imgf003863_0001
Figure imgf003864_0001
Figure imgf003865_0001
Figure imgf003866_0001
Figure imgf003867_0001
Figure imgf003868_0001
Figure imgf003869_0001
Figure imgf003870_0001
SEQ ID SEQ ID NO: Met SEQ ID NO: Nucleotide Nucleotide location of last Λmino acid sequence ( X=Unkno\vn, NO: of peptide hod in USSN location of first codon for last amino acid *=Stop codon, /=possible nucleotide sequence 09/540,217 codon for peptide of peptide sequence deletion, \=possible nucleotide insertion) sequence
30366 60734 30551 711 MAEIQHKTIRPLLEGRDLLAAV
KTGSGKTLAVLIPAIELWKLKF
MPRNGTGVLILSPTRQLAMQTF
GVLKELMTHHVHTYGLIIGGSN
RSAEAQKLANGINIIWTPGRLL
DHTQNTPGFMYKNLQVEDLAR
ISPKKEPLYVGVDEDKANATVD
GLEQGH F VCPSEKRYLLLFTFL
KKNQKKKLMVFFSACMSVKYP
YGLLKYIDL/PVLAIHGKQKQN
KHTTTFF*YCNADSGTLL
30367 60735 A ,0552 661 987 VTFYSSEHSNPCHKNLRKARRK DTKRIILKW*HTLV*GRDI*N*NI I ITRNTRYSLLCP WATKKLKAC FISQK*KRDVIERNSAQCLQPKS IYTLVR*VQILKSTKILL
30368 60736 30553 188 2188 KFQGASNLTLSETQNGDVSEET
MGSRKVKKSKQKPMNVGLSET
QNGGMSQEAVGNIKVTKSPQK
STVLTNGEAAMQSSNSESKGK
MKKKRKMVNDAEPDTKKAKT
ENKGKSEEESAETTKETENNVE
KPDNDEDESEVPSLPLGLTGAF
EDTSFASLCNLVNENTLKAIKE
MGFTNMTEIQHKSIRPLLEGRD
LLAAAKTGSGKTLAFLIPAVELI
VVKLRFMPRNGTGVLILSPVTRE
LAMQTFGVLKELMTHHVHTYG
LIMGGSNRSAEAQKLGNGINIIV
ATPGRLLDHMQNTPGFMYKNL
QCLVIDEADRILDVGFEEELKQI
IKLLPTRRQTMLFSATQTRKVE
DLARISLKKEPLYVGVDDDKA
NATVDGLEQGYVVVCPSEKRFL
LLFTFLKKNRKKKLMVFFSSCM
SVKYHYELLNYIDLPVLAIHGK
QKQNKRTTVTFFQFCNADSGVTL
LCTVDVVAARGLDIPEVDWIVQY
DPPDDPKEYIVHRVG* EQPEGLN
GEEGHALASFLRPRKDLGFFFR
LL*KHSKGFPLSGIWTFSW/SLK
ISDIQFSAWRNWIGKVITFLHKS
AVQEAYKSYITEPMDSPFL*NRS
FNVVNNLNLASGLLCQFGFKVVP
PFVDLNVNSNEGKQKKRGGGG
GFGLPRKTQEKLEKSVKIFTH*
ARKSSGQAGQFSH

Claims

WHAT IS CLAIMED IS:
1. An isolated polynucleotide comprising a nucleotide sequence selected from the group consisting of SEQ ID NO: 1-30368, a mature protein coding portion of SEQ ID NO: 1-30368, an active domain of SEQ ID NO: 1-30368, and complementary sequences thereof.
2. An isolated polynucleotide encoding a polypeptide with biological activity, wherein said polynucleotide hybridizes to the polynucleotide of claim 1 under stringent hybridization conditions.
3. An isolated polynucleotide encoding a polypeptide with biological activity, wherein said polynucleotide has greater than about 90% sequence identity with the polynucleotide of claim 1.
4. The polynucleotide of claim 1 wherein said polynucleotide is DNA.
5. An isolated polynucleotide of claim 1 wherein said polynucleotide comprises the complementary sequences.
6. A vector comprising the polynucleotide of claim 1.
7. An expression vector comprising the polynucleotide of claim 1.
8. A host cell genetically engineered to comprise the polynucleotide of claim 1.
9. A host cell genetically engineered to comprise the polynucleotide of claim 1 operatively associated with a regulatory sequence that modulates expression of the polynucleotide in the host cell.
10. An isolated polypeptide, wherein the polypeptide is selected from the group consisting of:
(a) a polypeptide encoded by any one of the polynucleotides of claim 1 ; and (b) a polypeptide encoded by a polynucleotide hybridizing under stringent conditions with any one of SEQ ID NO: 1 -30368.
1 1. A composition comprising the polypeptide of claim 10 and a carrier.
12. An antibody directed against the polypeptide of claim 10.
5871
13. A method for detecting the polynucleotide of claim 1 in a sample, comprising: a) contacting the sample with a compound that binds to and forms a complex with the polynucleotide of claim 1 for a period sufficient to form the complex; and b) detecting the complex, so that if a complex is detected, the polynucleotide of claim 1 is detected.
14. A method for detecting the polynucleotide of claim 1 in a sample, comprising: a) contacting the sample under stringent hybridization conditions with nucleic acid primers that anneal to the polynucleotide of claim 1 under such conditions;
b) amplifying a product comprising at least a portion of the polynucleotide of claim 1 ; and c) detecting said product and thereby the polynucleotide of claim 1 in the sample.
15. The method of claim 14, wherein the polynucleotide is an RNA molecule and the method further comprises reverse transcribing an annealed RNA molecule into a cDNA polynucleotide.
16. A method for detecting the polypeptide of claim 10 in a sample, comprising: a) contacting the sample with a compound that binds to and forms a complex with the polypeptide under conditions and for a period sufficient to form the complex; and b) detecting formation of the complex, so that if a complex formation is detected, the polypeptide of claim 10 is detected.
17. A method for identifying a compound that binds to the polypeptide of claim 10, comprising: a) contacting the compound with the polypeptide of claim 10 under conditions sufficient to form a polypeptide/compound complex; and b) detecting the complex, so that if the polypeptide/compound complex is detected, a compound that binds to the polypeptide of claim 10 is identified.
18. A method for identifying a compound that binds to the polypeptide of claim 10, comprising: a) contacting the compound with the polypeptide of claim 10, in a cell, under conditions sufficient to form a polypeptide/compound complex, wherein the complex drives expression of a reporter gene sequence in the cell; and b) detecting the complex by detecting reporter gene sequence expression, so that if the polypeptide/compound complex is detected, a compound that binds to the polypeptide of claim 10 is identified.
19. A method of producing the polypeptide of claim 10, comprising, a) culturing a host cell comprising a polynucleotide sequence selected from the group consisting of a polynucleotide sequence of SEQ ID NO: 1 -30368, a mature protein coding portion of SEQ ID NO: 1-30368, an active domain of SEQ ID NO: 1-30368, complementary sequences thereof and a polynucleotide sequence hybridizing under stringent conditions to SEQ ID NO: 1-30368, under conditions sufficient to express the polypeptide in said cell; and b) isolating the polypeptide from the cell culture or cells of step (a).
20. An isolated polypeptide comprising an amino acid sequence selected from the group consisting of SEQ ID NO: 30369-60736, the mature protein portion thereof, or the active domain thereof.
21 The polypeptide of claim 20 wherein the polypeptide is provided on a polypeptide array.
22. A collection of polynucleotides, wherein the collection comprises the sequence information of at least one of SEQ ID NO: 1-30368.
23. The collection of claim 22, wherein the collection is provided on a nucleic acid array.
24. The collection of claim 23, wherein the array detects full-matches to any one of the polynucleotides in the collection.
25. The collection of claim 23, wherein the array detects mismatches to any one of the polynucleotides in the collection.
26. The collection of claim 22, wherein the collection is provided in a computer-readable format.
27. A method of treatment comprising administering to a mammalian subject in need thereof a therapeutic amount of a composition comprising a polypeptide of claim 10 or 20 and a pharmaceutically acceptable carrier.
28 A method of treatment comprising administering to a mammalian subject in need thereof a therapeutic amount of a composition comprising an antibody that specifically binds to a polypeptide of claim 10 or 20 and a pharmaceutically acceptable carrier.
PCT/US2001/008631 1998-12-03 2001-03-30 Novel nucleic acids and polypeptides WO2001075067A2 (en)

Priority Applications (5)

Application Number Priority Date Filing Date Title
US10/450,763 US20050196754A1 (en) 2000-03-31 2001-03-30 Novel nucleic acids and polypeptides
AU2001249251A AU2001249251A1 (en) 2000-03-31 2001-03-30 Novel nucleic acids and polypeptides
US10/128,558 US20040219521A1 (en) 2000-01-21 2002-04-22 Novel nucleic acids and polypeptides
US10/243,552 US20030224379A1 (en) 2000-01-21 2002-09-12 Novel nucleic acids and polypeptides
US10/302,689 US20080050393A1 (en) 1998-12-03 2002-11-22 Novel nucleic acids and polypeptides

Applications Claiming Priority (4)

Application Number Priority Date Filing Date Title
US54021700A 2000-03-31 2000-03-31
US09/540,217 2000-03-31
US64916700A 2000-08-23 2000-08-23
US09/649,167 2000-08-23

Related Parent Applications (1)

Application Number Title Priority Date Filing Date
PCT/US2001/008656 Continuation-In-Part WO2001079449A2 (en) 1998-12-03 2001-04-16 Novel nucleic acids and polypeptides

Related Child Applications (1)

Application Number Title Priority Date Filing Date
PCT/US2001/004941 Continuation-In-Part WO2002016439A2 (en) 1998-12-03 2001-03-05 Novel nucleic acids and polypeptides

Publications (4)

Publication Number Publication Date
WO2001075067A2 WO2001075067A2 (en) 2001-10-11
WO2001075067A3 WO2001075067A3 (en) 2002-04-04
WO2001075067A9 true WO2001075067A9 (en) 2002-10-31
WO2001075067A8 WO2001075067A8 (en) 2004-10-14

Family

ID=27066367

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/US2001/008631 WO2001075067A2 (en) 1998-12-03 2001-03-30 Novel nucleic acids and polypeptides

Country Status (3)

Country Link
US (7) US6436703B1 (en)
AU (1) AU2001249251A1 (en)
WO (1) WO2001075067A2 (en)

Cited By (27)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8835387B2 (en) 2012-02-16 2014-09-16 Atyr Pharma, Inc. Histidyl-tRNA synthetases for treating autoimmune and inflammatory diseases
US8945541B2 (en) 2010-05-14 2015-02-03 Atyr Pharma, Inc. Innovative discovery of therapeutic, diagnostic, and antibody compositions related to protein fragments of phenylalanyl-beta-tRNA synthetases
US8946157B2 (en) 2010-05-03 2015-02-03 Atyr Pharma, Inc. Innovative discovery of therapeutic, diagnostic, and antibody compositions related to protein fragments of seryl-tRNA synthetases
US8961961B2 (en) 2010-05-03 2015-02-24 a Tyr Pharma, Inc. Innovative discovery of therapeutic, diagnostic, and antibody compositions related protein fragments of arginyl-tRNA synthetases
US8961960B2 (en) 2010-04-27 2015-02-24 Atyr Pharma, Inc. Innovative discovery of therapeutic, diagnostic, and antibody compositions related to protein fragments of isoleucyl tRNA synthetases
US8962560B2 (en) 2010-06-01 2015-02-24 Atyr Pharma Inc. Innovative discovery of therapeutic, diagnostic, and antibody compositions related to protein fragments of Lysyl-tRNA synthetases
US8969301B2 (en) 2010-07-12 2015-03-03 Atyr Pharma Inc. Innovative discovery of therapeutic, diagnostic, and antibody compositions related to protein fragments of aspartyl-tRNA synthetases
US8980253B2 (en) 2010-04-26 2015-03-17 Atyr Pharma, Inc. Innovative discovery of therapeutic, diagnostic, and antibody compositions related to protein fragments of cysteinyl-tRNA synthetase
US8981045B2 (en) 2010-05-03 2015-03-17 Atyr Pharma, Inc. Innovative discovery of therapeutic, diagnostic, and antibody compositions related to protein fragments of methionyl-tRNA synthetases
US8986680B2 (en) 2010-04-29 2015-03-24 Atyr Pharma, Inc. Innovative discovery of therapeutic, diagnostic, and antibody compositions related to protein fragments of Asparaginyl tRNA synthetases
US8986681B2 (en) 2010-04-27 2015-03-24 Atyr Pharma, Inc. Innovative discovery of therapeutic, diagnostic, and antibody compositions related to protein fragments of threonyl-tRNA synthetases
US8993723B2 (en) 2010-04-28 2015-03-31 Atyr Pharma, Inc. Innovative discovery of therapeutic, diagnostic, and antibody compositions related to protein fragments of alanyl-tRNA synthetases
US8999321B2 (en) 2010-07-12 2015-04-07 Atyr Pharma, Inc. Innovative discovery of therapeutic, diagnostic, and antibody compositions related to protein fragments of glycyl-tRNA synthetases
US9034320B2 (en) 2010-04-29 2015-05-19 Atyr Pharma, Inc. Innovative discovery of therapeutic, diagnostic, and antibody compositions related to protein fragments of Valyl-tRNA synthetases
US9034598B2 (en) 2010-05-17 2015-05-19 Atyr Pharma, Inc. Innovative discovery of therapeutic, diagnostic, and antibody compositions related to protein fragments of leucyl-tRNA synthetases
US9034321B2 (en) 2010-05-03 2015-05-19 Atyr Pharma, Inc. Innovative discovery of therapeutic, diagnostic, and antibody compositions related to protein fragments of phenylalanyl-alpha-tRNA synthetases
US9127268B2 (en) 2009-12-11 2015-09-08 Atyr Pharma, Inc. Aminoacyl tRNA synthetases for modulating inflammation
US9347053B2 (en) 2010-05-27 2016-05-24 Atyr Pharma, Inc. Innovative discovery of therapeutic, diagnostic, and antibody compositions related to protein fragments of glutaminyl-tRNA synthetases
US9399770B2 (en) 2010-10-06 2016-07-26 Atyr Pharma, Inc. Innovative discovery of therapeutic, diagnostic, and antibody compositions related to protein fragments of tryptophanyl-tRNA synthetases
US9404104B2 (en) 2010-05-04 2016-08-02 Atyr Pharma, Inc. Innovative discovery of therapeutic, diagnostic, and antibody compositions related to protein fragments of P38 multi-tRNA synthetase complex
US9422363B2 (en) 2007-08-28 2016-08-23 Uab Research Foundation Synthetic apolipoprotein E mimicking polypeptides and methods of use
US9422539B2 (en) 2010-07-12 2016-08-23 Atyr Pharma, Inc. Innovative discovery of therapeutic, diagnostic, and antibody compositions related to protein fragments of histidyl-tRNA synthetases
US9428743B2 (en) 2010-08-25 2016-08-30 Atyr Pharma, Inc. Innovative discovery of therapeutic, diagnostic, and antibody compositions related to protein fragments of tyrosyl-trna synthetases
US9453214B2 (en) 2009-02-27 2016-09-27 Atyr Pharma, Inc. Polypeptide structural motifs associated with cell signaling activity
US9574187B2 (en) 2010-05-04 2017-02-21 Atyr Pharma, Inc. Innovative discovery of therapeutic, diagnostic, and antibody compositions related to protein fragments of glutamyl-prolyl-tRNA synthetases
US9587235B2 (en) 2013-03-15 2017-03-07 Atyr Pharma, Inc. Histidyl-tRNA synthetase-Fc conjugates
JP7453921B2 (en) 2018-03-02 2024-03-21 ディセルナ ファーマシューティカルズ インコーポレイテッド Compositions and methods for inhibiting GYS2 expression

Families Citing this family (539)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5972697A (en) 1995-11-13 1999-10-26 The Salk Institute For Biological Studies NIMA interacting proteins
US7125677B2 (en) 1995-11-13 2006-10-24 The Salk Institute For Biological Studies NIMA interacting proteins
US7351421B2 (en) * 1996-11-05 2008-04-01 Hsing-Wen Sung Drug-eluting stent having collagen drug carrier chemically treated with genipin
US20050163818A1 (en) * 1996-11-05 2005-07-28 Hsing-Wen Sung Drug-eluting device chemically treated with genipin
JP2001512306A (en) 1997-01-08 2001-08-21 ライフ テクノロジーズ,インコーポレイテッド Methods for producing proteins
US20030082540A1 (en) * 1997-09-17 2003-05-01 Genentech, Inc. Secreted and transmembrane polypeptides and nucleic acids encoding the same
US7417126B2 (en) * 1997-10-17 2008-08-26 Genentech, Inc. PRO246 antibodies
JP2002504331A (en) * 1998-02-19 2002-02-12 スミスクライン ビーチャム コーポレーション G protein-coupled receptor AmMaid
US6696247B2 (en) 1998-03-18 2004-02-24 Corixa Corporation Compounds and methods for therapy and diagnosis of lung cancer
US6960570B2 (en) 1998-03-18 2005-11-01 Corixa Corporation Compositions and methods for the therapy and diagnosis of lung cancer
US7258860B2 (en) 1998-03-18 2007-08-21 Corixa Corporation Compositions and methods for the therapy and diagnosis of lung cancer
US7579160B2 (en) 1998-03-18 2009-08-25 Corixa Corporation Methods for the detection of cervical cancer
EP1086223B1 (en) 1998-06-01 2009-07-29 Agensys, Inc. Novel serpentine transmembrane antigens expressed in human cancers and uses thereof
US20030149531A1 (en) 2000-12-06 2003-08-07 Hubert Rene S. Serpentine transmembrane antigens expressed in human cancers and uses thereof
US6833438B1 (en) 1999-06-01 2004-12-21 Agensys, Inc. Serpentine transmembrane antigens expressed in human cancers and uses thereof
US7339033B2 (en) * 1998-06-26 2008-03-04 Genentech, Inc. Pro1481
US7091315B1 (en) 1998-07-15 2006-08-15 Human Genome Sciences, Inc. Protein HDPBQ71
US20070072248A1 (en) * 1998-10-13 2007-03-29 Arena Pharmaceuticals, Inc. Constitutively activated human G protein coupled receptors
US20030229216A1 (en) * 1998-10-13 2003-12-11 Ruoping Chen Constitutively activated human G protein coupled receptors
US20080213778A1 (en) 1998-12-30 2008-09-04 Millennium Pharmaceuticals, Inc. Novel genes encoding proteins having prognostic, diagnostic, preventive, therapeutic, and other uses
US20030190669A1 (en) 1998-12-30 2003-10-09 Genentech, Inc. Secreted and transmembrane polypeptides and nucleic acids encoding the same
US7041474B2 (en) 1998-12-30 2006-05-09 Millennium Pharmaceuticals, Inc. Nucleic acid encoding human tango 509
US7235642B1 (en) * 1999-01-12 2007-06-26 Genentech, Inc. Anti-PRO 1313 antibodies
US6303321B1 (en) 1999-02-11 2001-10-16 North Shore-Long Island Jewish Research Institute Methods for diagnosing sepsis
US6649342B1 (en) 1999-03-15 2003-11-18 Eos Biotechnology, Inc. Methods of diagnosing breast cancer, compositions, and methods of screening for breast cancer modulators
US6943235B1 (en) 1999-04-12 2005-09-13 Agensys, Inc. Transmembrane protein expressed in prostate cancer
AU6394400A (en) * 1999-07-30 2001-02-19 Millennium Pharmaceuticals, Inc. Secreted proteins and uses thereof
EP1218411A4 (en) * 1999-09-20 2004-09-01 Millennium Pharm Inc Secreted proteins and uses thereof
US20030228640A1 (en) * 1999-10-15 2003-12-11 University Of Pittsburgh Nuclear matrix proteins, polynucleotide sequences encoding them, and their use
CA2490853A1 (en) * 1999-12-01 2001-06-07 Genentech, Inc. Secreted and transmembrane polypeptides and nucleic acids encoding the same
JP2003515325A (en) * 1999-12-01 2003-05-07 ヒューマン ジノーム サイエンシーズ, インコーポレイテッド Four disulfide core domain containing (FDCD) polynucleotides, polypeptides, and antibodies
ES2360155T3 (en) 2000-01-26 2011-06-01 Agensys, Inc. 84P2A9: A SPECIFIC PROTEIN OF PROSTATE AND TESTICLES HIGHLY EXPRESSED IN PROSTATE CANCER.
AU2001236658A1 (en) * 2000-02-03 2001-08-14 Nuvelo, Inc. Methods and materials relating to neurotrimin-like polypeptides and polynucleotides
US6806254B2 (en) * 2000-02-03 2004-10-19 Nuvelo, Inc. Methods and materials relating to alpha-2-macroglobulin-like polypeptides and polynucleotides
US7700341B2 (en) 2000-02-03 2010-04-20 Dendreon Corporation Nucleic acid molecules encoding transmembrane serine proteases, the encoded proteins and methods based thereon
AU2001241446A1 (en) * 2000-02-03 2001-08-14 Lexicon Genetics Incorporated Human transporter proteins and polynucleotides encoding the same
WO2002026803A2 (en) * 2000-09-25 2002-04-04 Millenium Pharmaceuticals, Inc. 22108 and 47916, novel human thioredoxin family members and uses thereof
PT1276867E (en) * 2000-03-22 2006-12-29 Bayer Healthcare Ag Use of human latrophilin-like g protein-coupled receptor in screening methods
WO2001075075A2 (en) * 2000-03-27 2001-10-11 Bayer Aktiengesellschaft Human tyrosine phosphatase-like enzymes
AU5521401A (en) * 2000-03-31 2001-10-15 Hyseq Inc Method and materials relating to insulin-like growth factor binding protein-likepolypeptides and polynucleotides
US6436703B1 (en) * 2000-03-31 2002-08-20 Hyseq, Inc. Nucleic acids and polypeptides
EP1268762A4 (en) * 2000-03-31 2003-08-27 Nuvelo Inc Novel nucleic acids and polypeptides
GB0008305D0 (en) * 2000-04-06 2000-05-24 Neutec Pharma Plc Treatment of fungal infections
JP2003530873A (en) * 2000-04-19 2003-10-21 メルク パテント ゲゼルシャフト ミット ベシュレンクテル ハフトング NK-2 homeobox transcription regulator
US7371388B1 (en) 2000-05-04 2008-05-13 Human Genome Sciences, Inc. Treatment of Sjogren's syndrome by administration of TR18 polypeptides
AU6531101A (en) * 2000-06-02 2001-12-17 Genentech Inc Secreted and transmembrane polypeptides and nucleic acids encoding the same
EP1395655A2 (en) 2000-06-06 2004-03-10 Millennium Pharmaceuticals, Inc. 52906, 33408, and 12189, potassium channel family members and uses thereof
WO2001094614A2 (en) * 2000-06-07 2001-12-13 Cyclacel Limited Methods for monitoring enzyme activity
EP1309676A2 (en) * 2000-06-12 2003-05-14 Bayer Aktiengesellschaft Human epithin-like serine protease
EP1293510A4 (en) * 2000-06-22 2005-02-09 Takeda Pharmaceutical Novel polypeptide and its dna
PT1297006E (en) 2000-06-23 2006-05-31 Bayer Healthcare Ag REGULATION OF SERENIAL PROTEASE SIMILAR TO HUMAN PROSTASIN
AU2001272493A1 (en) * 2000-06-26 2002-01-08 Merck Patent G.M.B.H Receptor for phosphoinositides-1 associated scaffold protein
DK1325115T3 (en) * 2000-06-26 2016-11-21 Zymogenetics Inc CYTOKINRECEPTOR ZCYTOR17
EP1294882A2 (en) * 2000-06-28 2003-03-26 Amgen, Inc. B7-like molecules and uses thereof
US7355023B2 (en) * 2000-06-29 2008-04-08 Duke University Isolated nucleic acids and polypeptides associated with glucose homeostasis disorders and method of detecting the same
US20020064821A1 (en) * 2000-07-05 2002-05-30 Merkulov Gennady V. Isolated human ion channel proteins, nucleic acid molecules encoding human ion channel proteins, and uses thereof
US20030171324A1 (en) * 2000-07-18 2003-09-11 Shyam Ramakrishnan Regulation of human desc1-like serine protease
AU2001278488A1 (en) * 2000-07-25 2002-02-05 Bayer Aktiengesellschaft Regulation of human matriptase-like serine protease
US7402660B2 (en) 2000-08-02 2008-07-22 The Johns Hopkins University Endothelial cell expression patterns
US6924364B2 (en) 2000-08-04 2005-08-02 Zymogenetics, Inc. Human secreted protein, Zzp1
WO2002012469A2 (en) * 2000-08-09 2002-02-14 Millennium Pharmaceuticals, Inc. A human adenylate kinase family member and uses therefor
DK1313766T3 (en) * 2000-08-21 2008-10-27 Organon Nv Coaster, a human coactivator for steroid receptors
ATE328899T1 (en) 2000-09-22 2006-06-15 Kyowa Hakko Kogyo Kk CELL NUCLEAR LOCALIZING PROTEIN FOR BETA-CATENIN
AU2002210514A1 (en) * 2000-09-27 2002-04-08 Bayer Aktiengesellschaft Regulation of human phosphatidylinositol-specific phospholipase c-like enzyme
WO2002029058A2 (en) * 2000-10-05 2002-04-11 Curagen Corporation Human proteins, polynucleotides encoding them and methods of using the same
WO2002029049A2 (en) * 2000-10-05 2002-04-11 Bayer Aktiengesellschaft Regulation of human sodium-dependent monoamine transporter
US20030124704A1 (en) * 2000-10-06 2003-07-03 Strittmatter Stephen M. Nogo receptor homologs
WO2002030974A2 (en) * 2000-10-12 2002-04-18 Curagen Corporation Proteins and nucleic acids encoding same
EP1326979A2 (en) * 2000-10-18 2003-07-16 ZymoGenetics, Inc. Human cyclic nucleotide binding protein
JP4460829B2 (en) * 2000-10-20 2010-05-12 コモ・バイオテック・アンパルトセルスカブ Pregnancy-related plasma protein-A2 (PAPP-A2)
US20030068624A1 (en) * 2000-10-26 2003-04-10 Recipon Herve E. Compositions and methods relating to lung specific genes and proteins
AU2002243444A1 (en) 2000-10-31 2002-06-24 Diadexus, Inc. Compositions and methods relating to colon specific genes and proteins
WO2002050278A2 (en) * 2000-11-15 2002-06-27 Lexicon Genetics Incorporated Novel human secreted proteins and polynucleotides encoding the same
ES2349114T3 (en) * 2000-11-22 2010-12-28 University Of Maryland, Baltimore USE OF CLYA HEMOLISIN FOR THE EXCRETION OF FUSION PROTEINS.
AU2002233201A1 (en) * 2000-11-27 2002-06-03 Bayer Aktiengesellschaft Regulation of human tyrosine phosphatase
US20020081653A1 (en) * 2000-11-28 2002-06-27 Chunhua Yan Isolated human transporter proteins, nucleic acid molecules encoding human transporter proteins, and uses thereof
WO2002046477A2 (en) * 2000-12-07 2002-06-13 Chiron Corporation Endogenous retroviruses up-regulated in prostate cancer
CA2431007A1 (en) 2000-12-11 2002-07-18 Lexicon Genetics Incorporated Novel human kinase and polynucleotides encoding the same
DE10061931A1 (en) * 2000-12-13 2001-09-06 Marianne Kroepelin A new nucleic acid encoding M73791 HUMNOVGENE also known as HUMQM and human RPL10 is involved in tumor suppression and useful to diagnose and treat cancer
WO2002057460A2 (en) 2000-12-20 2002-07-25 Bristol-Myers Squibb Company Polynucleotides encoding human phosphatases
US6949363B2 (en) 2000-12-22 2005-09-27 Brilstol-Myers Squibb Polynucleotides encoding novel human leucine-rich repeat containing protein expressed predominately in bone marrow, HLRRBM1
US7247705B2 (en) * 2000-12-28 2007-07-24 Takeda Pharmaceutical Company Limited Proteins having glucose transporter activity
WO2002053748A1 (en) * 2001-01-02 2002-07-11 Bayer Aktiengesellschaft Regulation of human 3-ss-hydroxysteroid dehydrogenase-isomerase
GB0100750D0 (en) * 2001-01-11 2001-02-21 Inpharmatica Ltd Novel proteins
US6605710B2 (en) * 2001-01-11 2003-08-12 Board Of Trustees Of Michigan State University Gene whose expression promotes differentiation of myeloid progenitor cells into neutrophils and/or monocytes/macrophages
WO2002079473A2 (en) * 2001-01-12 2002-10-10 Incyte Genomics, Inc. Molecules for diagnostics and therapeutics
EP1490509A4 (en) * 2001-01-12 2005-06-01 Exelixis Inc Modulating insulin receptor signaling
GB2375110A (en) * 2001-01-30 2002-11-06 Aeomica Inc Human LCCL_domain containing peptides
WO2002061049A2 (en) * 2001-01-31 2002-08-08 Millennium Pharmaceuticals, Inc. Novel molecules of the pyrin/nbs/lrr protein family and uses thereof
US7041643B2 (en) 2001-01-31 2006-05-09 Millennium Pharmaceuticals, Inc. Molecules of the PYRIN/NBS/LRR protein family and uses thereof
AU2002229440A1 (en) * 2001-02-02 2002-08-19 Mount Sinai Hospital Human acid phosphatase gene
US20030166041A1 (en) * 2001-02-08 2003-09-04 Hillman Jennifer L. TIMM8b-related protein
US7601825B2 (en) 2001-03-05 2009-10-13 Agensys, Inc. Nucleic acid and corresponding protein entitled 121P1F1 useful in treatment and detection of cancer
US20060035237A1 (en) 2002-08-26 2006-02-16 Markowitz Sanford D Methods and compositions for categorizing patients
US7081516B2 (en) 2002-08-26 2006-07-25 Case Western Reserve University Methods for categorizing patients
US7118912B2 (en) * 2002-08-26 2006-10-10 Case Western Reserve University Methods and compositions for categorizing patients
CA2439693A1 (en) * 2001-03-05 2002-10-31 Aventis Pharma S.A. Nucleic acids of the human abcc12 gene, vectors containing such nucleic acids and uses thereof
US6924358B2 (en) 2001-03-05 2005-08-02 Agensys, Inc. 121P1F1: a tissue specific protein highly expressed in various cancers
MXPA03008039A (en) 2001-03-08 2004-05-24 Immunex Corp Human serpin polypeptides.
US8647826B2 (en) 2001-03-14 2014-02-11 Agensys, Inc. Nucleic acid and corresponding protein entitled 125P5C8 useful in treatment and detection of cancer
US7271240B2 (en) 2001-03-14 2007-09-18 Agensys, Inc. 125P5C8: a tissue specific protein highly expressed in various cancers
AU2002250143A1 (en) * 2001-03-16 2002-10-03 Eli Lilly And Company Lp mammalian proteins; related reagents
US6551809B2 (en) * 2001-03-20 2003-04-22 Applera Corporation Isolated human phosphatase proteins, nucleic acid molecules encoding human phophatase proteins, and uses thereof
JP4590157B2 (en) 2001-03-21 2010-12-01 アイシス イノヴェイション リミテッド Assays, methods and means
US20040209264A1 (en) * 2001-03-23 2004-10-21 Thomas Kramps New essential downstream component of the wingless signalling pathway
JP2004529639A (en) * 2001-03-23 2004-09-30 バイエル アクチェンゲゼルシャフト Regulation of human bispecific protein phosphatase 7-like protein
NZ527971A (en) * 2001-03-27 2006-03-31 Dendreon Corp Nucleic acid molecules encoding a transmembrane serine protease 9, the encoded polypeptides and methods based thereon
WO2002081515A1 (en) * 2001-04-03 2002-10-17 Kyowa Hakko Kogyo Co., Ltd. Insulin-like growth factor-binding protein
WO2002088303A2 (en) 2001-04-03 2002-11-07 Bristol-Myers Squibb Company Polynucleotide encoding a novel cysteine protease of the calpain superfamily, can-12, and variants thereof
AU2002258626B2 (en) 2001-04-10 2007-01-18 Agensys, Inc. Nucleid acid and corresponding protein entitled 158P3D2 useful in treatment and detection of cancer
WO2002083921A2 (en) 2001-04-10 2002-10-24 Agensys, Inc. Nuleic acids and corresponding proteins useful in the detection and treatment of various cancers
AU2002303340A1 (en) 2001-04-10 2002-10-28 Agensys, Inc. Nucleic acid and corresponding protein entitled 184p1e2 useful in treatment and detection of cancer
US20030191073A1 (en) 2001-11-07 2003-10-09 Challita-Eid Pia M. Nucleic acid and corresponding protein entitled 161P2F10B useful in treatment and detection of cancer
WO2002083866A2 (en) * 2001-04-17 2002-10-24 The Board Of Trustees Of The University Of Arkansas Repeat sequences of the ca125 gene and their use for diagnostic and therapeutic interventions
US7309760B2 (en) * 2001-04-17 2007-12-18 The Board Of Trustees Of The University Of Arkansas Repeat sequences of the CA125 gene and their use for diagnostic and therapeutic interventions
CA2444504A1 (en) * 2001-04-26 2002-11-07 Pe Corporation (Ny) Isolated human secreted proteins, nucleic acid molecules encoding human secreted proteins, and uses thereof
US7713526B2 (en) 2001-05-01 2010-05-11 The Regents Of The University Of California Wnt and frizzled receptors as targets for immunotherapy in head and neck squamous cell carcinomas
AU2002308557A1 (en) 2001-05-01 2002-11-11 The Regents Of The University Of California Wnt and frizzled receptors as targets for immunotherapy in head and neck squamous cell carcinomas
JP2004535794A (en) 2001-05-03 2004-12-02 チルドレンズ・メディカル・センター・コーポレイション Sperm-specific cation channels and uses thereof
US7217785B2 (en) * 2001-05-09 2007-05-15 The Regents Of The University Of California Cysteine-containing peptides having antioxidant properties
US20040152092A1 (en) * 2001-05-11 2004-08-05 Alex Smolyar Regulation of human phosphatidic acid phosphatase type 2c-like protein
US7304034B2 (en) 2001-05-15 2007-12-04 The Feinstein Institute For Medical Research Use of HMGB fragments as anti-inflammatory agents
US7220723B2 (en) 2001-05-15 2007-05-22 The Feinstein Institute For Medical Research Inhibitors of the interaction between HMGB polypeptides and toll-like receptor 2 as anti-inflammatory agents
CN100447154C (en) * 2001-05-15 2008-12-31 费因斯坦医学研究学院 Use of HMG fragments as anti-inflammatory agents
WO2002095002A2 (en) * 2001-05-22 2002-11-28 University Of Chicago N4 virion single-stranded dna dependent rna polymerase
WO2002099134A1 (en) * 2001-06-05 2002-12-12 Auckland Uniservices Limited Methods and compositions for assessment of pulmonary function and disorders
WO2002098889A2 (en) * 2001-06-05 2002-12-12 Exelixis Inc. MAP3Ks AS MODIFIER OF THE p53 PATHWAY AND METHODS OF USE
WO2003008450A1 (en) * 2001-06-12 2003-01-30 Kyogo Itoh Tumor antigen
EP1266906A1 (en) * 2001-06-13 2002-12-18 Millennium Pharmaceuticals, Inc. 46584, a human transporter family member and uses therefor
JP2004531268A (en) * 2001-06-14 2004-10-14 レキシコン・ジェネティクス・インコーポレーテッド Novel human transporter protein and polynucleotide encoding the same
US20030143668A1 (en) * 2001-06-18 2003-07-31 National Institute Of Advanced Industrial Guanosine triphosphate-binding protein coupled receptors
US20030120040A1 (en) 2001-06-29 2003-06-26 Genentech, Inc. Secreted and Transmembrane polypeptides and nucleic acids encoding the same
ITMI20011465A1 (en) * 2001-07-10 2003-01-10 Novuspharma Spa GENE WITH ONCOSUPPRESSIVE ACTIVITY
WO2003006688A2 (en) * 2001-07-13 2003-01-23 University Of Aarhus Methods for diagnosis and treatment of diseases associated with altered expression of gnas
JP2003024080A (en) * 2001-07-19 2003-01-28 Univ Tokyo Protein inducing p53-dependent apoptosis and method for screening apoptosis controlling agent
US20040210042A1 (en) * 2001-07-19 2004-10-21 Tsuchida Jun-Ichi Polypeptides relating to signal transfer of advanced glycation end product receptor
WO2003010198A1 (en) * 2001-07-26 2003-02-06 Kenton Srl Identification of specific tumour antigens by selection of cdna libraries with sera and use of said antigens in diagnostic techniques
WO2003012038A2 (en) 2001-07-27 2003-02-13 Wyeth Aggrecanase molecules
WO2003024389A2 (en) 2001-07-30 2003-03-27 Immunex Corporation T. reesei phytase enyzmes, polynucleides encoding the enzymes, vectors and host cells thereof, and methods of using
FR2828209A1 (en) * 2001-08-01 2003-02-07 Cytomics Systems New nucleic acid encoding HECT-domain polypeptides, useful e.g. for treatment of cancer or inflammation, also new proteins, antibodies and modulators
JP2003135063A (en) * 2001-08-24 2003-05-13 Chiba Prefecture New gene nedl-1
EP1757693A3 (en) * 2001-08-24 2007-05-09 Hisamitsu Pharmaceutical Co. Inc. Nucleic acids showing difference in expression between hepatoblastoma and normal liver
IL160651A0 (en) 2001-08-31 2004-07-25 Agensys Inc Nucleic acid and correspondidng protein entitled 205pib5 and pharmaceutical compositions containing the same
ES2537074T3 (en) 2001-09-06 2015-06-02 Agensys, Inc. Nucleic acid and corresponding protein called STEAP-1 useful in the treatment and detection of cancer
US7494646B2 (en) 2001-09-06 2009-02-24 Agensys, Inc. Antibodies and molecules derived therefrom that bind to STEAP-1 proteins
EP2339029B1 (en) * 2001-09-14 2016-03-09 Clinical Genomics Pty. Ltd Nucleic acid markers for use in determining predisposition to neoplasm and/or adenoma
WO2003099318A2 (en) * 2002-05-29 2003-12-04 DeveloGen Aktiengesellschaft für entwicklungsbiologische Forschung Pancreas-specific proteins
US20050222070A1 (en) 2002-05-29 2005-10-06 Develogen Aktiengesellschaft Fuer Entwicklungsbiologische Forschung Pancreas-specific proteins
AU2002326957B2 (en) * 2001-09-19 2006-08-10 The Government Of The United States Of America, Represented By The Secretary, Department Of Health And Human Services Transductin-1 and transductin-2 and applications to hereditary deafness
JP2003093056A (en) * 2001-09-26 2003-04-02 Japan Science & Technology Corp ENZYME PHOSPHORYLATING Ser46 of p53
US20030108928A1 (en) * 2001-10-09 2003-06-12 Millennium Pharmaceuticals, Inc. MID 241 receptor, a novel G-protein coupled receptor
CA2463578A1 (en) * 2001-10-16 2003-04-24 Autogen Research Pty Ltd Differentially expressed genes associated with obesity and type 2 diabetes
US20030100034A1 (en) * 2001-10-16 2003-05-29 Millennium Pharmaceuticals, Inc. 9136, a human aldehyde dehydrogenase family member and uses therefor
DE10152598A1 (en) * 2001-10-19 2003-05-22 Schering Ag ARL-2-for fertility control
US8729248B2 (en) 2001-10-22 2014-05-20 Children's Medical Center Corporation Sperm-specific cation channel, CATSPER2, and uses therefor
AU2002364939A1 (en) 2001-10-22 2003-07-09 Board Of Regents, The University Of Texas System Sperm-specific cation channel, catsper2, and uses therefor
DE10153259A1 (en) * 2001-10-31 2003-05-22 Osram Opto Semiconductors Gmbh Optoelectronic component
AU2002359331A1 (en) * 2001-10-31 2003-06-10 Millennium Pharmaceuticals, Inc. Methods and compositions for the treatment and diagnosis of cellular proliferative disorders using 32222
EP1446507A2 (en) * 2001-10-31 2004-08-18 Pfizer Products Inc. Therapeutics and diagnostics for disorders of erythropoiesis
IL161268A0 (en) 2001-11-07 2004-09-27 Agensys Inc Nucleic acid and corresponding protein entitled 161p2f10b and pharmaceutical and diagnostic compositions containing the same
EP1442118A2 (en) * 2001-11-08 2004-08-04 DeveloGen Aktiengesellschaft für entwicklungsbiologische Forschung Men protein, gst2, rab-rp1, csp, f-box protein lilina/fbl7, abc50, coronin, sec61 alpha, or vhappa1-1, or homologous proteins involved in the regulation of energy homeostasis
WO2003042370A2 (en) * 2001-11-14 2003-05-22 The Government Of The United States Of America As Represented By The Secretary Of The Department Of Health And Human Services New gene expressed in prostate cancer and methods of use
US7074576B2 (en) * 2001-11-23 2006-07-11 Syn X Pharma, Inc. Protein biopolymer markers indicative of alzheimer's disease
US20030100016A1 (en) * 2001-11-23 2003-05-29 George Jackowski Complement C3 precursor biopolymer markers predictive of Alzheimers disease
CA2468326C (en) * 2001-11-26 2015-11-17 Pierre Legrain Protein-protein interactions in human immunodeficiency virus
WO2003046165A1 (en) * 2001-11-26 2003-06-05 Bayer Healthcare Ag Regulation of human aldose reductase-like protein
AU2002360460A1 (en) * 2001-12-03 2003-06-17 Nuvelo, Inc. Methods and materials relating to novel polypeptides and polynucleotides
EP1318406B1 (en) * 2001-12-04 2005-11-09 B.R.A.H.M.S Aktiengesellschaft Use of Glycine-N-Acyl-Transferase (GNAT) for diagnosis of inflammatory diseases and sepsis
US20060275747A1 (en) * 2001-12-07 2006-12-07 Hardy Stephen F Endogenous retrovirus up-regulated in prostate cancer
AU2002351332A1 (en) * 2001-12-07 2003-06-23 Chiron Corporation Endogenous retrovirus polypeptides linked to oncogenic transformation
GB2398786A (en) * 2001-12-10 2004-09-01 Amersham Plc Human prostate cancer candidate protein 1
WO2003052119A2 (en) * 2001-12-14 2003-06-26 Incyte Genomics, Inc. Transporters and ion channels
AU2002357850A1 (en) * 2001-12-14 2003-06-30 Incyte Genomics, Inc. Neurotransmission-associated proteins
ES2337989T3 (en) * 2001-12-18 2010-05-03 Endocube Sas NEW PROTEINS ASSOCIATED WITH THE DEATH OF THE THAP FAMILY AND RELATED PAR4 ROUTES INVOLVED IN THE CONTROL OF APOPTOSIS.
US7858297B2 (en) * 2001-12-18 2010-12-28 Centre National De La Recherche Scientifique Cnrs Chemokine-binding protein and methods of use
JP3837524B2 (en) 2001-12-19 2006-10-25 国立大学法人 長崎大学 Nucleic acid, probe comprising the nucleic acid, and screening method using the probe
JP2003245084A (en) 2001-12-20 2003-09-02 Morinaga Milk Ind Co Ltd New gene useful for diagnosis and treatment of aplasia of corpus callosum and aspermatogenesis and use thereof
US20050112618A1 (en) * 2001-12-21 2005-05-26 Fagan Richard J. Secreted protein
GB0130720D0 (en) * 2001-12-21 2002-02-06 Serono Internat S A Proteins
WO2003059938A1 (en) * 2001-12-21 2003-07-24 Ramot At Tel Aviv University Ltd. Chromatin remodeling protein as a marker expressed by stromal progenitor cells
WO2003060129A1 (en) * 2002-01-16 2003-07-24 Bayer Healthcare Ag Regulation of human fatty acid coa ligase-like amp-binding enzyme
WO2003060110A2 (en) * 2002-01-16 2003-07-24 Bayer Healthcare Ag Regulation of human fatty acid coa ligase-like amp-binding enzyme
US20060088828A1 (en) * 2002-01-23 2006-04-27 Harris Peter C Polycystic kidney disease nucleic acids and proteins
WO2003062429A1 (en) * 2002-01-23 2003-07-31 Yamanouchi Pharmaceutical Co., Ltd. Novel serine protease
GB0202149D0 (en) * 2002-01-30 2002-03-20 Univ Edinburgh Pluripotency determining factors and uses thereof
EP1474515A2 (en) * 2002-02-01 2004-11-10 Bayer HealthCare AG Regulation of human hematopoietin receptor-like protein
EP1485406A4 (en) * 2002-02-20 2006-02-01 Invitrogen Corp Highly homogeneous molecular markers for electrophoresis
EP1338606A1 (en) * 2002-02-22 2003-08-27 Magnus Von Knebel Doeberitz Nucleic Acid encoding an HERV-H polypeptide for diagnosing colorectal cancer
GB2388113A (en) * 2002-03-06 2003-11-05 Oxford Glycosciences B-cell malignancy-associated protein
JP2003259872A (en) * 2002-03-07 2003-09-16 Japan Science & Technology Corp Human esophageal carcinoma antigenic peptide and method for diagnosing esophageal carcinoma
JPWO2003074701A1 (en) * 2002-03-07 2005-06-30 株式会社 ビーエフ研究所 p18AβrP gene and p18AβrP protein, a novel gene / protein (p60TRP) that interacts with these to suppress cell death, and a substance that promotes cell death
IL163733A0 (en) 2002-03-08 2005-12-18 Eisai Co Ltd Presynaptic protein p120
DE10211088A1 (en) * 2002-03-13 2003-09-25 Ugur Sahin Gene products differentially expressed in tumors and their use
AU2012216265B2 (en) * 2002-03-13 2015-01-22 Biontech Ag Genetic products differentially expressed in tumors and use thereof
EP1491630B1 (en) * 2002-03-14 2009-11-11 National Institute of Advanced Industrial Science and Technology N-acetylglucosamine transferase, nucleic acid encoding the same and use thereof in diagnosing cancer and/or tumor
WO2003083078A2 (en) 2002-03-28 2003-10-09 Bristol-Myers Squibb Company Novel human cell surface protein with immunoglobulin folds, bgs-19
US20040002593A1 (en) * 2002-04-04 2004-01-01 Reed John C. PAAD domain-containing polypeptides, encoding nucleic acids, and methods of use
AU2003222189A1 (en) * 2002-04-05 2003-10-27 Incyte Corporation Secreted proteins
AU2003222965A1 (en) * 2002-04-05 2003-10-27 University Court Of The University Of Edinburgh Schizophrenia associated genes
US7544767B2 (en) * 2002-04-05 2009-06-09 Burnham Institute For Medical Research HMGN2 peptides and related molecules that selectively home to tumor blood vessels and tumor cells
EP1576099A4 (en) 2002-04-12 2007-07-11 Bristol Myers Squibb Co Novel polynucleotides encoding the human citron kinase polypeptide, bmsnkc 0020/0021
US7459424B2 (en) 2002-04-22 2008-12-02 The Borad Of Trustees Of The Leland Stanford Junior University Peptide inhibitors of protein kinase C γ for pain management
EP1576096A4 (en) * 2002-04-23 2007-05-23 Univ Georgia Res Found N-ACETYLGLUCOSAMINYLTRANSFERASE Vb CODING SEQUENCE, RECOMBINANT CELLS AND METHODS
US20060160728A1 (en) * 2002-04-26 2006-07-20 Heinz Von Der Kammer Diagnostic and therapeutic use of ensandin-0138 gene and protein for neurodegenerative diseases
AU2003301984A1 (en) * 2002-05-15 2004-06-03 Incyte Corporation Vesicle-associated proteins
ATE414789T1 (en) * 2002-05-21 2008-12-15 Mtm Lab Ag MARKER FOR LUNG TUMORS
US7339028B2 (en) * 2002-06-06 2008-03-04 The Trustees Of Columbia University In The City Of New York Mahoganoid polypeptides, and related compositions and methods
SG145559A1 (en) 2002-06-06 2008-09-29 Oncotherapy Science Inc Genes and polypeptides relating to human colon cancers
JP2005528913A (en) * 2002-06-10 2005-09-29 メルク エンド カムパニー インコーポレーテッド Isolated nucleic acid molecules encoding novel centromere-related motor proteins and uses thereof
AU2003276679A1 (en) * 2002-06-13 2003-12-31 Chiron Corporation Vectors for expression of hml-2 polypeptides
AU2003238032A1 (en) * 2002-06-24 2004-01-06 Bayer Healthcare Ag Regulation of human growth hormone-releasing hormone receptor
FR2841261B1 (en) * 2002-06-25 2004-08-27 Univ Pasteur COMPOSITIONS AND METHODS FOR DETECTING PATHOLOGIES AFFECTING NEUROMUSCULAR TRANSMISSION
WO2004001017A2 (en) * 2002-06-25 2003-12-31 Tao Biosciences, Llc Methods for identifying antimicrobial agents, the agents identified therewith and methods of using same
AU2003251008A1 (en) * 2002-07-09 2004-01-23 Vlaams Interuniversitair Instituut Voor Biotechnologie Vzw Diagnostic tests for the detection of peripheral neuropathy
AU2003253930A1 (en) * 2002-07-11 2004-02-02 Seattle Genetics, Inc. Sga-72m, a cancer associated antigen, and uses thereof
CA2493061A1 (en) 2002-07-16 2004-01-22 Guangjie Cheng Regulatory protein for nox enzymes
EP1382613A1 (en) 2002-07-16 2004-01-21 Sumitomo Chemical Company, Limited Novel G protein, polynucleotide encoding the same and utilization thereof
GB0216661D0 (en) * 2002-07-17 2002-08-28 Ares Trading Sa Protein
US7135556B2 (en) 2002-07-19 2006-11-14 Schering Corporation NPC1L1 (NPC3) and methods of use thereof
US20040132058A1 (en) * 2002-07-19 2004-07-08 Schering Corporation NPC1L1 (NPC3) and methods of use thereof
US20050163821A1 (en) * 2002-08-02 2005-07-28 Hsing-Wen Sung Drug-eluting Biodegradable Stent and Delivery Means
US20050019404A1 (en) * 2003-06-30 2005-01-27 Hsing-Wen Sung Drug-eluting biodegradable stent
AU2003258056A1 (en) 2002-08-07 2004-02-25 Children's Medical Center Corporation Sperm-specific cation channel, catsper-3, and uses therefor
AU2003263981A1 (en) 2002-08-07 2004-02-25 Children's Medical Center Corporation Sperm-specific cation channel, catsper-4, and uses therefor
US7408048B2 (en) 2002-08-09 2008-08-05 Melbourne Health Mammalian grainyhead transcription factors
AU2003304034A1 (en) * 2002-08-09 2004-11-01 Curagen Corporation Therapeutic polypeptides, nucleic acids encoding same, and methods of use
WO2004016733A2 (en) 2002-08-16 2004-02-26 Agensys, Inc. Nucleic acid and corresponding protein entitled 251p5g2 useful in treatment and detection of cancer
AU2003282785A1 (en) * 2002-08-26 2004-03-11 Case Western Reserve University Methods for treating patients and identifying therapeutics
DE10242146A1 (en) 2002-09-04 2004-03-18 Eberhard-Karls-Universität Tübingen Universitätsklinikum Antibodies for the identification and / or isolation of hematopoietic stem cells
EP1575497A4 (en) * 2002-09-11 2010-11-03 Genentech Inc Novel composition and methods for the treatment of psoriasis
EP1398372A1 (en) * 2002-09-13 2004-03-17 Universitätsklinikum Charité an der Humboldt- Universität zu Berlin Lipid phosphate phosphatases and uses thereof for treating neuronal diseases
WO2004024914A1 (en) * 2002-09-13 2004-03-25 Sankyo Company, Limited 2’,5’-oligoadenylate phosphodiesterase
US6892925B2 (en) * 2002-09-18 2005-05-17 International Business Machines Corporation Solder hierarchy for lead free solder joint
US20060292668A1 (en) * 2002-09-19 2006-12-28 Zhengbin Yao Nuclear factor of activated t cell receptor
JP2006500074A (en) * 2002-09-27 2006-01-05 オリディス・ビオメド・フォルシュングス−ウント・エントヴィックルングス・ゲーエムベーハー Polypeptides and nucleic acids encoding them and their use for prevention, diagnosis or treatment of liver damage and epithelial cancer
TW200413725A (en) * 2002-09-30 2004-08-01 Oncotherapy Science Inc Method for diagnosing non-small cell lung cancers
AU2003279829A1 (en) * 2002-10-04 2004-05-04 Incyte Corp Protein modification and maintenance molecules
EP1559725A4 (en) * 2002-10-04 2005-12-14 Mitsubishi Pharma Corp Antigen recognizing antibody
EP1575518A4 (en) 2002-10-10 2007-08-22 Wyeth Corp Compositions, organisms and methodologies employing a novel human kinase
GB0224116D0 (en) * 2002-10-16 2002-11-27 Ares Trading Sa Proteins
WO2004038411A2 (en) * 2002-10-24 2004-05-06 Evotec Neurosciences Gmbh Diagnostic and therapeutic use of ensadin-0289 gene and protein for neurodegenerative diseases
EP1554385A2 (en) * 2002-10-24 2005-07-20 Wyeth Calcineurin-like human phoshphoesterase
EP1557178A4 (en) * 2002-10-29 2009-07-22 Takeda Pharmaceutical Use of sglt homolog
US7537914B1 (en) * 2002-11-04 2009-05-26 Florida State University Research Foundation, Inc. Nucleic acid and allergenic polypeptides encoded thereby in cashew nuts (Anacardium occidentale)
WO2004044205A1 (en) * 2002-11-11 2004-05-27 Korea Chungang Educational Foundation Plant seed-specific expression promoter derived from sesame and seed-specific expression vector comprising the promoter
NZ541504A (en) * 2002-11-13 2009-04-30 Uab Research Foundation Synthetic single domain polypeptides mimicking apolipoprotein E and methods of use
US7405289B2 (en) * 2002-11-13 2008-07-29 Metabolex, Inc. Pancreatic islet transcription factor and uses thereof
WO2004048521A2 (en) * 2002-11-21 2004-06-10 Wyeth Composition and method for treating lupus nephritis
DE10254601A1 (en) * 2002-11-22 2004-06-03 Ganymed Pharmaceuticals Ag Gene products differentially expressed in tumors and their use
EP1572975B1 (en) 2002-11-26 2014-07-16 University Of Tennessee Research Foundation Streptococcus uberis adhesion molecule
AU2003290664A1 (en) 2002-11-27 2004-06-23 Wei Liu Compositions, organisms and methodologies employing a novel human kinase
WO2004050702A2 (en) * 2002-12-04 2004-06-17 Applied Research Systems Ars Holding N.V. Novel ifngamma-like polypeptides
US6854636B2 (en) * 2002-12-06 2005-02-15 International Business Machines Corporation Structure and method for lead free solder electronic package interconnections
US7465581B2 (en) * 2002-12-18 2008-12-16 The Scripps Research Institute ANKTM1, a cold-activated TRP-like channel expressed in nociceptive neurons
US20050049214A1 (en) * 2003-01-15 2005-03-03 Chiron Corporation Inhibition of E3-ubiquitin ligase HAKAI for treatment of proliferative disorders
US7638270B2 (en) 2003-01-24 2009-12-29 Agensys, Inc. Nucleic acids and corresponding proteins entitled 254P1D6B useful in treatment and detection of cancer
AU2007231823B2 (en) * 2003-01-24 2011-07-21 Agensys, Inc. Nucleic acids and corresponding proteins entitled 254P1D6B useful in treatment and detection of cancer
US7304036B2 (en) 2003-01-28 2007-12-04 Microbia, Inc. Methods and compositions for the treatment of gastrointestinal disorders
WO2004069174A2 (en) * 2003-01-31 2004-08-19 Slil Biomedical Corporation Monitoring and treatment of amyotrophic lateral sclerosis
US7521534B1 (en) * 2003-03-03 2009-04-21 The University Board Of Regents Of Texas System IKK gamma gene products and methods for making and using same
EP1923400A1 (en) * 2003-03-03 2008-05-21 Genentech, Inc. Compositions and methods for the treatment of systemic lupus erythematosis
US7749695B2 (en) 2003-03-03 2010-07-06 Alexander Abbas PRO polypeptides for diagnosis of systemic lupus erythematosis
CA2518552A1 (en) * 2003-03-11 2004-09-23 Genentech, Inc. Novel compositions and methods for the treatment of immune related disease
EP1608983A2 (en) * 2003-03-20 2005-12-28 Novartis Forschungsstiftung, Zweigniederlassung Friedrich Miescher Institute for Biomedical Research Materials and methods for modulating cell motility
EP1464651A1 (en) * 2003-04-03 2004-10-06 Deutsches Krebsforschungszentrum Stiftung des öffentlichen Rechts Dystrophin-related protein (Drop1), a marker for carcinomas
WO2004087748A1 (en) * 2003-04-04 2004-10-14 Centre National De La Recherche Scientifique Protein with fusogenic activity, nucleic acid sequences encoding said protein and pharmaceutical compositions comprising the same
WO2004092213A1 (en) * 2003-04-08 2004-10-28 The Government Of The United States Of America As Represented By The Secretary, Department Of Healthand Human Services Gene expressed in prostate cancer, methods and use thereof
WO2004094651A2 (en) * 2003-04-18 2004-11-04 Five Prime Therapeutics, Inc. Novel human polypeptides encoded by polynucleotides
US20050032695A1 (en) * 2003-05-30 2005-02-10 Aventis Pharma Deutschland Gmbh Method for alleviating pain using protein associated with MYC and related compounds, and assays for identifying such compounds
US7696169B2 (en) 2003-06-06 2010-04-13 The Feinstein Institute For Medical Research Inhibitors of the interaction between HMGB polypeptides and toll-like receptor 2 as anti-inflammatory agents
FR2843753A1 (en) * 2003-06-20 2004-02-27 Gene Signal Antisense nucleic molecule useful as inhibitor of angiogenesis in the treatment of angiogenic disorders, e.g., rheumatoid arthritis, atherosclerosis and endometriosis
EP1636379A2 (en) * 2003-06-26 2006-03-22 Exonhit Therapeutics S.A. Prostate specific genes and the use thereof as targets for prostate cancer therapy and diagnosis
FR2857098B1 (en) * 2003-07-04 2006-05-05 Inst Nat Sante Rech Med METHOD FOR THE EVIDENCE OF A MOLECULAR EVENT IN A CELL USING FLUORESCENT MARKER PROTEINS
GB0315827D0 (en) * 2003-07-05 2003-08-13 Imp College Innovations Ltd Methods
WO2005049791A2 (en) * 2003-07-14 2005-06-02 The Uab Research Foundation Identification of reagents for the diagnosis and study of francisella
EP1507004A1 (en) * 2003-08-14 2005-02-16 DKFZ Deutsches Krebsforschungszentrum Method to inhibit the propagation of an undesired cell population
WO2005019255A1 (en) 2003-08-25 2005-03-03 Pieris Proteolab Ag Muteins of tear lipocalin
WO2005024011A2 (en) * 2003-09-04 2005-03-17 Applied Research Systems Ars Holding N.V. Novel ubp8rp polypeptides and their use in the treatment of psoriasis
US8283307B2 (en) 2003-09-08 2012-10-09 Merck Serono Sa Treatment of fibrotic disease
DE10341812A1 (en) * 2003-09-10 2005-04-07 Ganymed Pharmaceuticals Ag Differentially expressed in tumors gene products and their use
CA2538763C (en) 2003-09-11 2015-05-05 Critical Therapeutics, Inc. Monoclonal antibodies against hmgb1
WO2005030953A1 (en) * 2003-09-30 2005-04-07 Daiichi Pharmaceutical Co., Ltd. Tetrahydrofolate synthase gene
EP1692260A2 (en) * 2003-10-06 2006-08-23 Centocor, Inc. Hypoxia - responsive human cngh-0002 genes and polypeptides
US7985401B2 (en) 2003-10-31 2011-07-26 The Regents Of The University Of California Peptides whose uptake by cells is controllable
US7431915B2 (en) * 2003-10-31 2008-10-07 The Regents Of The University Of California Peptides whose uptake by cells is controllable
WO2005051981A1 (en) * 2003-11-19 2005-06-09 Pepharm R & D Limited Biologically active peptides comprising isoleucyl-valyl-threonyl-asparaginyl-threonyl-threonine (ivtntt)
WO2005051994A2 (en) * 2003-11-21 2005-06-09 Zymogenetics, Inc. Ztnf11, a tumor necrosis factor
NZ547629A (en) * 2003-11-26 2009-08-28 Max Planck Gesellschaft Antibodies binding to conformationally discriminating epitopes of human IGG Fc receptor IIb and IIb
CA2549710A1 (en) * 2003-12-15 2005-06-23 Genesis Research And Development Corporation Limited Polynucleotides and polypeptides isolated from lactobacillus rhamnosus hn001 materials incorporating them and methods for using them
GB0329958D0 (en) * 2003-12-24 2004-01-28 Univ Manchester Treatment of viral infections
CA2553769C (en) 2004-01-16 2011-01-04 Merck & Co., Inc. Npc1l1 (npc3) and methods of identifying ligands thereof
US7785805B2 (en) 2004-01-28 2010-08-31 Exelixis, Inc. MAN2As as modifiers of the IGFR pathway and methods of use
US7148192B2 (en) * 2004-01-29 2006-12-12 Ebwe Pharma Ges. M.H. Nfg.Kg Neuroprotective dietary supplement
US20050255530A1 (en) * 2004-02-06 2005-11-17 Keefer Carol L Identification and isolation of pluripotency determining factors
GB0403142D0 (en) * 2004-02-12 2004-03-17 Ares Trading Sa Protein
US20050186577A1 (en) 2004-02-20 2005-08-25 Yixin Wang Breast cancer prognostics
CA2560066A1 (en) 2004-03-01 2005-09-15 The Cbr Institute For Biomedical Research, Inc. Natural igm antibodies and inhibitors thereof
US7166433B2 (en) * 2004-03-03 2007-01-23 The United States Of America As Represented By The Department Of Health And Human Services Transductin-1 and transductin-2 and applications to hereditary deafness
US20070254283A1 (en) * 2004-03-04 2007-11-01 Roy Deudutta Brhf1 as a Cancer Diagnostic Marker
US20080050315A1 (en) * 2004-03-24 2008-02-28 Stefan Golz Diagnostics and Therapeutics for Diseases Associated with Protein Kinase, Cgmp-Dependent, Type I (Prkg1)
WO2005098440A2 (en) * 2004-03-26 2005-10-20 Immusol, Inc. Methods for identifying agents useful for the treatment of neurodegenerative diseases
EP1733743A4 (en) 2004-04-09 2007-06-27 Takeda Pharmaceutical Preventives/remedies for cancer
WO2005108563A2 (en) * 2004-04-19 2005-11-17 University Of Chicago Peptidoglycan-hydrolyzing protein encoded by bacteriophage n4
BRPI0418766B8 (en) 2004-04-22 2021-05-25 Agensys Inc antibody or fragment thereof, vector, pharmaceutical composition, assay and method for detecting the presence of step-1 protein, as well as a method for delivering a cytotoxic agent or a diagnostic agent
CA2504920C (en) 2004-05-14 2014-04-01 Euroscreen S.A. Ligand for g-protein coupled receptor fprl2 and uses thereof
DE102004024617A1 (en) 2004-05-18 2005-12-29 Ganymed Pharmaceuticals Ag Differentially expressed in tumors gene products and their use
JP4649575B2 (en) * 2004-05-19 2011-03-09 財団法人ヒューマンサイエンス振興財団 Diagnosis of novel mucin genes and mucosal-related diseases
WO2005114200A1 (en) * 2004-05-21 2005-12-01 Bayer Healthcare Ag Diagnostics and therapeutics for diseases associated with g protein-coupled receptor kinase 7 (grk7)
DK2031063T3 (en) 2004-06-18 2014-01-27 Univ Duke Modulators of fragrance receptors
CA2559760A1 (en) * 2004-07-02 2006-07-06 Metanomics Gmbh Process for the production of fine chemicals
EP2287310B1 (en) * 2004-07-22 2015-05-06 Five Prime Therapeutics, Inc. Compositions and methods using MGD-CSF in disease treatment
WO2006022689A1 (en) * 2004-08-06 2006-03-02 Applera Corporation Method and compositions for treating diseases targeting cd51
US8435507B2 (en) 2004-08-19 2013-05-07 University Of Maryland Prostate-specific antigen-derived MHC class II restricted peptides and their use in vaccines to treat or prevent prostate cancer
JP2008512120A (en) * 2004-09-08 2008-04-24 ルードヴィッヒ インスティテュート フォー キャンサー リサーチ Cancer-testis antigen
EP2292753A1 (en) 2004-09-24 2011-03-09 Oncotherapy Science, Inc. Method for diagnosing non-small cell lung cancers by tRNA-dihydrouridine synthase activity of URLC8
US9944713B2 (en) * 2004-11-24 2018-04-17 Medicinal Bioconvergence Research Center Antibody specific to the AIMP2-DX2
CA2590180A1 (en) * 2004-12-10 2006-06-15 The Corporation Of The Trustees Of The Order Of The Sisters Of Mercy In Queensland Binding partners of antibodies specific for dendritic cell antigens
DE102004059781A1 (en) * 2004-12-10 2006-06-22 Sanofi-Aventis Deutschland Gmbh Use of serum / glucocorticoid-regulated kinase
PT1838733E (en) 2004-12-21 2011-12-13 Medimmune Ltd Antibodies directed to angiopoietin-2 and uses thereof
EP1851543A2 (en) 2005-02-24 2007-11-07 Compugen Ltd. Novel diagnostic markers, especially for in vivo imaging, and assays and methods of use thereof
AU2006230563B8 (en) 2005-03-31 2010-06-17 Agensys, Inc. Antibodies and related molecules that bind to 161P2F10B proteins
US8350009B2 (en) 2005-03-31 2013-01-08 Agensys, Inc. Antibodies and related molecules that bind to 161P2F10B proteins
WO2006111586A2 (en) * 2005-04-20 2006-10-26 Proyecto De Biomedicina Cima, S.L. Method for the in vitro determination of the degree of methylation of the line-1 promoter
AU2006248201A1 (en) * 2005-05-19 2006-11-23 Synergenz Bioscience Limited Methods for the assesssment of risk of developing lung cancer using analysis of genetic polymorphisms
CA2608158A1 (en) * 2005-05-19 2006-11-23 Synergenz Bioscience Limited Methods and compositions for assessment of pulmonary function and disorders
AU2006248189A1 (en) * 2005-05-20 2006-11-23 Synergenz Bioscience Limited Methods of analysis of polymorphisms and uses thereof
AU2006284716B2 (en) * 2005-09-01 2012-02-09 The Feinstein Institute For Medical Research Peptide mimics of melanocyte stimulating hormone
US7659252B2 (en) 2005-09-15 2010-02-09 Novomed Technologies, Inc. (Shanghai) Transdermal delivery peptides and method of use thereof
WO2007035474A2 (en) * 2005-09-15 2007-03-29 Novomed Technologies, Inc. (Shanghai) Transdermal delivery peptides and method of use thereof
WO2007053803A2 (en) * 2005-10-31 2007-05-10 Centocor, Inc. Antagonist of teb4 and methods of use
EP1790664A1 (en) 2005-11-24 2007-05-30 Ganymed Pharmaceuticals AG Monoclonal antibodies against claudin-18 for treatment of cancer
US20070186295A1 (en) * 2006-01-25 2007-08-09 Daniel Chelsky TAT-036 and methods of assessing and treating cancer
BRPI0706784A2 (en) * 2006-01-30 2011-04-05 Ludwig Inst Cancer Res testis cancer antigen ctsp
WO2007090194A2 (en) * 2006-02-01 2007-08-09 The Burnham Institute For Medical Research Lymphatic zip codes in tumors and pre-malignant lesions
CN107446033B (en) * 2006-02-22 2022-04-01 朱正崙 Treatment of developmental related disorders
WO2007100807A2 (en) 2006-02-24 2007-09-07 Schering Corporation Npc1l1 orthologues
WO2007104511A1 (en) * 2006-03-13 2007-09-20 Mubio Products Bv Cancer vaccine
US7811787B2 (en) * 2006-04-25 2010-10-12 Bristol-Myers Squibb Company Polynucleotides encoding human SLAP-2 variant, hSLAP-2v3
EP2502628B1 (en) * 2006-06-23 2016-12-14 Alethia Biotherapeutics Inc. Polynucleotides and polypeptide sequences involved in cancer
KR20090064378A (en) * 2006-08-10 2009-06-18 온코세라피 사이언스 가부시키가이샤 Genes and polypeptides relating breast cancers
EP2695896B1 (en) 2006-10-06 2018-08-22 The Government of the United States of America as represented by the Secretary of the Department of Health and Human Services Prevention of tissue ischemia, related methods and compositions
PT2502938E (en) 2006-10-27 2015-06-05 Genentech Inc Antibodies and immunoconjugates and uses therefor
EP2052085B1 (en) * 2006-12-21 2019-05-22 Cognosci, Inc. Methods for modulating set and uses thereof
US8309687B2 (en) * 2007-03-15 2012-11-13 Reverse Proteomics Research Institute Co., Ltd. Biomarker specific for cancer
WO2008121007A1 (en) * 2007-03-29 2008-10-09 Auckland Uniservices Limited Novel saratan polypeptides and polynucleotides and methods of use thereof
WO2008126517A1 (en) * 2007-03-30 2008-10-23 National University Corporation Okayama University Novel slc17-type transporter protein in mammal and use thereof
US20100249038A1 (en) * 2007-06-12 2010-09-30 Board Of Regents, University Of Texas System Antagonists of the receptor for advanced glycation end-products (rage)
FR2919616A1 (en) * 2007-08-02 2009-02-06 Ecole Norm Superieure Lyon INSULATORY POLYNUCLEOTIDES DERIVED FROM THE D4Z4 ELEMENT AND THEIR USES IN TRANSGENESIS
US8163874B2 (en) * 2007-08-06 2012-04-24 The United States Of America, As Represented By The Secretary Of The Navy Beta helical peptide structures stable in aqueous and non-aqueous media and methods for preparing same
CN103483424A (en) * 2007-08-09 2014-01-01 诺瓦提斯公司 Thiopeptide precursor protein, gene encoding it and uses thereof
EP3492488A1 (en) 2007-08-22 2019-06-05 The Regents of The University of California Activatable binding polypeptides and methods of identification and use thereof
CA2695559A1 (en) * 2007-08-24 2009-03-05 Ramot At Tel-Aviv University, Ltd. Neuroprotection using nap-like and sal-like peptide mimetics
DE102007041657A1 (en) * 2007-09-03 2009-03-05 Protagen Ag Marker sequences for multiple sclerosis and their use
US8450279B2 (en) * 2007-09-04 2013-05-28 Procell Therapeutics Inc. Cell permeable Nm23 recombinant proteins, polynucleotides encoding the same, and anti-metastatic composition comprising the same
EP2198306A1 (en) 2007-09-06 2010-06-23 Case Western Reserve University Methods for diagnosing and treating cancers
WO2009039188A1 (en) * 2007-09-17 2009-03-26 Ludwig Institute For Cancer Research Ltd Peptides and methods for the treatment of gliomas and other cancers
AU2008301888A1 (en) * 2007-09-20 2009-03-26 R. Bradley Jones Long interspersed nuclear element polypeptide compositions and methods of use thereof
GB0719367D0 (en) 2007-10-03 2007-11-14 Procarta Biosystems Ltd Transcription factor decoys, compositions and methods
JP2011521626A (en) * 2008-05-02 2011-07-28 エピセンター テクノロジーズ コーポレーション RNA polyphosphatase compositions, kits and uses thereof
US8324165B2 (en) 2008-05-14 2012-12-04 Five Prime Therapeutics, Inc. Compositions for treating arthritis comprising HG1023575 polypeptide
US9051574B2 (en) * 2008-06-03 2015-06-09 University Of Maryland, Baltimore Non-hemolytic ClyA for excretion of proteins
CN102066945A (en) * 2008-06-19 2011-05-18 科技研究局 Modulators of STAT3 signalling
EP2324048A2 (en) * 2008-07-30 2011-05-25 Cosmix Therapeutics Llc Peptide therapeutics that bind vegf and methods of use thereof
EP2172477A1 (en) * 2008-10-06 2010-04-07 Atlas Antibodies AB Epitopes derived from SATB2 and uses thereof
US8156130B2 (en) 2008-10-17 2012-04-10 Embarq Holdings Company Llc System and method for collapsing search results
US8874564B2 (en) * 2008-10-17 2014-10-28 Centurylink Intellectual Property Llc System and method for communicating search results to one or more other parties
US8326829B2 (en) * 2008-10-17 2012-12-04 Centurylink Intellectual Property Llc System and method for displaying publication dates for search results
WO2010048125A2 (en) * 2008-10-22 2010-04-29 Trustees Of Dartmouth College Compositions and methods for inhibiting the interaction between cftr and cal
US9421239B2 (en) 2008-10-22 2016-08-23 Trustees Of Dartmouth College Therapy and kit for the prevention and treatment of cystic fibrosis
US9333235B2 (en) 2008-10-22 2016-05-10 Trustees Of Dartmouth College Combination therapy and kit for the prevention and treatment of cystic fibrosis
US8999919B2 (en) 2008-10-22 2015-04-07 Trustees Of Dartmouth College Compositions and methods for inhibiting the interaction between CFTR and CAL
EP2342355A4 (en) * 2008-10-29 2012-06-27 Beaumont Hospital William Methods of using biomarkers
CA2739705A1 (en) * 2008-10-30 2010-05-06 Yissum Research Development Company Of The Hebrew University Of Jerusale M Ltd. Efficient expression of truncated human rnaset2 in e. coli
US9855291B2 (en) 2008-11-03 2018-01-02 Adc Therapeutics Sa Anti-kidney associated antigen 1 (KAAG1) antibodies
US20110286924A1 (en) * 2008-12-09 2011-11-24 Alethia Biotherapeutics Inc Novel human endogenous retroviral erv3 variant and uses thereof in the diagnosing ovarian cancer
BRPI1006141B8 (en) * 2009-01-12 2021-05-25 Cytomx Therapeutics Llc modified antibody compositions, methods of making and using the same
CA2753294A1 (en) 2009-02-23 2010-08-26 Cytomx Therapeutics, Inc. Proproteins and methods of use thereof
CN102333543B (en) * 2009-02-24 2014-05-14 国立大学法人宫崎大学 Cell adhesion inhibitor and use thereof
US8404242B2 (en) 2009-03-16 2013-03-26 Atyr Pharma, Inc. Compositions and methods comprising histidyl-tRNA synthetase splice variants having non-canonical biological activities
PL2408913T3 (en) 2009-03-18 2017-07-31 Oncotherapy Science, Inc. Neil3 peptides and vaccines including the same
WO2010120509A2 (en) 2009-03-31 2010-10-21 Atyr Pharma, Inc. Compositions and methods comprising aspartyl-trna synthetases having non-canonical biological activities
WO2010118324A2 (en) 2009-04-09 2010-10-14 Nuclea Biotechnologies, LLC Antibodies against fatty acid synthase
CA2767570C (en) 2009-07-13 2018-05-01 Universite De Montreal Mutated sumo isoforms and uses thereof
JP2012533560A (en) 2009-07-15 2012-12-27 ザ リージェンツ オブ ザ ユニバーシティ オブ カリフォルニア Peptides that can control uptake in cells
WO2011031517A1 (en) 2009-08-27 2011-03-17 Nuclea Biotechnologies, LLC Method and assay for determining fas expression
KR101854820B1 (en) * 2009-09-25 2018-05-04 샤이어 오펀 테라피즈 게엠베하 Novel NPR-B Agonists
WO2011038066A2 (en) * 2009-09-25 2011-03-31 Alcon Research, Ltd. Novel npr-b agonists
WO2011040417A1 (en) * 2009-09-29 2011-04-07 学校法人福岡大学 Inositol-1,4,5-trisphosphate receptor binding protein, and transgenic non-human mammal containing same
WO2011072266A2 (en) * 2009-12-11 2011-06-16 Atyr Pharma, Inc. Aminoacyl trna synthetases for modulating hematopoiesis
US8828395B2 (en) 2009-12-11 2014-09-09 Atyr Pharma, Inc. Antibodies that bind tyrosyl-tRNA synthetases
US9023802B2 (en) * 2009-12-14 2015-05-05 Immatics Biotechnologies Gmbh HLA-binding peptides derived from prostate-associated antigenic molecules and methods of use thereof
TW201121995A (en) 2009-12-18 2011-07-01 Alcon Res Ltd Novel NPR-B agonists
PL2530091T3 (en) 2010-01-29 2018-08-31 Chugai Seiyaku Kabushiki Kaisha Anti-dll3 antibody
AU2011213609B2 (en) 2010-02-08 2016-11-03 Agensys, Inc. Antibody drug conjugates (ADC) that bind to 161P2F10B proteins
GB201002413D0 (en) 2010-02-12 2010-03-31 Procarta Biosystems Ltd Nucleic acid complexes
JP5655254B2 (en) * 2010-02-16 2015-01-21 国立大学法人京都工芸繊維大学 Polycarbonate and / or polymethyl methacrylate affinity peptide and use thereof
GB201005545D0 (en) 2010-04-01 2010-05-19 Procarta Biosystems Ltd Transcription factor decoys
WO2011146942A1 (en) * 2010-05-21 2011-11-24 The Translational Genomics Research Institute Methods and kits to analyze microrna by nucleic acid sequencing
RU2012157597A (en) 2010-05-31 2014-07-20 Лондон Хелс Сайенсис Сентр Рисерч Инк. RHAMM-BINDING PEPTIDES
JP6116479B2 (en) 2010-07-12 2017-04-19 エータイアー ファーマ, インコーポレイテッド Innovative discovery of therapeutic, diagnostic and antibody compositions related to protein fragments of glycyl-tRNA synthetase
US10865445B2 (en) * 2010-08-18 2020-12-15 Fred Hutchinson Cancer Research Center Methods for alleviating facioscapulohumeral dystrophy (FSHD) by N siRNA molecule inhibiting the expression of DUX4-FL
CA2812644A1 (en) 2010-09-27 2012-04-05 Zestagen, S.A. Compositions and methods for treating neoplasia
TWI764324B (en) 2010-09-29 2022-05-11 美商艾澤西公司 Antibody drug conjugates (adc) that bind to 191p4d12 proteins
KR20130126671A (en) * 2010-12-14 2013-11-20 이매틱스 바이오테크놀로지스 게엠베하 Hla-binding peptides derived from prostate-associated antigenic molecules and methods of use thereof
WO2012112842A2 (en) 2011-02-17 2012-08-23 The Government Of The United States Of America, As Represented By The Secretary, Department Of Health & Human Services Compositions and methods for treating poliovirus
US10370417B2 (en) 2011-03-01 2019-08-06 Rutgers, The State University Of New Jersey Methods for the diagnosis of cancer through detection of human endogenous retrovirus WL (HERV-WL) envelope expression
WO2012118974A2 (en) * 2011-03-01 2012-09-07 University Of Medicine And Dentistry Of New Jersey Detection of human endogenous retrovirus expression in cancer and normal cells
AU2012234754B2 (en) 2011-03-31 2016-09-29 Adc Therapeutics Sa Antibodies against kidney associated antigen 1 and antigen binding fragments thereof
EP2511295A1 (en) * 2011-04-15 2012-10-17 Institut National De La Sante Et De La Recherche Medicale Compositions for preventing and/or treating an infection by an HIV-1 virus
US10654916B2 (en) 2011-04-21 2020-05-19 The Regents Of The University Of California, A California Corporation Compositions and methods for the treatment of neuromyelitis optica
CA2841249C (en) 2011-07-29 2023-09-26 Avelas Biosciences, Inc. Selective delivery molecules and methods of use
US9714419B2 (en) 2011-08-09 2017-07-25 Atyr Pharma, Inc. PEGylated tyrosyl-tRNA synthetase polypeptides
SG10201601840RA (en) * 2011-09-09 2016-04-28 Agency Science Tech & Res P53 activating peptides
US20130084582A1 (en) 2011-10-04 2013-04-04 Err-Cheng Chan Serological markers for detecting colorectal cancer and their application for inhibiting colorectal cancer cells
JP2014532408A (en) * 2011-10-26 2014-12-08 ニュークレア バイオテクノロジーズ インコーポレイテッドNuclea Biotechnologies,Inc. USP2a peptide and antibody
WO2013086216A1 (en) 2011-12-06 2013-06-13 Atyr Pharma, Inc. Improved aspartyl-trna synthetases
US9822353B2 (en) 2011-12-06 2017-11-21 Atyr Pharma, Inc. PEGylated aspartyl-tRNA synthetase polypeptides
AU2012368189B2 (en) 2011-12-29 2017-08-31 Atyr Pharma, Inc. Aspartyl-tRNA synthetase-Fc conjugates
JP6282597B2 (en) 2012-01-09 2018-02-21 アー・デー・ツェー・セラピューティクス・エス・アー How to treat breast cancer
WO2013167153A1 (en) 2012-05-09 2013-11-14 Ganymed Pharmaceuticals Ag Antibodies useful in cancer diagnosis
WO2013174404A1 (en) 2012-05-23 2013-11-28 Ganymed Pharmaceuticals Ag Combination therapy involving antibodies against claudin 18.2 for treatment of cancer
EP3971571A3 (en) 2012-05-25 2022-06-08 The University of Vermont and State Agriculture College Compositions and methods for assaying platelet reactivity and treatment selection
EP2861622A4 (en) 2012-06-15 2016-06-01 Sinomab Bioscience Ltd Anti-cd22 anti-idiotypic antibodies and uses thereof
US9957557B2 (en) * 2012-08-13 2018-05-01 Life Genetics Lab, Llc Development of a highly sensitive quantification system for assessing DNA degradation and quality in forensic samples
KR20240033145A (en) 2012-11-13 2024-03-12 비온테크 에스이 Agents for treatment of claudin expressing cancer diseases
WO2014120837A2 (en) 2013-01-29 2014-08-07 The Regents Of The University Of California Pretargeted activatable cell penetrating peptide with intracellulary releaseable prodrug
CN105102484B (en) 2013-01-30 2020-03-10 艾维拉斯生物科学公司 Selective delivery molecules and methods of use
WO2014127785A1 (en) 2013-02-20 2014-08-28 Ganymed Pharmaceuticals Ag Combination therapy involving antibodies against claudin 18.2 for treatment of cancer
KR20210108497A (en) 2013-02-26 2021-09-02 메모리얼 슬로안 케터링 캔서 센터 Compositions and methods for immunotherapy
US9409977B2 (en) 2013-03-12 2016-08-09 Decimmune Therapeutics, Inc. Humanized, anti-N2 antibodies
US10295547B2 (en) 2013-03-14 2019-05-21 University Of Florida Research Foundation, Incorporated Use and treatment of di-amino acid repeat-containing proteins associated with ALS
IL300508A (en) 2013-03-15 2023-04-01 Memorial Sloan Kettering Cancer Center Compositions and methods for immunotherapy
WO2014146672A1 (en) 2013-03-18 2014-09-25 Ganymed Pharmaceuticals Ag Therapy involving antibodies against claudin 18.2 for treatment of cancer
EP2981607B1 (en) 2013-04-03 2020-08-19 Memorial Sloan Kettering Cancer Center Effective generation of tumor-targeted t-cells derived from pluripotent stem cells
EP2999710B1 (en) * 2013-05-20 2019-10-02 Nanyang Technological University Peptides binding to parallel-stranded g-quadruplexes
KR101841211B1 (en) * 2014-03-10 2018-03-22 한양대학교 산학협력단 Cell penetrating peptide and method for delivery of biologically active materials using it
WO2015176010A1 (en) 2014-05-15 2015-11-19 The United States Of America, As Represented By The Secretary, Departmentof Health & Human Services Treatment or prevention of an intestinal disease or disorder
WO2015188141A2 (en) 2014-06-06 2015-12-10 Memorial Sloan-Kettering Cancer Ceneter Mesothelin-targeted chimeric antigen receptors and uses thereof
US10653747B2 (en) 2014-07-31 2020-05-19 Uab Research Foundation ApoE mimetic peptides and higher potency to clear plasma cholesterol
WO2016022597A1 (en) * 2014-08-04 2016-02-11 Case Western Reserve University Targeting peptides and methods of use
US10925980B2 (en) 2014-08-04 2021-02-23 Case Western Reserve University Molecular probes and methods of use
CN107002046B (en) 2014-08-25 2021-05-11 经纬生物科技有限公司 Non-replicating transduction particles and transduction particle-based reporter systems
US10385380B2 (en) 2014-10-02 2019-08-20 The Regents Of The University Of California Personalized protease assay to measure protease activity in neoplasms
KR101693431B1 (en) 2014-11-18 2017-01-06 울산대학교 산학협력단 Targeting Peptides for Gastric Cancer and Medical Use Thereof
CA2969870A1 (en) 2014-12-05 2016-06-09 Memorial Sloan-Kettering Cancer Center Chimeric antigen receptors targeting b-cell maturation antigen and uses thereof
MY195115A (en) 2014-12-05 2023-01-10 Memorial Sloan Kettering Cancer Center Antibodies targeting g-protein coupled receptor and methods of use
SI3226897T1 (en) 2014-12-05 2021-08-31 Memorial Sloan Kettering Cancer Center Antibodies targeting b-cell maturation antigen and methods of use
JP7174522B2 (en) 2014-12-05 2022-11-17 メモリアル スローン ケタリング キャンサー センター Chimeric antigen receptor targeting Fc receptor-like 5 and uses thereof
HRP20211978T1 (en) 2014-12-05 2022-04-01 Memorial Sloan-Kettering Cancer Center Chimeric antigen receptors targeting g-protein coupled receptor and uses thereof
US10539572B2 (en) 2015-01-30 2020-01-21 Salk Institute For Biological Studies Compositions and methods for treating age-related diabetes and related disorders
US11497767B2 (en) 2015-02-18 2022-11-15 Enlivex Therapeutics R&D Ltd Combination immune therapy and cytokine control therapy for cancer treatment
US11318163B2 (en) 2015-02-18 2022-05-03 Enlivex Therapeutics Ltd Combination immune therapy and cytokine control therapy for cancer treatment
US11596652B2 (en) 2015-02-18 2023-03-07 Enlivex Therapeutics R&D Ltd Early apoptotic cells for use in treating sepsis
US11000548B2 (en) 2015-02-18 2021-05-11 Enlivex Therapeutics Ltd Combination immune therapy and cytokine control therapy for cancer treatment
US11304976B2 (en) 2015-02-18 2022-04-19 Enlivex Therapeutics Ltd Combination immune therapy and cytokine control therapy for cancer treatment
US11512289B2 (en) 2015-02-18 2022-11-29 Enlivex Therapeutics Rdo Ltd Combination immune therapy and cytokine control therapy for cancer treatment
EP3285877B1 (en) 2015-04-21 2022-10-19 Enlivex Therapeutics Rdo Ltd Therapeutic pooled blood apoptotic cell preparations and uses thereof
US10596259B2 (en) 2015-05-20 2020-03-24 The Regents Of The University Of California Tumor radiosensitization with monomethyl auristatin E (MMAE) and derivatives thereof
WO2016191263A1 (en) 2015-05-22 2016-12-01 Trustees Of Dartmouth College Therapy and kit for the prevention and treatment of cystic fibrosis
TW201702272A (en) 2015-05-22 2017-01-16 美國紀念斯隆 凱特琳癌症中心 T cell receptor-like antibodies specific for a PRAME peptide
WO2016196324A1 (en) 2015-05-29 2016-12-08 University Of Floridia Research Foundation, Inc. Methods for diagnosing huntington's disease
GB201511546D0 (en) 2015-07-01 2015-08-12 Immatics Biotechnologies Gmbh Novel peptides and combination of peptides for use in immunotherapy against ovarian cancer and other cancers
MA54884A (en) 2015-07-01 2022-01-12 Immatics Biotechnologies Gmbh NEUARTIGE PEPTIDE UND KOMBINATION AUS PEPTIDEN ZUR VERWENDUNG IN DER IMMUNTHERAPIE GEGEN OVARIALKARZINOM UND ANDERE KARZINOME
EP3328994A4 (en) 2015-07-31 2019-04-17 Memorial Sloan-Kettering Cancer Center Antigen-binding proteins targeting cd56 and uses thereof
CN115925806A (en) 2015-08-28 2023-04-07 伊玛提克斯生物技术有限公司 Novel peptides, peptide compositions and scaffolds for immunotherapy of various cancers
GB201515321D0 (en) 2015-08-28 2015-10-14 Immatics Biotechnologies Gmbh Novel peptides, combination of peptides and scaffolds for use in immunotherapeutic treatment of various cancers
EP3341482B1 (en) * 2015-08-28 2022-07-06 CureVac AG Artificial nucleic acid molecules
WO2017040526A2 (en) * 2015-09-01 2017-03-09 Eisai R&D Management Co., Ltd. Splice variants associated with neomorphic sf3b1 mutants
CA2997551A1 (en) 2015-09-04 2017-03-09 Memorial Sloan Kettering Cancer Center Immune cell compositions and methods of use
EP3336098B1 (en) * 2015-09-10 2021-08-11 IUCF-HYU (Industry-University Cooperation Foundation Hanyang University) Skin permeable peptide and method for using same
AU2016362402A1 (en) 2015-12-04 2018-06-28 Eureka Therapeutics, Inc. Antibodies targeting Fc receptor-like 5 and methods of use
WO2017100428A1 (en) 2015-12-09 2017-06-15 Memorial Sloan Kettering Cancer Center Immune cell compositions and methods of using same
JP7022067B2 (en) 2016-01-14 2022-02-17 メモリアル スローン ケタリング キャンサー センター T-cell receptor-like antibody specific for FOXP3-derived peptide
CA3014885A1 (en) 2016-02-18 2017-08-24 Enlivex Therapeutics Ltd. Combination immune therapy and cytokine control therapy for cancer treatment
GB201604490D0 (en) 2016-03-16 2016-04-27 Immatics Biotechnologies Gmbh Peptides combination of peptides for use in immunotherapy against cancers
US10940161B2 (en) 2016-04-04 2021-03-09 University Of Florida Research Foundation, Incorporated Manipulation of EIF3 to modulate repeat associated non-ATG (RAN) translation
PT3443075T (en) 2016-04-15 2022-12-16 Memorial Sloan Kettering Cancer Center Transgenic t cell and chimeric antigen receptor t cell compositions and related methods
WO2017218949A2 (en) * 2016-06-17 2017-12-21 Aileron Therapeutics, Inc. Peptidomimetic macrocycles and uses thereof
US20180052169A1 (en) * 2016-08-22 2018-02-22 The Board Of Regents Of The University Of Texas System Tsp1 as a marker for cancer
CA3034691A1 (en) 2016-08-30 2018-03-08 Memorial Sloan Kettering Cancer Center Immune cell compositions and methods of use for treating viral and other infections
AU2017342555A1 (en) 2016-10-14 2019-05-30 Children's Medical Center Corporation Compositions and methods for treating diseases and disorders of the central nervous system
CA3042706A1 (en) 2016-11-09 2018-05-17 Intrexon Corporation Frataxin expression constructs
JP2020502056A (en) * 2016-11-13 2020-01-23 イマジン ファーマ Compositions and methods for treating diabetes, hypertension and hypercholesterolemia
GB201621737D0 (en) * 2016-12-20 2017-02-01 Aberystwyth Univ Compositions and methods of treatment
US11566060B2 (en) 2017-01-05 2023-01-31 Kahr Medical Ltd. PD1-CD70 fusion protein and methods of use thereof
US11299530B2 (en) 2017-01-05 2022-04-12 Kahr Medical Ltd. SIRP alpha-CD70 fusion protein and methods of use thereof
HRP20220230T1 (en) 2017-01-05 2022-04-29 Kahr Medical Ltd. A sirp1 alpha-41bbl fusion protein and methods of use thereof
DK3565579T3 (en) 2017-01-05 2023-09-04 Kahr Medical Ltd PD1-41BBL FUSION PROTEIN AND METHODS OF USING THEREOF
AU2018210853B2 (en) 2017-01-17 2023-09-28 Children's Medical Center Corporation Compositions and methods for treating lysosomal storage diseases and disorders
EP3576781B9 (en) 2017-01-18 2024-03-06 Icahn School of Medicine at Mount Sinai Neoantigens and uses thereof for treating cancer
US11260101B2 (en) * 2017-02-28 2022-03-01 Jinan University Repair peptide for use in promoting post-traumatic tissue repair and regeneration, and application thereof
AU2018231190B2 (en) 2017-03-08 2023-05-25 Memorial Sloan Kettering Cancer Center Immune cell compositions and methods of use
ES2955852T3 (en) 2017-04-03 2023-12-07 Hoffmann La Roche STEAP-1 binding antibodies
JP2020516674A (en) 2017-04-17 2020-06-11 ユニバーシティー オブ フロリダ リサーチ ファンデーション, インク. Regulation of RAN protein translation by the PKR and eIF2A-P pathways
EP3612215A4 (en) 2017-04-20 2021-05-26 aTyr Pharma, Inc. Compositions and methods for treating lung inflammation
SG11201910101SA (en) 2017-05-08 2019-11-28 Gritstone Oncology Inc Alphavirus neoantigen vectors
CA3063464A1 (en) * 2017-05-19 2018-11-22 Encoded Therapeutics, Inc. High activity regulatory elements
WO2019067587A1 (en) 2017-09-26 2019-04-04 University Of Florida Research Foundation, Incorporated Use of metformin and analogs thereof to reduce ran protein levels in the treatment of neurological disorders
WO2019067592A1 (en) 2017-09-27 2019-04-04 University Of Georgia Research Foundation Treatment and detection of infection and disease associated with different fungal pathogens
AU2018367452A1 (en) 2017-11-14 2020-06-11 Memorial Sloan-Kettering Cancer Center IL-36 secreting immunoresponsive cells and uses thereof
US11008602B2 (en) 2017-12-20 2021-05-18 Roche Molecular Systems, Inc. Non-replicative transduction particles and transduction particle-based reporter systems
TWI817974B (en) 2017-12-28 2023-10-11 日商中外製藥股份有限公司 Cytotoxicity-inducing therapeutic agent
JP7416695B2 (en) 2017-12-29 2024-01-17 メモリアル スローン-ケタリング キャンサー センター Enhanced chimeric antigen receptors and their uses
BR112020013679A2 (en) 2018-01-04 2020-12-01 Iconic Therapeutics, Inc. antitecid factor antibodies, antibody-drug conjugates and related methods
CA3081125A1 (en) 2018-02-01 2019-08-08 Innovent Biologics (Suzhou) Co., Ltd. Fully humanized anti-b cell maturation antigen (bcma) single-chain antibody and use thereof
KR20200120673A (en) 2018-02-11 2020-10-21 메모리얼 슬로안 케터링 캔서 센터 Non-HLA restricted T cell receptor and uses thereof
CA3091143A1 (en) 2018-02-15 2019-08-22 Memorial Sloan-Kettering Cancer Center Foxp3 targeting agent compositions and methods of use for adoptive cell therapy
SG11202013167UA (en) 2018-07-11 2021-01-28 Kahr Medical Ltd SIRPalpha-4-1BBL VARIANT FUSION PROTEIN AND METHODS OF USE THEREOF
EP3880714A4 (en) 2018-11-16 2022-07-20 Memorial Sloan Kettering Cancer Center Antibodies to mucin-16 and methods of use thereof
EP3947471A4 (en) 2019-03-27 2023-03-15 The Trustees of The University of Pennsylvania Tn-muc1 chimeric antigen receptor (car) t cell therapy
CN110256574B (en) * 2019-05-21 2020-12-04 山东大学 Fusion polypeptide and application thereof in preparing medicament for resisting depression and neurodegenerative diseases
KR20220016137A (en) 2019-05-30 2022-02-08 그릿스톤 바이오, 인코포레이티드 modified adenovirus
EP4041745A4 (en) * 2019-10-08 2024-02-07 Univ North Carolina State Immunoglobulin purification peptides and their use
BR112022007952A2 (en) 2019-10-28 2022-10-04 Noile Immune Biotech Inc DRUG TO TREAT CANCER, COMBINATION DRUG, DRUG COMPOSITION, IMMUNE RESPONSIVE CELL, NUCLEIC ACID DELIVERY VEHICLE AND PRODUCT
CA3160468A1 (en) * 2019-11-05 2021-05-14 Board Of Regents, The University Of Texas System Hla restricted hormad1 t cell receptors and uses thereof
WO2021158818A1 (en) * 2020-02-05 2021-08-12 Synthetic Genomics, Inc. Regulatory sequences for expression of transgenes
CA3172447A1 (en) 2020-03-27 2021-09-30 Erik Hans MANTING In vivo use of modified cells of leukemic origin for enhancing the efficacy of adoptive cell therapy
CN115916963A (en) 2020-03-27 2023-04-04 门德斯有限公司 Ex vivo use of modified cells of leukemia origin for enhancing the efficacy of adoptive cell therapy
CN115397866A (en) 2020-03-31 2022-11-25 中外制药株式会社 DLL 3-targeted multispecific antigen-binding molecules and uses thereof
AU2021300112A1 (en) 2020-06-29 2023-02-16 Cell Medica, Inc. Methods and compositions for the reduction of chimeric antigen receptor tonic signaling
KR20230046313A (en) 2020-08-06 2023-04-05 그릿스톤 바이오, 인코포레이티드 Multi-epitope vaccine cassette
WO2022035793A1 (en) 2020-08-10 2022-02-17 Precision Biosciences, Inc. Antibodies and fragments specific for b-cell maturation antigen and uses thereof
EP4222267A1 (en) 2020-09-30 2023-08-09 Vor Biopharma Inc. Chimeric antigen receptor expression systems
JP2023547520A (en) 2020-11-05 2023-11-10 メンドゥス・ベスローテン・フェンノートシャップ Use of tumor-independent antigens in immunotherapy
WO2022104090A1 (en) 2020-11-13 2022-05-19 Vor Biopharma Inc. Methods and compositions relating to genetically engineered cells expressing chimeric antigen receptors
WO2022189626A2 (en) 2021-03-11 2022-09-15 Mnemo Therapeutics Tumor neoantigenic peptides
JP2024511941A (en) * 2021-03-11 2024-03-18 テンザ、インク. Synthetic signal peptides to direct secretion of heterologous proteins in yeast
EP4304635A1 (en) 2021-03-11 2024-01-17 Mnemo Therapeutics Tumor neoantigenic peptides and uses thereof
KR20240006721A (en) 2021-03-11 2024-01-15 엥스띠뛰 퀴리 Membrane-transforming neoantigen peptide
WO2022221420A2 (en) * 2021-04-13 2022-10-20 Capsida, Inc. Selected aav compositions having preferred brain enrichment
JP2024514933A (en) 2021-04-23 2024-04-03 ベイラー カレッジ オブ メディスン CAR NKT expressing artificial microRNA-embedded SHRNA for down-regulation of MHC class I and II expression
WO2022235832A1 (en) 2021-05-06 2022-11-10 Ludwig Institute For Cancer Research Ltd Compositions and methods for immunotherapy
WO2022238386A1 (en) 2021-05-10 2022-11-17 Institut Curie Methods for the treatment of cancer, inflammatory diseases and autoimmune diseases
EP4346912A1 (en) 2021-05-25 2024-04-10 Institut Curie Myeloid cells overexpressing bcl2
WO2023288203A2 (en) 2021-07-12 2023-01-19 Ludwig Institute For Cancer Research Ltd T cell receptors specific for tumor-associated antigens and methods of use thereof
WO2023022788A1 (en) * 2021-08-18 2023-02-23 The Regents Of The University Of California Methods and compositions for determining risk of autism spectrum disorders
WO2023064872A1 (en) 2021-10-14 2023-04-20 Precision Biosciences, Inc. Combinations of anti-bcma car t cells and gamma secretase inhibitors
WO2023192626A1 (en) * 2022-04-01 2023-10-05 The Regents Of The University Of California Methods and compositions for treating gliomas
US20240002800A1 (en) 2022-05-16 2024-01-04 Mendus B.V. Use of leukemia-derived cells for enhancing natural killer (nk) cell therapy

Family Cites Families (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6287801B1 (en) * 1996-07-22 2001-09-11 Smithkline Beecham Corporation Nucleic acids encoding the G-protein coupled receptor HNFDS78
US6555339B1 (en) * 1997-04-14 2003-04-29 Arena Pharmaceuticals, Inc. Non-endogenous, constitutively activated human protein-coupled receptors
US20030235883A1 (en) * 2000-03-31 2003-12-25 Tang Y. Tom Novel nucleic acids and polypeptides
US6436703B1 (en) * 2000-03-31 2002-08-20 Hyseq, Inc. Nucleic acids and polypeptides

Cited By (41)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9422363B2 (en) 2007-08-28 2016-08-23 Uab Research Foundation Synthetic apolipoprotein E mimicking polypeptides and methods of use
US9453214B2 (en) 2009-02-27 2016-09-27 Atyr Pharma, Inc. Polypeptide structural motifs associated with cell signaling activity
US9540628B2 (en) 2009-12-11 2017-01-10 Atyr Pharma, Inc. Aminoacyl tRNA synthetases for modulating inflammation
US9328340B2 (en) 2009-12-11 2016-05-03 Atyr Pharma, Inc. Amino acyl tRNA synthetases for modulating inflammation
US9127268B2 (en) 2009-12-11 2015-09-08 Atyr Pharma, Inc. Aminoacyl tRNA synthetases for modulating inflammation
US8980253B2 (en) 2010-04-26 2015-03-17 Atyr Pharma, Inc. Innovative discovery of therapeutic, diagnostic, and antibody compositions related to protein fragments of cysteinyl-tRNA synthetase
US9540629B2 (en) 2010-04-26 2017-01-10 Atyr Pharma, Inc. Innovative discovery of therapeutic, diagnostic, and antibody compositions related to protein fragments of Cysteinyl-tRNA synthetase
US8986681B2 (en) 2010-04-27 2015-03-24 Atyr Pharma, Inc. Innovative discovery of therapeutic, diagnostic, and antibody compositions related to protein fragments of threonyl-tRNA synthetases
US8961960B2 (en) 2010-04-27 2015-02-24 Atyr Pharma, Inc. Innovative discovery of therapeutic, diagnostic, and antibody compositions related to protein fragments of isoleucyl tRNA synthetases
US9528103B2 (en) 2010-04-27 2016-12-27 Atyr Pharma, Inc. Innovative discovery of therapeutic, diagnostic, and antibody compositions related to protein fragments of isoleucyl tRNA synthetases
US9580706B2 (en) 2010-04-27 2017-02-28 Atyr Pharma, Inc. Innovative discovery of therapeutic, diagnostic, and antibody compositions related to protein fragments of threonyl-tRNA synthetases
US8993723B2 (en) 2010-04-28 2015-03-31 Atyr Pharma, Inc. Innovative discovery of therapeutic, diagnostic, and antibody compositions related to protein fragments of alanyl-tRNA synthetases
US9320782B2 (en) 2010-04-28 2016-04-26 Atyr Pharma, Inc. Innovative discovery of therapeutic, diagnostic, and antibody compositions related to protein fragments of alanyl tRNA synthetases
US9556425B2 (en) 2010-04-29 2017-01-31 Atyr Pharma, Inc. Innovative discovery of therapeutic, diagnostic, and antibody compositions related to protein fragments of Valyl-tRNA synthetases
US8986680B2 (en) 2010-04-29 2015-03-24 Atyr Pharma, Inc. Innovative discovery of therapeutic, diagnostic, and antibody compositions related to protein fragments of Asparaginyl tRNA synthetases
US9034320B2 (en) 2010-04-29 2015-05-19 Atyr Pharma, Inc. Innovative discovery of therapeutic, diagnostic, and antibody compositions related to protein fragments of Valyl-tRNA synthetases
US8981045B2 (en) 2010-05-03 2015-03-17 Atyr Pharma, Inc. Innovative discovery of therapeutic, diagnostic, and antibody compositions related to protein fragments of methionyl-tRNA synthetases
US9034321B2 (en) 2010-05-03 2015-05-19 Atyr Pharma, Inc. Innovative discovery of therapeutic, diagnostic, and antibody compositions related to protein fragments of phenylalanyl-alpha-tRNA synthetases
US8961961B2 (en) 2010-05-03 2015-02-24 a Tyr Pharma, Inc. Innovative discovery of therapeutic, diagnostic, and antibody compositions related protein fragments of arginyl-tRNA synthetases
US9593323B2 (en) 2010-05-03 2017-03-14 Atyr Pharma, Inc. Innovative discovery of therapeutic, diagnostic, and antibody compositions related to protein fragments of phenylalanyl-alpha-tRNA synthetases
US8946157B2 (en) 2010-05-03 2015-02-03 Atyr Pharma, Inc. Innovative discovery of therapeutic, diagnostic, and antibody compositions related to protein fragments of seryl-tRNA synthetases
US9422538B2 (en) 2010-05-03 2016-08-23 Atyr Pharma, Inc. Innovative discovery of therapeutic, diagnostic, and antibody compositions related to protein fragments of methionyl-tRNA synthetasis
US9340780B2 (en) 2010-05-03 2016-05-17 Atyr Pharma, Inc. Innovative discovery of therapeutic, diagnostic, and antibody compositions related to protein fragments of seryl-tRNA synthetases
US9593322B2 (en) 2010-05-03 2017-03-14 Atyr Pharma, Inc. Innovative discovery of therapeutic, diagnostic, and antibody compositions related to protein fragments of arginyl-trna synthetases
US9574187B2 (en) 2010-05-04 2017-02-21 Atyr Pharma, Inc. Innovative discovery of therapeutic, diagnostic, and antibody compositions related to protein fragments of glutamyl-prolyl-tRNA synthetases
US9404104B2 (en) 2010-05-04 2016-08-02 Atyr Pharma, Inc. Innovative discovery of therapeutic, diagnostic, and antibody compositions related to protein fragments of P38 multi-tRNA synthetase complex
US8945541B2 (en) 2010-05-14 2015-02-03 Atyr Pharma, Inc. Innovative discovery of therapeutic, diagnostic, and antibody compositions related to protein fragments of phenylalanyl-beta-tRNA synthetases
US9034598B2 (en) 2010-05-17 2015-05-19 Atyr Pharma, Inc. Innovative discovery of therapeutic, diagnostic, and antibody compositions related to protein fragments of leucyl-tRNA synthetases
US9347053B2 (en) 2010-05-27 2016-05-24 Atyr Pharma, Inc. Innovative discovery of therapeutic, diagnostic, and antibody compositions related to protein fragments of glutaminyl-tRNA synthetases
US8962560B2 (en) 2010-06-01 2015-02-24 Atyr Pharma Inc. Innovative discovery of therapeutic, diagnostic, and antibody compositions related to protein fragments of Lysyl-tRNA synthetases
US9322009B2 (en) 2010-06-01 2016-04-26 Atyr Pharma, Inc. Innovative discovery of therapeutic, diagnostic, and antibody compositions related to protein fragments of Lysyl-tRNA synthetases
US8969301B2 (en) 2010-07-12 2015-03-03 Atyr Pharma Inc. Innovative discovery of therapeutic, diagnostic, and antibody compositions related to protein fragments of aspartyl-tRNA synthetases
US9422539B2 (en) 2010-07-12 2016-08-23 Atyr Pharma, Inc. Innovative discovery of therapeutic, diagnostic, and antibody compositions related to protein fragments of histidyl-tRNA synthetases
US9315794B2 (en) 2010-07-12 2016-04-19 Atyr Pharma, Inc. Innovative discovery of therapeutic, diagnostic, and antibody compositions related to protein fragments of aspartyl-tRNA synthetases
US8999321B2 (en) 2010-07-12 2015-04-07 Atyr Pharma, Inc. Innovative discovery of therapeutic, diagnostic, and antibody compositions related to protein fragments of glycyl-tRNA synthetases
US9428743B2 (en) 2010-08-25 2016-08-30 Atyr Pharma, Inc. Innovative discovery of therapeutic, diagnostic, and antibody compositions related to protein fragments of tyrosyl-trna synthetases
US9399770B2 (en) 2010-10-06 2016-07-26 Atyr Pharma, Inc. Innovative discovery of therapeutic, diagnostic, and antibody compositions related to protein fragments of tryptophanyl-tRNA synthetases
US9273302B2 (en) 2012-02-16 2016-03-01 Atyr Pharma, Inc. Histidyl-tRNA synthetases for treating autoimmune and inflammatory diseases
US8835387B2 (en) 2012-02-16 2014-09-16 Atyr Pharma, Inc. Histidyl-tRNA synthetases for treating autoimmune and inflammatory diseases
US9587235B2 (en) 2013-03-15 2017-03-07 Atyr Pharma, Inc. Histidyl-tRNA synthetase-Fc conjugates
JP7453921B2 (en) 2018-03-02 2024-03-21 ディセルナ ファーマシューティカルズ インコーポレイテッド Compositions and methods for inhibiting GYS2 expression

Also Published As

Publication number Publication date
US6610536B2 (en) 2003-08-26
US20020146757A1 (en) 2002-10-10
US20030096279A1 (en) 2003-05-22
US20020061567A1 (en) 2002-05-23
WO2001075067A2 (en) 2001-10-11
WO2001075067A8 (en) 2004-10-14
US6436703B1 (en) 2002-08-20
WO2001075067A3 (en) 2002-04-04
AU2001249251A1 (en) 2001-10-15
US20030092112A1 (en) 2003-05-15
US20030087370A1 (en) 2003-05-08
US20050196754A1 (en) 2005-09-08

Similar Documents

Publication Publication Date Title
US20050196754A1 (en) Novel nucleic acids and polypeptides
US20070042392A1 (en) Novel nucleic acids and polypeptides
WO2001064835A2 (en) Novel nucleic acids and polypeptides
WO2001053455A2 (en) Novel nucleic acids and polypeptides
WO2001053312A1 (en) Novel nucleic acids and polypeptides
WO2001057188A2 (en) Novel nucleic acids and polypeptides
EP1261743A2 (en) Novel nucleic acids and polypeptides
EP1285084A1 (en) Novel nucleic acids and polypeptides
WO2002022660A2 (en) Novel nucleic acids and polypeptides
WO2001055437A2 (en) Novel nucleic acids and polypeptides
EP1276751A2 (en) Novel nucleic acids and polypeptides
WO2001079449A2 (en) Novel nucleic acids and polypeptides
US20070060743A1 (en) Novel nucleic acids and polypeptides
WO2002016439A2 (en) Novel nucleic acids and polypeptides
WO2002044340A2 (en) Novel nucleic acids and polypeptides
EP1574520A2 (en) Methods and materials relating to neuronal guidance molecule-like (NGM-like) polypeptides and polynucleotides
WO2001053485A1 (en) Methods and materials relating to novel von willebrand/thrombospondin-like polypeptides and polynucleotides
WO2001053453A2 (en) Novel bone marrow nucleic acids and polypeptides
US20020137044A1 (en) Novel nucleic acids and polypeptides
WO2001087917A1 (en) Novel nucleic acids and polypeptides
US20030228584A1 (en) Novel nucleic acids and polypeptides
EP1268762A1 (en) Novel nucleic acids and polypeptides
US20040034208A1 (en) Novel nucleic acids and polypeptides
WO2001055337A9 (en) Methods and materials relating to carcinoembryonic antigen-like (cea-like) polypeptides and polynucleotides
AU2001236660A1 (en) Methods and materials relating to leucine-rich repeat protein-like (lrr protein-like) polypeptides and polynucleotides

Legal Events

Date Code Title Description
AK Designated states

Kind code of ref document: A2

Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BY BZ CA CH CN CO CR CU CZ DE DK DM DZ EE ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NO NZ PL PT RO RU SD SE SG SI SK SL TJ TM TR TT TZ UA UG US US UZ VN YU ZA ZW

AL Designated countries for regional patents

Kind code of ref document: A2

Designated state(s): GH GM KE LS MW MZ SD SL SZ TZ UG ZW AM AZ BY KG KZ MD RU TJ TM AT BE CH CY DE DK ES FI FR GB GR IE IT LU MC NL PT SE TR BF BJ CF CG CI CM GA GN GW ML MR NE SN TD TG

121 Ep: the epo has been informed by wipo that ep was designated in this application
AK Designated states

Kind code of ref document: A3

Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BY BZ CA CH CN CO CR CU CZ DE DK DM DZ EE ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NO NZ PL PT RO RU SD SE SG SI SK SL TJ TM TR TT TZ UA UG US US UZ VN YU ZA ZW

AL Designated countries for regional patents

Kind code of ref document: A3

Designated state(s): GH GM KE LS MW MZ SD SL SZ TZ UG ZW AM AZ BY KG KZ MD RU TJ TM AT BE CH CY DE DK ES FI FR GB GR IE IT LU MC NL PT SE TR BF BJ CF CG CI CM GA GN GW ML MR NE SN TD TG

DFPE Request for preliminary examination filed prior to expiration of 19th month from priority date (pct application filed before 20040101)
REG Reference to national code

Ref country code: DE

Ref legal event code: 8642

AK Designated states

Kind code of ref document: C2

Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BY BZ CA CH CN CO CR CU CZ DE DK DM DZ EE ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NO NZ PL PT RO RU SD SE SG SI SK SL TJ TM TR TT TZ UA UG US US UZ VN YU ZA ZW

AL Designated countries for regional patents

Kind code of ref document: C2

Designated state(s): GH GM KE LS MW MZ SD SL SZ TZ UG ZW AM AZ BY KG KZ MD RU TJ TM AT BE CH CY DE DK ES FI FR GB GR IE IT LU MC NL PT SE TR BF BJ CF CG CI CM GA GN GW ML MR NE SN TD TG

COP Corrected version of pamphlet

Free format text: PAGES 99-3870, DESCRIPTION, ADDED; PAGES 99-102, CLAIMS, RENUMBERED AS 3871-3874

122 Ep: pct application non-entry in european phase
WWE Wipo information: entry into national phase

Ref document number: 10450763

Country of ref document: US

CFP Corrected version of a pamphlet front page
CR1 Correction of entry in section i

Free format text: IN PCT GAZETTE 41/2001 UNDER "PUBLISHED" ADD "SEQUENCE LISTING PART OF DESCRIPTION PUBLISHED SEPARATELY IN ELECTRONIC FORM AND AVAILABLE UPON REQUEST FROM THE INTERNATIONAL BUREAU"

NENP Non-entry into the national phase

Ref country code: JP