US20080108798A1 - Selection of fibronectin scaffolds using nucleic acid-protein fusions - Google Patents

Selection of fibronectin scaffolds using nucleic acid-protein fusions Download PDF

Info

Publication number
US20080108798A1
US20080108798A1 US11/890,627 US89062707A US2008108798A1 US 20080108798 A1 US20080108798 A1 US 20080108798A1 US 89062707 A US89062707 A US 89062707A US 2008108798 A1 US2008108798 A1 US 2008108798A1
Authority
US
United States
Prior art keywords
thr
protein
domain
pro
ser
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Abandoned
Application number
US11/890,627
Inventor
Dasa Lipovsek
Richard Wagner
Robert Kuimelis
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Bristol Myers Squibb Co
Original Assignee
Adnexus a Bristol Myers Squibb R&D Co
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Priority claimed from US09/515,260 external-priority patent/US6818418B1/en
Application filed by Adnexus a Bristol Myers Squibb R&D Co filed Critical Adnexus a Bristol Myers Squibb R&D Co
Priority to US11/890,627 priority Critical patent/US20080108798A1/en
Assigned to ADNEXUS THERAPEUTICS, INC. reassignment ADNEXUS THERAPEUTICS, INC. CHANGE OF NAME (SEE DOCUMENT FOR DETAILS). Assignors: COMPOUND THERAPEUTICS INC.
Assigned to COMPOUND THERAPEUTICS, INC. reassignment COMPOUND THERAPEUTICS, INC. ASSIGNMENT OF ASSIGNORS INTEREST (SEE DOCUMENT FOR DETAILS). Assignors: PHYLOS, INC.
Assigned to PHYLOS, INC reassignment PHYLOS, INC ASSIGNMENT OF ASSIGNORS INTEREST (SEE DOCUMENT FOR DETAILS). Assignors: KUIMELIS, ROBERT G., LIPOVSEK, DASA, WAGNER, RICHARD W.
Publication of US20080108798A1 publication Critical patent/US20080108798A1/en
Assigned to ADNEXUS, A BRISTOL-MYERS SQUIBB R&D COMPANY reassignment ADNEXUS, A BRISTOL-MYERS SQUIBB R&D COMPANY CHANGE OF NAME (SEE DOCUMENT FOR DETAILS). Assignors: ADNEXUS, A BMS R&D COMPANY
Assigned to ADNEXUS, A BMS R&D COMPANY reassignment ADNEXUS, A BMS R&D COMPANY MERGER (SEE DOCUMENT FOR DETAILS). Assignors: ADNEXUS THERAPEUTICS, INC.
Assigned to BRISTOL-MYERS SQUIBB COMPANY reassignment BRISTOL-MYERS SQUIBB COMPANY ASSIGNMENT OF ASSIGNORS INTEREST (SEE DOCUMENT FOR DETAILS). Assignors: ADNEXUS, A BRISTOL-MYERS SQUIBB R&D COMPANY
Abandoned legal-status Critical Current

Links

Images

Classifications

    • CCHEMISTRY; METALLURGY
    • C07ORGANIC CHEMISTRY
    • C07KPEPTIDES
    • C07K14/00Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
    • C07K14/435Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans
    • C07K14/46Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans from vertebrates
    • C07K14/47Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans from vertebrates from mammals
    • CCHEMISTRY; METALLURGY
    • C07ORGANIC CHEMISTRY
    • C07KPEPTIDES
    • C07K14/00Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
    • C07K14/435Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans
    • C07K14/52Cytokines; Lymphokines; Interferons
    • C07K14/525Tumour necrosis factor [TNF]
    • CCHEMISTRY; METALLURGY
    • C07ORGANIC CHEMISTRY
    • C07KPEPTIDES
    • C07K14/00Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
    • C07K14/435Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans
    • C07K14/78Connective tissue peptides, e.g. collagen, elastin, laminin, fibronectin, vitronectin, cold insoluble globulin [CIG]
    • CCHEMISTRY; METALLURGY
    • C07ORGANIC CHEMISTRY
    • C07KPEPTIDES
    • C07K16/00Immunoglobulins [IGs], e.g. monoclonal or polyclonal antibodies
    • CCHEMISTRY; METALLURGY
    • C07ORGANIC CHEMISTRY
    • C07KPEPTIDES
    • C07K16/00Immunoglobulins [IGs], e.g. monoclonal or polyclonal antibodies
    • C07K16/18Immunoglobulins [IGs], e.g. monoclonal or polyclonal antibodies against material from animals or humans
    • C07K16/24Immunoglobulins [IGs], e.g. monoclonal or polyclonal antibodies against material from animals or humans against cytokines, lymphokines or interferons
    • C07K16/241Tumor Necrosis Factors
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/10Processes for the isolation, preparation or purification of DNA or RNA
    • C12N15/1034Isolating an individual clone by screening libraries
    • C12N15/1037Screening libraries presented on the surface of microorganisms, e.g. phage display, E. coli display
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q1/00Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
    • CCHEMISTRY; METALLURGY
    • C40COMBINATORIAL TECHNOLOGY
    • C40BCOMBINATORIAL CHEMISTRY; LIBRARIES, e.g. CHEMICAL LIBRARIES
    • C40B40/00Libraries per se, e.g. arrays, mixtures
    • C40B40/02Libraries contained in or displayed by microorganisms, e.g. bacteria or animal cells; Libraries contained in or displayed by vectors, e.g. plasmids; Libraries containing only microorganisms or vectors
    • CCHEMISTRY; METALLURGY
    • C07ORGANIC CHEMISTRY
    • C07KPEPTIDES
    • C07K2317/00Immunoglobulins specific features
    • C07K2317/20Immunoglobulins specific features characterized by taxonomic origin
    • C07K2317/22Immunoglobulins specific features characterized by taxonomic origin from camelids, e.g. camel, llama or dromedary
    • CCHEMISTRY; METALLURGY
    • C07ORGANIC CHEMISTRY
    • C07KPEPTIDES
    • C07K2318/00Antibody mimetics or scaffolds
    • C07K2318/20Antigen-binding scaffold molecules wherein the scaffold is not an immunoglobulin variable region or antibody mimetics
    • CCHEMISTRY; METALLURGY
    • C07ORGANIC CHEMISTRY
    • C07KPEPTIDES
    • C07K2319/00Fusion polypeptide
    • CCHEMISTRY; METALLURGY
    • C07ORGANIC CHEMISTRY
    • C07KPEPTIDES
    • C07K2319/00Fusion polypeptide
    • C07K2319/30Non-immunoglobulin-derived peptide or protein having an immunoglobulin constant or Fc region, or a fragment thereof, attached thereto

Definitions

  • This invention relates to protein scaffolds useful, for example, for the generation of products having novel binding characteristics.
  • Proteins having relatively defined three-dimensional structures may be used as reagents for the design of engineered products. These scaffolds typically contain one or more regions which are amenable to specific or random sequence variation, and such sequence randomization is often carried out to produce libraries of proteins from which desired products may be selected.
  • One particular area in which such scaffolds are useful is the field of antibody design.
  • a “minibody” scaffold which is related to the immunoglobulin fold, has been designed by deleting three beta strands from a heavy chain variable domain of a monoclonal antibody (Tramontano et al., J. Mol. Recognit. 7:9, 1994).
  • This protein includes 61 residues and can be used to present two hypervariable loops. These two loops have been randomized and products selected for antigen binding, but thus far the framework appears to have somewhat limited utility due to solubility problems.
  • the present invention provides a new family of proteins capable of evolving to bind any compound of interest.
  • These proteins which generally make use of a scaffold derived from a fibronectin type III (Fn3) or Fn3-like domain, function in a manner characteristic of natural or engineered antibodies (that is, polyclonal, monoclonal, or single-chain antibodies) and, in addition, possess structural advantages.
  • the structure of these antibody mimics has been designed for optimal folding, stability, and solubility, even under conditions that normally lead to the loss of structure and function in antibodies.
  • These antibody mimics may be utilized for the purpose of designing proteins which are capable of binding to virtually any compound (for example, any protein) of interest.
  • the 10 Fn3-based molecules described herein may be used as scaffolds which are subjected to directed evolution to form a population with one or more randomized Fn3 loops that are analogous by position and structure to the complementarity-determining regions (CDRs) of an antibody variable region, and/or to randomize Fn3's other three solvent exposed loops.
  • CDRs complementarity-determining regions
  • the scaffolds described herein may be used to display defined exposed loops (for example, loops previously randomized and selected on the basis of antigen binding) in order to direct the evolution of molecules that bind to such introduced loops.
  • a selection of this type may be carried out to identify recognition molecules for any individual CDR-like loop or, alternatively, for the recognition of two or all three CDR-like loops combined into a non-linear epitope.
  • the present invention features randomized or mutated scaffold proteins.
  • the invention features a non-antibody protein including a domain having an immunoglobulin-like fold, the non-antibody protein deriving from a reference protein by having a mutated amino acid sequence, wherein the non-antibody protein binds with a Kd at least as tight as 1 ⁇ M to a compound that is not bound as tightly by the reference protein.
  • the invention features a non-antibody protein deriving from a scaffold protein including a domain having an immunoglobulin-like fold, wherein the amino acid sequence of the domain in the derived protein is more than 50% identical to the amino acid sequence of the domain in the scaffold protein.
  • the invention features a protein that includes a fibronectin type III domain having at least one randomized loop, the protein being characterized by the ability of the Fn3 domain to bind to a compound that is not bound by the corresponding naturally-occurring Fn3 domain.
  • any of these proteins of the invention bind to their target compounds with a Kd at least as tight as 500 nM, preferably, with a Kd at least as tight as 100 nM or 10 nM, and, more preferably, with a Kd at least as tight as 1 nM, 500 ⁇ M, 100 ⁇ M, or even 20 ⁇ M.
  • the protein preferably contains one, two, or three mutated loops and at least one of the loops, and preferably two or all three of the loops, contributes to the binding of the protein to the compound.
  • the reference protein preferably lacks disulfide bonds, and the derivative protein may have at least one disulfide bond.
  • the domain having an immunoglobulin-like fold preferably has a molecular mass less than 10 kD or greater than 7.5 kD, and, more preferably, has a molecular mass between 7.5-10 kD.
  • the proteins of the invention may be monomers under physiological conditions or may be multimers, for example, dimers.
  • the reference protein used to derive a mutated protein of the invention is a naturally-occurring mammalian protein (for example, a human protein); and the domain having an immunoglobulin-like fold is mutated and includes up to 50%, and preferably up to 34%, mutated amino acids as compared to the immunoglobulin-like fold of the reference protein.
  • the domain having the immunoglobulin-like fold preferably consists of approximately 50-150 amino acids, and more preferably consists of approximately 50 amino acids.
  • Derivative proteins of the invention may be derived from any appropriate reference protein including, but not limited to, the preferred proteins, fibronectin or a fibronectin dimer, tenascin, N-cadherin, E-cadherin, ICAM, titin, GCSF-receptor, cytokine receptor, glycosidase inhibitor, antibiotic chromoprotein, myelin membrane adhesion molecule P0, CD8, CD4, CD2, class I MHC, T-cell antigen receptor, CD1, C2 and I-set domains of VCAM-1,1-set immunoglobulin domain of myosin-binding protein C, I-set immunoglobulin domain of myosin-binding protein H, I-set immunoglobulin domain of telokin, NCAM, twitchin, neuroglian, growth hormone receptor, erythropoietin receptor, prolactin receptor, interferon-gamma receptor, ⁇ -galactosidase/glucuronidase,
  • the fibronectin type III domain is a mammalian (for example, a human) fibronectin type III domain; and the protein includes the tenth module of the fibronectin type III ( 10 Fn3) domain.
  • compound binding is preferably mediated by either one, two, or three 10 Fn3 loops.
  • the second (DE) loop of 10 Fn3 may be extended in length relative to the naturally-occurring module, or the 10 Fn3 may lack an integrin-binding motif.
  • the integrin-binding motif may be replaced by an amino acid sequence in which a polar amino acid-neutral amino acid-acidic amino acid sequence (in the N-terminal to C-terminal direction) replaces the integrin-binding motif; alternatively, one preferred sequence is serine-glycine-glutamate.
  • the fibronectin type III domain-containing proteins of the invention lack disulfide bonds.
  • any of the proteins of the invention may be formulated as part of a fusion protein.
  • the fusion protein includes a heterologous protein that does not itself bind to the compound of interest.
  • the heterologous protein may, for example, be an antibody or antibody domain (such as an immunoglobulin F c domain), a complement protein, a toxin protein, or an albumin protein.
  • any of the proteins of the invention (for example, the fibronectin type III domain proteins) may be covalently bound to a nucleic acid (for example, an RNA), and the nucleic acid may encode the protein.
  • the protein may be a multimer, or, particularly if it lacks an integrin-binding motif, it may be formulated in a physiologically-acceptable carrier.
  • the present invention also features proteins that include a fibronectin type III domain having at least one mutation in a ⁇ -sheet sequence. Again, these proteins are characterized by their ability to bind to compounds that are not bound or are not bound as tightly by the corresponding naturally-occurring fibronectin domain.
  • any of the proteins of the invention may be immobilized on a solid support (for example, a bead or chip), and these proteins may be arranged in any configuration on the solid support, including an array.
  • a solid support for example, a bead or chip
  • the invention further features nucleic acids encoding any of the proteins of the invention.
  • the nucleic acid is DNA or RNA.
  • the invention also features a method for generating a protein which includes a fibronectin type III domain and which is pharmaceutically acceptable to a mammal, involving removing the integrin-binding domain of said fibronectin type III domain.
  • This method may be applied to any of the fibronectin type III domain-containing proteins described above and is particularly useful for generating proteins for human therapeutic applications.
  • the invention also features such fibronectin type III domain-containing proteins which lack integrin-binding domains.
  • the invention features methods of obtaining derivative non-antibody proteins which bind to compounds of interest.
  • One such method involves: (a) providing a non-antibody scaffold protein including an immunoglobulin-like fold, wherein the scaffold protein does not bind to the compound with a Kd as tight as 1 ⁇ M; (b) generating mutated derivatives of the non-antibody scaffold protein, thereby producing a library of mutated proteins; (c) contacting the library with the compound; (d) selecting from the library at least one derivative protein which binds to the compound with a Kd at least as tight as 1 ⁇ M; and (e) optionally repeating steps (b)-(d) substituting for the non-antibody scaffold protein in repeated step (b) the product from the previous step (d).
  • This technique may also be carried out with any of the proteins of the invention (for example, any of the fibronectin type III domain-containing proteins).
  • the invention features screening methods which may be used to obtain or evolve randomized or mutated proteins of the invention capable of binding to compounds of interest, or to obtain or evolve compounds (for example, proteins) capable of binding to a particular protein containing a randomized or mutated motif.
  • the invention features screening procedures which combine these two methods, in any order, to obtain either compounds or proteins of interest.
  • a first screening method useful for the isolation or identification of randomized or mutated proteins of interest, involves: (a) contacting a compound of interest with a candidate protein, the candidate protein being a derivative non-antibody protein including a domain having an immunoglobulin-like fold, the non-antibody protein deriving from a reference protein by having a mutated amino acid sequence wherein the non-antibody protein binds with a Kd at least as tight as 1 ⁇ M to a compound that is not bound as tightly by the reference protein, wherein the contacting is carried out under conditions that allow compound-protein complex formation; and (b) obtaining, from the complex, the derivative protein that binds to the compound.
  • This general technique may also be carried out with a fibronectin type III domain protein having at least one randomized or mutated loop.
  • the second screening method is for isolating or identifying a compound which binds to a protein of the invention.
  • This method begins with a non-antibody protein including a domain having an immunoglobulin-like fold and deriving from a reference protein by having a mutated amino acid sequence, wherein the non-antibody protein binds with a Kd at least as tight as 1 ⁇ M to a compound that is not bound as tightly by the reference protein.
  • This derivative protein is then contacted with a candidate compound, wherein the contacting is carried out under conditions that allow compound-protein complex formation, and the compound which binds to the derivative protein is obtained from the complex.
  • this general technique may be carried out with any protein of the invention, for example, a protein with a mutated fibronectin type III domain.
  • the invention features diagnostic methods which employ the proteins of the invention (for example, fibronectin type III scaffold proteins and their derivatives). Such diagnostic methods may be carried out on a sample (for example, a biological sample) to detect one analyte or to simultaneously detect many different analytes in the sample.
  • the method may employ any of the scaffold molecules described herein.
  • the method involves (a) contacting the sample with a protein of the invention that binds to the compound analyte, the contacting being carried out under conditions that allow compound-protein complex formation; and (b) detecting the complex, and therefore the compound in the sample.
  • this method may be used to quantitate, as well as detect, compound levels in a sample.
  • the protein of the invention binds to its target compound with a Kd at least as tight as 1 ⁇ M or 500 nM, preferably, with a Kd at least as tight as 100 nM or 10 nM, and, more preferably, with a Kd at least as tight as 1 nM, 500 pM, 100 pM, or even 20 pM.
  • the protein preferably contains one, two, or three mutated loops and at least one of the loops, and preferably two or all three of the loops contributes to the binding of the protein to the compound.
  • the reference protein preferably lacks disulfide bonds, and the derivative protein may have at least one disulfide bond.
  • the domain having an immunoglobulin-like fold preferably has a molecular mass less than 10 kD or greater than 7.5 kD, and, more preferably, has a molecular mass between 7.5-10 kD.
  • the proteins of the invention may be monomers under physiological conditions or may be multimers, for example, dimers.
  • the reference protein used to derive a mutated protein of the invention is a naturally-occurring mammalian protein (for example, a human protein); and the domain having an immunoglobulin-like fold is mutated and includes up to 50%, and preferably up to 34%, mutated amino acids as compared to the immunoglobulin-like fold of the reference protein.
  • the domain having an immunoglobulin-like fold preferably consists of approximately 50-150 amino acids, and more preferably consists of approximately 50 amino acids.
  • Derivative proteins used in the methods of the invention may be derived from any appropriate reference protein including, but not limited to, the preferred proteins, fibronectin or a fibronectin dimer, tenascin, N-cadherin, E-cadherin, ICAM, titin, GCSF-receptor, cytokine receptor, glycosidase inhibitor, antibiotic chromoprotein, myelin membrane adhesion molecule P0, CD8, CD4, CD2, class I MHC, T-cell antigen receptor, CD1, C2 and I-set domains of VCAM-1,1-set immunoglobulin domain of myosin-binding protein C, I-set immunoglobulin domain of myosin-binding protein H, I-set immunoglobulin domain of telokin, NCAM, twitchin, neuroglian, growth hormone receptor, erythropoietin receptor, prolactin receptor, interferon-gamma receptor, ⁇ -galactosidase/glucu
  • the steps of the selection methods described herein may be repeated with further mutation or randomization being carried out between cycles.
  • at least one loop of the fibronectin type III domain of the protein obtained in step (b) may be mutated and steps (a) and (b) repeated using the further randomized protein, or the compound obtained in step (b) may be modified and steps (a) and (b) repeated using the further modified compound.
  • the compound is preferably a protein
  • the fibronectin type III domain is preferably a mammalian (for example, a human) fibronectin type III domain.
  • the protein includes the tenth module of the fibronectin type III domain ( 10 Fn3), and binding is mediated by one, two, or three 10 Fn3 loops.
  • the second (DE) loop of 10 Fn3 may be extended in length relative to the naturally-occurring module, or 10 Fn3 may lack an integrin-binding motif.
  • the integrin-binding motif may be replaced by an amino acid sequence in which a basic amino acid-neutral amino acid-acidic amino acid sequence (in the N-terminal to C-terminal direction) replaces the integrin-binding motif; alternatively, one preferred replacement sequence is serine-glycine-glutamate.
  • any of the proteins of the invention for example, a fibronectin type III domain-containing protein.
  • any of these proteins may be formulated as part of a fusion protein with a heterologous protein (for example, an antibody or antibody domain (including an immunoglobulin F c domain) that does not itself bind the compound of interest, or a complement protein, toxin protein, or albumin protein).
  • selections and diagnostic methods may be carried out using the proteins of the invention (for example, the fibronectin type III domain proteins) covalently bound to nucleic acids (for example, RNAs or any nucleic acid which encodes the protein).
  • the selections and diagnostic methods may be carried out using these proteins (for example, the fibronectin domain-containing proteins) as monomers or as multimers, such as dimers.
  • the selections and diagnostic methods involve the immobilization of the binding target on a solid support.
  • solid supports include columns (for example, affinity columns, such as agarose-based affinity columns), microchips, or beads.
  • the proteins for example, the Fn3 domain-containing proteins
  • the proteins may be immobilized and contacted with one or more potential binding targets.
  • the compound is often a protein, but may also be any other analyte in a sample.
  • Detection may be accomplished by any standard technique including, without limitation, radiography, fluorescence detection, mass spectroscopy, or surface plasmon resonance.
  • the invention features a non-antibody protein that binds tumor necrosis factor- ⁇ (TNF- ⁇ ) with a Kd at least as tight as 1 ⁇ M, the protein having a sequence that is less than 20% identical to TNF- ⁇ receptor (for example, a naturally-occurring TNF- ⁇ receptor, such as a mammalian or human TNF- ⁇ receptor).
  • TNF- ⁇ tumor necrosis factor- ⁇
  • the protein having a sequence that is less than 20% identical to TNF- ⁇ receptor (for example, a naturally-occurring TNF- ⁇ receptor, such as a mammalian or human TNF- ⁇ receptor).
  • this protein includes a mutated fibronectin type III domain and the protein is mutated in the DE, BC, and FG loops.
  • the mutated FG loop is the same length as the wild-type FG loop.
  • the protein includes an immunoglobulin-like fold (preferably, having a molecular mass less than 10 kD, greater than 7.5 kD, or between 7.5-10 kD) that consists of approximately 50-150 amino acids, and preferably, approximately 50 amino acids.
  • the TNF- ⁇ binders according to the invention bind TNF- ⁇ with a Kd at least as tight as 1 ⁇ M, preferably, at least as tight as 500 nM, 100 nM, or 10 nM, more preferably, at least as tight as 1 nM or 500 pM, and, most preferably, at least as tight as 100 pM or even 20 pM.
  • these proteins contain one, two, or three mutated loops, and at least one, and preferably two or all three of the loops, contribute to the binding of the non-antibody protein to TNF- ⁇ .
  • the non-antibody protein has at least one disulfide bond, and the non-antibody protein is a monomer or dimer under physiological conditions.
  • the TNF- ⁇ binders may be immobilized on a solid support (for example, a chip or bead), and may be part of an array.
  • any of the TNF- ⁇ binders may be joined to a heterologous protein (for example, a heterologous protein that is an antibody or an antibody domain that does not bind TNF- ⁇ , an immunoglobulin F c domain, a complement protein, or an albumin protein).
  • the protein may include a mutated fibronectin type III domain (for example, one derived from a human fibronectin type III domain, such as a mutated tenth module of the fibronectin type III domain ( 10 Fn3)).
  • the protein may lack an 10 Fn3 integrin-binding motif.
  • TNF- ⁇ binders preferably include a non-naturally occurring sequence in a loop of 10 Fn3 (for example, the loop sequence PW(A/G), and may include a non-naturally occurring sequence in a ⁇ -sheet of 10 Fn3. Particularly preferred TNF- ⁇ binders of the invention are shown in FIG. 25 (SEQ ID NOS: 34-140).
  • the invention features nucleic acids encoding any of the TNF- ⁇ binding proteins of the invention, as well as a loop structure on any protein that includes any one of the amino acid sequences of FIG. 25 (SEQ ID NOS: 34-140).
  • non-antibody protein a protein that is not produced by the B cells of a mammal either naturally or following immunization of a mammal. This term also excludes antibody fragments of more than 100 amino acids, preferably, more than 80 amino acids, and, most preferably, more than 50 amino acids in length.
  • immunoglobulin-like fold is meant a protein domain of between about 80-150 amino acid residues that includes two layers of antiparallel beta-sheets, and in which the flat, hydrophobic faces of the two beta-sheets are packed against each other. Proteins according to the invention may include several immunoglobulin-like folds covalently bound or associated non-covalently into larger structures.
  • scaffold is meant a protein used to select or design a protein framework with specific and favorable properties, such as binding.
  • amino acid residues that are important for the framework's favorable properties are retained, while others residues may be varied.
  • Such a scaffold has less than 50% of the amino acid residues that vary between protein derivatives having different properties and greater than or equal to 50% of the residues that are constant between such derivatives. Most commonly, these constant residues confer the same overall three-dimensional fold to all the variant domains, regardless of their properties.
  • fibronectin type III domain is meant a domain having 7 or 8 beta strands which are distributed between two beta sheets, which themselves pack against each other to form the core of the protein, and further containing loops which connect the beta strands to each other and are solvent exposed. There are at least three such loops at each edge of the beta sheet sandwich, where the edge is the boundary of the protein perpendicular to the direction of the beta strands.
  • Naturally occurring is meant any protein that is encoded by a living organism.
  • randomized or “mutated” is meant including one or more amino acid alterations relative to a template sequence.
  • randomizing or “mutating” is meant the process of introducing, into a sequence, such an amino acid alteration. Randomization or mutation may be accomplished through intentional, blind, or spontaneous sequence variation, generally of a nucleic acid coding sequence, and may occur by any technique, for example, PCR, error-prone PCR, or chemical DNA synthesis.
  • a “corresponding, non-mutated protein” is meant a protein that is identical in sequence, except for the introduced amino acid mutations.
  • Protein By a “protein” is meant any sequence of two or more amino acids, regardless of length, post-translation modification, or function. “Protein” and “peptide” are used interchangeably herein.
  • RNA is meant a sequence of two or more covalently bonded, naturally occurring or modified ribonucleotides.
  • a modified RNA included within this term is phosphorothioate RNA.
  • DNA is meant a sequence of two or more covalently bonded, naturally occurring or modified deoxyribonucleotides.
  • nucleic acid is meant any two or more covalently bonded nucleotides or nucleotide analogs or derivatives. As used herein, this term includes, without limitation, DNA, RNA, and PNA.
  • pharmaceutically acceptable is meant a compound or protein that may be administered to an animal (for example, a mammal) without significant adverse medical consequences.
  • physiologically acceptable carrier is meant a carrier which does not have a significant detrimental impact on the treated host and which retains the therapeutic properties of the compound with which it is administered.
  • physiologically acceptable carrier is physiological saline.
  • Other physiologically acceptable carriers and their formulations are known to one skilled in the art and are described, for example, in Remington's Pharmaceutical Sciences , (18 th edition), ed. A. Gennaro, 1990, Mack Publishing Company, Easton, Pa., incorporated herein by reference.
  • fusion protein a protein that includes (i) a scaffold protein of the invention joined to (ii) a second, different (i.e., “heterologous”) protein.
  • Fusion proteins are distinguished from “nucleic acid-protein fusions” and “RNA-protein fusions” in that a “fusion protein” is composed entirely of amino acids, while both a “nucleic acid-protein fusion” and an “RNA-protein fusion” include a stretch of nucleic acids (the nucleic acid or RNA component) joined to a stretch of amino acids (the protein component).
  • a “selecting” step provides at least a 2-fold, preferably, at least a 30-fold, more preferably, at least a 100-fold, and, most preferably, at least a 1000-fold enrichment of a desired molecule relative to undesired molecules in a population following the selection step.
  • a selection step may be repeated any number of times, and different types of selection steps may be combined in a given approach.
  • binding partner any molecule which has a specific, covalent or non-covalent affinity for a portion of a desired compound (for example, protein) of interest.
  • binding partners include, without limitation, members of antigen/antibody pairs, protein/inhibitor pairs, receptor/ligand pairs (for example cell surface receptor/ligand pairs, such as hormone receptor/peptide hormone pairs), enzyme/substrate pairs (for example, kinase/substrate pairs), lectin/carbohydrate pairs, oligomeric or heterooligomeric protein aggregates, DNA binding protein/DNA binding site pairs, RNA/protein pairs, and nucleic acid duplexes, heteroduplexes, or ligated strands, as well as any molecule which is capable of forming one or more covalent or non-covalent bonds (for example, disulfide bonds) with any portion of another molecule (for example, a compound or protein).
  • solid support is meant, without limitation, any column (or column material), bead, test tube, microtiter dish, solid particle (for example, agarose or sepharose), microchip (for example, silicon, silicon-glass, or gold chip), or membrane (for example, an inorganic membrane, nitrocellulose, or the membrane of a liposome or vesicle) to which an antibody mimic or an affinity complex may be bound, either directly or indirectly (for example, through other binding partner intermediates such as other antibodies or Protein A), or in which an antibody mimic or an affinity complex may be embedded (for example, through a receptor or channel).
  • the present antibody mimics exhibit improved biophysical properties, such as stability under reducing conditions and solubility at high concentrations.
  • these molecules may be readily expressed and folded in prokaryotic systems, such as E. coli in eukaryotic systems, such as yeast, and in in vitro translation systems, such as the rabbit reticulocyte lysate system.
  • prokaryotic systems such as E. coli in eukaryotic systems, such as yeast
  • in vitro translation systems such as the rabbit reticulocyte lysate system.
  • these molecules are extremely amenable to affinity maturation techniques involving multiple cycles of selection, including in vitro selection using RNA-protein fusion technology (Roberts and Szostak, Proc. Natl. Acad. Sci. USA 94:12297, 1997; Szostak et al., U.S. Ser. No.
  • FIG. 1 is a photograph showing a comparison between the structures of antibody heavy chain variable regions from camel (dark blue) and llama (light blue), in each of two orientations.
  • FIG. 2 is a photograph showing a comparison between the structures of the camel antibody heavy chain variable region (dark blue), the llama antibody heavy chain variable region (light blue), and a fibronectin type III module number 10 ( 10 Fn3) (yellow).
  • FIG. 3 is a photograph showing a fibronectin type III module number 10 ( 10 Fn3), with the loops corresponding to the antigen-binding loops in IgG heavy chains highlighted in red.
  • FIG. 4 is a graph illustrating a sequence alignment between a fibronectin type III protein domain and related protein domains.
  • FIG. 5 is a photograph showing the structural similarities between a 10 Fn3 domain and 15 related proteins, including fibronectins, tenascins, collagens, and undulin.
  • the regions are labeled as follows: constant, dark blue; conserved, light blue; neutral, white; variable, red; and RGD integrin-binding motif (variable), yellow.
  • FIG. 6 is a photograph showing space filling models of fibronectin III modules 9 and 10, in each of two different orientations.
  • the two modules and the integrin binding loop (RGD) are labeled.
  • blue indicates positively charged residues
  • red indicates negatively charged residues
  • white indicates uncharged residues.
  • FIG. 7 is a photograph showing space filling models of fibronectin III modules 7-10, in each of three different orientations. The four modules are labeled. In this figure, blue indicates positively charged residues, red indicates negatively charged residues, and white indicates uncharged residues.
  • FIG. 8 is a photograph illustrating the formation, under different salt conditions, of RNA-protein fusions which include fibronectin type III domains.
  • FIG. 9 is a series of photographs illustrating the selection of fibronectin type III domain-containing RNA-protein fusions, as measured by PCR signal analysis.
  • FIG. 10 is a graph illustrating an increase in the percent TNF- ⁇ binding during the selections described herein, as well as a comparison between RNA-protein fusion and free protein selections.
  • FIG. 11 is a series of schematic representations showing IgG, 10 Fn3, Fn-CH 1 —CH 2 —CH 3 , and Fn-CH 2 —CH 3 (clockwise from top left).
  • FIG. 12 is a photograph showing a molecular model of Fn-CH 1 —CH 2 —CH 3 based on known three-dimensional structures of IgG (X-ray crystallography) and 10 Fn3 (NMR and X-ray crystallography).
  • FIG. 13 is a graph showing the time course of an exemplary 10 Fn3-based nucleic acid-protein fusion selection of TNF- ⁇ binders. The proportion of nucleic acid-protein fusion pool (open diamonds) and free protein pool (open circles) that bound to TNF- ⁇ -Sepharose, and the proportion of free protein pool (full circles) that bound to underivatized Sepharose, are shown.
  • FIGS. 14 and 15 are graphs illustrating TNF- ⁇ binding by TNF- ⁇ Fn-binders. In particular, these figures show mass spectra data obtained from a 10 Fn3 fusion chip and non-fusion chip, respectively.
  • FIGS. 16 and 17 are the phosphorimage and fluorescence scan, respectively, of an 10 Fn3 array, illustrating TNF- ⁇ binding.
  • FIG. 18 is a graph showing an alignment of the primary sequences of the llama V H domain and the wild-type human 10 F3 domain. Homologous residues between the two sequences are indicated. The 10 F3 residues outside the randomized loops that were found to have mutated in approximately 45% of the selected clones are marked with arrows under the wild-type 10 Fn3 sequence and with the letter that identifies the selected residue.
  • FIG. 19 shows schematic representations of the llama V H domain and the wild-type human 10 F3 domain. The locations of the mutated framework residues are indicated.
  • FIG. 20 is a graph illustrating the efficiency and specificity of binding of a free-protein pool translated from the original library (R0) and after ten rounds of selection with TNF- ⁇ (R10). Protein pool binding to underivatized Sepharose, to TNF- ⁇ -Sepharose, to IL-1 ⁇ -Sepharose, and to IL-13-Sepharose is compared.
  • FIG. 21 is a series of IgG-like scaffolds for the display of up to three loops.
  • FIG. 22 is a series of IgG-like scaffolds for the display of up to four, or even six, loops.
  • FIG. 23 is a series of scaffolds, unrelated to IgG, for the display of loop structures.
  • FIGS. 24A-24D are photographic and graphic illustrations demonstrating the specific capture of a target (TNF- ⁇ ) by a mimic immobilized on a solid surface.
  • FIG. 25 is a graph listing exemplary TNF- ⁇ binders (SEQ ID NOS: 33-140) according to the invention.
  • novel antibody mimics described herein have been designed to be superior both to antibody-derived fragments and to non-antibody frameworks, for example, those frameworks cited above.
  • antibody mimics over antibody fragments is structural.
  • These antibody-mimics are derived from whole, stable, and soluble structural scaffolds.
  • the Fn3 scaffold is found in the human body. Consequently, they exhibit better folding and thermostability properties than antibody fragments, whose creation involves the removal of parts of the antibody native fold, often exposing amino acid residues that, in an intact antibody, would be buried in a hydrophobic environment, such as an interface between variable and constant domains. Exposure of such hydrophobic residues to solvent increases the likelihood of aggregation of the antibody fragments.
  • the scaffolds described herein have no disulfide bonds, which have been reported to retard or prevent proper folding of antibody fragments under certain conditions. Since the present scaffolds do not rely on disulfides for native fold stability, they are stable under reducing conditions, unlike antibodies and their fragments which unravel upon disulfide bond reduction.
  • these scaffolds provide the functional advantages of antibody molecules.
  • the 10 Fn3 module is not an immunoglobulin
  • its overall fold is close to that of the variable region of the IgG heavy chain ( FIG. 2 ), making it possible to display the three fibronectin loops analogous to CDRs in relative orientations similar to those of native antibodies.
  • the present antibody mimics possess antigen binding properties that are similar in nature and affinity to those of antibodies, and a loop randomization and shuffling strategy may be employed in vitro that is similar to the process of affinity maturation of antibodies in vivo.
  • exemplary scaffolds for example, fibronectin-based scaffolds, and their use for identifying, selecting, and evolving novel binding proteins as well as their target ligands. These examples are provided for the purpose of illustrating, and not limiting, the invention.
  • Preferred antibody mimics of the present invention are based on the structure of a fibronectin module of type III (Fn3), a common domain found in mammalian blood and structural proteins. This domain occurs more than 400 times in the protein sequence database and has been estimated to occur in 2% of the proteins sequenced to date, including fibronectins, tenascin, intracellular cytoskeletal proteins, and prokaryotic enzymes (Bork and Doolittle, Proc. Natl. Acad. Sci. USA 89:8990, 1992; Bork et al., Nature Biotech. 15:553, 1997; Meinke et al., J. Bacteriol. 175:1910, 1993; Watanabe et al., J. Biol. Chem.
  • Fn3 fibronectin module of type III
  • a particular scaffold is the tenth module of human Fn3 ( 10 Fn3), which comprises 94 amino acid residues.
  • the overall fold of this domain is closely related to that of the smallest functional antibody fragment, the variable region of the heavy chain, which comprises the entire antigen recognition unit in camel and llama IgG ( FIG. 1, 2 ).
  • the major differences between camel and llama domains and the 10 Fn3 domain are that (i) 10 Fn3 has fewer beta strands (seven vs. nine) and (ii) the two beta sheets packed against each other are connected by a disulfide bridge in the camel and llama domains, but not in 10 F3.
  • the three loops of 10 Fn3 corresponding to the antigen-binding loops of the IgG heavy-chain run between amino acid residues 21-31 (BC), 51-56 (DE), and 76-88 (FG) ( FIG. 3 ).
  • the length of the BC and DE loop, 10 and 6 residues, respectively, fall within the narrow range of the corresponding antigen-recognition loops found in antibody heavy chains, that is, 7-10 and 4-8 residues, respectively. Accordingly, once randomized and selected for high antigen affinity, these two loops may make contacts with antigens equivalent to the contacts of the corresponding loops in antibodies.
  • the FG loop of 10 Fn3 is 12 residues long, whereas the corresponding loop in antibody heavy chains ranges from 4-28 residues.
  • the length of the FG loop of 10 F3 is preferably randomized in length as well as in sequence to cover the CDR3 range of 4-28 residues to obtain the greatest possible flexibility and affinity in antigen binding.
  • the lengths as well as the sequences of the CDR-like loops of the antibody mimics may be randomized during in vitro or in vivo affinity maturation (as described in more detail below).
  • the tenth human fibronectin type III domain, 10 Fn3, refolds rapidly even at low temperature; its backbone conformation has been recovered within 1 second at 5° C.
  • Fn3 Fn3 fibronectin, a glycoprotein that exists in a soluble form in body fluids and in an insoluble form in the extracellular matrix (Dickinson et al., J. Mol. Biol. 236:1079, 1994).
  • a fibronectin monomer of 220-250 kD contains 12 type I modules, two type II modules, and 17 fibronectin type III modules (Potts and Campbell, Curr. Opin. Cell Biol. 6:648, 1994). Different type III modules are involved in the binding of fibronectin to integrins, heparin, and chondroitin sulfate.
  • 10 Fn3 was found to mediate cell adhesion through an integrin-binding Arg-Gly-Asp (RGD) motif on one of its exposed loops. Similar RGD motifs have been shown to be involved in integrin binding by other proteins, such as fibrinogen, von Wellebrand factor, and vitronectin (Hynes et al., Cell 69:11, 1992). No other matrix- or cell-binding roles have been described for 10 Fn3.
  • RGD integrin-binding Arg-Gly-Asp
  • the 10 F3 framework possesses exposed loop sequences tolerant of randomization, facilitating the generation of diverse pools of antibody mimics. This determination was made by examining the flexibility of the 10 Fn3 sequence.
  • the human 10 Fn3 sequence was aligned with the sequences of fibronectins from other sources as well as sequences of related proteins ( FIG. 4 ), and the results of this alignment were mapped onto the three-dimensional structure of the human 10 F3 domain ( FIG. 5 ).
  • changes in the ⁇ -sheet sequences may also be used to evolve new proteins. These mutations change the scaffold and thereby indirectly alter loop structure(s). If this approach is taken, mutations should not saturate the sequence, but rather few mutations should be introduced. Preferably, no more than between 3-20 changes should be introduced to the ⁇ -sheet sequences by this approach.
  • Sequence variation may be introduced by any technique including, for example, mutagenesis by Taq polymerase (Tindall and Kunkel, Biochemistry 27:6008 (1988)), fragment recombination, or a combination thereof.
  • mutagenesis by Taq polymerase (Tindall and Kunkel, Biochemistry 27:6008 (1988)
  • fragment recombination or a combination thereof.
  • an increase of the structural diversity of libraries for example, by varying the length as well as the sequence of the CDR-like loops, or by structural redesign based on the advantageous framework mutations found in selected pools, may be used to introduce further improvements in antibody mimic scaffolds.
  • the antibody mimics described herein may be fused to other protein domains.
  • these mimics may be integrated with the human immune response by fusing the constant region of an IgG (F c ) with an antibody mimic, such as an 10 Fn3 module, preferably through the C-terminus of 10 Fn3.
  • the F c in such a 10 Fn3-F c fusion molecule activates the complement component of the immune response and increases the therapeutic value of the antibody mimic.
  • a fusion between an antibody mimic, such as 10 Fn3, and a complement protein, such as Clq may be used to target cells
  • a fusion between an antibody mimic, such as 10 F3, and a toxin may be used to specifically destroy cells that carry a particular antigen.
  • an antibody scaffold such as 10 Fn3, in any form may be fused with albumin to increase its half-life in the bloodstream and its tissue penetration.
  • Any of these fusions may be generated by standard techniques, for example, by expression of the fusion protein from a recombinant fusion gene constructed using publically available gene sequences.
  • any of the scaffold constructs described herein may be generated as dimers or multimers of antibody mimics as a means to increase the valency and thus the avidity of antigen binding.
  • Such multimers may be generated through covalent binding.
  • individual 10 Fn3 modules may be bound by imitating the natural 8 Fn3- 9 Fn3- 10 Fn3 C-to-N-terminus binding or by imitating antibody dimers that are held together through their constant regions.
  • a 10 Fn3-Fc construct may be exploited to design dimers of the general scheme of 10 Fn3-Fc::Fc- 10 Fn3.
  • the bonds engineered into the Fc::Fc interface may be covalent or non-covalent.
  • dimerizing or multimerizing partners other than Fc can be used in hybrids, such as 10 Fn3 hybrids, to create such higher order structures.
  • covalently bonded multimers may be generated by constructing fusion genes that encode the multimer or, alternatively, by engineering codons for cysteine residues into monomer sequences and allowing disulfide bond formation to occur between the expression products.
  • Non-covalently bonded multimers may also be generated by a variety of techniques. These include the introduction, into monomer sequences, of codons corresponding to positively and/or negatively charged residues and allowing interactions between these residues in the expression products (and therefore between the monomers) to occur. This approach may be simplified by taking advantage of charged residues naturally present in a monomer subunit, for example, the negatively charged residues of fibronectin.
  • Another means for generating non-covalently bonded antibody mimics is to introduce, into the monomer gene (for example, at the amino- or carboxy-termini), the coding sequences for proteins or protein domains known to interact.
  • proteins or protein domains include coil-coil motifs, leucine zipper motifs, and any of the numerous protein subunits (or fragments thereof) known to direct formation of dimers or higher order multimers.
  • 10 Fn3 represents a preferred scaffold for the generation of antibody mimics
  • other molecules may be substituted for 10 Fn3 in the molecules described herein.
  • These include, without limitation, human fibronectin modules 10 Fn3- 9 Fn3 and 11 Fn3- 17 Fn3 as well as related Fn3 modules from non-human animals and prokaryotes.
  • Fn3 modules from other proteins with sequence homology to 10 Fn3, such as tenascins and undulins may also be used.
  • Other exemplary scaffolds having immunoglobulin-like folds (but with sequences that are unrelated to the V H domain) are shown in FIG.
  • Yet further domains with related structures may be derived from myelin membrane adhesion molecule P0, CD8, CD4, CD2, class I MHC, T-cell antigen receptor, CD1, C2 and I-set domains of VCAM-1, I-set immunoglobulin domain of myosin-binding protein C, I-set immunoglobulin domain of myosin-binding protein H, I-set immunoglobulin domain of telokin, telikin, NCAM, twitchin, neuroglian, growth hormone receptor, erythropoietin receptor, prolactin receptor, GC-SF receptor, interferon-gamma receptor, ⁇ -galactosidase/glucuronidase, ⁇ -glucuronidase, and transglutaminase.
  • any other protein that includes one or more immunoglobulin-like folds may be utilized.
  • Such proteins may be identified, for example, using the program SCOP (Murzin et al., J. Mol. Biol. 247:536 (1995); Lo Conte et al., Nucleic Acids Res. 25:257 (2000).
  • any molecule that exhibits a structural relatedness to the V H domain may be utilized as an antibody mimic.
  • Such molecules may, like fibronectin, include three loops at the N-terminal pole of the molecule and three loops at the C-terminal pole, each of which may be randomized to create diverse libraries; alternatively, larger domains may be utilized, having larger numbers of loops, as long as a number of such surface randomizable loops are positioned closely enough in space that they can participate in antigen binding.
  • FIG. 22 shows examples of useful domains having more than three loops positioned close to each other.
  • T-cell antigen receptor and superoxide dismutase which each have four loops that can be randomized; and an Fn3 dimer, tissue factor domains, and cytokine receptor domains, each of which have three sets of two similar domains where three randomizable loops are part of the two domains (bringing the total number of loops to six).
  • any protein having variable loops positioned close enough in space may be utilized for candidate binding protein production.
  • large proteins having spatially related, solvent accessible loops may be used, even if unrelated structurally to an immunoglobulin-like fold.
  • Exemplary proteins include, without limitation, cytochrome F, green fluorescent protein, GroEL, and thaumatin ( FIG. 23 ).
  • the loops displayed by these proteins may be randomized and superior binders selected from a randomized library as described herein. Because of their size, molecules may be obtained that exhibit an antigen binding surface considerably larger than that found in an antibody-antigen interaction.
  • Other useful scaffolds of this type may also be identified using the program SCOP (Murzin et al., J. Mol. Biol. 247:536 (1995)) to browse among candidate proteins having numerous loops, particularly loops positioned among parallel beta sheets or a number of alpha-helices.
  • Modules from different organisms and parent proteins may be most appropriate for different applications. For example, in designing an antibody mimic, it may be most desirable to generate that protein from a fibronectin or fibronectin-like molecule native to the organism for which a therapeutic is intended. In contrast, the organism of origin is less important or even irrelevant for antibody mimics that are to be used for in vitro applications, such as diagnostics, or as research reagents.
  • libraries may be generated and used to select binding proteins by any of the methods described herein.
  • the antibody mimics described herein may be used in any technique for evolving new or improved binding proteins.
  • the target of binding is immobilized on a solid support, such as a column resin or microtiter plate well, and the target contacted with a library of candidate scaffold-based binding proteins.
  • a library may consist of antibody mimic clones, such as 10 Fn3 clones constructed from the wild type 10 Fn3 scaffold through randomization of the sequence and/or the length of the 10 Fn3 CDR-like loops.
  • this library may be an RNA-protein fusion library generated, for example, by the techniques described in Szostak et al., U.S. Ser. No.
  • it may be a DNA-protein library (for example, as described in Lohse, DNA-Protein Fusions and Uses Thereof, U.S. Ser. No. 60/110,549, U.S. Ser. No. 09/459,190, and WO 00/32823).
  • the fusion library is incubated with the immobilized target, the support is washed to remove non-specific binders, and the tightest binders are eluted under very stringent conditions and subjected to PCR to recover the sequence information or to create a new library of binders which may be used to repeat the selection process, with or without further mutagenesis of the sequence. A number of rounds of selection may be performed until binders of sufficient affinity for the antigen are obtained.
  • the 10 Fn3 scaffold may be used as the selection target.
  • a protein is required that binds a specific peptide sequence presented in a ten residue loop
  • a single 10 Fn3 clone is constructed in which one of its loops has been set to the length of ten and to the desired sequence.
  • the new clone is expressed in vivo and purified, and then immobilized on a solid support.
  • An RNA-protein fusion library based on an appropriate scaffold is then allowed to interact with the support, which is then washed, and desired molecules eluted and re-selected as described above.
  • the scaffolds described herein may be used to find natural proteins that interact with the peptide sequence displayed by the scaffold, for example, in an 10 F3 loop.
  • the scaffold protein such as the 10 F3 protein, is immobilized as described above, and an RNA-protein fusion library is screened for binders to the displayed loop.
  • the binders are enriched through multiple rounds of selection and identified by DNA sequencing.
  • RNA-protein libraries represent exemplary libraries for directed evolution
  • any type of scaffold-based library may be used in the selection methods of the invention.
  • the antibody mimics described herein may be evolved to bind any antigen of interest. These proteins have thermodynamic properties superior to those of natural antibodies and can be evolved rapidly in vitro. Accordingly, these antibody mimics may be employed in place of antibodies in all areas in which antibodies are used, including in the research, therapeutic, and diagnostic fields. In addition, because these scaffolds possess solubility and stability properties superior to antibodies, the antibody mimics described herein may also be used under conditions which would destroy or inactivate antibody molecules. Finally, because the scaffolds of the present invention may be evolved to bind virtually any compound, these molecules provide completely novel binding proteins which also find use in the research, diagnostic, and therapeutic areas.
  • Exemplary scaffold molecules described above were generated and tested, for example, in selection protocols, as follows.
  • a complex library was constructed from three fragments, each of which contained one randomized area corresponding to a CDR-like loop.
  • the randomized residues are indicated in FIG. 18 as underlined sequences, specifically, residues 23-29 of the 10 Fn3 BC loop (corresponding to CDR-H1 of the llama V H ); residues 52-55 of the 10 Fn3 DE loop (corresponding to CDR-H2. of the llama V H ); and residues 78-87 of the 10 Fn3 FG loop (corresponding to CDR-H3 of the llama V H ).
  • each of the fragments contained stretches encoding an N-terminal HiS6 domain or a C-terminal FLAG peptide tag.
  • each DNA fragment contained recognition sequences for the EarI Type IIS restriction endonuclease. This restriction enzyme allowed the splicing together of adjacent fragments while removing all foreign, non-10Fn3, sequences. It also allowed for a recombination-like mixing of the three 10 Fn3 fragments between cycles of mutagenesis and selection.
  • the wild-type, human 10 Fn3 gene was cloned from a human liver library (Maxim Biotech, South San Francisco, Calif.) using the primers Hu5PCR-NdeI 5′CATATGGTTTCTGATGTTCCGAGG3′; SEQ ID NO: 28) and Hu3PCR-EcoRI (5′GAATTCCTATGTTCGGTAATTAATGGAAATTG3′; SEQ ID NO: 29).
  • Hu5PCR-NdeI 5′CATATGGTTTCTGATGTTCCGAGG3′; SEQ ID NO: 28
  • Hu3PCR-EcoRI 5′GAATTCCTATGTTCGGTAATTAATGGAAATTG3′
  • Three different libraries were constructed from the wild-type segments obtained by the PCR of the 10 Fn3 clone and from randomized segments obtained by oligonucleotide synthesis.
  • the BC r -DE r -FG r library was obtained by randomizing the selected residues in BC, DE, and FG loops; the BC r -DE wt -FG r library was obtained by randomizing the selected residues in BC and FG loops, leaving the DE loop sequence wild-type; and the BC wt -DE wt -FG r library was obtained by randomizing the selected residues in the FG loop only.
  • the BC r , DE r , and FG r fragments were made synthetically. Each fragment was assembled from two overlapping oligonucleotides, which were first annealed, then extended to form the double-stranded DNA form of the fragment.
  • the oligonucleotides that were used to construct and process the three fragments are listed below; the “Top” and “Bottom” species for each fragment are the oligonucleotides that contained the entire 10 Fn3 encoding sequence. In these oligonucleotides designations, “N” indicates A, T, C, or G; and “S” indicates C or G.
  • HfnLbcTop (His): (SEQ ID NO:1) 5′-GG AAT TCC TAA TAC GAC TCA CTA TAG GGA CAA TTA CTA TTT ACA ATT ACA ATG CAT CAC CAT CAC CAT CAC GTT TCT GAT GTT CCG AGG GAC CTG GAA GTT GTT GCT GCG ACC CCC ACC AGC-3′ HfnLbcTop (an alternative N-terminus): (SEQ ID NO:2) 5′-GG AAT TCC TAA TAC GAC TCA CTA TAG GGA CAA TTA CTA TTT ACA ATT ACA ATG GTT TCT GAT GTT CCG AGG GAC CTG GAA GTT GTT GCT GCG ACC CCC ACC AGC-3′ HFnLBCBot-flag8: (SEQ ID NO:3) 5′-AGG GGA TGC CTT GTC GTC GTC GTC CTT GTA GTC GCT CTT CCC TGT T
  • the oligonucleotide pair BC Top and BC Bot-flag8 was used to construct the fragment which contains the randomized BC loop; the pair DE Top and DE Bot-flag8 was used to construct the fragment which contains the randomized DE loop; the pair BC Top and DE 3-Flag8 was used to PCR-amplify the BC wt -D wt fragments; and the pair FG Top and FG Bot-Flag8 was used to construct the fragment which contains the randomized FG loop.
  • the pairs of oligonucleotides (500 pmol of each) were annealed in 100 ⁇ L of 10 mM Tris 7.5, 50 mM NaCl for 10 minutes at 85° C., followed by a slow (0.5-1 hour) cooling to room temperature.
  • the annealed fragments with single-stranded overhangs were then extended using 100 U Klenow (New England Biolabs, Beverly, Mass.) for each 100 ⁇ L aliquot of annealed oligos, and the buffer made of 838.5 ⁇ l H 2 O, 9 ⁇ l 1 M Tris 7.5, 5 ⁇ l 1M MgCl 2 , 20 ⁇ l 10 mM dNTPs, and 7.5 ⁇ l 1M DTT.
  • the extension reactions proceeded for 1 hour at 25° C.
  • the randomized residues were encoded by (NNS) n , where N stands for any nucleotide and S for an equimolar mixture of C and G; only one of the three stop codons (TAG) conforms to the NNS restriction.
  • N stands for any nucleotide and S for an equimolar mixture of C and G; only one of the three stop codons (TAG) conforms to the NNS restriction.
  • the gene fragments contained the 5′ Tobacco Mosaic Virus (TMV) untranslated region and the T7 promoter, as well as the sequences encoding a 5′ hexahistidine protein purification tag and a 3′FLAG epitope purification tag.
  • Ear I restriction endonuclease recognition sites were engineered into the overlaps between adjacent fragments in order to facilitate the assembly of the three fragments.
  • each of the double-stranded fragments was transformed into an RNA-protein fusion (PROfusionTM) using the technique developed by Szostak et al., U.S. Ser. No. 09/007,005 and U.S. Ser. No. 09/247,190; Szostak et al., WO98/31700; and Roberts & Szostak, Proc. Natl. Acad. Sci. USA (1997) vol. 94, p. 12297-12302.
  • the fragments were transcribed using an Ambion in vitro transcription kit, T7-MEGAshortscriptTM (Ambion, Austin, Tex.), and the resulting mRNA was gel-purified and ligated to a 5′-phosphorylated DNA-puromycin linker, preferably, 5′ dA 18 PEG 2 dCdCPur) using DNA ligase (Promega, Madison, Wis.); the mRNA was aligned with the DNA linker using a DNA splint oligonucleotide (5′ TTTTTTTTTNAGCGGATGC 3′; SEQ ID NO: 30) as described in Szostak (supra).
  • the mRNA-DNA-puromycin molecule was then translated using the Ambion rabbit reticulocyte lysate-based translation kit in the presence of 35 S-methionine.
  • the resulting mRNA-DNA-puromycin-protein fusion was purified using Oligo(dT) cellulose, (Type 7, Amersham Pharmacia, Piscataway, N.J.) and a complementary DNA strand was synthesized using reverse transcriptase (SuperscriptTM II, Gibco, Life Technologies, Rockville, Md.) and the RT primers described above (Unisplint-S or flagASA), following the manufacturer's instructions (preferably, a two-minute annealing at 70° C. and a 40 minute reaction at 42° C.).
  • RNA-protein fusion with annealing cDNA obtained for each fragment was next purified on the resin appropriate to its peptide purification tag, i.e., on Ni-NTA agarose (Qiagen, Valencia, Calif.) for the His 6 -tag and M2 Anti-Flag Agarose (Sigma, St. Louis, Mo.) for the FLAG-tag, following the procedures recommended by the manufacturers.
  • the fragment-encoding genetic information recovered by KOH elution was amplified by PCR using Pharmacia Ready-to-Go PCR Beads, 10 pmol of 5′ and 3′ PCR primers, and the following PCR program (Pharmacia, Piscataway, N.J.): Step 1: 95° C.
  • Step 2 95° C. for 30 seconds, 58/62° C. for 30 seconds, 72° C. for 1 minute, 20/25/30 cycles, as required; Step 3: 72° C. for 5 minutes; Step 4: 4° C. until end (typically, 25 cycles).
  • the resulting DNA was cleaved by 5-6 U EarI (New England Biolabs) per ⁇ g DNA; the reaction took place in T4 DNA Ligase Buffer (New England Biolabs) at 37° C., for 1 hour, and was followed by an optional incubation at 70° C. for 15 minutes to inactivate Ear I. Equal amounts of the BC, DE, and FG fragments were combined and ligated to form a full-length 10 Fn3 gene with randomized loops. The ligation required 10 U of fresh EarI (New England Biolabs) and 20 U of T4 DNA Ligase (Promega, Madison, Wis.), and took 1 hour at 37° C. Earl and ligase were then inactivated by a 15 minute incubation at 65° C.
  • sequences of 76 randomly picked clones from the original, randomized, BC r -DE r -FG r library showed no pattern in the randomized loops (data not shown); the amino acid frequency in the library varied in proportion to the number of codons available that encoded each residue, between 1% per position (glutamic acid, methionine, tryptophan) and 14% per position (proline). In contrast, the average probability for a residue in the preserved, beta-sheet framework to have remained as wild type was found to be 99%.
  • RNA-protein fusions were obtained from the master library following the general procedure described in Szostak et al., U.S. Ser. No. 09/007,005 and 09/247,190; Szostak et al., WO98/31700; and Roberts & Szostak, Proc. Natl. Acad. Sci. USA (1997) vol. 94, p. 12297-12302 ( FIG. 8 ), except that affinity purification performed in rounds three to ten used only M2-Sepharose (see below).
  • the master library in the RNA-protein fusion form was subjected to selection for binding to TNF- ⁇ (Pepro Tech, Rocky Hill, N.J.). Two initial protocols were employed: one in which the target was immobilized on an agarose column and one in which the target was immobilized on a BIACORE chip.
  • Two initial protocols were employed: one in which the target was immobilized on an agarose column and one in which the target was immobilized on a BIACORE chip.
  • an extensive optimization of conditions to minimize background binders to the agarose column yielded the favorable buffer conditions of 50 mM HEPES pH 7.4, 0.02% Triton, 100 ⁇ g/ml sheared salmon sperm DNA. In this buffer, the non-specific binding of the 10 F3-RNA fusion to TNF- ⁇ Sepharose was 0.3%.
  • the non-specific binding background of the 10 Fn3-RNA/cDNA library to TNF- ⁇ Sepharose was found to be 0.1%.
  • the library was first preincubated for an hour with underivatized Sepharose to remove any remaining non-specific binders; the flow-through from this pre-clearing was incubated for another hour with TNF- ⁇ Sepharose. The TNF- ⁇ Sepharose was washed for 3-30 minutes.
  • the cDNA component of the complex that had been eluted from the solid support with 0.3 M NaOH or 0.1M KOH was amplified by PCR; a DNA band of the expected size persisted through multiple rounds of selection ( FIG. 9 ); similar results were observed in the two alternative selection protocols, and only the data from the agarose column selection is shown in FIG. 9 .
  • the fusion pools selected in the first eight rounds of selection bound to TNF- ⁇ -Sepharose at levels close to the background ( ⁇ 0.25%) ( FIG. 13 ).
  • the binding of fusion to TNF- ⁇ -Sepharose increased sharply to 0.7%, and, after ten rounds of selection (R10), the binding increased further to 7% ( FIG. 13 ).
  • These selections were carried out using TNF- ⁇ immobilized on Epoxy-Activated SepharoseTM 6B (Amersham Pharmacia) at 10 mg TNF/g Sepharose in 10 mL.
  • TNF- ⁇ -derivatized Sepharose was blocked in Binding Buffer (50 mM HEPES, pH 7.4, 0.02% Triton, 0.1 mg/mL sheared salmon sperm DNA (Ambion)), overnight, at 4° C.
  • Binding Buffer 50 mM HEPES, pH 7.4, 0.02% Triton, 0.1 mg/mL sheared salmon sperm DNA (Ambion)
  • the 10 Fn3-based master library was transcribed, ligated to the puromycin-bearing linker, translated into an mRNA-protein library in the presence of 5-10 ⁇ L/300 ⁇ L 35 S-methionine, affinity purified on Oligo(dT) Cellulose, reverse-transcribed into a DNA/mRNA-protein library, and affinity-purified on M2-Sepharose (for rounds 3-10), as described above. Forty pmol of DNA/mRNA-protein fusion library molecules, the equivalent of 20 copies of 4 ⁇ 10 12 different sequences, were recovered, then subjected to the first round (R1) of the selection.
  • the supernatant was recovered by microcentrifugation through a Micro Bio-Spin® chromatography column (BIO-RAD, Hercules, Calif.), then incubated with 30 ⁇ L of TNF- ⁇ -Sepharose (6 ⁇ M) in 300 ⁇ L of the Binding Buffer for 1 hour at 4° C. (during Rounds 7-10, the Binding Buffer contained an additional 1 mg/mL of BSA).
  • the TNF- ⁇ -Sepharose was recovered on a spin column, then washed with 3 ⁇ 300 ⁇ L of Binding Buffer, eluted with 100 ⁇ L of 0.1 M KOH, and finally neutralized with 1 ⁇ L of 1 M Tris 8.0, 8 ⁇ L of 1 M HCl.
  • Samples of the library, of the TNF- ⁇ -Sepharose before and after the elution, of the washes, and of the elutions were quantified by counting 35 S-methionine in the sample in a scintillation counter.
  • the next round of selection began with the formation of a new DNA/mRNA-protein pool by PCR amplification, which was transcribed, translated, and reverse-transcribed from the PCR product.
  • the DNA pools obtained from the elution after nine and after ten rounds were cloned into the TOPOTM TA®, pCR2.1 cloning vector (Invitrogen, Carlsbad, Calif.) and transformed into E. coli . Between 30 and 100 clones were picked and grown into plasmid minipreps (Qiagen). Thirty-eight clones from R9 and 29 clones from R10 were picked at random and sequenced (DNA Sequencing Core Facility, Massachusetts General Hospital, Dept. of Molecular Biology, Boston, Mass.). The program ClustalW 60 was used to align the resulting protein sequences.
  • the most common motif found in the selected loop sequences is PWA(S/T), which is found in the DE loop of 33 of the 61 clones; the more loosely defined sequence of PW(A/G) is seen in 41/61 clones.
  • NRSGLQS (12/61) SEQ ID NO: 31
  • AQTGHHLHDK 6/61)
  • SEQ ID NO: 32 The sequences selected most frequently in the BC loop.
  • An NRSGLQS BC loop and an AQTGHHLHDK FG loop have not been found in the same molecule, but two clones were found which contain the most frequently isolated sequences on two of the three randomized loops. These clones, T10.06 (BC: NRSGLQS, DE: PWA) and T09.12 (DE: PWA, FG: AQTGHHLHDK), have two of the lowest four dissociation constants from TNF- ⁇ of the clones examined (Table 1).
  • FIG. 18 indicates the residues in the 10 F3 beta sheet that had not been randomized, but nevertheless mutated during selection. This mutagenesis occurred at the frequency of 26-28 of the 61 clones; these mutations are marked with arrows under the wild-type 10 Fn3 sequence and with the letter that identifies the selected residue. In particular, 28 of the 61 clones mutated from Leucine 18 to Arginine or to Glutamine, and 26 clones mutated from Threonine 56 to Isoleucine. FIG. 19 shows the location of these scaffold mutations.
  • the PCR product of the elution after the tenth round of selection was transcribed and translated in vitro, in the presence of 35 S-methionine but without forming the mRNA-protein fusion.
  • the resulting fraction of the free protein bound to TNF- ⁇ -Sepharose, to IL-1 ⁇ -Sepharose, to IL-13-Sepharose at approximately 10 ⁇ M, 30 ⁇ M, and 50 ⁇ M, respectively, and to underivatized Sepharose was compared ( FIG. 20 ), using the procedure described above for DNA/mRNA-protein fusion binding to TNF- ⁇ -Sepharose.
  • the amount of the selected pool bound to each of the targets was measured by scintillation counting of the washed beads.
  • FIG. 20 shows that, whereas the binding of R0 to TNF- ⁇ , IL-1 ⁇ , and IL-13 was similar (2%, 4%, and 3%, respectively), the ten rounds of selection resulted in 32% binding to the targeted TNF- ⁇ , in 3% binding to IL-1 ⁇ , and in 1% binding to IL-13.
  • the absolute and the relative increase of protein binding to TNF- ⁇ demonstrates the ability of the 10 Fn3 scaffold and of the DNA/mRNA-protein fusion-based selection system to select target-specific binders.
  • clone T09.12 was immobilized in a microarray format (as generally described below) and was tested for binding to soluble TNF- ⁇ . Specific binding of TNF- ⁇ to this clone was detected using fluorescence detection ( FIG. 24A ) and mass spectroscopy ( FIG. 24B ). For the mass spectroscopy results, binding assays were carried out in the presence of fetal bovine serum, an exemplary complex biological fluid containing a variety of potential interfering proteins. For fluorescence detection ( FIG.
  • RNA- 10 Fn3 fusion of wild-type 10 Fn3 and of the T09.12 variant (Table 1) was hybridized onto a DNA microarray on which oligonucleotides complementary to the RNA portion of the fusion molecules had been immobilized at 600 micron pitch, with 24 replicate features. After removal of unhybridized fusion by washing, the surface was exposed to biotin-TNF- ⁇ (2.6 ⁇ g/mL in TBS, 0.02% Tween-20, 0.2% BSA), washed, and air-dried.
  • RNA- 10 Fn3 fusion of the T09.12 variant ( FIG. 24B ) and wild-type 10 Fn3 ( FIG. 24C ) was treated with RNase A to generate a fusion between the protein and the DNA linker.
  • the resulting DNA-linked protein was hybridized to a glass coverslip arrayed with an immobilized oligonucleotide complementary to the DNA linker ( FIGS. 24B and 24C ; no fusion was applied in FIG. 24D ).
  • TNF- ⁇ (1.5 mg/mL in 90% v/v PBS/10% fetal bovine serum).
  • the dried chip was spotted with MALDI matrix and analyzed with a Voyager DE MALDI-TOF mass spectrometer (PerSeptive Biosystems).
  • a signal at 17.4 kD which corresponded to the expected molecular mass of monomeric TNF- ⁇ , was detected on the 200 ⁇ m features that contained T09.12 protein ( FIG. 24A ), but not on the features that contained wild-type 10 Fn3 ( FIG. 24B ) nor on the features that did not contain DNA-protein fusion ( FIG. 24C ).
  • Dissociation constants were determined for all the clones that were represented more than once in the two pools generated after nine and after ten rounds of selection, as well as for the only clone that originated from the BC wt -DE wt -FG r library (clone T09.08).
  • biotinylated TNF- ⁇ was prepared using the NHS-LC-LC-Biotin reagent supplied by Pierce (Rockford, Ill.).
  • MALDI-TOF mass spectrometry was used to estimate that more than 80% of the monomeric TNF- ⁇ , and hence more than 99% of the trimer, was biotinylated.
  • each sample was loaded on a pre-soaked, SAM 2R Biotin Capture Membrane (Promega, Madison, Wis.) using a 96 well, Easy-TiterTM ELIFA system (Pierce). Under vacuum, each spot was washed with 200 ⁇ L of HBS pH 7.4, 1% BSA, 0.05% Triton; next the entire membrane was rinsed in the buffer and air-dried. The membrane was exposed with a Storage Phosphor Screen (Molecular Dynamics, Sunnyvale, Calif.) overnight, and the intensities of the resulting individual spots were quantified using a STORM 860 phosphoimager with the ImageQuaNT densitometry program (Molecular Dynamics). The K d of the binding was determined by fitting the equilibrium equation to the resulting binding curve (KaleidaGraph, Synergy Software); the error of the experiment was estimated from 24 independent experiments.
  • the TNF- ⁇ -binding pools after nine and after ten rounds of selection contained numerous different clones, i.e., 38 different sequences in 61 clones sampled. Therefore, further selection, with more stringent binding requirements, was undertaken to recover a subset of these clones with superior TNF- ⁇ binding properties. Consequently, four further rounds of selections (R11-R14) were conducted in solution, where the concentration of the target was controlled more easily.
  • the concentration of TNF- ⁇ was limited to 0.5 nM and the concentration of DNA/mRNA- 10 Fn3 pool to 0.1 nM; in addition, the length and the temperature of the washes of the 10 Fn3/TNF- ⁇ complex bound to streptavidin-coated paramagnetic beads were increased.
  • nM DNA/mRNA- 10 Fn3 fusion library which had been made as described above, was pre-cleared by tumbling for 1 hour at 4° C. with 100 ⁇ L of Dynabeads® M-280 (streptavidin-coated paramagnetic beads; Dynal, Lake Success, N.Y.) that had been pre-blocked in Binding Buffer.
  • the resulting pre-cleared fusion mixture was combined with 0.5 nM biotinylated TNF- ⁇ in 300 ⁇ L of the above Binding Buffer, and the complex incubated at 4° C. for 1 hour.
  • DNA was eluted from the washed DNA/mRNA- 10 Fn3:TNF- ⁇ -biotin:streptavidin-bead complexes with two washes of 100 ⁇ L 0.1 M KOH, and treated as described above for the column-based selection to produce the next generation DNA/mRNA- 10 Fn3 fusion library.
  • Round 14 differed from R11-R13 in that the selection was performed at 30° C. and in the presence of an additional 150 mM NaCl. Except for the elevated temperature, the sequence of washes was the same for R14 as for R13.
  • the selections described herein may also be combined with mutagenesis after all or a subset of the selection steps to further increase library diversity.
  • error-prone PCR was incorporated into the amplification of DNA between rounds (Cadwell and Joyce, PCR Methods Appl 2:28 (1992)). This technique was carried out beginning with the diverse DNA pool eluted after R8 above. This pool was amplified using error-prone PCR, with the pool divided into seven equal parts and mutagenized at the target frequency of 0.8%, 1.6%, 2.4%, 3.2%, 4.0%, 4.8%, and 5.6%. The seven PCR reactions were combined, and cDNA/RNA-protein fusion was made from the mixture and subjected to a round of selection in solution.
  • the randomized loops remained the length of the corresponding, wild-type 10 Fn3 loops.
  • the length of the loops as well as their sequences may be varied, to incorporate favorable mutations in the 10 Fn3 beta-sheet into the wild-type scaffold used for library construction, and to create libraries with randomized beta-sheet scaffolds which will allow selection of structures even more successful at mimicking antibodies.
  • Wild-type 10 Fn3 contains an integrin-binding tripepetide motif, Arginine 78-Glycine 79-Aspartate 80 (the “RGD motif”) at the tip of the FG loop.
  • RGD motif Arginine 78-Glycine 79-Aspartate 80
  • SGE mutant Serine 78-Glycine 79-Glutamate 80
  • This SGE mutant was expressed as an N-terminally His 6 -tagged, free protein in E. coli , and purified to homogeneity on a metal chelate column followed by a size exclusion column.
  • the DNA sequence encoding His 6 - 10 F3(SGE) was cloned into the pET9a expression vector and transformed into BL21 DE3 pLysS cells.
  • the induced culture was further incubated, under the same conditions, overnight (14-18 hours); the bacteria were recovered by standard, low speed centrifugation.
  • the cell pellet was resuspended in 1/50 of the original culture volume of lysis buffer (50 mM Tris 8.0, 0.5 M NaCl, 5% glycerol, 0.05% Triton X-100, and 1 mM PMSF), and the cells were lysed by passing the resulting paste through a Microfluidics Corporation Microfluidizer M110-EH, three times.
  • the lysate was clarified by centrifugation, and the supernatant was filtered through a 0.45 ⁇ m filter followed by filtration through a 0.2 ⁇ m filter.
  • the column was run at 0.8 mL/min, in buffer 1; all fractions that contained a protein of the expected MW were pooled, concentrated 10 ⁇ as described above, then dialyzed into PBS. Endotoxin screens and animal studies were performed on the resulting sample (Toxikon; MA).
  • This 10 Fn3-F c molecule contains the —CH 1 —CH 2 —CH 3 ( FIG. 11 ) or —CH 2 —CH 3 domains of the IgG constant region of the host; in these constructs, the 10 Fn3 domain is grafted onto the N-terminus in place of the IgG V H domain ( FIGS. 11 and 12 ).
  • Such antibody-like constructs are to improve the pharmacokinetics of the protein as well as its ability to harness the natural immune response.
  • the —CH 1 —CH 2 —CH 3 region was first amplified from a mouse liver spleen cDNA library (Clontech), then ligated into the pET25b vector.
  • the primers used in the cloning were 5′ Fc Nest and 3′ 5 Fc Nest, and the primers used to graft the appropriate restriction sites onto the ends of the recovered insert were 5′Fc HIII and 3′Fc Nhe: 5′ Fc Nest (SEQ ID NO:15) 5′GCG GCA GGG TTT GCT TAC TGG GGC CAA GGG 3′; 3′ Fc Nest (SEQ ID NO:16) 5′GGG AGG GGT GGA GGT AGG TCA CAG TCC 3′; 3′ Fc Nhe (SEQ ID NO:17) 5′ TTT GCT AGC TTT ACC AGG AGA GTG GGA GGC 3′; and 5′ Fc HIII (SEQ ID NO:18) 5′ AAA AAG CTT GCC AAA ACG ACA CCC CCA TCT GTC 3′.
  • the oligonucleotides used in the cloning of murine wild-type 10 Fn3 were: Mo 5PCR-NdeI: (SEQ ID NO:19) 5′ CATATGGTTTCTGATATTCCGAGAGATCTGGAG 3′; Mo5PCR-His-NdeI (for an alternative N-terminus with the His 6 purification tag): (SEQ ID NO:20) 5′ CAT ATG CAT CAC CAT CAC CAT CAC GTT TCT GAT ATT CCG AGA G 3′; and Mo3PCR-EcoRI: (SEQ ID NO:21) 5′ GAATTCCTATGTTTTATAATTGATGGAAAC3′.
  • the human equivalents of the clones are constructed using the same strategy with human oligonucleotide sequences.
  • any of the antibody mimics described herein may be immobilized on a solid support, such as a microchip.
  • a solid support such as a microchip.
  • the suitability of the present scaffolds, for example, the 10 Fn3 scaffold, for protein chip applications is the consequence of (1) their ability to support many binding functions which can be selected rapidly on the bench or in an automated setup, and (2) their superior biophysical properties.
  • 10 Fn3 The versatile binding properties of 10 Fn3 are a function of the loops displayed by the Fn3 immunoglobulin-like, beta sandwich fold. As discussed above, these loops are similar to the complementarity determining regions of antibody variable domains and can cooperate in a way similar to those antibody loops in order to bind antigens.
  • 10 Fn3 loops BC for example, residues 21-30
  • DE for example, residues 51-56
  • FG for example, residues 76-87
  • binders in such libraries are then enriched based on their affinity for an immobilized or tagged target, until a small population of high affinity binders are generated. Also, error-prone PCR and recombination can be employed to facilitate affinity maturation of selected binders. Due to the rapid and efficient selection and affinity maturation protocols, binders to a large number of targets can be selected in a short time.
  • the 10 F3 domain As a scaffold for binders to be immobilized on protein chips, the 10 F3 domain has the advantage over antibody fragments and single-chain antibodies of being smaller and easier to handle.
  • 10 Fn3 unlike single-chain scaffolds or isolated variable domains of antibodies, which vary widely in their stability and solubility, and which require an oxidizing environment to preserve their structurally essential disulfide bonds, 10 Fn3 is extremely stable, with a melting temperature of 110° C., and solubility at a concentration >16 mg/mL.
  • the 10 Fn3 scaffold also contains no disulfides or free cysteines; consequently, it is insensitive to the redox potential of its environment.
  • a further advantage of 10 F3 is that its antigen-binding loops and N-terminus are on the edge of the beta-sandwich opposite to the C-terminus; thus the attachment of a 10 Fn3 scaffold to a chip by its C-terminus aligns the antigen-binding loops, allowing for their greatest accessibility to the solution being assayed. Since 10 Fn3 is a single domain of only 94 amino acid residues, it is also possible to immobilize it onto a chip surface at a higher density than is used for single-chain antibodies, with their approximately 250 residues. In addition, the hydrophilicity of the 10 F3 scaffold, which is reflected in the high solubility of this domain, minimizes unwanted binding of 10 Fn3 to a chip surface.
  • the stability of the 10 F3 scaffold as well as its suitability for library formation and selection of binders are likely to be shared by the large, Fn3-like class of protein domains with an immunoglobulin-like fold, such as the domains of tenascin, N-cadherin, E-cadherin, ICAM, titin, GCSF-R, cytokine receptor, glycosidase inhibitor, and antibiotic chromoprotein.
  • the key features shared by all such domains are a stable framework provided by two beta-sheets, which are packed against each other and which are connected by at least three solvent-accessible loops per edge of the sheet; such loops can be randomized to generate a library of potential binders without disrupting the structure of the framework (as described above).
  • any of these loops may be immobilized alone or in combination with other loops onto a solid support surface.
  • antibody mimics such as Fn3-based antibody mimics
  • a number of exemplary techniques may be utilized.
  • antibody mimics may be immobilized as RNA-protein fusions by Watson-Crick hybridization of the RNA moiety of the fusion to a base complementary DNA immobilized on the chip surface (as described, for example, in Addressable Protein Arrays, U.S. Ser. No. 60/080,686; U.S. Ser. No. 09/282,734; and WO 99/51773; and Methods for Encoding and Sorting In Vitro Translated Proteins, U.S. Ser. No. 60/151,261 and U.S. Ser. No. 09/648,040).
  • antibody mimics can be immobilized as free proteins directly on a chip surface.
  • Manual as well as robotic devices may be used for deposition of the antibody mimics on the chip surface.
  • Spotting robots can be used for deposition of antibody mimics with high density in an array format (for example, by the method of Lueking et al., Anal Biochem. 1999 May 15; 270(1): 103-11).
  • Different methods may also be utilized for anchoring the antibody mimic on the chip surface.
  • a number of standard immobilization procedures may be used including those described in Methods in Enzymology (K. Mosbach and B. Danielsson, eds.), vols. 135 and 136, Academic Press, Orlando, Fla., 1987; Nilsson et al., Protein Expr. Purif.
  • Oriented immobilization of antibody mimics can help to increase the binding capacity of chip-bound antibody mimics.
  • Exemplary approaches for achieving oriented coupling are described in Lu et al., The Analyst (1996), vol. 121, p. 29R-32R; and Turkova, J Chromatogr B Biomed Sci App. 1999 Feb. 5; 722(1-2): 11-31.
  • any of the methods described herein for anchoring antibody mimics to chip surfaces can also be applied to the immobilization of antibody mimics on beads, or other supports.
  • Selected populations of scaffold-binders may be used for detection and/or quantitation of analyte targets, for example, in samples such as biological samples.
  • selected scaffold-binders to targets of interest are immobilized on an appropriate support to form multi-featured protein chips.
  • a sample is applied to the chip, and the components of the sample that associate with the binders are identified based on the target-specificity of the immobilized binders.
  • one or more components may be simultaneously identified or quantitated in a sample (for example, as a means to carry out sample profiling).
  • Methods for target detection allow measuring the levels of bound protein targets and include, without limitation, radiography, fluorescence scanning, mass spectroscopy (MS), and surface plasmon resonance (SPR).
  • Autoradiography using a phosphorimager system can be used for detection and quantification of target protein which has been radioactively labeled, e.g., using 35 S methionine.
  • Fluorescence scanning using a laser scanner may be used for detection and quantification of fluorescently labeled targets.
  • fluorescence scanning may be used for the detection of fluorescently labeled ligands which themselves bind to the target protein (e.g., fluorescently labeled target-specific antibodies or fluorescently labeled streptavidin binding to target-biotin, as described below).
  • fluorescently labeled ligands which themselves bind to the target protein (e.g., fluorescently labeled target-specific antibodies or fluorescently labeled streptavidin binding to target-biotin, as described below).
  • Mass spectroscopy can be used to detect and identify bound targets based on their molecular mass. Desorption of bound target protein can be achieved with laser assistance directly from the chip surface as described below. Mass detection also allows determinations, based on molecular mass, of target modifications including post-translational modifications like phosophorylation or glycosylation. Surface plasmon resonance can be used for quantification of bound protein targets where the scaffold-binder(s) are immobilized on a suitable gold-surface (for example, as obtained from Biacore, Sweden).
  • binders in this case, Fn-binders specific for the protein, TNF- ⁇
  • TNF- ⁇ binders specific for the protein
  • This example is provided for the purpose of illustrating the invention, and should not be construed as limiting.
  • an 10 Fn3-based selection was performed against TNF- ⁇ , using a library of human 10 Fn3 variants with randomized loops BC, DE, and FG.
  • the library was constructed from three DNA fragments, each of which contained nucleotide sequences that encoded approximately one third of human 10 Fn3, including one of the randomized loops.
  • the DNA sequences that encoded the loop residues listed above were rebuilt by oligonucleotide synthesis, so that the codons for the residues of interest were replaced by (NNS) n , where N represents any of the four deoxyribonucleotides (A, C, G, or T), and S represents either C or G.
  • N represents any of the four deoxyribonucleotides (A, C, G, or T)
  • S represents either C or G.
  • the C-terminus of each fragment contained the sequence for the FLAG purification tag.
  • each DNA fragment was transcribed, ligated to a puromycin-containing DNA linker, and translated in vitro, as described by Szostak et al. (Roberts and Szostak, Proc. Natl. Acad. Sci. USA 94:12297, 1997; Szostak et al., U.S. Ser. No. 09/007,005 and U.S. Ser. No. 09/247,190; Szostak et al., WO98/31700), to generate an mRNA-peptide fusion, which was then reverse-transcribed into a DNA-mRNA-peptide fusion.
  • the binding of the FLAG-tagged peptide to M2 agarose separated full-length fusion molecules from those containing frameshifts or superfluous stop codons; the DNA associated with the purified full-length fusion was amplified by PCR, then the three DNA fragments were cut by Ear I restriction endonuclease and ligated to form the full length template.
  • the template was transcribed, ligated to puromycin-containing DNA linkers, and translated to generate a 10 Fn3-RNA/cDNA library, which was then reverse-transcribed to yield the DNA-mRNA-peptide fusion library which was subsequently used in the selection.
  • TNF- ⁇ binders were placed in 50 mM HEPES, pH 7.4, 0.02% Triton-X, 0.1 mg/mL salmon sperm DNA.
  • the PROfusionTM library was incubated with Sepharose-immobilized TNF- ⁇ ; after washing, the DNA associated with the tightest binders was eluted with 0.1 M KOH, amplified by PCR, and transcribed, ligated, translated, and reverse-transcribed into the starting material for the next round of selection.
  • an oligonucleotide capture probe was prepared with an automated DNA synthesizer (PE BioSystems Expedite 8909) using the solid-support phosphoramidite approach. All reagents were obtained from Glen Research. Synthesis was initiated with a solid support containing a disulfide bond to eventually provide a 3′-terminal thiol functionality. The first four monomers to be added were hexaethylene oxide units, followed by 20 T monomers. The 5′-terminal DMT group was not removed.
  • the capture probe was cleaved from the solid support and deprotected with ammonium hydroxide, concentrated to dryness in a vacuum centrifuge, and purified by reverse-phase HPLC using an acetonitrile gradient in triethylammonium acetate buffer. Appropriate fractions from the HPLC were collected, evaporated to dryness in a vacuum centrifuge, and the 5′-terminal DMT group was removed by treatment with 80% AcOH for 30 minutes. The acid was removed by evaporation, and the oligonucleotide was then treated with 100 mM DTT for 30 minutes to cleave the disulfide bond. DTT was removed by repeated extraction with EtOAc. The oligonucleotide was ethanol precipitated from the remaining aqueous layer and checked for purity by reverse-phase HPLC.
  • the 3′-thiol capture probe was adjusted to 250 ⁇ M in degassed 1 ⁇ PBS buffer and applied as a single droplet (75 ⁇ L) to a 9 ⁇ 9 mm gold-coated chip (Biacore) in an argon-flushed chamber containing a small amount of water. After 18 hours at room temperature, the capture probe solution was removed, and the functionalized chip was washed with 50 mL 1 ⁇ PBS buffer (2 ⁇ for 15 minutes each) with gentle agitation, and then rinsed with 50 mL water (2 ⁇ for 15 minutes each) in the same fashion. Remaining liquid was carefully removed and the functionalized chips were either used immediately or stored at 4° C. under argon.
  • MALDI-TOF matrix (15 mg/mL 3,5-dimethoxy-4-hydroxycinnamic acid in 1:1 ethanol/10% formic acid in water) was uniformly applied to the gold chips with a high-precision 3-axis robot (MicroGrid, BioRobotics).
  • a 16-pin tool was used to transfer the matrix from a 384-well microtiter plate to the chips, producing 200 micron diameter features with a 600 micron pitch.
  • the gold chips were individually placed on a MALDI sample stage modified to keep the level of the chip the same as the level of the stage, thus allowing proper flight distance.
  • the instrument's video monitor and motion control system were used to direct the laser beam to individual matrix features.
  • FIGS. 14 and 15 show the mass spectra from the 10 Fn3 fusion chip and the non-fusion chip, respectively. In each case, a small number of 200 micron features were analyzed to collect the spectra, but FIG. 15 required significantly more acquisitions.
  • the signal at 17.4 kDa corresponds to TNF- ⁇ monomer.
  • Pre-cleaned 1 ⁇ 3 inch glass microscope slides (Goldseal, #3010) were treated with Nanostrip (Cyantek) for 15 minutes, 10% aqueous NaOH at 70° C. for 3 minutes, and 1% aqueous HCl for 1 minute, thoroughly rinsing with deionized water after each reagent.
  • the slides were then dried in a vacuum desiccator over anhydrous calcium sulfate for several hours.
  • a 1% solution of aminopropytrimethoxysilane in 95% acetone/5% water was prepared and allowed to hydrolyze for 20 minutes.
  • the glass slides were immersed in the hydrolyzed silane solution for 5 minutes with gentle agitation.
  • silane was removed by subjecting the slides to ten 5-minute washes, using fresh portions of 95% acetone/5% water for each wash, with gentle agitation. The slides were then cured by heating at 110° C. for 20 minutes. The silane treated slides were immersed in a freshly prepared 0.2% solution of phenylene 1,4-diisothiocyanate in 90% DMF/10% pyridine for two hours, with gentle agitation. The slides were washed sequentially with 90% DMF/10% pyridine, methanol, and acetone. After air drying, the functionalized slides were stored at 0° C. in a vacuum desiccator over anhydrous calcium sulfate. Similar results were obtained with commercial amine-reactive slides (3-D Link, Surmodics).
  • Oligonucleotide capture probes were prepared with an automated DNA synthesizer (PE BioSystems Expedite 8909) using conventional phosphoramidite chemistry. All reagents were from Glen Research. Synthesis was initiated with a solid support bearing an orthogonally protected amino functionality, whereby the 3′-terminal amine is not unmasked until final deprotection step. The first four monomers to be added were hexaethylene oxide units, followed by the standard A, G, C and T monomers.
  • the purified, amine-labeled capture oligos were adjusted to a concentration of 250 ⁇ M in 50 mM sodium carbonate buffer (pH 9.0) containing 10% glycerol.
  • the probes were spotted onto the amine-reactive glass surface at defined positions in a 5 ⁇ 5 ⁇ 6 array pattern with a 3-axis robot (MicroGrid, BioRobotics).
  • a 16-pin tool was used to transfer the liquid from 384-well microtiter plates, producing 200 micron features with a 600 micron pitch. Each sub-grid of 24 features represents a single capture probe (i.e., 24 duplicate spots).
  • the arrays were incubated at room temperature in a moisture-saturated environment for 12-18 hours.
  • the attachment reaction was terminated by immersing the chips in 2% aqueous ammonium hydroxide for five minutes with gentle agitation, followed by rinsing with distilled water (3 ⁇ for 5 minutes each).
  • the array was finally soaked in 10 ⁇ PBS solution for 30 minutes at room temperature, and then rinsed again for 5 minutes in distilled water.
  • Each of the capture probes possesses a 3′-amino terminus and four hexaethylene oxide spacer units, as described above.
  • Recombinant human TNF- ⁇ (500 ⁇ g, lyophilized, from PreproTech) was taken up in 230 ⁇ L 1 ⁇ PBS and dialyzed against 700 mL stirred 1 ⁇ PBS at 4° C. for 18 hours in a Microdialyzer unit (3,500 MWCO, Pierce).
  • the dialyzed TNF- ⁇ was treated with EZ-Link NHS-LC-LC biotinylation reagent (20 ⁇ g, Pierce) for 2 hours at 0° C., and again dialyzed against 700 mL stirred 1 ⁇ PBS at 4° C. for 18 hours in a Microdialyzer unit (3,500 MWCO, Pierce).
  • the resulting conjugate was analyzed by MALDI-TOF mass spectrometry and was found to be almost completely functionalized with a single biotin moiety.
  • the protein microarray surface was passivated by treatment with 1 ⁇ TBS containing 0.02% Tween-20 and 0.2% BSA (200 ⁇ L) for 60 minutes.
  • Biotinylated TNF- ⁇ 100 nM concentration made up in the passivation buffer
  • the microarray was washed with 1 ⁇ TBS containing 0.02% Tween-20 (3 ⁇ 50 mL, 5 minutes each wash).
  • Fluorescently labeled streptavidin (2.5 ⁇ g/mL Alexa 546-streptavidin conjugate from Molecular Probes, made up in the passivation buffer) was contacted with the microarray for 60 minutes.
  • the microarray was washed with 1 ⁇ TBS containing 0.02% Tween-20 (2 ⁇ 50 mL, 5 minutes each wash) followed by a 3 minute rinse with 1 ⁇ TBS. Traces of liquid were removed by centrifugation, and the slide was allowed to air-dry at room temperature.
  • Fluorescence laser scanning was performed with a GSI Lumonics ScanArray 5000 system-using 10 ⁇ M pixel resolution and preset excitation and emission wavelengths for Alexa 546 dye.
  • Phosphorimage analysis was performed with a Molecular Dynamics Storm system. Exposure time was 48 hours with direct contact between the microarray and the phosphor storage screen. Phosphorimage scanning was performed at the 50 micron resolution setting, and data was extracted with ImageQuant v.4.3 software.
  • FIGS. 16 and 17 are the phosphorimage and fluorescence scan, respectively, of the same array.
  • the phosphorimage shows where the 10 Fn3 fusion hybridized based on the 35 S methionine signal.
  • the fluorescence scan shows where the labeled TNF- ⁇ bound.

Abstract

Disclosed herein are proteins that include an immunoglobulin fold and that can be used as scaffolds. Also disclosed herein are nucleic acids encoding such proteins and the use of such proteins in diagnostic methods and in methods for evolving novel compound-binding species and their ligands.

Description

    CROSS REFERENCE TO RELATED APPLICATIONS
  • This application is a continuation of U.S. patent application Ser. No. 11/483,918, filed Jul. 7, 2006, which is a continuation of U.S. patent application Ser. No. 10/728,078, filed Dec. 3, 2003, which is a continuation of U.S. patent application Ser. No. 09/688,566, filed Oct. 16, 2000, which is a continuation-in-part of U.S. patent application Ser. No. 09/515,260, filed Feb. 29, 2000, now U.S. Pat. No. 6,818,418, which is a continuation-in-part of U.S. patent application Ser. No. 09/456,693, filed Dec. 9, 1999, now abandoned, which claims the benefit of U.S. Provisional application 60/111,737, filed Dec. 10, 1998.
  • BACKGROUND OF THE INVENTION
  • This invention relates to protein scaffolds useful, for example, for the generation of products having novel binding characteristics.
  • Proteins having relatively defined three-dimensional structures, commonly referred to as protein scaffolds, may be used as reagents for the design of engineered products. These scaffolds typically contain one or more regions which are amenable to specific or random sequence variation, and such sequence randomization is often carried out to produce libraries of proteins from which desired products may be selected. One particular area in which such scaffolds are useful is the field of antibody design.
  • A number of previous approaches to the manipulation of the mammalian immune system to obtain reagents or drugs have been attempted. These have included injecting animals with antigens of interest to obtain mixtures of polyclonal antibodies reactive against specific antigens, production of monoclonal antibodies in hybridoma cell culture (Koehler and Milstein, Nature 256:495, 1975), modification of existing monoclonal antibodies to obtain new or optimized recognition properties, creation of novel antibody fragments with desirable binding characteristics, and randomization of single chain antibodies (created by connecting the variable regions of the heavy and light chains of antibody molecules with a flexible peptide linker) followed by selection for antigen binding by phage display (Clackson et al., Nature 352:624, 1991).
  • In addition, several non-immunoglobulin protein scaffolds have been proposed for obtaining proteins with novel binding properties. For example, a “minibody” scaffold, which is related to the immunoglobulin fold, has been designed by deleting three beta strands from a heavy chain variable domain of a monoclonal antibody (Tramontano et al., J. Mol. Recognit. 7:9, 1994). This protein includes 61 residues and can be used to present two hypervariable loops. These two loops have been randomized and products selected for antigen binding, but thus far the framework appears to have somewhat limited utility due to solubility problems. Another framework used to display loops has been tendamistat, a 74 residue, six-strand beta sheet sandwich held together by two disulfide bonds (McConnell and Hoess, J. Mol. Biol. 250:460, 1995). This scaffold includes three loops, but, to date, only two of these loops have been examined for randomization potential.
  • Other proteins have been tested as frameworks and have been used to display randomized residues on alpha helical surfaces (Nord et al., Nat. Biotechnol. 15:772, 1997; Nord et al., Protein Eng. 8:601, 1995), loops between alpha helices in alpha helix bundles (Ku and Schultz, Proc. Natl. Acad. Sci. USA 92:6552, 1995), and loops constrained by disulfide bridges, such as those of the small protease inhibitors (Markland et al., Biochemistry 35:8045, 1996; Markland et al., Biochemistry 35:8058, 1996; Rottgen and Collins, Gene 164:243, 1995; Wang et al., J. Biol. Chem. 270:12250, 1995).
  • SUMMARY OF THE INVENTION
  • The present invention provides a new family of proteins capable of evolving to bind any compound of interest. These proteins, which generally make use of a scaffold derived from a fibronectin type III (Fn3) or Fn3-like domain, function in a manner characteristic of natural or engineered antibodies (that is, polyclonal, monoclonal, or single-chain antibodies) and, in addition, possess structural advantages. Specifically, the structure of these antibody mimics has been designed for optimal folding, stability, and solubility, even under conditions that normally lead to the loss of structure and function in antibodies.
  • These antibody mimics may be utilized for the purpose of designing proteins which are capable of binding to virtually any compound (for example, any protein) of interest. For example, the 10Fn3-based molecules described herein may be used as scaffolds which are subjected to directed evolution to form a population with one or more randomized Fn3 loops that are analogous by position and structure to the complementarity-determining regions (CDRs) of an antibody variable region, and/or to randomize Fn3's other three solvent exposed loops. Such a directed evolution approach results in the production of antibody-like molecules with high affinities for antigens of interest. In addition, the scaffolds described herein may be used to display defined exposed loops (for example, loops previously randomized and selected on the basis of antigen binding) in order to direct the evolution of molecules that bind to such introduced loops. A selection of this type may be carried out to identify recognition molecules for any individual CDR-like loop or, alternatively, for the recognition of two or all three CDR-like loops combined into a non-linear epitope.
  • Accordingly, in a first aspect, the present invention features randomized or mutated scaffold proteins. In particular, the invention features a non-antibody protein including a domain having an immunoglobulin-like fold, the non-antibody protein deriving from a reference protein by having a mutated amino acid sequence, wherein the non-antibody protein binds with a Kd at least as tight as 1 μM to a compound that is not bound as tightly by the reference protein.
  • In addition, the invention features a non-antibody protein deriving from a scaffold protein including a domain having an immunoglobulin-like fold, wherein the amino acid sequence of the domain in the derived protein is more than 50% identical to the amino acid sequence of the domain in the scaffold protein.
  • In yet another embodiment, the invention features a protein that includes a fibronectin type III domain having at least one randomized loop, the protein being characterized by the ability of the Fn3 domain to bind to a compound that is not bound by the corresponding naturally-occurring Fn3 domain.
  • In various preferred embodiments, any of these proteins of the invention bind to their target compounds with a Kd at least as tight as 500 nM, preferably, with a Kd at least as tight as 100 nM or 10 nM, and, more preferably, with a Kd at least as tight as 1 nM, 500 μM, 100 μM, or even 20 μM. The protein preferably contains one, two, or three mutated loops and at least one of the loops, and preferably two or all three of the loops, contributes to the binding of the protein to the compound. Additionally, the reference protein preferably lacks disulfide bonds, and the derivative protein may have at least one disulfide bond.
  • With respect to certain embodiments, the domain having an immunoglobulin-like fold preferably has a molecular mass less than 10 kD or greater than 7.5 kD, and, more preferably, has a molecular mass between 7.5-10 kD. The proteins of the invention may be monomers under physiological conditions or may be multimers, for example, dimers. In other preferred embodiments, the reference protein used to derive a mutated protein of the invention is a naturally-occurring mammalian protein (for example, a human protein); and the domain having an immunoglobulin-like fold is mutated and includes up to 50%, and preferably up to 34%, mutated amino acids as compared to the immunoglobulin-like fold of the reference protein. In addition, the domain having the immunoglobulin-like fold preferably consists of approximately 50-150 amino acids, and more preferably consists of approximately 50 amino acids.
  • Derivative proteins of the invention may be derived from any appropriate reference protein including, but not limited to, the preferred proteins, fibronectin or a fibronectin dimer, tenascin, N-cadherin, E-cadherin, ICAM, titin, GCSF-receptor, cytokine receptor, glycosidase inhibitor, antibiotic chromoprotein, myelin membrane adhesion molecule P0, CD8, CD4, CD2, class I MHC, T-cell antigen receptor, CD1, C2 and I-set domains of VCAM-1,1-set immunoglobulin domain of myosin-binding protein C, I-set immunoglobulin domain of myosin-binding protein H, I-set immunoglobulin domain of telokin, NCAM, twitchin, neuroglian, growth hormone receptor, erythropoietin receptor, prolactin receptor, interferon-gamma receptor, β-galactosidase/glucuronidase, β-glucuronidase, transglutaminase, T-cell antigen receptor, superoxide dismutase, tissue factor domain, cytochrome F, green fluorescent protein, GroEL, and thaumatin.
  • In further preferred embodiments of Fn3 domain-containing proteins, the fibronectin type III domain is a mammalian (for example, a human) fibronectin type III domain; and the protein includes the tenth module of the fibronectin type III (10Fn3) domain. In such proteins, compound binding is preferably mediated by either one, two, or three 10Fn3 loops. In other preferred embodiments, the second (DE) loop of 10Fn3 may be extended in length relative to the naturally-occurring module, or the 10Fn3 may lack an integrin-binding motif. In these molecules, the integrin-binding motif may be replaced by an amino acid sequence in which a polar amino acid-neutral amino acid-acidic amino acid sequence (in the N-terminal to C-terminal direction) replaces the integrin-binding motif; alternatively, one preferred sequence is serine-glycine-glutamate. In another preferred embodiment, the fibronectin type III domain-containing proteins of the invention lack disulfide bonds.
  • Any of the proteins of the invention (for example, the fibronectin type III domain-containing proteins) may be formulated as part of a fusion protein. If the fusion protein is to be used for compound binding or compound binding selections, the fusion protein includes a heterologous protein that does not itself bind to the compound of interest. The heterologous protein may, for example, be an antibody or antibody domain (such as an immunoglobulin Fc domain), a complement protein, a toxin protein, or an albumin protein. In addition, any of the proteins of the invention (for example, the fibronectin type III domain proteins) may be covalently bound to a nucleic acid (for example, an RNA), and the nucleic acid may encode the protein. Moreover, the protein may be a multimer, or, particularly if it lacks an integrin-binding motif, it may be formulated in a physiologically-acceptable carrier.
  • The present invention also features proteins that include a fibronectin type III domain having at least one mutation in a β-sheet sequence. Again, these proteins are characterized by their ability to bind to compounds that are not bound or are not bound as tightly by the corresponding naturally-occurring fibronectin domain.
  • Any of the proteins of the invention may be immobilized on a solid support (for example, a bead or chip), and these proteins may be arranged in any configuration on the solid support, including an array.
  • In a related aspect, the invention further features nucleic acids encoding any of the proteins of the invention. In preferred embodiments, the nucleic acid is DNA or RNA.
  • In another related aspect, the invention also features a method for generating a protein which includes a fibronectin type III domain and which is pharmaceutically acceptable to a mammal, involving removing the integrin-binding domain of said fibronectin type III domain. This method may be applied to any of the fibronectin type III domain-containing proteins described above and is particularly useful for generating proteins for human therapeutic applications. The invention also features such fibronectin type III domain-containing proteins which lack integrin-binding domains.
  • In yet another related aspect, the invention features methods of obtaining derivative non-antibody proteins which bind to compounds of interest. One such method involves: (a) providing a non-antibody scaffold protein including an immunoglobulin-like fold, wherein the scaffold protein does not bind to the compound with a Kd as tight as 1 μM; (b) generating mutated derivatives of the non-antibody scaffold protein, thereby producing a library of mutated proteins; (c) contacting the library with the compound; (d) selecting from the library at least one derivative protein which binds to the compound with a Kd at least as tight as 1 μM; and (e) optionally repeating steps (b)-(d) substituting for the non-antibody scaffold protein in repeated step (b) the product from the previous step (d). This technique may also be carried out with any of the proteins of the invention (for example, any of the fibronectin type III domain-containing proteins).
  • In yet other related aspects, the invention features screening methods which may be used to obtain or evolve randomized or mutated proteins of the invention capable of binding to compounds of interest, or to obtain or evolve compounds (for example, proteins) capable of binding to a particular protein containing a randomized or mutated motif. In addition, the invention features screening procedures which combine these two methods, in any order, to obtain either compounds or proteins of interest.
  • In particular, a first screening method, useful for the isolation or identification of randomized or mutated proteins of interest, involves: (a) contacting a compound of interest with a candidate protein, the candidate protein being a derivative non-antibody protein including a domain having an immunoglobulin-like fold, the non-antibody protein deriving from a reference protein by having a mutated amino acid sequence wherein the non-antibody protein binds with a Kd at least as tight as 1 μM to a compound that is not bound as tightly by the reference protein, wherein the contacting is carried out under conditions that allow compound-protein complex formation; and (b) obtaining, from the complex, the derivative protein that binds to the compound. This general technique may also be carried out with a fibronectin type III domain protein having at least one randomized or mutated loop.
  • The second screening method is for isolating or identifying a compound which binds to a protein of the invention. This method begins with a non-antibody protein including a domain having an immunoglobulin-like fold and deriving from a reference protein by having a mutated amino acid sequence, wherein the non-antibody protein binds with a Kd at least as tight as 1 μM to a compound that is not bound as tightly by the reference protein. This derivative protein is then contacted with a candidate compound, wherein the contacting is carried out under conditions that allow compound-protein complex formation, and the compound which binds to the derivative protein is obtained from the complex. Again, this general technique may be carried out with any protein of the invention, for example, a protein with a mutated fibronectin type III domain.
  • In addition, the invention features diagnostic methods which employ the proteins of the invention (for example, fibronectin type III scaffold proteins and their derivatives). Such diagnostic methods may be carried out on a sample (for example, a biological sample) to detect one analyte or to simultaneously detect many different analytes in the sample. The method may employ any of the scaffold molecules described herein. Preferably, the method involves (a) contacting the sample with a protein of the invention that binds to the compound analyte, the contacting being carried out under conditions that allow compound-protein complex formation; and (b) detecting the complex, and therefore the compound in the sample. In addition, this method may be used to quantitate, as well as detect, compound levels in a sample.
  • In preferred embodiments of any of the selection or diagnostic methods described herein, the protein of the invention binds to its target compound with a Kd at least as tight as 1 μM or 500 nM, preferably, with a Kd at least as tight as 100 nM or 10 nM, and, more preferably, with a Kd at least as tight as 1 nM, 500 pM, 100 pM, or even 20 pM. The protein preferably contains one, two, or three mutated loops and at least one of the loops, and preferably two or all three of the loops contributes to the binding of the protein to the compound. Additionally, the reference protein preferably lacks disulfide bonds, and the derivative protein may have at least one disulfide bond.
  • With respect to certain embodiments of the methods, the domain having an immunoglobulin-like fold preferably has a molecular mass less than 10 kD or greater than 7.5 kD, and, more preferably, has a molecular mass between 7.5-10 kD. The proteins of the invention may be monomers under physiological conditions or may be multimers, for example, dimers. In other preferred embodiments, the reference protein used to derive a mutated protein of the invention is a naturally-occurring mammalian protein (for example, a human protein); and the domain having an immunoglobulin-like fold is mutated and includes up to 50%, and preferably up to 34%, mutated amino acids as compared to the immunoglobulin-like fold of the reference protein. In addition, the domain having an immunoglobulin-like fold preferably consists of approximately 50-150 amino acids, and more preferably consists of approximately 50 amino acids.
  • Derivative proteins used in the methods of the invention may be derived from any appropriate reference protein including, but not limited to, the preferred proteins, fibronectin or a fibronectin dimer, tenascin, N-cadherin, E-cadherin, ICAM, titin, GCSF-receptor, cytokine receptor, glycosidase inhibitor, antibiotic chromoprotein, myelin membrane adhesion molecule P0, CD8, CD4, CD2, class I MHC, T-cell antigen receptor, CD1, C2 and I-set domains of VCAM-1,1-set immunoglobulin domain of myosin-binding protein C, I-set immunoglobulin domain of myosin-binding protein H, I-set immunoglobulin domain of telokin, NCAM, twitchin, neuroglian, growth hormone receptor, erythropoietin receptor, prolactin receptor, interferon-gamma receptor, β-galactosidase/glucuronidase, β-glucuronidase, transglutaminase, T-cell antigen receptor, superoxide dismutase, tissue factor domain, cytochrome F, green fluorescent protein, GroEL, and thaumatin.
  • In addition, the steps of the selection methods described herein may be repeated with further mutation or randomization being carried out between cycles. For example, for the methods involving a protein having a mutated or randomized fibronectin type III domain, at least one loop of the fibronectin type III domain of the protein obtained in step (b) may be mutated and steps (a) and (b) repeated using the further randomized protein, or the compound obtained in step (b) may be modified and steps (a) and (b) repeated using the further modified compound. In these methods, the compound is preferably a protein, and the fibronectin type III domain is preferably a mammalian (for example, a human) fibronectin type III domain. In other preferred embodiments, the protein includes the tenth module of the fibronectin type III domain (10Fn3), and binding is mediated by one, two, or three 10Fn3 loops. In addition, the second (DE) loop of 10Fn3 may be extended in length relative to the naturally-occurring module, or 10Fn3 may lack an integrin-binding motif. Again, as described above, the integrin-binding motif may be replaced by an amino acid sequence in which a basic amino acid-neutral amino acid-acidic amino acid sequence (in the N-terminal to C-terminal direction) replaces the integrin-binding motif; alternatively, one preferred replacement sequence is serine-glycine-glutamate.
  • The selection and diagnostic methods described herein may be carried out using any of the proteins of the invention (for example, a fibronectin type III domain-containing protein). In addition, any of these proteins may be formulated as part of a fusion protein with a heterologous protein (for example, an antibody or antibody domain (including an immunoglobulin Fc domain) that does not itself bind the compound of interest, or a complement protein, toxin protein, or albumin protein). In addition, selections and diagnostic methods may be carried out using the proteins of the invention (for example, the fibronectin type III domain proteins) covalently bound to nucleic acids (for example, RNAs or any nucleic acid which encodes the protein). Moreover, the selections and diagnostic methods may be carried out using these proteins (for example, the fibronectin domain-containing proteins) as monomers or as multimers, such as dimers.
  • Preferably, the selections and diagnostic methods involve the immobilization of the binding target on a solid support. Preferred solid supports include columns (for example, affinity columns, such as agarose-based affinity columns), microchips, or beads. Alternatively, the proteins (for example, the Fn3 domain-containing proteins) may be immobilized and contacted with one or more potential binding targets.
  • For the diagnostic methods, the compound is often a protein, but may also be any other analyte in a sample. Detection may be accomplished by any standard technique including, without limitation, radiography, fluorescence detection, mass spectroscopy, or surface plasmon resonance.
  • In a final aspect, the invention features a non-antibody protein that binds tumor necrosis factor-α (TNF-α) with a Kd at least as tight as 1 μM, the protein having a sequence that is less than 20% identical to TNF-α receptor (for example, a naturally-occurring TNF-α receptor, such as a mammalian or human TNF-α receptor).
  • In preferred embodiments, this protein includes a mutated fibronectin type III domain and the protein is mutated in the DE, BC, and FG loops. Preferably, the mutated FG loop is the same length as the wild-type FG loop. In other preferred embodiments, the protein includes an immunoglobulin-like fold (preferably, having a molecular mass less than 10 kD, greater than 7.5 kD, or between 7.5-10 kD) that consists of approximately 50-150 amino acids, and preferably, approximately 50 amino acids.
  • The TNF-α binders according to the invention bind TNF-α with a Kd at least as tight as 1 μM, preferably, at least as tight as 500 nM, 100 nM, or 10 nM, more preferably, at least as tight as 1 nM or 500 pM, and, most preferably, at least as tight as 100 pM or even 20 pM. Preferably, these proteins contain one, two, or three mutated loops, and at least one, and preferably two or all three of the loops, contribute to the binding of the non-antibody protein to TNF-α. In other preferred embodiments, the non-antibody protein has at least one disulfide bond, and the non-antibody protein is a monomer or dimer under physiological conditions.
  • The TNF-α binders may be immobilized on a solid support (for example, a chip or bead), and may be part of an array. In addition, any of the TNF-α binders may be joined to a heterologous protein (for example, a heterologous protein that is an antibody or an antibody domain that does not bind TNF-α, an immunoglobulin Fc domain, a complement protein, or an albumin protein).
  • If desired, the protein may include a mutated fibronectin type III domain (for example, one derived from a human fibronectin type III domain, such as a mutated tenth module of the fibronectin type III domain (10Fn3)). In addition, the protein may lack an 10Fn3 integrin-binding motif. TNF-α binders preferably include a non-naturally occurring sequence in a loop of 10Fn3 (for example, the loop sequence PW(A/G), and may include a non-naturally occurring sequence in a β-sheet of 10Fn3. Particularly preferred TNF-α binders of the invention are shown in FIG. 25 (SEQ ID NOS: 34-140).
  • In addition, in related aspects, the invention features nucleic acids encoding any of the TNF-α binding proteins of the invention, as well as a loop structure on any protein that includes any one of the amino acid sequences of FIG. 25 (SEQ ID NOS: 34-140).
  • As used herein, by “non-antibody protein” is meant a protein that is not produced by the B cells of a mammal either naturally or following immunization of a mammal. This term also excludes antibody fragments of more than 100 amino acids, preferably, more than 80 amino acids, and, most preferably, more than 50 amino acids in length.
  • By “immunoglobulin-like fold” is meant a protein domain of between about 80-150 amino acid residues that includes two layers of antiparallel beta-sheets, and in which the flat, hydrophobic faces of the two beta-sheets are packed against each other. Proteins according to the invention may include several immunoglobulin-like folds covalently bound or associated non-covalently into larger structures.
  • By “scaffold” is meant a protein used to select or design a protein framework with specific and favorable properties, such as binding. When designing proteins from the scaffold, amino acid residues that are important for the framework's favorable properties are retained, while others residues may be varied. Such a scaffold has less than 50% of the amino acid residues that vary between protein derivatives having different properties and greater than or equal to 50% of the residues that are constant between such derivatives. Most commonly, these constant residues confer the same overall three-dimensional fold to all the variant domains, regardless of their properties.
  • By “fibronectin type III domain” is meant a domain having 7 or 8 beta strands which are distributed between two beta sheets, which themselves pack against each other to form the core of the protein, and further containing loops which connect the beta strands to each other and are solvent exposed. There are at least three such loops at each edge of the beta sheet sandwich, where the edge is the boundary of the protein perpendicular to the direction of the beta strands. Preferably, a fibronectin type III domain includes a sequence which exhibits at least 30% amino acid identity, and preferably at least 50% amino acid identity, to the sequence encoding the structure of the 10Fn3 domain referred to as “1ttg” (ID=“1ttg” (one ttg)) available from the RCSB (Research Collaboratory for Structural Bioinformatics) Protein Data Base. Sequence identity referred to in this definition is determined by the Homology program, available from Molecular Simulation (San Diego, Calif.). The invention further includes polymers of 10Fn3-related molecules, which are an extension of the use of the monomer structure, whether or not the subunits of the polyprotein are identical.
  • By “naturally occurring” is meant any protein that is encoded by a living organism.
  • By “randomized” or “mutated” is meant including one or more amino acid alterations relative to a template sequence. By “randomizing” or “mutating” is meant the process of introducing, into a sequence, such an amino acid alteration. Randomization or mutation may be accomplished through intentional, blind, or spontaneous sequence variation, generally of a nucleic acid coding sequence, and may occur by any technique, for example, PCR, error-prone PCR, or chemical DNA synthesis. By a “corresponding, non-mutated protein” is meant a protein that is identical in sequence, except for the introduced amino acid mutations.
  • By a “protein” is meant any sequence of two or more amino acids, regardless of length, post-translation modification, or function. “Protein” and “peptide” are used interchangeably herein.
  • By “RNA” is meant a sequence of two or more covalently bonded, naturally occurring or modified ribonucleotides. One example of a modified RNA included within this term is phosphorothioate RNA.
  • By “DNA” is meant a sequence of two or more covalently bonded, naturally occurring or modified deoxyribonucleotides.
  • By a “nucleic acid” is meant any two or more covalently bonded nucleotides or nucleotide analogs or derivatives. As used herein, this term includes, without limitation, DNA, RNA, and PNA.
  • By “pharmaceutically acceptable” is meant a compound or protein that may be administered to an animal (for example, a mammal) without significant adverse medical consequences.
  • By “physiologically acceptable carrier” is meant a carrier which does not have a significant detrimental impact on the treated host and which retains the therapeutic properties of the compound with which it is administered. One exemplary physiologically acceptable carrier is physiological saline. Other physiologically acceptable carriers and their formulations are known to one skilled in the art and are described, for example, in Remington's Pharmaceutical Sciences, (18th edition), ed. A. Gennaro, 1990, Mack Publishing Company, Easton, Pa., incorporated herein by reference.
  • By a “fusion protein” is meant a protein that includes (i) a scaffold protein of the invention joined to (ii) a second, different (i.e., “heterologous”) protein. “Fusion proteins” are distinguished from “nucleic acid-protein fusions” and “RNA-protein fusions” in that a “fusion protein” is composed entirely of amino acids, while both a “nucleic acid-protein fusion” and an “RNA-protein fusion” include a stretch of nucleic acids (the nucleic acid or RNA component) joined to a stretch of amino acids (the protein component).
  • By “selecting” is meant substantially partitioning a molecule from other molecules in a population. As used herein, a “selecting” step provides at least a 2-fold, preferably, at least a 30-fold, more preferably, at least a 100-fold, and, most preferably, at least a 1000-fold enrichment of a desired molecule relative to undesired molecules in a population following the selection step. A selection step may be repeated any number of times, and different types of selection steps may be combined in a given approach.
  • By “binding partner,” as used herein, is meant any molecule which has a specific, covalent or non-covalent affinity for a portion of a desired compound (for example, protein) of interest. Examples of binding partners include, without limitation, members of antigen/antibody pairs, protein/inhibitor pairs, receptor/ligand pairs (for example cell surface receptor/ligand pairs, such as hormone receptor/peptide hormone pairs), enzyme/substrate pairs (for example, kinase/substrate pairs), lectin/carbohydrate pairs, oligomeric or heterooligomeric protein aggregates, DNA binding protein/DNA binding site pairs, RNA/protein pairs, and nucleic acid duplexes, heteroduplexes, or ligated strands, as well as any molecule which is capable of forming one or more covalent or non-covalent bonds (for example, disulfide bonds) with any portion of another molecule (for example, a compound or protein).
  • By a “solid support” is meant, without limitation, any column (or column material), bead, test tube, microtiter dish, solid particle (for example, agarose or sepharose), microchip (for example, silicon, silicon-glass, or gold chip), or membrane (for example, an inorganic membrane, nitrocellulose, or the membrane of a liposome or vesicle) to which an antibody mimic or an affinity complex may be bound, either directly or indirectly (for example, through other binding partner intermediates such as other antibodies or Protein A), or in which an antibody mimic or an affinity complex may be embedded (for example, through a receptor or channel).
  • The present invention provides a number of advantages. For example, as described in more detail below, the present antibody mimics exhibit improved biophysical properties, such as stability under reducing conditions and solubility at high concentrations. In addition, these molecules may be readily expressed and folded in prokaryotic systems, such as E. coli in eukaryotic systems, such as yeast, and in in vitro translation systems, such as the rabbit reticulocyte lysate system. Moreover, these molecules are extremely amenable to affinity maturation techniques involving multiple cycles of selection, including in vitro selection using RNA-protein fusion technology (Roberts and Szostak, Proc. Natl. Acad. Sci. USA 94:12297, 1997; Szostak et al., U.S. Ser. No. 09/007,005 and U.S. Ser. No. 09/247,190; Szostak et al. WO98/31700), phage display (see, for example, Smith and Petrenko, Chem. Rev. 97:317, 1997), and yeast display systems (see, for example, Boder and Wittrup, Nature Biotech. 15:553, 1997).
  • Other features and advantages of the present invention will be apparent from the following detailed description thereof, and from the claims.
  • BRIEF DESCRIPTION OF THE DRAWINGS
  • FIG. 1 is a photograph showing a comparison between the structures of antibody heavy chain variable regions from camel (dark blue) and llama (light blue), in each of two orientations.
  • FIG. 2 is a photograph showing a comparison between the structures of the camel antibody heavy chain variable region (dark blue), the llama antibody heavy chain variable region (light blue), and a fibronectin type III module number 10 (10Fn3) (yellow).
  • FIG. 3 is a photograph showing a fibronectin type III module number 10 (10Fn3), with the loops corresponding to the antigen-binding loops in IgG heavy chains highlighted in red.
  • FIG. 4 is a graph illustrating a sequence alignment between a fibronectin type III protein domain and related protein domains.
  • FIG. 5 is a photograph showing the structural similarities between a 10Fn3 domain and 15 related proteins, including fibronectins, tenascins, collagens, and undulin. In this photograph, the regions are labeled as follows: constant, dark blue; conserved, light blue; neutral, white; variable, red; and RGD integrin-binding motif (variable), yellow.
  • FIG. 6 is a photograph showing space filling models of fibronectin III modules 9 and 10, in each of two different orientations. The two modules and the integrin binding loop (RGD) are labeled. In this figure, blue indicates positively charged residues, red indicates negatively charged residues, and white indicates uncharged residues.
  • FIG. 7 is a photograph showing space filling models of fibronectin III modules 7-10, in each of three different orientations. The four modules are labeled. In this figure, blue indicates positively charged residues, red indicates negatively charged residues, and white indicates uncharged residues.
  • FIG. 8 is a photograph illustrating the formation, under different salt conditions, of RNA-protein fusions which include fibronectin type III domains.
  • FIG. 9 is a series of photographs illustrating the selection of fibronectin type III domain-containing RNA-protein fusions, as measured by PCR signal analysis.
  • FIG. 10 is a graph illustrating an increase in the percent TNF-α binding during the selections described herein, as well as a comparison between RNA-protein fusion and free protein selections.
  • FIG. 11 is a series of schematic representations showing IgG, 10Fn3, Fn-CH1—CH2—CH3, and Fn-CH2—CH3 (clockwise from top left).
  • FIG. 12 is a photograph showing a molecular model of Fn-CH1—CH2—CH3 based on known three-dimensional structures of IgG (X-ray crystallography) and 10Fn3 (NMR and X-ray crystallography).
  • FIG. 13 is a graph showing the time course of an exemplary 10Fn3-based nucleic acid-protein fusion selection of TNF-α binders. The proportion of nucleic acid-protein fusion pool (open diamonds) and free protein pool (open circles) that bound to TNF-α-Sepharose, and the proportion of free protein pool (full circles) that bound to underivatized Sepharose, are shown.
  • FIGS. 14 and 15 are graphs illustrating TNF-α binding by TNF-α Fn-binders. In particular, these figures show mass spectra data obtained from a 10Fn3 fusion chip and non-fusion chip, respectively.
  • FIGS. 16 and 17 are the phosphorimage and fluorescence scan, respectively, of an 10Fn3 array, illustrating TNF-α binding.
  • FIG. 18 is a graph showing an alignment of the primary sequences of the llama VH domain and the wild-type human 10F3 domain. Homologous residues between the two sequences are indicated. The 10F3 residues outside the randomized loops that were found to have mutated in approximately 45% of the selected clones are marked with arrows under the wild-type 10Fn3 sequence and with the letter that identifies the selected residue.
  • FIG. 19 shows schematic representations of the llama VH domain and the wild-type human 10F3 domain. The locations of the mutated framework residues are indicated.
  • FIG. 20 is a graph illustrating the efficiency and specificity of binding of a free-protein pool translated from the original library (R0) and after ten rounds of selection with TNF-α (R10). Protein pool binding to underivatized Sepharose, to TNF-α-Sepharose, to IL-1α-Sepharose, and to IL-13-Sepharose is compared.
  • FIG. 21 is a series of IgG-like scaffolds for the display of up to three loops.
  • FIG. 22 is a series of IgG-like scaffolds for the display of up to four, or even six, loops.
  • FIG. 23 is a series of scaffolds, unrelated to IgG, for the display of loop structures.
  • FIGS. 24A-24D are photographic and graphic illustrations demonstrating the specific capture of a target (TNF-α) by a mimic immobilized on a solid surface.
  • FIG. 25 is a graph listing exemplary TNF-α binders (SEQ ID NOS: 33-140) according to the invention.
  • DETAILED DESCRIPTION
  • The novel antibody mimics described herein have been designed to be superior both to antibody-derived fragments and to non-antibody frameworks, for example, those frameworks cited above.
  • The major advantage of these antibody mimics over antibody fragments is structural. These antibody-mimics are derived from whole, stable, and soluble structural scaffolds. For example, the Fn3 scaffold is found in the human body. Consequently, they exhibit better folding and thermostability properties than antibody fragments, whose creation involves the removal of parts of the antibody native fold, often exposing amino acid residues that, in an intact antibody, would be buried in a hydrophobic environment, such as an interface between variable and constant domains. Exposure of such hydrophobic residues to solvent increases the likelihood of aggregation of the antibody fragments.
  • In addition, the scaffolds described herein have no disulfide bonds, which have been reported to retard or prevent proper folding of antibody fragments under certain conditions. Since the present scaffolds do not rely on disulfides for native fold stability, they are stable under reducing conditions, unlike antibodies and their fragments which unravel upon disulfide bond reduction.
  • Moreover, these scaffolds provide the functional advantages of antibody molecules. In particular, despite the fact that the 10Fn3 module is not an immunoglobulin, its overall fold is close to that of the variable region of the IgG heavy chain (FIG. 2), making it possible to display the three fibronectin loops analogous to CDRs in relative orientations similar to those of native antibodies. Because of this structure, the present antibody mimics possess antigen binding properties that are similar in nature and affinity to those of antibodies, and a loop randomization and shuffling strategy may be employed in vitro that is similar to the process of affinity maturation of antibodies in vivo.
  • There are now described below exemplary scaffolds, for example, fibronectin-based scaffolds, and their use for identifying, selecting, and evolving novel binding proteins as well as their target ligands. These examples are provided for the purpose of illustrating, and not limiting, the invention.
  • 10Fn3 Structural Motif
  • Preferred antibody mimics of the present invention are based on the structure of a fibronectin module of type III (Fn3), a common domain found in mammalian blood and structural proteins. This domain occurs more than 400 times in the protein sequence database and has been estimated to occur in 2% of the proteins sequenced to date, including fibronectins, tenascin, intracellular cytoskeletal proteins, and prokaryotic enzymes (Bork and Doolittle, Proc. Natl. Acad. Sci. USA 89:8990, 1992; Bork et al., Nature Biotech. 15:553, 1997; Meinke et al., J. Bacteriol. 175:1910, 1993; Watanabe et al., J. Biol. Chem. 265:15659, 1990). A particular scaffold is the tenth module of human Fn3 (10Fn3), which comprises 94 amino acid residues. The overall fold of this domain is closely related to that of the smallest functional antibody fragment, the variable region of the heavy chain, which comprises the entire antigen recognition unit in camel and llama IgG (FIG. 1, 2). The major differences between camel and llama domains and the 10Fn3 domain are that (i) 10Fn3 has fewer beta strands (seven vs. nine) and (ii) the two beta sheets packed against each other are connected by a disulfide bridge in the camel and llama domains, but not in 10F3.
  • The three loops of 10Fn3 corresponding to the antigen-binding loops of the IgG heavy-chain run between amino acid residues 21-31 (BC), 51-56 (DE), and 76-88 (FG) (FIG. 3). The length of the BC and DE loop, 10 and 6 residues, respectively, fall within the narrow range of the corresponding antigen-recognition loops found in antibody heavy chains, that is, 7-10 and 4-8 residues, respectively. Accordingly, once randomized and selected for high antigen affinity, these two loops may make contacts with antigens equivalent to the contacts of the corresponding loops in antibodies.
  • In contrast, the FG loop of 10Fn3 is 12 residues long, whereas the corresponding loop in antibody heavy chains ranges from 4-28 residues. To optimize antigen binding, therefore, the length of the FG loop of 10F3 is preferably randomized in length as well as in sequence to cover the CDR3 range of 4-28 residues to obtain the greatest possible flexibility and affinity in antigen binding. Indeed, in general, the lengths as well as the sequences of the CDR-like loops of the antibody mimics may be randomized during in vitro or in vivo affinity maturation (as described in more detail below).
  • The tenth human fibronectin type III domain, 10Fn3, refolds rapidly even at low temperature; its backbone conformation has been recovered within 1 second at 5° C. Thermodynamic stability of 10Fn3 is high (ΔGU=24 kJ/mol=5.7 kcal/mol), correlating with its high melting temperature of 110° C.
  • One of the physiological roles of 10Fn3 is as a subunit of fibronectin, a glycoprotein that exists in a soluble form in body fluids and in an insoluble form in the extracellular matrix (Dickinson et al., J. Mol. Biol. 236:1079, 1994). A fibronectin monomer of 220-250 kD contains 12 type I modules, two type II modules, and 17 fibronectin type III modules (Potts and Campbell, Curr. Opin. Cell Biol. 6:648, 1994). Different type III modules are involved in the binding of fibronectin to integrins, heparin, and chondroitin sulfate. 10Fn3 was found to mediate cell adhesion through an integrin-binding Arg-Gly-Asp (RGD) motif on one of its exposed loops. Similar RGD motifs have been shown to be involved in integrin binding by other proteins, such as fibrinogen, von Wellebrand factor, and vitronectin (Hynes et al., Cell 69:11, 1992). No other matrix- or cell-binding roles have been described for 10Fn3.
  • The observation that 10F3 has only slightly more adhesive activity than a short peptide containing RGD is consistent with the conclusion that the cell-binding activity of 10F3 is localized in the RGD peptide rather than distributed throughout the 10Fn3 structure (Baron et al., Biochemistry 31:2068, 1992). The fact that 10Fn3 without the RGD motif is unlikely to bind to other plasma proteins or extracellular matrix makes 10Fn3 a useful scaffold to replace antibodies. In addition, the presence of 10Fn3 in natural fibrinogen in the bloodstream suggests that 10Fn3 itself is unlikely to be immunogenic in the organism of origin.
  • In addition, we have determined that the 10F3 framework possesses exposed loop sequences tolerant of randomization, facilitating the generation of diverse pools of antibody mimics. This determination was made by examining the flexibility of the 10Fn3 sequence. In particular, the human 10Fn3 sequence was aligned with the sequences of fibronectins from other sources as well as sequences of related proteins (FIG. 4), and the results of this alignment were mapped onto the three-dimensional structure of the human 10F3 domain (FIG. 5). This alignment revealed that the majority of conserved residues are found in the core of the beta sheet sandwich, whereas the highly variable residues are located along the edges of the beta sheets, including the N- and C-termini, on the solvent-accessible faces of both beta sheets, and on three solvent-accessible loops that serve as the hypervariable loops for affinity maturation of the antibody mimics. In view of these results, the randomization of these three loops are unlikely to have an adverse effect on the overall fold or stability of the 10Fn3 framework itself.
  • For the human 10Fn3 sequence, this analysis indicates that, at a minimum, amino acids 1-9, 44-50, 61-54, 82-94 (edges of beta sheets); 19, 21, 30-46 (even), 79-65 (odd) (solvent-accessible faces of both beta sheets); 21-31, 51-56, 76-88 (CDR-like solvent-accessible loops); and 14-16 and 36-45 (other solvent-accessible loops and beta turns) may be randomized to evolve new or improved compound-binding proteins. In addition, as discussed above, alterations in the lengths of one or more solvent exposed loops may also be included in such directed evolution methods.
  • Alternatively, changes in the β-sheet sequences may also be used to evolve new proteins. These mutations change the scaffold and thereby indirectly alter loop structure(s). If this approach is taken, mutations should not saturate the sequence, but rather few mutations should be introduced. Preferably, no more than between 3-20 changes should be introduced to the β-sheet sequences by this approach.
  • Sequence variation may be introduced by any technique including, for example, mutagenesis by Taq polymerase (Tindall and Kunkel, Biochemistry 27:6008 (1988)), fragment recombination, or a combination thereof. Similarly, an increase of the structural diversity of libraries, for example, by varying the length as well as the sequence of the CDR-like loops, or by structural redesign based on the advantageous framework mutations found in selected pools, may be used to introduce further improvements in antibody mimic scaffolds.
  • Antibody Mimic Fusions
  • The antibody mimics described herein may be fused to other protein domains. For example, these mimics may be integrated with the human immune response by fusing the constant region of an IgG (Fc) with an antibody mimic, such as an 10Fn3 module, preferably through the C-terminus of 10Fn3. The Fc in such a 10Fn3-Fc fusion molecule activates the complement component of the immune response and increases the therapeutic value of the antibody mimic. Similarly, a fusion between an antibody mimic, such as 10Fn3, and a complement protein, such as Clq, may be used to target cells, and a fusion between an antibody mimic, such as 10F3, and a toxin may be used to specifically destroy cells that carry a particular antigen. In addition, an antibody scaffold, such as 10Fn3, in any form may be fused with albumin to increase its half-life in the bloodstream and its tissue penetration. Any of these fusions may be generated by standard techniques, for example, by expression of the fusion protein from a recombinant fusion gene constructed using publically available gene sequences.
  • Scaffold Multimers
  • In addition to monomers, any of the scaffold constructs described herein may be generated as dimers or multimers of antibody mimics as a means to increase the valency and thus the avidity of antigen binding. Such multimers may be generated through covalent binding. For example, individual 10Fn3 modules may be bound by imitating the natural 8Fn3-9Fn3-10Fn3 C-to-N-terminus binding or by imitating antibody dimers that are held together through their constant regions. A 10Fn3-Fc construct may be exploited to design dimers of the general scheme of 10Fn3-Fc::Fc-10Fn3. The bonds engineered into the Fc::Fc interface may be covalent or non-covalent. In addition, dimerizing or multimerizing partners other than Fc can be used in hybrids, such as 10Fn3 hybrids, to create such higher order structures.
  • In particular examples, covalently bonded multimers may be generated by constructing fusion genes that encode the multimer or, alternatively, by engineering codons for cysteine residues into monomer sequences and allowing disulfide bond formation to occur between the expression products. Non-covalently bonded multimers may also be generated by a variety of techniques. These include the introduction, into monomer sequences, of codons corresponding to positively and/or negatively charged residues and allowing interactions between these residues in the expression products (and therefore between the monomers) to occur. This approach may be simplified by taking advantage of charged residues naturally present in a monomer subunit, for example, the negatively charged residues of fibronectin. Another means for generating non-covalently bonded antibody mimics is to introduce, into the monomer gene (for example, at the amino- or carboxy-termini), the coding sequences for proteins or protein domains known to interact. Such proteins or protein domains include coil-coil motifs, leucine zipper motifs, and any of the numerous protein subunits (or fragments thereof) known to direct formation of dimers or higher order multimers.
  • Fibronectin-Like Molecules
  • Although 10Fn3 represents a preferred scaffold for the generation of antibody mimics, other molecules may be substituted for 10Fn3 in the molecules described herein. These include, without limitation, human fibronectin modules 10Fn3-9Fn3 and 11Fn3-17Fn3 as well as related Fn3 modules from non-human animals and prokaryotes. In addition, Fn3 modules from other proteins with sequence homology to 10Fn3, such as tenascins and undulins, may also be used. Other exemplary scaffolds having immunoglobulin-like folds (but with sequences that are unrelated to the VH domain) are shown in FIG. 21 and include N-cadherin, ICAM-2, titin, GCSF receptor, cytokine receptor, glycosidase inhibitor, E-cadherin, and antibiotic chromoprotein. Yet further domains with related structures may be derived from myelin membrane adhesion molecule P0, CD8, CD4, CD2, class I MHC, T-cell antigen receptor, CD1, C2 and I-set domains of VCAM-1, I-set immunoglobulin domain of myosin-binding protein C, I-set immunoglobulin domain of myosin-binding protein H, I-set immunoglobulin domain of telokin, telikin, NCAM, twitchin, neuroglian, growth hormone receptor, erythropoietin receptor, prolactin receptor, GC-SF receptor, interferon-gamma receptor, β-galactosidase/glucuronidase, β-glucuronidase, and transglutaminase. Alternatively, any other protein that includes one or more immunoglobulin-like folds may be utilized. Such proteins may be identified, for example, using the program SCOP (Murzin et al., J. Mol. Biol. 247:536 (1995); Lo Conte et al., Nucleic Acids Res. 25:257 (2000).
  • Generally, any molecule that exhibits a structural relatedness to the VH domain (as identified, for example, using the computer program above) may be utilized as an antibody mimic. Such molecules may, like fibronectin, include three loops at the N-terminal pole of the molecule and three loops at the C-terminal pole, each of which may be randomized to create diverse libraries; alternatively, larger domains may be utilized, having larger numbers of loops, as long as a number of such surface randomizable loops are positioned closely enough in space that they can participate in antigen binding. FIG. 22 shows examples of useful domains having more than three loops positioned close to each other. These examples include T-cell antigen receptor and superoxide dismutase, which each have four loops that can be randomized; and an Fn3 dimer, tissue factor domains, and cytokine receptor domains, each of which have three sets of two similar domains where three randomizable loops are part of the two domains (bringing the total number of loops to six).
  • In yet another alternative, any protein having variable loops positioned close enough in space may be utilized for candidate binding protein production. For example, large proteins having spatially related, solvent accessible loops may be used, even if unrelated structurally to an immunoglobulin-like fold. Exemplary proteins include, without limitation, cytochrome F, green fluorescent protein, GroEL, and thaumatin (FIG. 23). The loops displayed by these proteins may be randomized and superior binders selected from a randomized library as described herein. Because of their size, molecules may be obtained that exhibit an antigen binding surface considerably larger than that found in an antibody-antigen interaction. Other useful scaffolds of this type may also be identified using the program SCOP (Murzin et al., J. Mol. Biol. 247:536 (1995)) to browse among candidate proteins having numerous loops, particularly loops positioned among parallel beta sheets or a number of alpha-helices.
  • Modules from different organisms and parent proteins may be most appropriate for different applications. For example, in designing an antibody mimic, it may be most desirable to generate that protein from a fibronectin or fibronectin-like molecule native to the organism for which a therapeutic is intended. In contrast, the organism of origin is less important or even irrelevant for antibody mimics that are to be used for in vitro applications, such as diagnostics, or as research reagents.
  • For any of these molecules, libraries may be generated and used to select binding proteins by any of the methods described herein.
  • Directed Evolution of Scaffold-Based Binding Proteins
  • The antibody mimics described herein may be used in any technique for evolving new or improved binding proteins. In one particular example, the target of binding is immobilized on a solid support, such as a column resin or microtiter plate well, and the target contacted with a library of candidate scaffold-based binding proteins. Such a library may consist of antibody mimic clones, such as 10Fn3 clones constructed from the wild type 10Fn3 scaffold through randomization of the sequence and/or the length of the 10Fn3 CDR-like loops. If desired, this library may be an RNA-protein fusion library generated, for example, by the techniques described in Szostak et al., U.S. Ser. No. 09/007,005 and 09/247,190; Szostak et al., WO98/31700; and Roberts & Szostak, Proc. Natl. Acad. Sci. USA (1997) vol. 94, p. 12297-12302. Alternatively, it may be a DNA-protein library (for example, as described in Lohse, DNA-Protein Fusions and Uses Thereof, U.S. Ser. No. 60/110,549, U.S. Ser. No. 09/459,190, and WO 00/32823). The fusion library is incubated with the immobilized target, the support is washed to remove non-specific binders, and the tightest binders are eluted under very stringent conditions and subjected to PCR to recover the sequence information or to create a new library of binders which may be used to repeat the selection process, with or without further mutagenesis of the sequence. A number of rounds of selection may be performed until binders of sufficient affinity for the antigen are obtained.
  • In one particular example, the 10Fn3 scaffold may be used as the selection target. For example, if a protein is required that binds a specific peptide sequence presented in a ten residue loop, a single 10Fn3 clone is constructed in which one of its loops has been set to the length of ten and to the desired sequence. The new clone is expressed in vivo and purified, and then immobilized on a solid support. An RNA-protein fusion library based on an appropriate scaffold is then allowed to interact with the support, which is then washed, and desired molecules eluted and re-selected as described above.
  • Similarly, the scaffolds described herein, for example, the 10F3 scaffold, may be used to find natural proteins that interact with the peptide sequence displayed by the scaffold, for example, in an 10F3 loop. The scaffold protein, such as the 10F3 protein, is immobilized as described above, and an RNA-protein fusion library is screened for binders to the displayed loop. The binders are enriched through multiple rounds of selection and identified by DNA sequencing.
  • In addition, in the above approaches, although RNA-protein libraries represent exemplary libraries for directed evolution, any type of scaffold-based library may be used in the selection methods of the invention.
  • Use
  • The antibody mimics described herein may be evolved to bind any antigen of interest. These proteins have thermodynamic properties superior to those of natural antibodies and can be evolved rapidly in vitro. Accordingly, these antibody mimics may be employed in place of antibodies in all areas in which antibodies are used, including in the research, therapeutic, and diagnostic fields. In addition, because these scaffolds possess solubility and stability properties superior to antibodies, the antibody mimics described herein may also be used under conditions which would destroy or inactivate antibody molecules. Finally, because the scaffolds of the present invention may be evolved to bind virtually any compound, these molecules provide completely novel binding proteins which also find use in the research, diagnostic, and therapeutic areas.
  • Experimental Results
  • Exemplary scaffold molecules described above were generated and tested, for example, in selection protocols, as follows.
  • Library Construction
  • A complex library was constructed from three fragments, each of which contained one randomized area corresponding to a CDR-like loop. The randomized residues are indicated in FIG. 18 as underlined sequences, specifically, residues 23-29 of the 10Fn3 BC loop (corresponding to CDR-H1 of the llama VH); residues 52-55 of the 10Fn3 DE loop (corresponding to CDR-H2. of the llama VH); and residues 78-87 of the 10Fn3 FG loop (corresponding to CDR-H3 of the llama VH). The fragments were named BC, DE, and FG based on the names of the CDR-H-like loops contained within them; in addition to 10Fn3 and a randomized sequence, each of the fragments contained stretches encoding an N-terminal HiS6 domain or a C-terminal FLAG peptide tag. At each junction between two fragments (i.e., between the BC and DE fragments or between the DE and FG fragments), each DNA fragment contained recognition sequences for the EarI Type IIS restriction endonuclease. This restriction enzyme allowed the splicing together of adjacent fragments while removing all foreign, non-10Fn3, sequences. It also allowed for a recombination-like mixing of the three 10Fn3 fragments between cycles of mutagenesis and selection.
  • The wild-type, human 10Fn3 gene was cloned from a human liver library (Maxim Biotech, South San Francisco, Calif.) using the primers Hu5PCR-NdeI 5′CATATGGTTTCTGATGTTCCGAGG3′; SEQ ID NO: 28) and Hu3PCR-EcoRI (5′GAATTCCTATGTTCGGTAATTAATGGAAATTG3′; SEQ ID NO: 29). Three different libraries were constructed from the wild-type segments obtained by the PCR of the 10Fn3 clone and from randomized segments obtained by oligonucleotide synthesis. The BCr-DEr-FGr library was obtained by randomizing the selected residues in BC, DE, and FG loops; the BCr-DEwt-FGr library was obtained by randomizing the selected residues in BC and FG loops, leaving the DE loop sequence wild-type; and the BCwt-DEwt-FGr library was obtained by randomizing the selected residues in the FG loop only.
  • The BCr, DEr, and FGr fragments were made synthetically. Each fragment was assembled from two overlapping oligonucleotides, which were first annealed, then extended to form the double-stranded DNA form of the fragment. The oligonucleotides that were used to construct and process the three fragments are listed below; the “Top” and “Bottom” species for each fragment are the oligonucleotides that contained the entire 10Fn3 encoding sequence. In these oligonucleotides designations, “N” indicates A, T, C, or G; and “S” indicates C or G.
    HfnLbcTop (His):
    (SEQ ID NO:1)
    5′-GG AAT TCC TAA TAC GAC TCA CTA TAG GGA CAA TTA
    CTA TTT ACA ATT ACA ATG CAT CAC CAT CAC CAT CAC
    GTT TCT GAT GTT CCG AGG GAC CTG GAA GTT GTT GCT
    GCG ACC CCC ACC AGC-3′
    HfnLbcTop (an alternative N-terminus):
    (SEQ ID NO:2)
    5′-GG AAT TCC TAA TAC GAC TCA CTA TAG GGA CAA TTA
    CTA TTT ACA ATT ACA ATG GTT TCT GAT GTT CCG AGG
    GAC CTG GAA GTT GTT GCT GCG ACC CCC ACC AGC-3′
    HFnLBCBot-flag8:
    (SEQ ID NO:3)
    5′-AGG GGA TGC CTT GTC GTC GTC GTC CTT GTA GTC GCT
    CTT CCC TGT TTC TCC GTA AGT GAT CCT GTA ATA TCT
    (SNN)7 CCA GCT GAT CAG TAG GCT GGT GGG GGT CGC
    AGC-3′
    HFnBC3′-flag8:
    (SEQ ID NO:4)
    5′-AGC GGA TGC CTT GTC GTC GTC GTC CTT GTA GTC GCT
    CTT CCC TGT TTC TCC GTA AGT GAT CC-3′
    HFnLDETop:
    (SEQ ID NO:5)
    5′-GG AAT TCC TAA TAC GAC TCA CTA TAG GGA CAA TTA
    CTA TTT ACA ATT ACA ATG CAT CAC CAT CAC CAT CAC
    CTC TTC ACA GGA GGA AAT AGC CCT GTC C-3′
    HFnLDEBot-flag8:
    (SEQ ID NO:6)
    5′-AGC GGA TGC CTT GTC GTC GTC GTC CTT GTA GTC GCT
    CTT CGT ATA ATC AAC TCC AGG TTT AAG GCC GCT GAT
    GGT AGC TGT (SNN)4 AGG CAC AGT GAA CTC CTG GAC AGG
    GCT ATT TCC TCC TGT-3′
    HFnDE3′-flag8:
    (SEQ ID NO:7)
    5′-AGG GGA TGC CTT GTC GTC GTC GTC CTT GTA GTC GCT
    CTT CGT ATA ATC AAC TCC AGG TTT AAG G-3′
    HFnLFGTop:
    (SEQ ID NO:8)
    5′-GG AAT TCC TAA TAC GAC TCA CTA TAG GGA CAA TTA
    CTA TTT ACA ATT ACA ATG CAT CAC CAT CAC CAT CAC
    CTC TTC TAT ACC ATC ACT GTG TAT GCT GTC-3′
    HFnLFGBot-flag8:
    (SEQ ID NO:9)
    5′-AGC GGA TGC CTT GTC GTC GTC GTC CTT GTA GTC TGT
    TCG GTA ATT AAT GGA AAT TGG (SNN)10 AGT GAC AGC
    ATA CAC AGT GAT GGT ATA-3′
    HFnFG3′-flag8:
    (SEQ ID NO:10)
    5′-AGC GGA TGC CTT GTC GTC GTC GTC CTT GTA GTC TGT
    TCG GTA ATT AAT GGA AAT TGG-3′
    T7Tmv (introduces T7 promoter and TMV untranslated
    region needed for in vitro translation):
    (SEQ ID NO:11)
    5′-GCG TAA TAC GAC TCA CTA TAG GGA CAA TTA CTA TTT
    ACA ATT ACA-3′
    ASAflag8:
    (SEQ ID NO:12)
    5′-AGC GGA TGC CTT GTC GTC GTC GTC CTT GTA GTC-3′
    Unispl-s (spint oligonucleotide used to ligate
    mRNA to the puromycin-containing linker,
    described by Roberts et al, 1997, supra):
    (SEQ ID NO:13)
    5′-TTTTTTTTTNAGCGGATGC-3′
    A18---2PEG (DNA-puromycin linker):
    (SEQ ID NO:14)
    5′-(A)18(PEG)2CCPur
  • The oligonucleotide pair BCTop and BCBot-flag8 was used to construct the fragment which contains the randomized BC loop; the pair DETop and DEBot-flag8 was used to construct the fragment which contains the randomized DE loop; the pair BCTop and DE3-Flag8 was used to PCR-amplify the BCwt-Dwt fragments; and the pair FGTop and FGBot-Flag8 was used to construct the fragment which contains the randomized FG loop. The pairs of oligonucleotides (500 pmol of each) were annealed in 100 μL of 10 mM Tris 7.5, 50 mM NaCl for 10 minutes at 85° C., followed by a slow (0.5-1 hour) cooling to room temperature. The annealed fragments with single-stranded overhangs were then extended using 100 U Klenow (New England Biolabs, Beverly, Mass.) for each 100 μL aliquot of annealed oligos, and the buffer made of 838.5 μl H2O, 9 μl 1 M Tris 7.5, 5 μl 1M MgCl2, 20 μl 10 mM dNTPs, and 7.5 μl 1M DTT. The extension reactions proceeded for 1 hour at 25° C.
  • In order to reduce the frequency of stop codons introduced by the random sequences, the randomized residues were encoded by (NNS)n, where N stands for any nucleotide and S for an equimolar mixture of C and G; only one of the three stop codons (TAG) conforms to the NNS restriction. In addition to the sequence encoding 10Fn3, the gene fragments contained the 5′ Tobacco Mosaic Virus (TMV) untranslated region and the T7 promoter, as well as the sequences encoding a 5′ hexahistidine protein purification tag and a 3′FLAG epitope purification tag. In addition, as noted above, Ear I restriction endonuclease recognition sites were engineered into the overlaps between adjacent fragments in order to facilitate the assembly of the three fragments.
  • Next, each of the double-stranded fragments was transformed into an RNA-protein fusion (PROfusion™) using the technique developed by Szostak et al., U.S. Ser. No. 09/007,005 and U.S. Ser. No. 09/247,190; Szostak et al., WO98/31700; and Roberts & Szostak, Proc. Natl. Acad. Sci. USA (1997) vol. 94, p. 12297-12302. Briefly, the fragments were transcribed using an Ambion in vitro transcription kit, T7-MEGAshortscript™ (Ambion, Austin, Tex.), and the resulting mRNA was gel-purified and ligated to a 5′-phosphorylated DNA-puromycin linker, preferably, 5′ dA18PEG2dCdCPur) using DNA ligase (Promega, Madison, Wis.); the mRNA was aligned with the DNA linker using a DNA splint oligonucleotide (5′ TTTTTTTTTNAGCGGATGC 3′; SEQ ID NO: 30) as described in Szostak (supra). The mRNA-DNA-puromycin molecule was then translated using the Ambion rabbit reticulocyte lysate-based translation kit in the presence of 35S-methionine. The resulting mRNA-DNA-puromycin-protein fusion was purified using Oligo(dT) cellulose, (Type 7, Amersham Pharmacia, Piscataway, N.J.) and a complementary DNA strand was synthesized using reverse transcriptase (Superscript™ II, Gibco, Life Technologies, Rockville, Md.) and the RT primers described above (Unisplint-S or flagASA), following the manufacturer's instructions (preferably, a two-minute annealing at 70° C. and a 40 minute reaction at 42° C.).
  • The RNA-protein fusion with annealing cDNA obtained for each fragment was next purified on the resin appropriate to its peptide purification tag, i.e., on Ni-NTA agarose (Qiagen, Valencia, Calif.) for the His6-tag and M2 Anti-Flag Agarose (Sigma, St. Louis, Mo.) for the FLAG-tag, following the procedures recommended by the manufacturers. The fragment-encoding genetic information recovered by KOH elution was amplified by PCR using Pharmacia Ready-to-Go PCR Beads, 10 pmol of 5′ and 3′ PCR primers, and the following PCR program (Pharmacia, Piscataway, N.J.): Step 1: 95° C. for 3 minutes; Step 2: 95° C. for 30 seconds, 58/62° C. for 30 seconds, 72° C. for 1 minute, 20/25/30 cycles, as required; Step 3: 72° C. for 5 minutes; Step 4: 4° C. until end (typically, 25 cycles).
  • The resulting DNA was cleaved by 5-6 U EarI (New England Biolabs) per μg DNA; the reaction took place in T4 DNA Ligase Buffer (New England Biolabs) at 37° C., for 1 hour, and was followed by an optional incubation at 70° C. for 15 minutes to inactivate Ear I. Equal amounts of the BC, DE, and FG fragments were combined and ligated to form a full-length 10Fn3 gene with randomized loops. The ligation required 10 U of fresh EarI (New England Biolabs) and 20 U of T4 DNA Ligase (Promega, Madison, Wis.), and took 1 hour at 37° C. Earl and ligase were then inactivated by a 15 minute incubation at 65° C.
  • Three different libraries, BCwt-DEwt-FGr, BCr-DEwt-FGr, and BCr-DEr-FGr, were made in the manner described above. Each contained the form of the FG loop with 10 randomized residues. The BC and the DE loops of the first library bore the wild type 10Fn3 sequence; a BC loop with 7 randomized residues and a wild type DE loop made up the second library; and a BC loop with 7 randomized residues and a DE loop with 4 randomized residues made up the third library. The complexity of the FG loop in each of these three libraries was 1013; the further two randomized loops provided the potential for a complexity too large to be sampled in a laboratory. The combination of these libraries provided a master library having 1012 unique clones.
  • The sequences of 76 randomly picked clones from the original, randomized, BCr-DEr-FGr library showed no pattern in the randomized loops (data not shown); the amino acid frequency in the library varied in proportion to the number of codons available that encoded each residue, between 1% per position (glutamic acid, methionine, tryptophan) and 14% per position (proline). In contrast, the average probability for a residue in the preserved, beta-sheet framework to have remained as wild type was found to be 99%.
  • Equimolar amounts of the three libraries (2 pmoles of DNA each) were combined into one master library in order to simplify the selection process; target binding itself was expected to select the most suitable library for a particular challenge. RNA-protein fusions were obtained from the master library following the general procedure described in Szostak et al., U.S. Ser. No. 09/007,005 and 09/247,190; Szostak et al., WO98/31700; and Roberts & Szostak, Proc. Natl. Acad. Sci. USA (1997) vol. 94, p. 12297-12302 (FIG. 8), except that affinity purification performed in rounds three to ten used only M2-Sepharose (see below).
  • Fusion Selections
  • The master library in the RNA-protein fusion form was subjected to selection for binding to TNF-α (Pepro Tech, Rocky Hill, N.J.). Two initial protocols were employed: one in which the target was immobilized on an agarose column and one in which the target was immobilized on a BIACORE chip. First, an extensive optimization of conditions to minimize background binders to the agarose column yielded the favorable buffer conditions of 50 mM HEPES pH 7.4, 0.02% Triton, 100 μg/ml sheared salmon sperm DNA. In this buffer, the non-specific binding of the 10F3-RNA fusion to TNF-α Sepharose was 0.3%. The non-specific binding background of the 10Fn3-RNA/cDNA library to TNF-α Sepharose was found to be 0.1%.
  • During each round of selection on TNF-α Sepharose, the library was first preincubated for an hour with underivatized Sepharose to remove any remaining non-specific binders; the flow-through from this pre-clearing was incubated for another hour with TNF-α Sepharose. The TNF-α Sepharose was washed for 3-30 minutes.
  • After each selection, the cDNA component of the complex that had been eluted from the solid support with 0.3 M NaOH or 0.1M KOH was amplified by PCR; a DNA band of the expected size persisted through multiple rounds of selection (FIG. 9); similar results were observed in the two alternative selection protocols, and only the data from the agarose column selection is shown in FIG. 9.
  • In this selection, in the first seven rounds, the binding of Fn3-RNA/cDNA molecules to the target remained low; in contrast, when free protein was translated from DNA pools at different stages of the selection, the proportion of the column binding species increased significantly between rounds (FIG. 10).
  • In later selections, the fusion pools selected in the first eight rounds of selection (R1-8) bound to TNF-α-Sepharose at levels close to the background (<0.25%) (FIG. 13). After nine rounds of selection (R9), the binding of fusion to TNF-α-Sepharose increased sharply to 0.7%, and, after ten rounds of selection (R10), the binding increased further to 7% (FIG. 13). These selections were carried out using TNF-α immobilized on Epoxy-Activated Sepharose™ 6B (Amersham Pharmacia) at 10 mg TNF/g Sepharose in 10 mL. Before use, the TNF-α-derivatized Sepharose was blocked in Binding Buffer (50 mM HEPES, pH 7.4, 0.02% Triton, 0.1 mg/mL sheared salmon sperm DNA (Ambion)), overnight, at 4° C.
  • The 10Fn3-based master library was transcribed, ligated to the puromycin-bearing linker, translated into an mRNA-protein library in the presence of 5-10 μL/300 μL 35S-methionine, affinity purified on Oligo(dT) Cellulose, reverse-transcribed into a DNA/mRNA-protein library, and affinity-purified on M2-Sepharose (for rounds 3-10), as described above. Forty pmol of DNA/mRNA-protein fusion library molecules, the equivalent of 20 copies of 4×1012 different sequences, were recovered, then subjected to the first round (R1) of the selection.
  • In the first step of the selection, 40 pmoles of the DNA/mRNA-protein library was incubated for 1 hour at 4° C., with tumbling, in 300 μL of Binding Buffer with 30 μL of Epoxy-Sepharose that had been subjected to the derivatization procedure in the absence of TNF-α. In the second round, 24 pmol of the library was added, and in the remaining eight rounds, 0.1-2 pmol of the library was added. The supernatant was recovered by microcentrifugation through a Micro Bio-Spin® chromatography column (BIO-RAD, Hercules, Calif.), then incubated with 30 μL of TNF-α-Sepharose (6 μM) in 300 μL of the Binding Buffer for 1 hour at 4° C. (during Rounds 7-10, the Binding Buffer contained an additional 1 mg/mL of BSA). The TNF-α-Sepharose was recovered on a spin column, then washed with 3×300 μL of Binding Buffer, eluted with 100 μL of 0.1 M KOH, and finally neutralized with 1 μL of 1 M Tris 8.0, 8 μL of 1 M HCl. Samples of the library, of the TNF-α-Sepharose before and after the elution, of the washes, and of the elutions were quantified by counting 35S-methionine in the sample in a scintillation counter. The next round of selection began with the formation of a new DNA/mRNA-protein pool by PCR amplification, which was transcribed, translated, and reverse-transcribed from the PCR product.
  • The DNA pools obtained from the elution after nine and after ten rounds were cloned into the TOPO™ TA®, pCR2.1 cloning vector (Invitrogen, Carlsbad, Calif.) and transformed into E. coli. Between 30 and 100 clones were picked and grown into plasmid minipreps (Qiagen). Thirty-eight clones from R9 and 29 clones from R10 were picked at random and sequenced (DNA Sequencing Core Facility, Massachusetts General Hospital, Dept. of Molecular Biology, Boston, Mass.). The program ClustalW60 was used to align the resulting protein sequences.
  • Amino Acid Residue Sequences of the TNF-α Binding Clones
  • Thirty-eight of the 61 clones derived from R9 and from R10 had unique amino acid sequences, a surprising diversity. The ten clones that were isolated more than once, presumably because of their superior binding to TNF-α, are listed in Table 1 (full sequences in FIG. 25).
  • Of the 61 clones picked randomly from the winning pool, only one (clone T09.08, sequence not shown) had its origin in the BCwt-DEwt-FGr library, with another six from the BCr-DEwt-FGr library. The observation that the remaining 54 (88% of the winners) were selected from the BCr-DEr-FGr library points out the importance for TNF-α cooperative binding of the target by several loops.
  • The most common motif found in the selected loop sequences is PWA(S/T), which is found in the DE loop of 33 of the 61 clones; the more loosely defined sequence of PW(A/G) is seen in 41/61 clones. Such a strong selection for a specific DE sequence is surprising since the analogous CDR-H2 loops of antibody VH domains generally make only a small contribution to antigen binding. On the other hand, the short length of the DE loop, which means that 107 copies of each possible tetrapeptide sequence would be expected to be present in the library, would facilitate the optimization of any contribution of the DE loop to the selected properties. A survey of other Fn3 domains (Dickenson et al., J. Mol. Biol. 236:1079-1092 (1994)) shows that proline is found at positions equivalent to the 10 Fn3 residue 52 as frequently as is the wild-type glycine; similarly, alanine, glycine, and the wild-type lysine are all common at positions equivalent to the 10Fn3 position 54. In consequence, it appears likely that the selected residues at positions 52 and 54 are at least consistent with favorable biophysical properties. In contrast, no tryptophan is found at the position equivalent to the 10Fn3 residue 53, which suggests that Tryptophan 53 may have been selected for a reason specific to the present selection, such as due to a contribution to TNF-α binding. This is consistent with the absence of this motif in later selections against other antigens, again suggesting that the PWA/G motif is more likely to contribute to TNF-α binding directly than through stability or solubility of the 10Fn3 domain. The preference for the PWA/G motif on loop DE suggests another possible reason for the preference for the BCr-DEr-FGr library during the selection: the BCr-DEr-FGr library alone contained the randomized DE loop, and would be expected to outcompete the other two libraries if the PWA/G sequence were important to target binding.
  • The sequences selected most frequently in the BC loop is NRSGLQS (12/61) (SEQ ID NO: 31), whereas the sequence selected most commonly in the FG loop is AQTGHHLHDK (6/61) (SEQ ID NO: 32). An NRSGLQS BC loop and an AQTGHHLHDK FG loop have not been found in the same molecule, but two clones were found which contain the most frequently isolated sequences on two of the three randomized loops. These clones, T10.06 (BC: NRSGLQS, DE: PWA) and T09.12 (DE: PWA, FG: AQTGHHLHDK), have two of the lowest four dissociation constants from TNF-α of the clones examined (Table 1).
  • Due to the use of a Taq polymerase that contains no proofreading activity, every round of PCR introduced additional random mutations into both the CDR-like loops and the beta-sheet scaffold of the 10Fn3 sequence, at the estimated rate of 0.01% per base pair, i.e., 3% per 10Fn3 gene per round of PCR and approximately 75% per round of selection. Consequently, it is likely that the residues preserved as wild-type and those preserved in a non-wild-type stable sequence indicate that such sequences were selected due to their superior properties. In the mutated loops, it is impossible to distinguish between the mutations introduced by oligonucleotide synthesis or by PCR mutagenesis, but in the beta-strand scaffold, most of the mutations selected originate from Taq errors. The selected clones showed several conserved changes in the scaffold of the protein, which had not been randomized intentionally. FIG. 18 indicates the residues in the 10F3 beta sheet that had not been randomized, but nevertheless mutated during selection. This mutagenesis occurred at the frequency of 26-28 of the 61 clones; these mutations are marked with arrows under the wild-type 10Fn3 sequence and with the letter that identifies the selected residue. In particular, 28 of the 61 clones mutated from Leucine 18 to Arginine or to Glutamine, and 26 clones mutated from Threonine 56 to Isoleucine. FIG. 19 shows the location of these scaffold mutations. Whereas position 56 is at the stem loop DE and thus would be expected to affect the conformation and the target-binding properties of this loop, the distance of position 18 from the presumed TNF-α-binding loops suggests that the selective advantage of this mutation may arise from an indirect effect on the conformation of loop BC or from an effect on the stability of the protein (FIG. 19). This is supported by an experiment in which clone T10.06, which contains the frequently seen L18R and T56I changes from the wild-type, was mutagenized to reverse position 18 back to the wild-type leucine. This change caused an increase of the Kd of the variant by approximately 10-fold. The weaker binding of the T10.06(L18) protein to TNF-α suggests that the residue at position 18 has an effect on the binding of the target by the CDR-like loops, possibly by a minor structural change that is transmitted through the beta-strand to loop BC.
  • Affinity and Specificity of the Selected TNF-α Binding Pools
  • The apparent average Kd values of free protein pools for TNF-α after nine and after ten rounds of selection were found to be indistinguishable (4 and 6 nM, respectively; Table 1); this similarity in affinity is consistent with the relatively low (10 fold) level of enrichment observed in the last round of selection and with the similarity in the sequence composition of the two pools. The apparent average Kd values of free protein pool after four further rounds of selection was 3 nM, also indistinguishable from those of R9 and R10 pools
  • In order to assess the specificity of the binding of the pool selected after ten rounds of selection, we compared the binding of two different free protein pools to three cytokines immobilized on Sepharose to TNF-α, the target of the selection, and to IL-1α and IL-13, which were irrelevant to the selection. The first pool had been transcribed and translated from the initial, randomized DNA library before the selection (R0), and the second pool, from the library after ten rounds of selection (R10).
  • To carry out these experiments, the PCR product of the elution after the tenth round of selection was transcribed and translated in vitro, in the presence of 35S-methionine but without forming the mRNA-protein fusion. The resulting fraction of the free protein bound to TNF-α-Sepharose, to IL-1α-Sepharose, to IL-13-Sepharose at approximately 10 μM, 30 μM, and 50 μM, respectively, and to underivatized Sepharose was compared (FIG. 20), using the procedure described above for DNA/mRNA-protein fusion binding to TNF-α-Sepharose. The amount of the selected pool bound to each of the targets was measured by scintillation counting of the washed beads.
  • FIG. 20 shows that, whereas the binding of R0 to TNF-α, IL-1α, and IL-13 was similar (2%, 4%, and 3%, respectively), the ten rounds of selection resulted in 32% binding to the targeted TNF-α, in 3% binding to IL-1α, and in 1% binding to IL-13. The absolute and the relative increase of protein binding to TNF-α demonstrates the ability of the 10Fn3 scaffold and of the DNA/mRNA-protein fusion-based selection system to select target-specific binders.
  • To examine the specificity of binding further, clone T09.12 was immobilized in a microarray format (as generally described below) and was tested for binding to soluble TNF-α. Specific binding of TNF-α to this clone was detected using fluorescence detection (FIG. 24A) and mass spectroscopy (FIG. 24B). For the mass spectroscopy results, binding assays were carried out in the presence of fetal bovine serum, an exemplary complex biological fluid containing a variety of potential interfering proteins. For fluorescence detection (FIG. 24A), a mixture of RNA-10Fn3 fusion of wild-type 10Fn3 and of the T09.12 variant (Table 1) was hybridized onto a DNA microarray on which oligonucleotides complementary to the RNA portion of the fusion molecules had been immobilized at 600 micron pitch, with 24 replicate features. After removal of unhybridized fusion by washing, the surface was exposed to biotin-TNF-α (2.6 μg/mL in TBS, 0.02% Tween-20, 0.2% BSA), washed, and air-dried. The captured biotin-TNF-α was detected by Cy3-labeled anti-biotin monoclonal antibody (Sigma) using a ScanArray 5000 system (GSI Lumonics). For mass spectroscopy detection, RNA-10Fn3 fusion of the T09.12 variant (FIG. 24B) and wild-type 10Fn3 (FIG. 24C) was treated with RNase A to generate a fusion between the protein and the DNA linker. The resulting DNA-linked protein was hybridized to a glass coverslip arrayed with an immobilized oligonucleotide complementary to the DNA linker (FIGS. 24B and 24C; no fusion was applied in FIG. 24D). After washing, the above surfaces were exposed to TNF-α (1.5 mg/mL in 90% v/v PBS/10% fetal bovine serum). The dried chip was spotted with MALDI matrix and analyzed with a Voyager DE MALDI-TOF mass spectrometer (PerSeptive Biosystems). A signal at 17.4 kD, which corresponded to the expected molecular mass of monomeric TNF-α, was detected on the 200 μm features that contained T09.12 protein (FIG. 24A), but not on the features that contained wild-type 10Fn3 (FIG. 24B) nor on the features that did not contain DNA-protein fusion (FIG. 24C).
  • Kd of the Selected TNF-α Binding Clones
  • Dissociation constants were determined for all the clones that were represented more than once in the two pools generated after nine and after ten rounds of selection, as well as for the only clone that originated from the BCwt-DEwt-FGr library (clone T09.08).
  • To determine these binding constants, biotinylated TNF-α was prepared using the NHS-LC-LC-Biotin reagent supplied by Pierce (Rockford, Ill.). MALDI-TOF mass spectrometry was used to estimate that more than 80% of the monomeric TNF-α, and hence more than 99% of the trimer, was biotinylated.
  • For the R9 and R10 pools (and the R14 and M12 pools discussed below), as well as for the characterized clones derived from these two pools, eleven samples of 0.25 nM, in vitro-translated, 35S-methionine-labeled free protein were incubated with the biotinylated TNF-α at a concentration between 17 pM and 23 nM, in 200 μL 10 mM HEPES, pH 7.4, 150 mM NaCl, 1% BSA, 0.02% Triton, for one hour at room temperature. Subsequently, each sample was loaded on a pre-soaked, SAM2R Biotin Capture Membrane (Promega, Madison, Wis.) using a 96 well, Easy-Titer™ ELIFA system (Pierce). Under vacuum, each spot was washed with 200 μL of HBS pH 7.4, 1% BSA, 0.05% Triton; next the entire membrane was rinsed in the buffer and air-dried. The membrane was exposed with a Storage Phosphor Screen (Molecular Dynamics, Sunnyvale, Calif.) overnight, and the intensities of the resulting individual spots were quantified using a STORM 860 phosphoimager with the ImageQuaNT densitometry program (Molecular Dynamics). The Kd of the binding was determined by fitting the equilibrium equation to the resulting binding curve (KaleidaGraph, Synergy Software); the error of the experiment was estimated from 24 independent experiments.
  • In these studies, the Kd values were found to be in the narrow range of 1-24 nM (Table 1). The T09.12 and T10.06 clones, which contained the most commonly isolated sequences in two loops each, have the low Kd of 4 and 2 nM, respectively; on the other hand, a number of clones with less frequently seen loops, such as clones T09.07 and T10.15, showed similarly tight binding.
  • A sample comparison of TNF-α binding between free protein and the cDNA/mRNA-protein complex derived from the same sequence showed that the two dissociation constants were within experimental error of each other, a property of the system that makes it possible to use the cDNA/mRNA-protein complex to select for target-binding properties of the protein itself.
  • High-Stringency Selection of TNF-α Binding Clones
  • Despite the duplicate clones isolated, the TNF-α-binding pools after nine and after ten rounds of selection contained numerous different clones, i.e., 38 different sequences in 61 clones sampled. Therefore, further selection, with more stringent binding requirements, was undertaken to recover a subset of these clones with superior TNF-α binding properties. Consequently, four further rounds of selections (R11-R14) were conducted in solution, where the concentration of the target was controlled more easily. The concentration of TNF-α was limited to 0.5 nM and the concentration of DNA/mRNA-10Fn3 pool to 0.1 nM; in addition, the length and the temperature of the washes of the 10Fn3/TNF-α complex bound to streptavidin-coated paramagnetic beads were increased.
  • Specifically, these selections were carried out as follows. For rounds 11-13, 0.1 nM DNA/mRNA-10Fn3 fusion library, which had been made as described above, was pre-cleared by tumbling for 1 hour at 4° C. with 100 μL of Dynabeads® M-280 (streptavidin-coated paramagnetic beads; Dynal, Lake Success, N.Y.) that had been pre-blocked in Binding Buffer. The resulting pre-cleared fusion mixture was combined with 0.5 nM biotinylated TNF-α in 300 μL of the above Binding Buffer, and the complex incubated at 4° C. for 1 hour. Next, 100 μl of resuspended Dynabeads® M-280 Streptavidin at 1.3 g/cm3, which had been blocked by overnight incubation in Binding Buffer, were added to the mixture and incubated at 4° C., with tumbling, for 45 additional minutes. The paramagnetic beads were separated from the supernatant on a Dynal MPC-S rack, the supernatant was removed, and the beads were washed with the Binding Buffer for 1, 15, and 30 minutes in the case of R11 and R12, or for 1 minute, followed by nine ten-minute washes in the case of R13-R14. DNA was eluted from the washed DNA/mRNA-10Fn3:TNF-α-biotin:streptavidin-bead complexes with two washes of 100 μL 0.1 M KOH, and treated as described above for the column-based selection to produce the next generation DNA/mRNA-10Fn3 fusion library. Round 14 differed from R11-R13 in that the selection was performed at 30° C. and in the presence of an additional 150 mM NaCl. Except for the elevated temperature, the sequence of washes was the same for R14 as for R13.
  • Twenty-two clones derived from the DNA eluted after four further rounds of selection (R14) were picked at random and found to represent 15 different loop sequences (Table 2; full sequences in FIG. 25). The clone T10.06, isolated previously from R10 as described above, was picked eight separate times, whereas the remaining sequences, including T09.31, which had been isolated from the R9 pool, were found in one isolate each. Similar to the isolates from rounds nine and ten, the R14 clones examined showed a preference (18 of 22 clones) for the PWA/G sequence in the DE loop, and four new, non-wild-type DE sequences were revealed.
  • Whereas the apparent average Kd values of the R14 free protein pool, 3 nM, is similar to those measured for the pools after nine and ten rounds (4 and 6 nM, respectively), several Kd values of the clones isolated from the R14 pool were an order of magnitude lower than the lowest values observed in the R9 and R10 pools (Table 2). The clones that bound TNF-α most tightly, T14.07 and T14.25, had a Kd of 90 pmol. Thus, the conditions used in the last four rounds of selections were stringent enough to favor 10Fn3 molecules with subnanomolar Kd, but not so stringent as to eliminate such molecules.
  • Mutagenic Affinity Maturation
  • As discussed above, the selections described herein may also be combined with mutagenesis after all or a subset of the selection steps to further increase library diversity. In one parallel selection strategy, error-prone PCR was incorporated into the amplification of DNA between rounds (Cadwell and Joyce, PCR Methods Appl 2:28 (1992)). This technique was carried out beginning with the diverse DNA pool eluted after R8 above. This pool was amplified using error-prone PCR, with the pool divided into seven equal parts and mutagenized at the target frequency of 0.8%, 1.6%, 2.4%, 3.2%, 4.0%, 4.8%, and 5.6%. The seven PCR reactions were combined, and cDNA/RNA-protein fusion was made from the mixture and subjected to a round of selection in solution. Before the second mutagenic round, M10, error-prone PCR was performed in three separate reactions, at 0.8%, 1.6%, and 2.4%. The two remaining rounds, M11 and M12, were performed using standard Taq PCR. Except for mutagenesis, the selection conditions for M9-M12 were the same as for R11-R14. The twenty M12 clones tested showed tighter binding to TNF-α than the clones selected using the two earlier selection protocols (Table 3; full sequences in FIG. 25); the tightest binding of TNF-α was seen in M12.04, and had the observed Kd of 20 pM. These results demonstrated that low-level, random mutagenesis late in a selection can improve both the binding affinity of selected antibody mimics (20 pM vs. 90 pM) and the speed with which they can be selected (12 rounds vs. 14 rounds). In addition, the frequency of tight binders in this mutagenesis approach was observed to be about 5%, whereas the frequency is approximately 3% in other selections.
  • Superiority of Fn Binders
  • The selection of 10Fn3 variants capable of binding to TNF-α, performed using covalent mRNA-protein fusion as the unit of selection, was won by molecules with dissociation constants as low as 20 pM. These Kd values compared favorably against the standards of selection of others that used other antibody mimic scaffolds and selection methods. Consequently, the 10Fn3-based scaffold and covalent mRNA-protein fusion-based in vitro selection method may be utilized for the development of antibody mimics against a broad range of antigens. In addition, the subnanomolar, TNF-α-binding 10Fn3 variants described herein represent potential therapeutic, research, and diagnostic agents. Moreover, since this in vitro selection method can be automated, such a combination of scaffold and selection methods have applications on the genomic scale.
  • One of the factors that contributed to the success of the present selection was the randomization of all three CDR-like loops of 10Fn3; similar libraries which contained only one or two randomized loops were less likely to include tight binders than the library with three randomized, CDR-like loops.
  • In the selection reported above, the randomized loops remained the length of the corresponding, wild-type 10Fn3 loops. To generate further library diversity, the length of the loops as well as their sequences may be varied, to incorporate favorable mutations in the 10Fn3 beta-sheet into the wild-type scaffold used for library construction, and to create libraries with randomized beta-sheet scaffolds which will allow selection of structures even more successful at mimicking antibodies.
  • Selections similar to those described herein may be carried out with any other binding species target (for example, IL-1 or IL-13).
  • Animal Studies
  • Wild-type 10Fn3 contains an integrin-binding tripepetide motif, Arginine 78-Glycine 79-Aspartate 80 (the “RGD motif”) at the tip of the FG loop. In order to avoid integrin binding and a potential inflammatory response based on this tripeptide in vivo, a mutant form of 10F3 was generated that contained an inert sequence, Serine 78-Glycine 79-Glutamate 80 (the “SGE mutant”), a sequence which is found in the closely related, wild-type 11Fn3 domain. This SGE mutant was expressed as an N-terminally His6-tagged, free protein in E. coli, and purified to homogeneity on a metal chelate column followed by a size exclusion column.
  • In particular, the DNA sequence encoding His6-10F3(SGE) was cloned into the pET9a expression vector and transformed into BL21 DE3 pLysS cells. The culture was then grown in LB broth containing 50 μg/mL kanamycin at 37° C., with shaking, to A560=1.0, and was then induced with 0.4 mM IPTG. The induced culture was further incubated, under the same conditions, overnight (14-18 hours); the bacteria were recovered by standard, low speed centrifugation. The cell pellet was resuspended in 1/50 of the original culture volume of lysis buffer (50 mM Tris 8.0, 0.5 M NaCl, 5% glycerol, 0.05% Triton X-100, and 1 mM PMSF), and the cells were lysed by passing the resulting paste through a Microfluidics Corporation Microfluidizer M110-EH, three times. The lysate was clarified by centrifugation, and the supernatant was filtered through a 0.45 μm filter followed by filtration through a 0.2 μm filter. 100 mL of the clarified lysate was loaded onto a 5 mL Talon cobalt column (Clontech, Palo Alto, Calif.), washed by 70 mL of lysis buffer, and eluted with a linear gradient of 0-30 mM imidazole in lysis buffer. The flow rate through the column through all the steps was 1 mL/min. The eluted protein was concentrated 10-fold by dialysis (MW cutoff=3,500) against 15,000-20,000 PEG. The resulting sample was dialysed into buffer 1 (lysis buffer without the glycerol), then loaded, 5 mL at a time, onto a 16×60 mm Sephacryl 100 size exclusion column equilibrated in buffer 1. The column was run at 0.8 mL/min, in buffer 1; all fractions that contained a protein of the expected MW were pooled, concentrated 10× as described above, then dialyzed into PBS. Endotoxin screens and animal studies were performed on the resulting sample (Toxikon; MA).
  • The endotoxin levels in the samples examined to date have been below the detection level of the assay. In a preliminary animal toxicology study, this protein was injected into two mice at the estimated 100× therapeutic dose of 2.6 mg/mouse. The animals survived the two weeks of the study with no apparent ill effects. These safety results support the use of 10Fn3 incorporated into an IV drug.
  • Alternative Constructs for In Vivo Use
  • To extend the half life of the 8 kD 10Fn3 domain, a larger molecule has also been constructed that mimics natural antibodies. This 10Fn3-Fc molecule contains the —CH1—CH2—CH3 (FIG. 11) or —CH2—CH3 domains of the IgG constant region of the host; in these constructs, the 10Fn3 domain is grafted onto the N-terminus in place of the IgG VH domain (FIGS. 11 and 12). Such antibody-like constructs are to improve the pharmacokinetics of the protein as well as its ability to harness the natural immune response.
  • In order to construct the murine form of the 10Fn3-CH1—CH2—CH3 clone, the —CH1—CH2—CH3 region was first amplified from a mouse liver spleen cDNA library (Clontech), then ligated into the pET25b vector. The primers used in the cloning were 5′ Fc Nest and 3′ 5 Fc Nest, and the primers used to graft the appropriate restriction sites onto the ends of the recovered insert were 5′Fc HIII and 3′Fc Nhe:
    5′ Fc Nest
    (SEQ ID NO:15)
    5′GCG GCA GGG TTT GCT TAC TGG GGC CAA GGG 3′;
    3′ Fc Nest
    (SEQ ID NO:16)
    5′GGG AGG GGT GGA GGT AGG TCA CAG TCC 3′;
    3′ Fc Nhe
    (SEQ ID NO:17)
    5′ TTT GCT AGC TTT ACC AGG AGA GTG GGA GGC 3′;
    and
    5′ Fc HIII
    (SEQ ID NO:18)
    5′ AAA AAG CTT GCC AAA ACG ACA CCC CCA TCT GTC 3′.
  • Further PCR was used to remove the CH1 region from this clone and to create the Fc part of the shorter, 10Fn3-CH2—CH3 clone. The sequence encoding 10Fn3 was spliced onto the 5′ end of each clone; either the wild type 10Fn3 cloned from the same mouse spleen cDNA library or a modified 10Fn3 obtained by mutagenesis or randomization of the molecules can be used. The oligonucleotides used in the cloning of murine wild-type 10Fn3 were:
    Mo 5PCR-NdeI:
    (SEQ ID NO:19)
    5′ CATATGGTTTCTGATATTCCGAGAGATCTGGAG 3′;
    Mo5PCR-His-NdeI (for an alternative N-terminus
    with the His6 purification tag):
    (SEQ ID NO:20)
    5′ CAT ATG CAT CAC CAT CAC CAT CAC GTT TCT GAT ATT
    CCG AGA G 3′;
    and
    Mo3PCR-EcoRI:
    (SEQ ID NO:21)
    5′ GAATTCCTATGTTTTATAATTGATGGAAAC3′.
  • The human equivalents of the clones are constructed using the same strategy with human oligonucleotide sequences.
  • Antibody Mimics in Protein Chip Applications
  • Any of the antibody mimics described herein may be immobilized on a solid support, such as a microchip. The suitability of the present scaffolds, for example, the 10Fn3 scaffold, for protein chip applications is the consequence of (1) their ability to support many binding functions which can be selected rapidly on the bench or in an automated setup, and (2) their superior biophysical properties.
  • The versatile binding properties of 10Fn3 are a function of the loops displayed by the Fn3 immunoglobulin-like, beta sandwich fold. As discussed above, these loops are similar to the complementarity determining regions of antibody variable domains and can cooperate in a way similar to those antibody loops in order to bind antigens. In our system, 10Fn3 loops BC (for example, residues 21-30),DE (for example, residues 51-56), and FG (for example, residues 76-87) are randomized either in sequence, in length, or in both sequence and length in order to generate diverse libraries of mRNA-10Fn3 fusions. The binders in such libraries are then enriched based on their affinity for an immobilized or tagged target, until a small population of high affinity binders are generated. Also, error-prone PCR and recombination can be employed to facilitate affinity maturation of selected binders. Due to the rapid and efficient selection and affinity maturation protocols, binders to a large number of targets can be selected in a short time.
  • As a scaffold for binders to be immobilized on protein chips, the 10F3 domain has the advantage over antibody fragments and single-chain antibodies of being smaller and easier to handle. For example, unlike single-chain scaffolds or isolated variable domains of antibodies, which vary widely in their stability and solubility, and which require an oxidizing environment to preserve their structurally essential disulfide bonds, 10Fn3 is extremely stable, with a melting temperature of 110° C., and solubility at a concentration >16 mg/mL. The 10Fn3 scaffold also contains no disulfides or free cysteines; consequently, it is insensitive to the redox potential of its environment. A further advantage of 10F3 is that its antigen-binding loops and N-terminus are on the edge of the beta-sandwich opposite to the C-terminus; thus the attachment of a 10Fn3 scaffold to a chip by its C-terminus aligns the antigen-binding loops, allowing for their greatest accessibility to the solution being assayed. Since 10Fn3 is a single domain of only 94 amino acid residues, it is also possible to immobilize it onto a chip surface at a higher density than is used for single-chain antibodies, with their approximately 250 residues. In addition, the hydrophilicity of the 10F3 scaffold, which is reflected in the high solubility of this domain, minimizes unwanted binding of 10Fn3 to a chip surface.
  • The stability of the 10F3 scaffold as well as its suitability for library formation and selection of binders are likely to be shared by the large, Fn3-like class of protein domains with an immunoglobulin-like fold, such as the domains of tenascin, N-cadherin, E-cadherin, ICAM, titin, GCSF-R, cytokine receptor, glycosidase inhibitor, and antibiotic chromoprotein. The key features shared by all such domains are a stable framework provided by two beta-sheets, which are packed against each other and which are connected by at least three solvent-accessible loops per edge of the sheet; such loops can be randomized to generate a library of potential binders without disrupting the structure of the framework (as described above). In addition, as with 10F3, any of these loops (or similar loops from other proteins) may be immobilized alone or in combination with other loops onto a solid support surface.
  • Immobilization of Fn3-Based Antibody Mimics
  • To immobilize antibody mimics, such as Fn3-based antibody mimics, to a chip surface, a number of exemplary techniques may be utilized. For example, such antibody mimics may be immobilized as RNA-protein fusions by Watson-Crick hybridization of the RNA moiety of the fusion to a base complementary DNA immobilized on the chip surface (as described, for example, in Addressable Protein Arrays, U.S. Ser. No. 60/080,686; U.S. Ser. No. 09/282,734; and WO 99/51773; and Methods for Encoding and Sorting In Vitro Translated Proteins, U.S. Ser. No. 60/151,261 and U.S. Ser. No. 09/648,040). Alternatively, antibody mimics can be immobilized as free proteins directly on a chip surface. Manual as well as robotic devices may be used for deposition of the antibody mimics on the chip surface. Spotting robots can be used for deposition of antibody mimics with high density in an array format (for example, by the method of Lueking et al., Anal Biochem. 1999 May 15; 270(1): 103-11). Different methods may also be utilized for anchoring the antibody mimic on the chip surface. A number of standard immobilization procedures may be used including those described in Methods in Enzymology (K. Mosbach and B. Danielsson, eds.), vols. 135 and 136, Academic Press, Orlando, Fla., 1987; Nilsson et al., Protein Expr. Purif. 1997 October; 11(1):1-16; and references therein. Oriented immobilization of antibody mimics can help to increase the binding capacity of chip-bound antibody mimics. Exemplary approaches for achieving oriented coupling are described in Lu et al., The Analyst (1996), vol. 121, p. 29R-32R; and Turkova, J Chromatogr B Biomed Sci App. 1999 Feb. 5; 722(1-2): 11-31. In addition, any of the methods described herein for anchoring antibody mimics to chip surfaces can also be applied to the immobilization of antibody mimics on beads, or other supports.
  • Target Protein Capture and Detection
  • Selected populations of scaffold-binders may be used for detection and/or quantitation of analyte targets, for example, in samples such as biological samples. To carry out this type of diagnostic assay, selected scaffold-binders to targets of interest are immobilized on an appropriate support to form multi-featured protein chips. Next, a sample is applied to the chip, and the components of the sample that associate with the binders are identified based on the target-specificity of the immobilized binders. Using this technique, one or more components may be simultaneously identified or quantitated in a sample (for example, as a means to carry out sample profiling).
  • Methods for target detection allow measuring the levels of bound protein targets and include, without limitation, radiography, fluorescence scanning, mass spectroscopy (MS), and surface plasmon resonance (SPR). Autoradiography using a phosphorimager system (Molecular Dynamics, Sunnyvale, Calif.) can be used for detection and quantification of target protein which has been radioactively labeled, e.g., using 35S methionine. Fluorescence scanning using a laser scanner (see below) may be used for detection and quantification of fluorescently labeled targets. Alternatively, fluorescence scanning may be used for the detection of fluorescently labeled ligands which themselves bind to the target protein (e.g., fluorescently labeled target-specific antibodies or fluorescently labeled streptavidin binding to target-biotin, as described below).
  • Mass spectroscopy can be used to detect and identify bound targets based on their molecular mass. Desorption of bound target protein can be achieved with laser assistance directly from the chip surface as described below. Mass detection also allows determinations, based on molecular mass, of target modifications including post-translational modifications like phosophorylation or glycosylation. Surface plasmon resonance can be used for quantification of bound protein targets where the scaffold-binder(s) are immobilized on a suitable gold-surface (for example, as obtained from Biacore, Sweden).
  • Described below are exemplary schemes for selecting binders (in this case, Fn-binders specific for the protein, TNF-α) and the use of those selected populations for detection on chips. This example is provided for the purpose of illustrating the invention, and should not be construed as limiting.
  • Selection of TNF-α Binders Based on 10Fn3 Scaffold
  • In one exemplary use for scaffold selection on chips, an 10Fn3-based selection was performed against TNF-α, using a library of human 10Fn3 variants with randomized loops BC, DE, and FG. The library was constructed from three DNA fragments, each of which contained nucleotide sequences that encoded approximately one third of human 10Fn3, including one of the randomized loops. The DNA sequences that encoded the loop residues listed above were rebuilt by oligonucleotide synthesis, so that the codons for the residues of interest were replaced by (NNS)n, where N represents any of the four deoxyribonucleotides (A, C, G, or T), and S represents either C or G. The C-terminus of each fragment contained the sequence for the FLAG purification tag.
  • Once extended by Klenow, each DNA fragment was transcribed, ligated to a puromycin-containing DNA linker, and translated in vitro, as described by Szostak et al. (Roberts and Szostak, Proc. Natl. Acad. Sci. USA 94:12297, 1997; Szostak et al., U.S. Ser. No. 09/007,005 and U.S. Ser. No. 09/247,190; Szostak et al., WO98/31700), to generate an mRNA-peptide fusion, which was then reverse-transcribed into a DNA-mRNA-peptide fusion. The binding of the FLAG-tagged peptide to M2 agarose separated full-length fusion molecules from those containing frameshifts or superfluous stop codons; the DNA associated with the purified full-length fusion was amplified by PCR, then the three DNA fragments were cut by Ear I restriction endonuclease and ligated to form the full length template. The template was transcribed, ligated to puromycin-containing DNA linkers, and translated to generate a 10Fn3-RNA/cDNA library, which was then reverse-transcribed to yield the DNA-mRNA-peptide fusion library which was subsequently used in the selection.
  • Selection for TNF-α binders took place in 50 mM HEPES, pH 7.4, 0.02% Triton-X, 0.1 mg/mL salmon sperm DNA. The PROfusion™ library was incubated with Sepharose-immobilized TNF-α; after washing, the DNA associated with the tightest binders was eluted with 0.1 M KOH, amplified by PCR, and transcribed, ligated, translated, and reverse-transcribed into the starting material for the next round of selection.
  • Ten rounds of such selection were performed (as shown in FIG. 13); they resulted in a PROfusion™ pool that bound to TNF-α-Sepharose with the apparent average Kd of 120 nM. Specific clonal components of the pool that were characterized showed TNF-α binding in the range of 50-500 nM.
  • Immobilization, Target Protein Capture, and MALDI-TOF Detection
  • As a first step toward immobilizing Fn3 fusions to a chip surface, an oligonucleotide capture probe was prepared with an automated DNA synthesizer (PE BioSystems Expedite 8909) using the solid-support phosphoramidite approach. All reagents were obtained from Glen Research. Synthesis was initiated with a solid support containing a disulfide bond to eventually provide a 3′-terminal thiol functionality. The first four monomers to be added were hexaethylene oxide units, followed by 20 T monomers. The 5′-terminal DMT group was not removed. The capture probe was cleaved from the solid support and deprotected with ammonium hydroxide, concentrated to dryness in a vacuum centrifuge, and purified by reverse-phase HPLC using an acetonitrile gradient in triethylammonium acetate buffer. Appropriate fractions from the HPLC were collected, evaporated to dryness in a vacuum centrifuge, and the 5′-terminal DMT group was removed by treatment with 80% AcOH for 30 minutes. The acid was removed by evaporation, and the oligonucleotide was then treated with 100 mM DTT for 30 minutes to cleave the disulfide bond. DTT was removed by repeated extraction with EtOAc. The oligonucleotide was ethanol precipitated from the remaining aqueous layer and checked for purity by reverse-phase HPLC.
  • The 3′-thiol capture probe was adjusted to 250 μM in degassed 1× PBS buffer and applied as a single droplet (75 μL) to a 9×9 mm gold-coated chip (Biacore) in an argon-flushed chamber containing a small amount of water. After 18 hours at room temperature, the capture probe solution was removed, and the functionalized chip was washed with 50 mL 1× PBS buffer (2× for 15 minutes each) with gentle agitation, and then rinsed with 50 mL water (2× for 15 minutes each) in the same fashion. Remaining liquid was carefully removed and the functionalized chips were either used immediately or stored at 4° C. under argon.
  • About 1 pmol of 10Fn3 fusion pool from the Round 10 TNF-α selection (above) was treated with RNAse A for several hours, adjusted to 5×SSC in 70 μL, and applied to a functionalized gold chip from above as a single droplet. A 50 μL volume gasket device was used to seal the fusion mixture with the functionalized chip, and the apparatus was continuously rotated at 4° C. After 18 hours the apparatus was disassembled, and the gold chip was washed with 50 mL 5×SSC for 10 minutes with gentle agitation. Excess liquid was carefully removed from the chip surface, and the chip was passivated with a blocking solution (1×TBS+0.02% Tween-20+0.25% BSA) for 10 minutes at 4° C. Excess liquid was carefully removed, and a solution containing 500 μg/mL TNF-α in the same composition blocking solution was applied to the chip as a single droplet and incubated at 4° C. for two hours with occasional mixing of the droplet via Pipetman. After removal of the binding solution, the chip was washed for 5 minutes at 4° C. with gentle agitation (50 mL 1×TBS+0.02% Tween-20) and then dried at room temperature. A second chip was prepared exactly as described above, except fusion was not added to the hybridization mix.
  • Next, MALDI-TOF matrix (15 mg/mL 3,5-dimethoxy-4-hydroxycinnamic acid in 1:1 ethanol/10% formic acid in water) was uniformly applied to the gold chips with a high-precision 3-axis robot (MicroGrid, BioRobotics). A 16-pin tool was used to transfer the matrix from a 384-well microtiter plate to the chips, producing 200 micron diameter features with a 600 micron pitch. The MALDI-TOF mass spectrometer (Voyager DE, PerSeptive Biosystems) instrument settings were as follows: Accelerating Voltage=25 k, Grid Voltage=92%, Guide Wire Voltage=0.05%, Delay=200 on, Laser Power=2400, Low Mass Gate=1500, Negative Ions=off. The gold chips were individually placed on a MALDI sample stage modified to keep the level of the chip the same as the level of the stage, thus allowing proper flight distance. The instrument's video monitor and motion control system were used to direct the laser beam to individual matrix features.
  • FIGS. 14 and 15 show the mass spectra from the 10Fn3 fusion chip and the non-fusion chip, respectively. In each case, a small number of 200 micron features were analyzed to collect the spectra, but FIG. 15 required significantly more acquisitions. The signal at 17.4 kDa corresponds to TNF-α monomer.
  • Immobilization, Target Protein Capture, and Fluorescence Detection
  • Pre-cleaned 1×3 inch glass microscope slides (Goldseal, #3010) were treated with Nanostrip (Cyantek) for 15 minutes, 10% aqueous NaOH at 70° C. for 3 minutes, and 1% aqueous HCl for 1 minute, thoroughly rinsing with deionized water after each reagent. The slides were then dried in a vacuum desiccator over anhydrous calcium sulfate for several hours. A 1% solution of aminopropytrimethoxysilane in 95% acetone/5% water was prepared and allowed to hydrolyze for 20 minutes. The glass slides were immersed in the hydrolyzed silane solution for 5 minutes with gentle agitation. Excess silane was removed by subjecting the slides to ten 5-minute washes, using fresh portions of 95% acetone/5% water for each wash, with gentle agitation. The slides were then cured by heating at 110° C. for 20 minutes. The silane treated slides were immersed in a freshly prepared 0.2% solution of phenylene 1,4-diisothiocyanate in 90% DMF/10% pyridine for two hours, with gentle agitation. The slides were washed sequentially with 90% DMF/10% pyridine, methanol, and acetone. After air drying, the functionalized slides were stored at 0° C. in a vacuum desiccator over anhydrous calcium sulfate. Similar results were obtained with commercial amine-reactive slides (3-D Link, Surmodics).
  • Oligonucleotide capture probes were prepared with an automated DNA synthesizer (PE BioSystems Expedite 8909) using conventional phosphoramidite chemistry. All reagents were from Glen Research. Synthesis was initiated with a solid support bearing an orthogonally protected amino functionality, whereby the 3′-terminal amine is not unmasked until final deprotection step. The first four monomers to be added were hexaethylene oxide units, followed by the standard A, G, C and T monomers. All capture oligo sequences were cleaved from the solid support and deprotected with ammonium hydroxide, concentrated to dryness, precipitated in ethanol, and purified by reverse-phase HPLC using an acetonitrile gradient in triethylammonium acetate buffer. Appropriate fractions from the HPLC were collected, evaporated to dryness in a vacuum centrifuge, and then coevaporated with a portion of water.
  • The purified, amine-labeled capture oligos were adjusted to a concentration of 250 μM in 50 mM sodium carbonate buffer (pH 9.0) containing 10% glycerol. The probes were spotted onto the amine-reactive glass surface at defined positions in a 5×5×6 array pattern with a 3-axis robot (MicroGrid, BioRobotics). A 16-pin tool was used to transfer the liquid from 384-well microtiter plates, producing 200 micron features with a 600 micron pitch. Each sub-grid of 24 features represents a single capture probe (i.e., 24 duplicate spots). The arrays were incubated at room temperature in a moisture-saturated environment for 12-18 hours. The attachment reaction was terminated by immersing the chips in 2% aqueous ammonium hydroxide for five minutes with gentle agitation, followed by rinsing with distilled water (3× for 5 minutes each). The array was finally soaked in 10×PBS solution for 30 minutes at room temperature, and then rinsed again for 5 minutes in distilled water.
  • Specific and thermodynamically isoenergetic sequences along the 10F3 mRNA were identified to serve as capture points to self-assemble and anchor the 10Fn3 protein. The software program HybSimulator v4.0 (Advanced Gene Computing Technology, Inc.) facilitated the identification and analysis of potential capture probes. Six unique capture probes were chosen and printed onto the chip, three of which are complementary to common regions of the 10F3 fusion pool's mRNA (CP3′, CP5′, and CPflag). The remaining three sequences (CPneg1, CPneg2, and CPneg3) are not complementary and function in part as negative controls. Each of the capture probes possesses a 3′-amino terminus and four hexaethylene oxide spacer units, as described above. The following is a list of the capture probe sequences that were employed (5′→3′):
    CP3′:
    (SEQ ID NO:22)
    TGTAAATAGTAATTGTCCC
    CP5′:
    (SEQ ID NO:23)
    TTTTTTTTTTTTTTTTTTTT
    CPneg1:
    (SEQ ID NO:24)
    CCTGTAGGTGTCCAT
    CPflag:
    (SEQ ID NO:25)
    CATCGTCCTTGTAGTC
    CPneg2:
    (SEQ ID NO:26)
    CGTCGTAGGGGTA
    CPneg3:
    (SEQ ID NO:27)
    CAGGTCTTCTTCAGAGA
  • About 1 pmol of 10F3 fusion pool from the Round 10 TNF-α selection was adjusted to 5×SSC containing 0.02% Tween-20 and 2 mM vanadyl ribonucleotide complex in a total volume of 350 μL. The entire volume was applied to the microarray under a 400 μL gasket device and the assembly was continuously rotated for 18 hours at room temperature. After hybridization the slide was washed sequentially with stirred 500 mL portions of 5×SSC, 2.5×SSC, and 1×SSC for 5 minutes each. Traces of liquid were removed by centrifugation and the slide was allowed to air-dry.
  • Recombinant human TNF-α (500 μg, lyophilized, from PreproTech) was taken up in 230 μL 1×PBS and dialyzed against 700 mL stirred 1×PBS at 4° C. for 18 hours in a Microdialyzer unit (3,500 MWCO, Pierce). The dialyzed TNF-α was treated with EZ-Link NHS-LC-LC biotinylation reagent (20 μg, Pierce) for 2 hours at 0° C., and again dialyzed against 700 mL stirred 1×PBS at 4° C. for 18 hours in a Microdialyzer unit (3,500 MWCO, Pierce). The resulting conjugate was analyzed by MALDI-TOF mass spectrometry and was found to be almost completely functionalized with a single biotin moiety.
  • Each of the following processes was conducted at 4° C. with continuous rotation or mixing. The protein microarray surface was passivated by treatment with 1×TBS containing 0.02% Tween-20 and 0.2% BSA (200 μL) for 60 minutes. Biotinylated TNF-α (100 nM concentration made up in the passivation buffer) was contacted with the microarray for 120 minutes. The microarray was washed with 1×TBS containing 0.02% Tween-20 (3×50 mL, 5 minutes each wash). Fluorescently labeled streptavidin (2.5 μg/mL Alexa 546-streptavidin conjugate from Molecular Probes, made up in the passivation buffer) was contacted with the microarray for 60 minutes. The microarray was washed with 1×TBS containing 0.02% Tween-20 (2×50 mL, 5 minutes each wash) followed by a 3 minute rinse with 1×TBS. Traces of liquid were removed by centrifugation, and the slide was allowed to air-dry at room temperature.
  • Fluorescence laser scanning was performed with a GSI Lumonics ScanArray 5000 system-using 10 μM pixel resolution and preset excitation and emission wavelengths for Alexa 546 dye. Phosphorimage analysis was performed with a Molecular Dynamics Storm system. Exposure time was 48 hours with direct contact between the microarray and the phosphor storage screen. Phosphorimage scanning was performed at the 50 micron resolution setting, and data was extracted with ImageQuant v.4.3 software.
  • FIGS. 16 and 17 are the phosphorimage and fluorescence scan, respectively, of the same array. The phosphorimage shows where the 10Fn3 fusion hybridized based on the 35S methionine signal. The fluorescence scan shows where the labeled TNF-α bound.
    <400> SEQUENCE: 8
    ggaattccta atacgactca ctatagggac aattactatt tacaattaca at
    #gcatcacc     60
    atcaccatca cctcttctat accatcactg tgtatgctgt c    
    #                  
    #  101
    <210> SEQ ID NO 9
    <211> LENGTH: 114
    <212> TYPE: DNA
    <213> ORGANISM: Homo sapiens
    <220> FEATURE:
    <221> NAME/KEY: misc_feature
    <222> LOCATION: (1)...(114)
    <223> OTHER INFORMATION: n = A,T,C or G
    <220> FEATURE:
    <221> NAME/KEY: misc_feature
    <222> LOCATION: (1)...(114)
    <223> OTHER INFORMATION: s = C or G
    <220> FEATURE:
    <221> NAME/KEY: misc_feature
    <222> LOCATION: 59, 60, 62, 63, 65, 66, 68, 
    #69, 71, 72, 74, 75, 77, 78,
          80, 81, 83, 84, 86, 87
    <223> OTHER INFORMATION: n = A,T,C or G
    <400> SEQUENCE: 9
    agcggatgcc ttgtcgtcgt cgtccttgta gtctgttcgg taattaatgg aa
    #attggsnn     60
    snnsnnsnns nnsnnsnnsn nsnnsnnagt gacagcatac acagtgatgg ta
    #ta          114
    <210> SEQ ID NO 10
    <211> LENGTH: 57
    <212> TYPE: DNA
    <213> ORGANISM: Homo sapiens
    <400> SEQUENCE: 10
    agcggatgcc ttgtcgtcgt cgtccttgta gtctgttcgg taattaatgg aa
    #attgg        57
    <210> SEQ ID NO 11
    <211> LENGTH: 45
    <212> TYPE: DNA
    <213> ORGANISM: T7 phage and tobacco mosaic vir
    #us
    <400> SEQUENCE: 11
    gcgtaatacg actcactata gggacaatta ctatttacaa ttaca   
    #                  
    #45
    <210> SEQ ID NO 12
    <211> LENGTH: 33
    <212> TYPE: DNA
    <213> ORGANISM: Artificial Sequence
    <220> FEATURE:
    <223> OTHER INFORMATION: Flag sequence
    <400> SEQUENCE: 12
    agcggatgcc ttgtcgtcgt cgtccttgta gtc       
    #                  
    #         33
    <210> SEQ ID NO 13
    <211> LENGTH: 19
    <212> TYPE: DNA
    <213> ORGANISM: Artificial Sequence
    <220> FEATURE:
    <223> OTHER INFORMATION: Splint oligonucleotide
    <220> FEATURE:
    <221> NAME/KEY: misc_feature
    <222> LOCATION: (1)...(19)
    <223> OTHER INFORMATION: n = A,T,C or G
    <220> FEATURE:
    <221> NAME/KEY: misc_feature
    <222> LOCATION: 10
    <223> OTHER INFORMATION: n = A,T,C or G
    <400> SEQUENCE: 13
    tttttttttn agcggatgc             
    #                  
    #                  
    # 19
    <210> SEQ ID NO 14
    <211> LENGTH: 20
    <212> TYPE: DNA
    <213> ORGANISM: Artificial Sequence
    <220> FEATURE:
    <223> OTHER INFORMATION: Puromycin linker oligonucleo
    #tide
    <400> SEQUENCE: 14
    aaaaaaaaaa aaaaaaaacc            
    #                  
    #                  
    # 20
    <210> SEQ ID NO 15
    <211> LENGTH: 30
    <212> TYPE: DNA
    <213> ORGANISM: Mus musculus
    <400> SEQUENCE: 15
    gcggcagggt ttgcttactg gggccaaggg         
    #                  
    #           30
    <210> SEQ ID NO 16
    <211> LENGTH: 27
    <212> TYPE: DNA
    <213> ORGANISM: Mus musculus
    <400> SEQUENCE: 16
    gggaggggtg gaggtaggtc acagtcc          
    #                  
    #             27
    <210> SEQ ID NO 17
    <211> LENGTH: 30
    <212> TYPE: DNA
    <213> ORGANISM: Mus musculus
    <400> SEQUENCE: 17
    tttgctagct ttaccaggag agtgggaggc         
    #                  
    #           30
    <210> SEQ ID NO 18
    <211> LENGTH: 33
    <212> TYPE: DNA
    <213> ORGANISM: Mus musculus
    <400> SEQUENCE: 18
    aaaaagcttg ccaaaacgac acccccatct gtc       
    #                  
    #         33
    <210> SEQ ID NO 19
    <211> LENGTH: 33
    <212> TYPE: DNA
    <213> ORGANISM: Mus musculus
    <400> SEQUENCE: 19
    catatggttt ctgatattcc gagagatctg gag       
    #                  
    #         33
    <210> SEQ ID NO 20
    <211> LENGTH: 43
    <212> TYPE: DNA
    <213> ORGANISM: Mus musculus
    <400> SEQUENCE: 20
    catatgcatc accatcacca tcacgtttct gatattccga gag    
    #                  
    # 43
    <210> SEQ ID NO 21
    <211> LENGTH: 30
    <212> TYPE: DNA
    <213> ORGANISM: Mus musculus
    <400> SEQUENCE: 21
    gaattcctat gttttataat tgatggaaac         
    #                  
    #           30
    <210> SEQ ID NO 22
    <211> LENGTH: 19
    <212> TYPE: DNA
    <213> ORGANISM: Homo sapiens
    <400> SEQUENCE: 22
    tgtaaatagt aattgtccc             
    #                  
    #                  
    # 19
    <210> SEQ ID NO 23
    <211> LENGTH: 20
    <212> TYPE: DNA
    <213> ORGANISM: Homo sapiens
    <400> SEQUENCE: 23
    tttttttttt tttttttttt            
    #                  
    #                  
    # 20
    <210> SEQ ID NO 24
    <211> LENGTH: 15
    <212> TYPE: DNA
    <213> ORGANISM: Artificial Sequence
    <220> FEATURE:
    <223> OTHER INFORMATION: Oligonucleotide
    <400> SEQUENCE: 24
    cctgtaggtg tccat              
    #                  
    #                  
    #    15
    <210> SEQ ID NO 25
    <211> LENGTH: 16
    <212> TYPE: DNA
    <213> ORGANISM: Homo sapiens
    <400> SEQUENCE: 25
    catcgtcctt gtagtc             
    #                  
    #                  
    #    16
    <210> SEQ ID NO 26
    <211> LENGTH: 13
    <212> TYPE: DNA
    <213> ORGANISM: Artificial Sequence
    <220> FEATURE:
    <223> OTHER INFORMATION: Oligonucleotide
    <400> SEQUENCE: 26
    cgtcgtaggg gta              
    #                  
    #                  
    #      13
    <210> SEQ ID NO 27
    <211> LENGTH: 17
    <212> TYPE: DNA
    <213> ORGANISM: Artificial Sequence
    <220> FEATURE:
    <223> OTHER INFORMATION: Oligonucleotide
    <400> SEQUENCE: 27
    caggtcttct tcagaga             
    #                  
    #                  
    #   17
    <210> SEQ ID NO 28
    <211> LENGTH: 24
    <212> TYPE: DNA
    <213> ORGANISM: Homo sapiens
    <400> SEQUENCE: 28
    catatggttt ctgatgttcc gagg          
    #                  
    #                24
    <210> SEQ ID NO 29
    <211> LENGTH: 32
    <212> TYPE: DNA
    <213> ORGANISM: Homo sapiens
    <400> SEQUENCE: 29
    gaattcctat gttcggtaat taatggaaat tg       
    #                  
    #          32
    <210> SEQ ID NO 30
    <211> LENGTH: 19
    <212> TYPE: DNA
    <213> ORGANISM: Artificial Sequence
    <220> FEATURE:
    <223> OTHER INFORMATION: DNA splint Oligonucleotide
    <220> FEATURE:
    <221> NAME/KEY: misc_feature
    <222> LOCATION: 10
    <223> OTHER INFORMATION: n = A,T,C or G
    <400> SEQUENCE: 30
    tttttttttn agcggatgc             
    #                  
    #                  
    # 19
    <210> SEQ ID NO 31
    <211> LENGTH: 7
    <212> TYPE: PRT
    <213> ORGANISM: Homo sapien
    <400> SEQUENCE: 31
    Asn Arg Ser Gly Leu Gln Ser
     1               5
    <210> SEQ ID NO 32
    <211> LENGTH: 10
    <212> TYPE: PRT
    <213> ORGANISM: Homo sapien
    <400> SEQUENCE: 32
    Ala Gln Thr Gly His His Leu His Asp Lys
     1               5  
    #                10
    <210> SEQ ID NO 33
    <211> LENGTH: 94
    <212> TYPE: PRT
    <213> ORGANISM: Homo sapien
    <400> SEQUENCE: 33
    Val Ser Asp Val Pro Arg Asp Leu Glu Val Va
    #l Ala Ala Thr Pro Thr
     1               5  
    #                10  
    #                15
    Ser Leu Leu Ile Ser Trp Asp Ala Pro Ala Va
    #l Thr Val Arg Tyr Tyr
                20      
    #            25      
    #            30
    Arg Ile Thr Tyr Gly Glu Thr Gly Gly Asn Se
    #r Pro Val Gln Glu Phe
            35          
    #        40          
    #        45
    Thr Val Pro Gly Ser Lys Ser Thr Ala Thr Il
    #e Ser Gly Leu Lys Pro
        50              
    #    55              
    #    60
    Gly Val Asp Tyr Thr Ile Thr Val Tyr Ala Va
    #l Thr Gly Arg Gly Asp
    65                  
    #70                  
    #75                  
    #80
    Ser Pro Ala Ser Ser Lys Pro Ile Ser Ile As
    #n Tyr Arg Thr
                    85  
    #                90
    <210> SEQ ID NO 34
    <211> LENGTH: 95
    <212> TYPE: PRT
    <213> ORGANISM: Homo sapiens
    <400> SEQUENCE: 34
    Val Ser Glu Ile Pro Arg Asp Leu Glu Val Va
    #l Ala Ala Thr Pro Thr
     1               5  
    #                10  
    #                15
    Ser Leu Leu Phe Ser Trp Asp Ala Pro Ala Va
    #l Thr Val Arg Tyr Tyr
                20      
    #            25      
    #            30
    Arg Ile Thr Tyr Gly Glu Thr Gly Gly Asn Se
    #r Leu Val Gln Glu Phe
            35          
    #        40          
    #        45
    Thr Val Pro Gly Ser Lys Ser Thr Ala Thr Il
    #e Ser Gly Leu Lys Pro
        50              
    #    55              
    #    60
    Gly Val Asp Tyr Asn Thr Ile Thr Gly Tyr Al
    #a Val Thr Thr Thr Tyr
    65                  
    #70                  
    #75                  
    #80
    Arg Thr Arg Ile Asp Lys Gln Pro Ile Ser Il
    #e Asn Tyr Arg Thr
                    85  
    #                90  
    #                95
    <210> SEQ ID NO 35
    <211> LENGTH: 90
    <212> TYPE: PRT
    <213> ORGANISM: Homo sapien
    <400> SEQUENCE: 35
    Val Ser Asp Val Pro Arg Asp Leu Glu Val Va
    #l Ala Ala Thr Pro Thr
     1               5  
    #                10  
    #                15
    Ser Leu Leu Ile Ser Trp Asp Ala Pro Ala Va
    #l Thr Val Arg Tyr Tyr
                20      
    #            25      
    #            30
    Arg Ile Thr Tyr Gly Glu Lys Gly Gly Asn Se
    #r Pro Val Gln Glu Phe
            35          
    #        40          
    #        45
    Thr Val Pro Glu Leu Asn Pro Thr Ala Thr Il
    #e Ser Arg Leu Lys Pro
        50              
    #    55              
    #    60
    Gly Val Asp Tyr Thr Ile Thr Val Tyr Ala Va
    #l Thr Gln Asn Gly Thr
    65                  
    #70                  
    #75                  
    #80
    Pro Arg Arg His Leu Arg Pro Asn Phe His
                    85  
    #                90
    <210> SEQ ID NO 36
    <211> LENGTH: 95
    <212> TYPE: PRT
    <213> ORGANISM: Homo sapien
    <400> SEQUENCE: 36
    Val Ser Asp Val Pro Arg Asp Leu Glu Val Va
    #l Ala Ala Thr Pro Thr
     1               5  
    #                10  
    #                15
    Gly Leu Leu Ile Ser Trp Asn Lys Ser Arg Me
    #t Thr Thr Arg Tyr Tyr
                20      
    #            25      
    #            30
    Arg Ile Thr Tyr Gly Glu Thr Gly Gly Asn Se
    #r Pro Val Gln Glu Phe
            35          
    #        40          
    #        45
    Thr Val Pro Val Thr Asp Ser Thr Ala Thr Il
    #e Ser Gly Leu Lys Pro
        50              
    #    55              
    #    60
    Gly Val Asp Tyr Asn Thr Ile Ile Val His Al
    #a Val Thr Leu Thr Asn
    65                  
    #70                  
    #75                  
    #80
    Gln Asn Ser Asp His Thr Tyr Pro Ile Ser Il
    #e Asn Tyr Arg Thr
                    85  
    #                90  
    #                95
    <210> SEQ ID NO 37
    <211> LENGTH: 91
    <212> TYPE: PRT
    <213> ORGANISM: Homo sapien
    <400> SEQUENCE: 37
    Val Ser Asp Val Pro Arg Asp Leu Asp Val Va
    #l Ala Ala Thr Pro Thr
     1               5  
    #                10  
    #                15
    Ser Leu Leu Ile Ser Trp Asp Ser Ser His Ar
    #g Tyr Tyr Arg Ile Thr
                20      
    #            25      
    #            30
    Tyr Gly Glu Thr Gly Gly Asn Ser Pro Val Gl
    #n Glu Phe Thr Ala Pro
            35          
    #        40          
    #        45
    Asn Asn Pro Pro Thr Ala Thr Ile Ser Gly Le
    #u Lys Pro Gly Val Asp
        50              
    #    55              
    #    60
    Tyr Thr Ile Thr Val Tyr Ala Val Thr Pro As
    #p Gly Ser Arg His Met
    65                  
    #70                  
    #75                  
    #80
    Leu Thr Lys Pro Ile Ser Ile Asn Tyr Arg Th
    #r
                    85  
    #                90
    <210> SEQ ID NO 38
    <211> LENGTH: 88
    <212> TYPE: PRT
    <213> ORGANISM: Homo sapiens
    <400> SEQUENCE: 38
    Val Ser Asp Val Pro Arg Asp Leu Glu Val Va
    #l Ala Ala Thr Pro Thr
     1               5  
    #                10  
    #                15
    Ser Leu Leu Ile Ser Trp His Asn Asn His Il
    #e Asp Met Arg Tyr Tyr
                20      
    #            25      
    #            30
    Arg Ser Ala Asn Gly Glu Thr Gly Gly Asn Se
    #r Pro Val Gln Val Phe
            35          
    #        40          
    #        45
    Thr Val Pro Gln Arg Arg Gln Thr Ala Thr Il
    #e Ser Gly Leu Lys Pro
        50              
    #    55              
    #    60
    Gly Val Asp Tyr Thr Ile Thr Val Tyr Ala Va
    #l Thr Pro Lys Asn Gln
    65                  
    #70                  
    #75                  
    #80
    Gly Arg Arg Arg Gln Gly Ile Arg
                    85
    <210> SEQ ID NO 39
    <211> LENGTH: 94
    <212> TYPE: PRT
    <213> ORGANISM: Homo sapiens
    <400> SEQUENCE: 39
    Val Ser Asp Val Pro Arg Asp Leu Glu Val Va
    #l Ala Ala Thr Ser Thr
     1               5  
    #                10  
    #                15
    Ser Leu Leu Ile Ser Trp Arg Thr Pro Ala Se
    #r Pro His Gly Tyr Tyr
                20      
    #            25      
    #            30
    Arg Ile Thr Tyr Gly Glu Thr Gly Gly Asn Se
    #r Pro Val Glu Glu Phe
            35          
    #        40          
    #        45
    Thr Val Pro Leu Leu Trp Pro Thr Ala Thr Il
    #e Ser Gly Leu Lys Pro
        50              
    #    55              
    #    60
    Gly Val Asp Tyr Thr Ile Thr Val Tyr Ala Va
    #l Thr Pro Thr His Met
    65                  
    #70                  
    #75                  
    #80
    Leu Lys Pro Gln Ser Met Pro Ile Ser Ile As
    #n Tyr Arg Thr
                    85  
    #                90
    <210> SEQ ID NO 40
    <211> LENGTH: 94
    <212> TYPE: PRT
    <213> ORGANISM: Homo sapien
    <400> SEQUENCE: 40
    Val Ser Asp Val Pro Arg Asp Leu Glu Val Va
    #l Ala Ala Thr Pro Thr
     1               5  
    #                10  
    #                15
    Ser Leu Leu Ile Ser Trp Arg Thr Pro Ala Se
    #r Pro His Gly Tyr Tyr
                20      
    #            25      
    #            30
    Arg Ile Thr Tyr Gly Glu Thr Gly Gly Asn Se
    #r Pro Val Glu Glu Phe
            35          
    #        40          
    #        45
    Thr Val Pro Leu Leu Trp Pro Thr Ala Thr Il
    #e Ser Gly Leu Lys Pro
        50              
    #    55              
    #    60
    Gly Val Asp Tyr Thr Ile Thr Val Tyr Ala Va
    #l Thr Pro Thr His Met
    65                  
    #70                  
    #75                  
    #80
    Leu Lys Pro Gln Ser Met Pro Ile Ser Ile As
    #n Tyr Arg Thr
                    85  
    #                90
    <210> SEQ ID NO 41
    <211> LENGTH: 94
    <212> TYPE: PRT
    <213> ORGANISM: Homo sapien
    <400> SEQUENCE: 41
    Val Ser Asp Val Pro Arg Asp Leu Glu Val Va
    #l Ala Ala Ala Ser Thr
     1               5  
    #                10  
    #                15
    Ser Leu Leu Ile Ser Trp Arg Pro Asn Pro Ar
    #g Leu Ser Arg Tyr Tyr
                20      
    #            25      
    #            30
    Arg Ile Thr Tyr Gly Glu Thr Gly Gly Asn Se
    #r Pro Val Gln Glu Phe
            35          
    #        40          
    #        45
    Thr Val Pro Gly Leu Phe Ser Thr Ala Thr Il
    #e Ser Gly Leu Asn Pro
        50              
    #    55              
    #    60
    Gly Val Asp Tyr Thr Ile Thr Val Tyr Ala Va
    #l Thr Pro Lys Glu Thr
    65                  
    #70                  
    #75                  
    #80
    Ser Asn Ile Phe Ile Ala Pro Ile Ser Ile As
    #n Tyr Arg Thr
                    85  
    #                90
    <210> SEQ ID NO 42
    <211> LENGTH: 94
    <212> TYPE: PRT
    <213> ORGANISM: Homo sapien
    <400> SEQUENCE: 42
    Val Ser Asp Val Pro Arg Asp Leu Glu Val Va
    #l Ala Ala Thr Ser Thr
     1               5  
    #                10  
    #                15
    Cys Leu Leu Ile Ser Trp Arg Pro Asn Pro Ar
    #g Leu Ser Arg Tyr Tyr
                20      
    #            25      
    #            30
    Arg Ile Thr Tyr Gly Glu Thr Gly Gly Asn Se
    #r Pro Val Gln Glu Phe
            35          
    #        40          
    #        45
    Thr Val Pro Gly Leu Phe Ser Thr Ala Thr Il
    #e Ser Gly Leu Lys Pro
        50              
    #    55              
    #    60
    Gly Val Asp Tyr Thr Ile Thr Val Tyr Ala Va
    #l Thr Pro Lys Glu Thr
    65                  
    #70                  
    #75                  
    #80
    Ser Asn Ile Phe Ile Ala Pro Ile Ser Ile As
    #n Tyr Arg Thr
                    85  
    #                90
    <210> SEQ ID NO 43
    <211> LENGTH: 94
    <212> TYPE: PRT
    <213> ORGANISM: Homo sapien
    <400> SEQUENCE: 43
    Val Ser Asp Val Pro Arg Asp Pro Glu Val Va
    #l Ala Ala Thr Pro Thr
     1               5  
    #                10  
    #                15
    Ser Leu Leu Ile Ser Trp Asp Pro Asn Ile Ar
    #g Leu Arg Arg Tyr Tyr
                20      
    #            25      
    #            30
    Arg Ile Thr Tyr Gly Glu Thr Gly Gly Asn Se
    #r Pro Val Gln Glu Phe
            35          
    #        40          
    #        45
    Thr Val Pro Gly Phe Phe Ser Thr Ala Thr Il
    #e Ser Gly Leu Lys Pro
        50              
    #    55              
    #    60
    Gly Val Asp Tyr Thr Ile Thr Val Tyr Ala Va
    #l Thr Ala Ser Arg Asn
    65                  
    #70                  
    #75                  
    #80
    Glu Asp Thr Arg Phe Gly Pro Ile Ser Ile As
    #n Tyr Arg Thr
                    85  
    #                90
    <210> SEQ ID NO 44
    <211> LENGTH: 94
    <212> TYPE: PRT
    <213> ORGANISM: Homo sapiens
    <400> SEQUENCE: 44
    Val Ser Asp Val Pro Arg Asp Leu Glu Val Va
    #l Ala Ala Thr Pro Thr
     1               5  
    #                10  
    #                15
    Ser Leu Leu Ile Ser Trp Phe Arg Ser Leu Gl
    #n Arg Asp Arg Asp Tyr
                20      
    #            25      
    #            30
    Arg Ile Thr Tyr Gly Glu Thr Gly Gly Asn Se
    #r Pro Val Gln Glu Phe
            35          
    #        40          
    #        45
    Thr Val Pro Phe Arg Met Lys Thr Ala Thr Il
    #e Ser Gly Leu Lys Pro
        50              
    #    55              
    #    60
    Gly Val Asp Tyr Thr Ile Thr Val Tyr Ala Il
    #e Thr Pro Pro Asp Lys
    65                  
    #70                  
    #75                  
    #80
    Met Glu Pro Pro Lys Gly Pro Ile Ser Ile As
    #n Tyr Arg Thr
                    85  
    #                90
    <210> SEQ ID NO 45
    <211> LENGTH: 94
    <212> TYPE: PRT
    <213> ORGANISM: Homo sapiens
    <400> SEQUENCE: 45
    Val Ser Asp Val Pro Arg Asp Leu Glu Val Va
    #l Ala Ala Thr Pro Thr
     1               5  
    #                10  
    #                15
    Ser Leu Leu Ile Ser Trp Tyr Arg His Thr Ty
    #r Arg Asp Arg Tyr Tyr
                20      
    #            25      
    #            30
    Arg Ile Thr Tyr Gly Glu Thr Gly Gly Asn Se
    #r Pro Val Gln Glu Ser
            35          
    #        40          
    #        45
    Thr Val Pro Pro Trp Ala Thr Thr Ala Thr Il
    #e Ser Gly Leu Lys Pro
        50              
    #    55              
    #    60
    Gly Val Asp Tyr Thr Ile Ala Val Tyr Ala Va
    #l Thr Asp Thr Gly Tyr
    65                  
    #70                  
    #75                  
    #80
    Asp Val His Thr Lys Arg Pro Ile Ser Ile As
    #n Tyr Arg Thr
                    85  
    #                90
    <210> SEQ ID NO 46
    <211> LENGTH: 94
    <212> TYPE: PRT
    <213> ORGANISM: Homo sapiens
    <400> SEQUENCE: 46
    Val Ser Asp Val Pro Arg Asp Leu Glu Val Va
    #l Ala Ala Ala Pro Thr
     1               5  
    #                10  
    #                15
    Ser Leu Leu Ile Ser Trp Tyr Arg His Thr Ty
    #r Arg Asp Arg Tyr Tyr
                20      
    #            25      
    #            30
    Arg Ile Thr Tyr Gly Glu Thr Gly Gly Asn Se
    #r Pro Val Gln Glu Phe
            35          
    #        40          
    #        45
    Thr Val Pro Pro Trp Ala Thr Thr Ala Ala Il
    #e Ser Gly Leu Lys Pro
        50              
    #    55              
    #    60
    Gly Val Asp Tyr Ala Ile Ala Val Tyr Ala Va
    #l Thr Asp Thr Gly Tyr
    65                  
    #70                  
    #75                  
    #80
    Asp Val His Thr Lys Arg Pro Ile Ser Ile As
    #n Tyr Arg Thr
                    85  
    #                90
    <210> SEQ ID NO 47
    <211> LENGTH: 94
    <212> TYPE: PRT
    <213> ORGANISM: Homo sapiens
    <400> SEQUENCE: 47
    Val Ser Asp Val Pro Arg Asp Leu Glu Val Va
    #l Ala Ala Thr Pro Thr
     1               5  
    #                10  
    #                15
    Ser Gln Leu Ile Ser Trp Pro Phe Gly Trp Ty
    #r Pro Ser Arg Tyr Tyr
                20      
    #            25      
    #            30
    Arg Ile Thr Tyr Gly Glu Thr Gly Gly Asn Se
    #r Pro Val Gln Glu Phe
            35          
    #        40          
    #        45
    Thr Val Pro Pro Trp Ala Arg Thr Ala Thr Il
    #e Ser Gly Leu Lys Pro
        50              
    #    55              
    #    60
    Gly Val Asp Tyr Thr Ile Thr Val Tyr Ala Va
    #l Thr Asp Tyr Ser Asp
    65                  
    #70                  
    #75                  
    #80
    Phe Ser Gln Val His Thr Pro Asn Ser Ile As
    #n Tyr Arg Thr
                    85  
    #                90
    <210> SEQ ID NO 48
    <211> LENGTH: 93
    <212> TYPE: PRT
    <213> ORGANISM: Homo sapiens
    <400> SEQUENCE: 48
    Val Ser Asp Val Pro Arg Asp Leu Glu Val Va
    #l Ala Ala Thr Pro Thr
     1               5  
    #                10  
    #                15
    Ser Arg Leu Ile Ser Trp Arg Pro Gly Arg Th
    #r Tyr Ser Arg Tyr Arg
                20      
    #            25      
    #            30
    Ile Thr Tyr Gly Glu Thr Gly Gly Asn Ser Pr
    #o Val Gln Glu Phe Thr
            35          
    #        40          
    #        45
    Val Pro Pro Trp Ala Asn Thr Ala Thr Ile Se
    #r Gly Leu Lys Pro Gly
        50              
    #    55              
    #    60
    Val Asp Tyr Thr Ile Thr Val Tyr Ala Val Th
    #r Pro Leu Pro Ile Pro
    65                  
    #70                  
    #75                  
    #80
    Thr Leu Val His Gly Pro Ile Ser Ile Asn Ty
    #r Arg Thr
                    85  
    #                90
    <210> SEQ ID NO 49
    <211> LENGTH: 94
    <212> TYPE: PRT
    <213> ORGANISM: Homo sapiens
    <400> SEQUENCE: 49
    Val Ser Asp Val Pro Arg Asp Leu Glu Val Va
    #l Ala Ala Thr Pro Thr
     1               5  
    #                10  
    #                15
    Ser Arg Leu Ile Ser Trp Ala Ser Pro Pro Me
    #t Trp Cys Arg Tyr Tyr
                20      
    #            25      
    #            30
    Arg Ile Thr Tyr Gly Glu Thr Gly Gly Asn Gl
    #y Pro Val Gln Glu Phe
            35          
    #        40          
    #        45
    Thr Val Pro Pro Trp Ala Thr Thr Ala Thr Il
    #e Ser Gly Leu Lys Pro
        50              
    #    55              
    #    60
    Gly Val Asp Tyr Thr Ile Thr Val Tyr Ala Va
    #l Thr Glu Tyr Leu Pro
    65                  
    #70                  
    #75                  
    #80
    Glu Trp Asn Met Thr Gln Pro Ile Ser Ile As
    #n Tyr Arg Thr
                    85  
    #                90
    <210> SEQ ID NO 50
    <211> LENGTH: 94
    <212> TYPE: PRT
    <213> ORGANISM: Homo sapiens
    <400> SEQUENCE: 50
    Val Ser Asp Val Pro Arg Asp Leu Glu Val Va
    #l Ala Ala Thr Pro Thr
     1               5  
    #                10  
    #                15
    Ser Arg Leu Ile Ser Trp Ala Ser Pro Pro Me
    #t Trp Cys Arg Tyr Tyr
                20      
    #            25      
    #            30
    Arg Ile Thr Tyr Gly Glu Thr Gly Gly Asn Se
    #r Pro Val Gln Glu Phe
            35          
    #        40          
    #        45
    Thr Val Pro Pro Trp Ala Thr Thr Ala Thr Il
    #e Ser Gly Leu Lys Pro
        50              
    #    55              
    #    60
    Gly Val Asp Tyr Thr Ile Thr Met Tyr Ala Va
    #l Thr Glu Tyr Leu Pro
    65                  
    #70                  
    #75                  
    #80
    Glu Trp Asn Met Thr Gln Pro Ile Ser Ile As
    #n Tyr Arg Thr
                    85  
    #                90
    <210> SEQ ID NO 51
    <211> LENGTH: 93
    <212> TYPE: PRT
    <213> ORGANISM: Homo sapiens
    <400> SEQUENCE: 51
    Asn Thr Thr His Tyr Arg Lys Asn Asn Tyr Ty
    #r Ala Thr Pro Thr Ser
     1               5  
    #                10  
    #                15
    Arg Leu Ile Ser Trp Asn Arg Ser Gly Leu Gl
    #n Ser Arg Tyr Tyr Arg
                20      
    #            25      
    #            30
    Ile Thr Tyr Gly Glu Thr Gly Gly Asn Ser Pr
    #o Val Gln Glu Phe Thr
            35          
    #        40          
    #        45
    Val Pro Pro Trp Ala Ser Ile Ala Thr Ile Se
    #r Gly Leu Lys Pro Gly
        50              
    #    55              
    #    60
    Val Asp Tyr Thr Ile Thr Val Tyr Ala Val Th
    #r Asp Lys Ser Asp Thr
    65                  
    #70                  
    #75                  
    #80
    Tyr Lys Tyr Asp Asp Pro Ile Ser Ile Asn Ty
    #r Arg Thr
                    85  
    #                90
    <210> SEQ ID NO 52
    <211> LENGTH: 93
    <212> TYPE: PRT
    <213> ORGANISM: Homo sapiens
    <400> SEQUENCE: 52
    Val Ser Asp Val Pro Arg Asp Leu Glu Val Va
    #l Ala Ala Thr Pro Thr
     1               5  
    #                10  
    #                15
    Ser Arg Leu Ile Ser Trp Asn Arg Ser Gly Le
    #u Gln Ser Arg Tyr Tyr
                20      
    #            25      
    #            30
    Arg Ile Thr Tyr Gly Glu Thr Gly Gly Asn Se
    #r Pro Val Gln Glu Phe
            35          
    #        40          
    #        45
    Thr Val Pro Pro Trp Ala Ser Ile Ala Thr Il
    #e Ser Gly Leu Lys Pro
        50              
    #    55              
    #    60
    Gly Val Asp Tyr Thr Ile Thr Val Tyr Ala Va
    #l Thr Ser Ala Thr Arg
    65                  
    #70                  
    #75                  
    #80
    Thr Val Lys Arg Asp Pro Ile Ser Ile Asn Ty
    #r Arg Thr
                    85  
    #                90
    <210> SEQ ID NO 53
    <211> LENGTH: 94
    <212> TYPE: PRT
    <213> ORGANISM: Homo sapiens
    <400> SEQUENCE: 53
    Val Ser Asp Ala Pro Arg Asp Leu Glu Val Va
    #l Ala Ala Thr Pro Thr
     1               5  
    #                10  
    #                15
    Ser Arg Leu Ile Ser Trp Asn Arg Ser Gly Le
    #u Gln Ser Arg Tyr Tyr
                20      
    #            25      
    #            30
    Arg Ile Thr Tyr Gly Glu Thr Gly Gly Asn Se
    #r Pro Val Gln Glu Phe
            35          
    #        40          
    #        45
    Thr Val Pro Pro Trp Ala Ser Ile Ala Thr Il
    #e Ser Gly Leu Lys Pro
        50              
    #    55              
    #    60
    Gly Val Asp Tyr Thr Ile Thr Met Tyr Ala Va
    #l Thr Ser Asn Val Gly
    65                  
    #70                  
    #75                  
    #80
    Arg Leu Asp Thr Arg Tyr Pro Ile Ser Ile As
    #n Tyr Arg Thr
                    85  
    #                90
    <210> SEQ ID NO 54
    <211> LENGTH: 94
    <212> TYPE: PRT
    <213> ORGANISM: Homo sapiens
    <400> SEQUENCE: 54
    Val Ser Asp Val Pro Arg Asp Leu Asp Val Va
    #l Ala Ala Thr Pro Thr
     1               5  
    #                10  
    #                15
    Ser Arg Leu Ile Ser Trp Asn Arg Ser Gly Le
    #u Gln Ser Arg Tyr Tyr
                20      
    #            25      
    #            30
    Arg Ile Thr Tyr Gly Glu Thr Gly Gly Asn Se
    #r Pro Val Gln Glu Phe
            35          
    #        40          
    #        45
    Thr Glu Pro Pro Trp Ala Ser Ile Ala Thr Il
    #e Ser Gly Leu Lys Pro
        50              
    #    55              
    #    60
    Gly Val Asp Tyr Thr Ile Thr Val Tyr Ala Va
    #l Thr Ser Asn Val Gly
    65                  
    #70                  
    #75                  
    #80
    Arg Leu Asp Thr Arg Tyr Pro Ile Ser Ile As
    #n Tyr Arg Thr
                    85  
    #                90
    <210> SEQ ID NO 55
    <211> LENGTH: 94
    <212> TYPE: PRT
    <213> ORGANISM: Homo sapiens
    <400> SEQUENCE: 55
    Val Ser Asp Val Pro Arg Asp Leu Glu Val Va
    #l Ala Ala Thr Pro Thr
     1               5  
    #                10  
    #                15
    Ser Arg Leu Ile Ser Trp Asn Arg Ser Gly Le
    #u Gln Ser Arg Tyr Tyr
                20      
    #            25      
    #            30
    Arg Ile Thr Tyr Gly Glu Thr Gly Gly Asn Se
    #r Pro Val Gln Glu Phe
            35          
    #        40          
    #        45
    Thr Val Pro Pro Trp Ala Ser Ile Ala Thr Il
    #e Ser Gly Leu Lys Pro
        50              
    #    55              
    #    60
    Gly Val Asp Tyr Thr Ile Thr Val Tyr Ala Va
    #l Thr Lys Glu Pro Gln
    65                  
    #70                  
    #75                  
    #80
    Arg His Ala Leu Val Thr Pro Ile Ser Ile As
    #n Tyr Arg Thr
                    85  
    #                90
    <210> SEQ ID NO 56
    <211> LENGTH: 94
    <212> TYPE: PRT
    <213> ORGANISM: Homo sapiens
    <400> SEQUENCE: 56
    Val Ser Asp Val Pro Arg Asp Leu Glu Val Va
    #l Ala Ala Thr Pro Thr
     1               5  
    #                10  
    #                15
    Ser Arg Leu Ile Ser Trp Asn Arg Ser Gly Le
    #u Gln Ser Arg Tyr Tyr
                20      
    #            25      
    #            30
    Arg Ile Thr Tyr Gly Glu Thr Gly Gly Asn Se
    #r Pro Val Gln Glu Phe
            35          
    #        40          
    #        45
    Thr Val Pro Pro Trp Ala Ser Ile Ala Thr Il
    #e Ser Gly Leu Lys Pro
        50              
    #    55              
    #    60
    Gly Val Asp Tyr Thr Ile Thr Val Tyr Ala Va
    #l Thr Glu Thr Pro Ser
    65                  
    #70                  
    #75                  
    #80
    Thr Lys Pro His Asn Val Pro Ile Ser Ile As
    #n Tyr Arg Thr
                    85  
    #                90
    <210> SEQ ID NO 57
    <211> LENGTH: 74
    <212> TYPE: PRT
    <213> ORGANISM: Homo sapiens
    <400> SEQUENCE: 57
    Val Ser Asp Val Pro Arg Asp Leu Glu Val Va
    #l Ala Ala Thr Pro Thr
     1               5  
    #                10  
    #                15
    Ser Arg Leu Ile Ser Trp Asn His Pro Gly Pr
    #o Phe Ser Arg Tyr Tyr
                20      
    #            25      
    #            30
    Arg Ile Thr Tyr Gly Glu Thr Gly Gly Asn Se
    #r Pro Val Gln Glu Phe
            35          
    #        40          
    #        45
    Thr Val Pro Pro Trp Ala Arg Thr Ala Ile Il
    #e Ser Gly Leu Lys Pro
        50              
    #    55              
    #    60
    Gly Val Asp Tyr Thr Ile Thr Val Tyr Ala
    65                  
    #70
    <210> SEQ ID NO 58
    <211> LENGTH: 93
    <212> TYPE: PRT
    <213> ORGANISM: Homo sapiens
    <400> SEQUENCE: 58
    Val Ser Asp Val Pro Arg Asp Leu Glu Val Va
    #l Ala Ala Thr Thr Ser
     1               5  
    #                10  
    #                15
    Leu Leu Ile Ser Trp Asp Tyr Asn Arg Thr Gl
    #y Asp Arg Tyr Tyr Arg
                20      
    #            25      
    #            30
    Ile Thr Tyr Gly Glu Thr Gly Gly Asn Ser Pr
    #o Val Gln Glu Phe Thr
            35          
    #        40          
    #        45
    Val Pro Pro Trp Ala Ser Ile Ala Thr Ile Gl
    #y Gly Leu Lys Pro Gly
        50              
    #    55              
    #    60
    Val Asp Tyr Thr Ile Thr Val Tyr Ala Val Th
    #r Ala Gln Thr Gly His
    65                  
    #70                  
    #75                  
    #80
    His Leu His Asp Lys Pro Ile Ser Ile Asn Ty
    #r Arg Ser
                    85  
    #                90
    <210> SEQ ID NO 59
    <211> LENGTH: 94
    <212> TYPE: PRT
    <213> ORGANISM: Homo sapiens
    <400> SEQUENCE: 59
    Val Ser Asp Val Pro Arg Asp Leu Glu Val Va
    #l Ala Ala Thr Ser Thr
     1               5  
    #                10  
    #                15
    Ser Leu Leu Ile Ser Trp His Tyr Leu Arg Ar
    #g Gln Pro Arg Tyr Tyr
                20      
    #            25      
    #            30
    Arg Ile Thr Tyr Gly Glu Thr Gly Gly Asn Se
    #r Pro Val Gln Glu Phe
            35          
    #        40          
    #        45
    Thr Val Pro Pro Trp Ala Ser Ile Ala Thr Il
    #e Gly Gly Leu Lys Pro
        50              
    #    55              
    #    60
    Gly Val Asp Tyr Thr Ile Thr Val Tyr Ala Va
    #l Thr Ala Gln Thr Gly
    65                  
    #70                  
    #75                  
    #80
    His His Leu His Asp Glu Pro Ile Ser Ile As
    #n Tyr Arg Thr
                    85  
    #                90
    <210> SEQ ID NO 60
    <211> LENGTH: 94
    <212> TYPE: PRT
    <213> ORGANISM: Homo sapiens
    <400> SEQUENCE: 60
    Val Ser Asp Val Pro Arg Asp Leu Gln Ile Va
    #l Ala Ala Thr Pro Thr
     1               5  
    #                10  
    #                15
    Ser Leu Leu Ile Ser Trp Asp Ile Ser Arg Ty
    #r Lys His Arg Tyr Tyr
                20      
    #            25      
    #            30
    Arg Ile Thr Tyr Gly Glu Thr Gly Gly Asp Se
    #r Pro Val Gln Glu Phe
            35          
    #        40          
    #        45
    Thr Ala Pro Pro Trp Ala Ser Ile Ala Thr Il
    #e Gly Gly Leu Lys Pro
        50              
    #    55              
    #    60
    Gly Val Asp Tyr Thr Ile Thr Val Tyr Ala Va
    #l Thr Ala Gln Thr Gly
    65                  
    #70                  
    #75                  
    #80
    His His Leu His Asp Lys Pro Ile Ser Ile As
    #n Tyr Arg Thr
                    85  
    #                90
    <210> SEQ ID NO 61
    <211> LENGTH: 93
    <212> TYPE: PRT
    <213> ORGANISM: Homo sapiens
    <400> SEQUENCE: 61
    Val Ser Asp Val Pro Arg Asp Leu Glu Val Va
    #l Ala Ala Thr Pro Thr
     1               5  
    #                10  
    #                15
    Ser Arg Leu Ile Ser Trp Arg Pro Thr Ser As
    #n Pro Pro Arg Tyr Tyr
                20      
    #            25      
    #            30
    Arg Ile Thr Tyr Gly Glu Thr Gly Gly Asn Se
    #r Pro Val Gln Glu Phe
            35          
    #        40          
    #        45
    Thr Val Pro Pro Trp Ala Ser Ile Thr Ile Gl
    #y Gly Leu Lys Pro Gly
        50              
    #    55              
    #    60
    Val Asp Tyr Thr Ile Thr Val Tyr Ala Val Th
    #r Ala Gln Thr Gly His
    65                  
    #70                  
    #75                  
    #80
    His Leu His Asp Lys Pro Ile Ser Ile Asn Ty
    #r Arg Thr
                    85  
    #                90
    <210> SEQ ID NO 62
    <211> LENGTH: 94
    <212> TYPE: PRT
    <213> ORGANISM: Homo sapiens
    <400> SEQUENCE: 62
    Val Ser Asp Val Pro Arg Asp Leu Glu Val Va
    #l Ala Ala Thr Pro Thr
     1               5  
    #                10  
    #                15
    Ser Arg Leu Ile Cys Trp Arg Pro Thr Ser As
    #n Pro Pro Arg Tyr Tyr
                20      
    #            25      
    #            30
    Arg Ile Thr Tyr Gly Glu Thr Gly Gly Asn Se
    #r Pro Val Gln Glu Phe
            35          
    #        40          
    #        45
    Thr Val Pro Pro Trp Ala Ser Ile Ala Thr Il
    #e Gly Gly Leu Lys Pro
        50              
    #    55              
    #    60
    Gly Val Asp Tyr Thr Ile Thr Val Tyr Ala Va
    #l Thr Ala Gln Thr Gly
    65                  
    #70                  
    #75                  
    #80
    His His Leu His Asp Lys Pro Ile Ser Ile As
    #n Tyr Arg Thr
                    85  
    #                90
    <210> SEQ ID NO 63
    <211> LENGTH: 94
    <212> TYPE: PRT
    <213> ORGANISM: Homo sapiens
    <400> SEQUENCE: 63
    Val Ser Asp Val Pro Arg Asp Leu Glu Val Va
    #l Ala Ala Thr Pro Thr
     1               5  
    #                10  
    #                15
    Ser Arg Leu Ile Ser Trp Arg Pro Thr Ser As
    #n Pro Pro Arg Tyr Tyr
                20      
    #            25      
    #            30
    Arg Ile Ser Tyr Gly Glu Thr Gly Gly Asn Se
    #r Pro Val Gln Glu Phe
            35          
    #        40          
    #        45
    Thr Val Pro Pro Trp Ala Ser Ile Ala Thr Il
    #e Gly Gly Leu Lys Pro
        50              
    #    55              
    #    60
    Gly Val Asp Tyr Thr Ile Thr Val Tyr Ala Va
    #l Thr Ala Gln Thr Gly
    65                  
    #70                  
    #75                  
    #80
    His His Leu His Asp Lys Pro Ile Ser Ile As
    #n Tyr Arg Thr
                    85  
    #                90
    <210> SEQ ID NO 64
    <211> LENGTH: 94
    <212> TYPE: PRT
    <213> ORGANISM: Homo sapiens
    <400> SEQUENCE: 64
    Val Ser Asp Val Pro Arg Asp Leu Glu Val Va
    #l Ala Ala Thr Pro Thr
     1               5  
    #                10  
    #                15
    Ser Gln Leu Ile Ser Trp Lys Thr Thr Asn Pr
    #o Thr Ala Arg Tyr Tyr
                20      
    #            25      
    #            30
    Arg Ile Thr Tyr Gly Glu Thr Gly Gly Asn Se
    #r Pro Val Gln Glu Phe
            35          
    #        40          
    #        45
    Thr Val Pro Pro Trp Ala Thr Ile Ala Thr Il
    #e Ser Gly Leu Lys Pro
        50              
    #    55              
    #    60
    Gly Val Asp Tyr Thr Ile Thr Val Tyr Ala Va
    #l Thr Asn Leu Thr Thr
    65                  
    #70                  
    #75                  
    #80
    Arg Arg Arg His Arg Ala Pro Ile Ser Ile As
    #n Tyr Arg Thr
                    85  
    #                90
    <210> SEQ ID NO 65
    <211> LENGTH: 94
    <212> TYPE: PRT
    <213> ORGANISM: Homo sapiens
    <400> SEQUENCE: 65
    Val Ser Asp Val Pro Arg Asp Leu Glu Val Va
    #l Ala Ala Thr Pro Thr
     1               5  
    #                10  
    #                15
    Ser Arg Leu Ile Ser Trp Thr Thr Arg His Se
    #r Pro Val Arg Tyr Tyr
                20      
    #            25      
    #            30
    Arg Ile Thr Tyr Gly Glu Thr Gly Gly Asn Se
    #r Pro Val Gln Glu Phe
            35          
    #        40          
    #        45
    Ile Val Pro Pro Trp Ala Thr Thr Ala Thr Il
    #e Ser Gly Leu Lys Pro
        50              
    #    55              
    #    60
    Gly Val Asp Tyr Thr Ile Thr Val Tyr Ala Va
    #l Thr Met Pro Thr Asn
    65                  
    #70                  
    #75                  
    #80
    Trp Arg Phe Pro His Arg Pro Ile Ser Ile As
    #p Tyr Arg Thr
                    85  
    #                90
    <210> SEQ ID NO 66
    <211> LENGTH: 90
    <212> TYPE: PRT
    <213> ORGANISM: Homo sapiens
    <400> SEQUENCE: 66
    Val Ser Asp Val Pro Arg Asp Leu Glu Ala Va
    #l Ala Ala Thr Pro Thr
     1               5  
    #                10  
    #                15
    Ser Leu Leu Ile Arg Glu Arg Glu Arg Arg Ty
    #r Tyr Arg Ile Thr Tyr
                20      
    #            25      
    #            30
    Gly Glu Thr Gly Gly Asn Ser Pro Val Gln Gl
    #u Phe Thr Val Pro Gly
            35          
    #        40          
    #        45
    Ser Lys Ser Thr Ala Thr Ile Ser Gly Leu Gl
    #u Pro Gly Val Asp Tyr
        50              
    #    55              
    #    60
    Thr Ile Thr Val Tyr Ala Val Thr Pro His Hi
    #s Gly His Phe Asp Leu
    65                  
    #70                  
    #75                  
    #80
    Glu Leu Pro Ile Ser Ile Asn Tyr Arg Thr
                    85  
    #                90
    <210> SEQ ID NO 67
    <211> LENGTH: 94
    <212> TYPE: PRT
    <213> ORGANISM: Homo sapiens
    <400> SEQUENCE: 67
    Val Ser Asp Val Pro Arg Asp Leu Glu Gly Va
    #l Ala Ala Thr Pro Thr
     1               5  
    #                10  
    #                15
    Ser Leu Leu Ile Ser Arg Lys Asp Arg Val Se
    #r Ser Arg Arg Tyr Tyr
                20      
    #            25      
    #            30
    Arg Ile Thr Tyr Gly Glu Thr Gly Gly Asn Se
    #r Pro Val Gln Glu Phe
            35          
    #        40          
    #        45
    Thr Val Pro Gly Ser Lys Ser Thr Ala Ile Il
    #e Ser Gly Leu Lys Pro
        50              
    #    55              
    #    60
    Gly Val Asp Tyr Thr Ile Thr Ala Tyr Val Va
    #l Thr Pro His His Gly
    65                  
    #70                  
    #75                  
    #80
    His Phe Asp Leu Glu Leu Pro Ile Ser Ile As
    #n Tyr Arg Thr
                    85  
    #                90
    <210> SEQ ID NO 68
    <211> LENGTH: 95
    <212> TYPE: PRT
    <213> ORGANISM: Homo spiens
    <400> SEQUENCE: 68
    Val Ser Asp Val Pro Arg Asp Leu Glu Val Va
    #l Ala Ala Thr Pro Thr
     1               5  
    #                10  
    #                15
    Ser Leu Leu Ile Ser Trp His Met Ala Thr Pr
    #o Asn Thr Arg Tyr Tyr
                20      
    #            25      
    #            30
    Arg Thr Thr Tyr Gly Glu Thr Gly Gly Asn Se
    #r Pro Val Gln Glu Phe
            35          
    #        40          
    #        45
    Thr Val Pro Gly Ser Lys Ser Thr Ala Thr Il
    #e Ser Gly Leu Lys Pro
        50              
    #    55              
    #    60
    Gly Val Asp Tyr Asn Thr Asn Thr Val Tyr Al
    #a Val Thr Ser Val Asn
    65                  
    #70                  
    #75                  
    #80
    Ala Phe Pro Tyr Glu Gly Met Pro Ile Ser Il
    #e Asn Tyr Arg Thr
                    85  
    #                90  
    #                95
    <210> SEQ ID NO 69
    <211> LENGTH: 94
    <212> TYPE: PRT
    <213> ORGANISM: Homo sapiens
    <400> SEQUENCE: 69
    Val Ser Asp Val Pro Arg Asp Leu Glu Val Va
    #l Ala Ala Thr Ala Thr
     1               5  
    #                10  
    #                15
    Ser Leu Leu Ser Ser Trp Tyr Leu Cys Thr Gl
    #y Asn Asn Arg Asp Tyr
                20      
    #            25      
    #            30
    Arg Ile Thr Tyr Gly Glu Thr Gly Gly Asn Se
    #r Pro Val Gln Ala Phe
            35          
    #        40          
    #        45
    Thr Val Pro Gly Ser Lys Ser Thr Ala Thr Il
    #e Ser Gly Leu Lys Pro
        50              
    #    55              
    #    60
    Gly Val Asp Tyr Ile Pro Ser Arg Cys Met Le
    #u Ser Leu Ala Ser Leu
    65                  
    #70                  
    #75                  
    #80
    Met Ser Thr Arg Asn Lys Pro Ile Ser Ile As
    #n Tyr Arg Thr
                    85  
    #                90
    <210> SEQ ID NO 70
    <211> LENGTH: 94
    <212> TYPE: PRT
    <213> ORGANISM: Homo sapiens
    <400> SEQUENCE: 70
    Val Ser Asp Val Pro Arg Asp Leu Glu Val Va
    #l Ala Ala Thr Pro Thr
     1               5  
    #                10  
    #                15
    Ser Leu Leu Ile Ser Trp Arg Thr Pro Ala Se
    #r Pro His Gly Tyr Tyr
                20      
    #            25      
    #            30
    Arg Ile Thr Tyr Gly Glu Thr Gly Gly Asn Se
    #r Pro Val Glu Glu Phe
            35          
    #        40          
    #        45
    Thr Val Pro Leu Leu Trp Pro Thr Ala Thr Il
    #e Ser Gly Leu Lys Pro
        50              
    #    55              
    #    60
    Gly Val Asp Tyr Ala Ile Thr Val Tyr Ala Va
    #l Thr Pro Thr His Met
    65                  
    #70                  
    #75                  
    #80
    Leu Lys Pro Leu Ser Met Pro Ile Ser Ile As
    #n Tyr Arg Thr
                    85  
    #                90
    <210> SEQ ID NO 71
    <211> LENGTH: 93
    <212> TYPE: PRT
    <213> ORGANISM: Homo sapiens
    <400> SEQUENCE: 71
    Ile Ser Asp Val Pro Arg Asp Met Glu Val Va
    #l Ala Ala Thr Pro Thr
     1               5  
    #                10  
    #                15
    Ser Leu Leu Ile Ser Trp Asn Met Ala His Pr
    #o His Asp Arg Asn Tyr
                20      
    #            25      
    #            30
    Arg Ile Thr Tyr Gly Glu Thr Gly Gly Asn Se
    #r Pro Val Gln Glu Ser
            35          
    #        40          
    #        45
    Thr Val Pro Arg Tyr Leu Ser Thr Ala Thr Il
    #e Ser Gly Pro Lys Arg
        50              
    #    55              
    #    60
    Val Asp Tyr Thr Ile Ile Val Tyr Ala Val As
    #n Gln Pro Thr Val Ser
    65                  
    #70                  
    #75                  
    #80
    Ala His Asn His Ala Pro Ile Ser Ile Asn Ty
    #r Arg Thr
                    85  
    #                90
    <210> SEQ ID NO 72
    <211> LENGTH: 93
    <212> TYPE: PRT
    <213> ORGANISM: Homo sapiens
    <400> SEQUENCE: 72
    Val Ser Asp Val Pro Arg Asp Leu Lys Val Va
    #l Ala Ala Thr Pro Thr
     1               5  
    #                10  
    #                15
    Ser Leu Leu Ile Ser Trp Phe Pro Asp Asn Al
    #a Thr Pro Arg Tyr Tyr
                20      
    #            25      
    #            30
    Arg Ile Thr Tyr Gly Glu Thr Gly Gly Asn Se
    #r Pro Val Gln Glu Phe
            35          
    #        40          
    #        45
    Thr Val Pro Leu Phe Thr Thr Ala Thr Ile Se
    #r Gly Leu Lys Pro Gly
        50              
    #    55              
    #    60
    Val Asp Tyr Thr Ile Thr Val Tyr Ala Val Th
    #r Ser His Arg Asp Tyr
    65                  
    #70                  
    #75                  
    #80
    His Ser Thr Gly Arg Pro Ile Ser Ile Asn Ty
    #r Arg Thr
                    85  
    #                90
    <210> SEQ ID NO 73
    <211> LENGTH: 95
    <212> TYPE: PRT
    <213> ORGANISM: Homo sapiens
    <400> SEQUENCE: 73
    Val Ser Asp Val Pro Arg Asp Leu Glu Val Va
    #l Ala Ala Thr Pro Thr
     1               5  
    #                10  
    #                15
    Ser Leu Leu Ile Ser Trp Met Leu Leu Arg As
    #p Asp Arg Arg Tyr Tyr
                20      
    #            25      
    #            30
    Arg Ile Thr Tyr Gly Glu Thr Gly Gly Asn Se
    #r Pro Val Gln Glu Phe
            35          
    #        40          
    #        45
    Thr Val Pro Thr Phe His Pro Thr Ala Thr Il
    #e Ser Gly Arg Lys Pro
        50              
    #    55              
    #    60
    Gly Val Asp Tyr Asn Thr Ile Thr Val Tyr Al
    #a Val Thr Gln Ser Thr
    65                  
    #70                  
    #75                  
    #80
    Asn Gly Asn Arg Asn Asp Phe Pro Ile Ser Il
    #e Asn Tyr Arg Thr
                    85  
    #                90  
    #                95
    <210> SEQ ID NO 74
    <211> LENGTH: 94
    <212> TYPE: PRT
    <213> ORGANISM: Homo sapiens
    <400> SEQUENCE: 74
    Val Ser Asp Val Pro Arg Asp Leu Glu Val Va
    #l Ala Ala Thr Pro Thr
     1               5  
    #                10  
    #                15
    Ser Leu Leu Ile Ser Trp Ser Pro Pro Asn As
    #p Ala His Arg Tyr Tyr
                20      
    #            25      
    #            30
    Arg Ile Thr Tyr Gly Glu Thr Gly Gly Asn Se
    #r Pro Val Gln Glu Phe
            35          
    #        40          
    #        45
    Thr Val Pro Gly Ser Lys Ser Thr Ala Thr Il
    #e Ser Gly Leu Lys Pro
        50              
    #    55              
    #    60
    Gly Val Asp Tyr Thr Asn Thr Val Tyr Ala Va
    #l Thr Asp Gln Gln Ser
    65                  
    #70                  
    #75                  
    #80
    Tyr Thr Tyr Tyr Ser Asn Pro Ile Ser Ile As
    #n Tyr Arg Thr
                    85  
    #                90
    <210> SEQ ID NO 75
    <211> LENGTH: 94
    <212> TYPE: PRT
    <213> ORGANISM: Homo sapiens
    <400> SEQUENCE: 75
    Val Ser Asp Val Pro Arg Asp Leu Glu Val Va
    #l Ala Ala Thr Pro Thr
     1               5  
    #                10  
    #                15
    Ser Leu Val Ile Ser Trp Ser Pro Pro Asn As
    #p Ala His Arg Tyr Tyr
                20      
    #            25      
    #            30
    Arg Ile Thr Tyr Gly Glu Thr Gly Gly Asn Se
    #r Pro Val Gln Glu Phe
            35          
    #        40          
    #        45
    Thr Val Pro Gly Ser Lys Ser Thr Ala Thr Il
    #e Ser Gly Leu Lys Pro
        50              
    #    55              
    #    60
    Gly Val Asp Tyr Thr Asn Thr Val Tyr Ala Va
    #l Thr Asp Gln Gln Ser
    65                  
    #70                  
    #75                  
    #80
    Tyr Thr Tyr Tyr Ser Asn Pro Ile Ser Ile As
    #n Tyr Arg Thr
                    85  
    #                90
    <210> SEQ ID NO 76
    <211> LENGTH: 92
    <212> TYPE: PRT
    <213> ORGANISM: Homo sapiens
    <400> SEQUENCE: 76
    Val Ser Asp Val Pro Arg Asp Leu Glu Val Va
    #l Ala Ala Thr Pro Thr
     1               5  
    #                10  
    #                15
    Ser Leu Leu Ile Ser Trp Ser Pro Pro Asn As
    #p Ala His Arg Tyr Tyr
                20      
    #            25      
    #            30
    Arg Ile Thr Tyr Gly Glu Thr Gly Gly Asn Se
    #r Pro Val Gln Glu Phe
            35          
    #        40          
    #        45
    Thr Val Pro Pro Trp Ala Thr Thr Ala Thr Il
    #e Ser Gly Leu Lys Pro
        50              
    #    55              
    #    60
    Gly Val Asp Tyr Thr Ile Thr Val Tyr Thr Me
    #t Pro Thr Asn Trp Arg
    65                  
    #70                  
    #75                  
    #80
    Phe Pro His Arg Pro Ile Ser Ile Asn Tyr Ar
    #g Thr
                    85  
    #                90
    <210> SEQ ID NO 77
    <211> LENGTH: 94
    <212> TYPE: PRT
    <213> ORGANISM: Homo sapiens
    <400> SEQUENCE: 77
    Val Ser Asp Val Pro Arg Asp Leu Glu Val Va
    #l Ala Ala Thr Pro Thr
     1               5  
    #                10  
    #                15
    Ser Gln Leu Ile Ser Trp Thr Thr Arg His Se
    #r Pro Val Arg Tyr Tyr
                20      
    #            25      
    #            30
    Arg Ile Thr Tyr Gly Glu Thr Gly Gly Asn Se
    #r Pro Val Gln Glu Phe
            35          
    #        40          
    #        45
    Thr Val Pro Pro Trp Ala Thr Thr Ala Thr Il
    #e Ser Gly Leu Lys Pro
        50              
    #    55              
    #    60
    Gly Val Asp Tyr Thr Ile Thr Val Tyr Ala Va
    #l Thr Met Pro Thr Asn
    65                  
    #70                  
    #75                  
    #80
    Trp Arg Phe Pro His Arg Pro Ile Ser Ile As
    #n Tyr Arg Thr
                    85  
    #                90
    <210> SEQ ID NO 78
    <211> LENGTH: 94
    <212> TYPE: PRT
    <213> ORGANISM: Homo sapiens
    <400> SEQUENCE: 78
    Val Ser Asp Val Pro Arg Asp Leu Glu Ile Va
    #l Ala Ala Thr Pro Thr
     1               5  
    #                10  
    #                15
    Ser Arg Leu Ile Ser Trp Asn Arg Ser Gly Le
    #u Gln Ser Gly Tyr Tyr
                20      
    #            25      
    #            30
    Arg Ile Thr Tyr Gly Glu Thr Gly Gly Asn Se
    #r Pro Val Gln Glu Phe
            35          
    #        40          
    #        45
    Thr Val Pro Pro Trp Ala Ser Ile Ala Thr Th
    #r Ser Gly Leu Lys Pro
        50              
    #    55              
    #    60
    Gly Val Asp Tyr Thr Ile Thr Val Tyr Ala Va
    #l Thr Ser Asn Val Gly
    65                  
    #70                  
    #75                  
    #80
    Arg Leu Asp Thr Arg Tyr Pro Ile Ser Thr As
    #n Tyr Arg Thr
                    85  
    #                90
    <210> SEQ ID NO 79
    <211> LENGTH: 93
    <212> TYPE: PRT
    <213> ORGANISM: Homo sapiens
    <220> FEATURE:
    <221> NAME/KEY: VARIANT
    <222> LOCATION: 17, 30, 34
    <223> OTHER INFORMATION: Xaa = Any Amino Aci
    #d
    <400> SEQUENCE: 79
    Val Ser Asp Val Pro Arg Asp Leu Glu Val Va
    #l Ala Ala Thr Pro Thr
     1               5  
    #                10  
    #                15
    Xaa Arg Leu Ile Ser Trp Asn Arg Ser Gly Le
    #u Gln Ser Xaa Tyr Tyr
                20      
    #            25      
    #            30
    Arg Xaa Thr Tyr Gly Glu Thr Gly Gly Asn Se
    #r Pro Val Gln Glu Phe
            35          
    #        40          
    #        45
    Thr Val Pro Pro Trp Ala Ser Ile Ala Ile Se
    #r Gly Leu Lys Pro Gly
        50              
    #    55              
    #    60
    Val Asp Tyr Thr Ile Thr Val Tyr Ala Val Th
    #r Ser Asn Val Gly Arg
    65                  
    #70                  
    #75                  
    #80
    Leu Asp Thr Arg Tyr Pro Ile Phe Ile Asn Ty
    #r Arg Thr
                    85  
    #                90
    <210> SEQ ID NO 80
    <211> LENGTH: 76
    <212> TYPE: PRT
    <213> ORGANISM: Homo sapiens
    <400> SEQUENCE: 80
    Val Ser Asp Val Pro Arg Asp Leu Glu Val Va
    #l Ala Ala Thr Pro Thr
     1               5  
    #                10  
    #                15
    Ser Arg Leu Ile Ser Trp Asn Arg Ser Gly Le
    #u Gln Ser Arg Tyr Tyr
                20      
    #            25      
    #            30
    Arg Ile Thr Tyr Gly Glu Thr Gly Gly Asn Se
    #r Pro Val Gln Glu Phe
            35          
    #        40          
    #        45
    Thr Val Pro Pro Trp Ala Ser Ile Ala Thr Il
    #e Ser Gly Leu Lys Pro
        50              
    #    55              
    #    60
    Gly Val Asp Tyr Thr Ile Thr Val Tyr Ala Va
    #l Thr
    65                  
    #70                  
    #75
    <210> SEQ ID NO 81
    <211> LENGTH: 94
    <212> TYPE: PRT
    <213> ORGANISM: Homo sapiens
    <400> SEQUENCE: 81
    Val Ser Asp Val Pro Arg Asp Leu Glu Val Va
    #l Ala Ala Thr Pro Thr
     1               5  
    #                10  
    #                15
    Ser Arg Leu Ile Ser Trp Asn Arg Ser Gly Le
    #u Gln Ser Arg Tyr Tyr
                20      
    #            25      
    #            30
    Arg Ile Thr Tyr Gly Glu Thr Gly Gly Asn Se
    #r Pro Val Gln Glu Phe
            35          
    #        40          
    #        45
    Thr Val Pro Pro Trp Ala Ser Ile Ala Thr Il
    #e Ser Gly Leu Lys Pro
        50              
    #    55              
    #    60
    Gly Val Asp Tyr Thr Ile Thr Val Tyr Ala Va
    #l Thr Asp Lys Ser Asp
    65                  
    #70                  
    #75                  
    #80
    Thr Tyr Lys Tyr Asp Asp Pro Ile Ser Ile As
    #n Tyr Arg Thr
                    85  
    #                90
    <210> SEQ ID NO 82
    <211> LENGTH: 94
    <212> TYPE: PRT
    <213> ORGANISM: Homo sapiens
    <400> SEQUENCE: 82
    Val Ser Asp Val Pro Arg Asp Leu Glu Val Va
    #l Ala Ala Thr Pro Thr
     1               5  
    #                10  
    #                15
    Ser Arg Leu Ile Ser Cys Asn Arg Ser Gly Le
    #u Gln Ser Arg Tyr Tyr
                20      
    #            25      
    #            30
    Arg Ile Thr Tyr Gly Glu Thr Gly Gly Asn Se
    #r Pro Val Gln Glu Phe
            35          
    #        40          
    #        45
    Thr Val Pro Pro Trp Ala Ser Ile Ala Thr Il
    #e Ser Gly Leu Lys Pro
        50              
    #    55              
    #    60
    Gly Val Asp Tyr Thr Ile Thr Val Tyr Ala Va
    #l Thr Asp Lys Ser Asp
    65                  
    #70                  
    #75                  
    #80
    Thr Tyr Lys Tyr Asp Asp Pro Ile Ser Ile As
    #n Tyr Arg Thr
                    85  
    #                90
    <210> SEQ ID NO 83
    <211> LENGTH: 94
    <212> TYPE: PRT
    <213> ORGANISM: Homo sapiens
    <400> SEQUENCE: 83
    Val Ser Asp Val Pro Arg Asp Leu Glu Val Va
    #l Ala Ala Thr Pro Thr
     1               5  
    #                10  
    #                15
    Ser Arg Leu Ile Ser Trp Asn Arg Ser Gly Le
    #u Gln Ser Arg Tyr Tyr
                20      
    #            25      
    #            30
    Arg Ile Thr Tyr Gly Glu Thr Gly Gly Ser Se
    #r Pro Val Gln Glu Phe
            35          
    #        40          
    #        45
    Thr Val Pro Pro Trp Ala Ser Ile Ala Thr Il
    #e Ser Gly Leu Lys Pro
        50              
    #    55              
    #    60
    Gly Val Asp Tyr Thr Ile Thr Val Tyr Ala Va
    #l Thr Pro Thr His Asn
    65                  
    #70                  
    #75                  
    #80
    Trp Asn Asp Gln Thr Arg Ser Ile Ser Ile As
    #n Tyr Arg Thr
                    85  
    #                90
    <210> SEQ ID NO 84
    <211> LENGTH: 94
    <212> TYPE: PRT
    <213> ORGANISM: Homo sapiens
    <400> SEQUENCE: 84
    Val Ser Asp Val Pro Arg Asp Leu Glu Val Va
    #l Ala Ala Thr Pro Thr
     1               5  
    #                10  
    #                15
    Ser Arg Leu Ile Ser Trp Arg Pro Thr Ser As
    #n Pro Pro Arg Tyr Tyr
                20      
    #            25      
    #            30
    Arg Ile Thr Tyr Gly Glu Thr Gly Gly Asn Se
    #r Pro Val Gln Glu Phe
            35          
    #        40          
    #        45
    Thr Val Pro Pro Trp Ala Ser Ile Ala Thr Il
    #e Gly Gly Leu Lys Pro
        50              
    #    55              
    #    60
    Gly Val Asp Tyr Thr Ile Thr Val Tyr Ala Va
    #l Thr Ala Gln Thr Gly
    65                  
    #70                  
    #75                  
    #80
    Tyr His Leu His Asp Lys Pro Ile Ser Ile As
    #n Tyr Arg Thr
                    85  
    #                90
    <210> SEQ ID NO 85
    <211> LENGTH: 93
    <212> TYPE: PRT
    <213> ORGANISM: Homo sapiens
    <220> FEATURE:
    <221> NAME/KEY: VARIANT
    <222> LOCATION: 28, 51, 82
    <223> OTHER INFORMATION: Xaa = Any Amino Aci
    #d
    <400> SEQUENCE: 85
    Val Ser Asp Val Pro Arg Asp Leu Glu Val Va
    #l Ala Ala Pro Thr Ser
     1               5  
    #                10  
    #                15
    Arg Leu Ile Ser Trp Arg Pro Gly Arg Thr Ty
    #r Xaa Arg Tyr Tyr Arg
                20      
    #            25      
    #            30
    Ile Thr Tyr Gly Glu Thr Gly Gly Asn Ser Pr
    #o Val Gln Glu Phe Thr
            35          
    #        40          
    #        45
    Val Pro Xaa Trp Ala Asn Thr Ala Thr Ile Se
    #r Gly Leu Lys Pro Gly
        50              
    #    55              
    #    60
    Val Asp Tyr Thr Ile Thr Val Tyr Ala Val Th
    #r Phe Pro Pro Gly Tyr
    65                  
    #70                  
    #75                  
    #80
    Pro Xaa Thr Glu Met Pro Ile Ser Ile Asn Ty
    #r Arg Thr
                    85  
    #                90
    <210> SEQ ID NO 86
    <211> LENGTH: 92
    <212> TYPE: PRT
    <213> ORGANISM: Homo sapiens
    <400> SEQUENCE: 86
    Ile Ser Asp Val Pro Arg Asp Leu Glu Val Va
    #l Ala Ala Thr Pro Thr
     1               5  
    #                10  
    #                15
    Ser Leu Leu Ile Ser Trp Arg Arg Trp Pro Hi
    #s Phe Asp Arg Tyr Tyr
                20      
    #            25      
    #            30
    Arg Ile Thr Tyr Gly Glu Thr Gly Gly Asn Se
    #r Pro Val Gln Glu Phe
            35          
    #        40          
    #        45
    Thr Val Pro Pro Trp Ala Thr Ile Ala Thr Il
    #e Ser Gly Leu Lys Pro
        50              
    #    55              
    #    60
    Gly Val Asp Tyr Thr Ile Thr Val Tyr Ala Va
    #l Thr Asn Pro Leu Ser
    65                  
    #70                  
    #75                  
    #80
    Pro Thr Thr Leu His Pro Pro Ile Asn Tyr Ar
    #g Thr
                    85  
    #                90
    <210> SEQ ID NO 87
    <211> LENGTH: 94
    <212> TYPE: PRT
    <213> ORGANISM: Homo sapiens
    <400> SEQUENCE: 87
    Val Ser Asp Val Pro Arg Asp Leu Glu Val Va
    #l Ala Ala Thr Pro Thr
     1               5  
    #                10  
    #                15
    Ser Arg Leu Ile Ser Trp Lys Pro Arg Arg Th
    #r Asn Thr Arg Tyr Tyr
                20      
    #            25      
    #            30
    Arg Ile Thr Tyr Gly Glu Thr Gly Gly Asn Se
    #r Pro Val Gln Glu Phe
            35          
    #        40          
    #        45
    Thr Val Pro Pro Trp Gly Thr Ile Ala Thr Il
    #e Asn Gly Leu Lys Pro
        50              
    #    55              
    #    60
    Gly Val Asp Tyr Thr Ile Thr Val Tyr Ala Va
    #l Thr Leu Gly Thr Gly
    65                  
    #70                  
    #75                  
    #80
    Val Tyr Thr Arg Ala Gln Pro Ile Ser Ile As
    #n Tyr Arg Thr
                    85  
    #                90
    <210> SEQ ID NO 88
    <211> LENGTH: 94
    <212> TYPE: PRT
    <213> ORGANISM: Homo sapiens
    <400> SEQUENCE: 88
    Val Ser Asp Val Pro Arg Asp Leu Glu Val Va
    #l Ala Ala Thr Pro Thr
     1               5  
    #                10  
    #                15
    Ser Gln Leu Ile Ser Trp Pro Phe Gly Trp Ty
    #r Pro Ser Arg Tyr Tyr
                20      
    #            25      
    #            30
    Arg Ile Thr Tyr Gly Glu Thr Gly Gly Asn Se
    #r Pro Val Gln Glu Phe
            35          
    #        40          
    #        45
    Thr Val Pro Pro Trp Ala Arg Thr Ala Thr Il
    #e Ser Gly Leu Lys Pro
        50              
    #    55              
    #    60
    Gly Val Asp Tyr Thr Ile Thr Val Tyr Ala Va
    #l Thr His Phe Pro Glu
    65                  
    #70                  
    #75                  
    #80
    Ser Arg Arg Pro Ala Lys Pro Met Ser Ile As
    #n Tyr Arg Thr
                    85  
    #                90
    <210> SEQ ID NO 89
    <211> LENGTH: 94
    <212> TYPE: PRT
    <213> ORGANISM: Homo  sapiens
    <400> SEQUENCE: 89
    Val Ser Asp Val Pro Arg Asp Leu Glu Val Va
    #l Ala Ala Thr Pro Thr
     1               5  
    #                10  
    #                15
    Ser Leu Leu Ile Ser Trp His Thr Glu Arg Se
    #r Phe Pro Arg Tyr Tyr
                20      
    #            25      
    #            30
    Arg Ile Thr Tyr Gly Glu Thr Gly Gly Asn Se
    #r Pro Val Gln Glu Phe
            35          
    #        40          
    #        45
    Thr Val Pro Pro Trp Gly Ser Ile Ala Thr Il
    #e Ser Gly Leu Lys Pro
        50              
    #    55              
    #    60
    Gly Val Asp Tyr Thr Ile Thr Val Tyr Ala Va
    #l Thr Glu His Tyr Arg
    65                  
    #70                  
    #75                  
    #80
    Asp Thr Gly Thr Gly His Pro Ile Pro Ile As
    #n Tyr Arg Thr
                    85  
    #                90
    <210> SEQ ID NO 90
    <211> LENGTH: 94
    <212> TYPE: PRT
    <213> ORGANISM: Homo sapiens
    <400> SEQUENCE: 90
    Val Ser Asp Val Pro Arg Asp Leu Glu Val Va
    #l Ala Ala Thr Pro Thr
     1               5  
    #                10  
    #                15
    Ser Leu Leu Ile Ser Trp His Thr Glu Arg Se
    #r Phe Pro Arg Tyr Tyr
                20      
    #            25      
    #            30
    Arg Ile Thr Tyr Gly Glu Thr Gly Gly Asn Se
    #r Pro Val Gln Glu Phe
            35          
    #        40          
    #        45
    Thr Val Pro Pro Trp Gly Ser Ile Ala Thr Il
    #e Ser Gly Leu Lys Pro
        50              
    #    55              
    #    60
    Gly Val Asp Tyr Thr Ile Thr Val Tyr Ala Va
    #l Thr Glu His Tyr Arg
    65                  
    #70                  
    #75                  
    #80
    Asp Thr Gly Thr Gly His Pro Ile Pro Ile As
    #n Tyr Arg Thr
                    85  
    #                90
    <210> SEQ ID NO 91
    <211> LENGTH: 94
    <212> TYPE: PRT
    <213> ORGANISM: Homo sapiens
    <400> SEQUENCE: 91
    Val Ser Asp Val Pro Arg Asp Leu Glu Val Va
    #l Ala Ala Thr Pro Thr
     1               5  
    #                10  
    #                15
    Ser Gln Leu Ile Ser Trp Lys Ser His Thr Ph
    #e His Pro Arg Tyr Tyr
                20      
    #            25      
    #            30
    Arg Ile Thr Tyr Gly Glu Thr Gly Gly Asn Se
    #r Pro Val Gln Glu Phe
            35          
    #        40          
    #        45
    Thr Val Pro Pro Trp Ala Ser Thr Ala Ala Il
    #e Ser Gly Leu Lys Pro
        50              
    #    55              
    #    60
    Gly Ala Asp Tyr Thr Ile Thr Val Tyr Ala Va
    #l Thr Leu Asn Arg Ser
    65                  
    #70                  
    #75                  
    #80
    Ser Pro Asn Ser Ala Arg Pro Ile Ser Ile As
    #n Tyr Arg Thr
                    85  
    #                90
    <210> SEQ ID NO 92
    <211> LENGTH: 94
    <212> TYPE: PRT
    <213> ORGANISM: Homo sapiens
    <400> SEQUENCE: 92
    Val Ser Asp Val Pro Arg Asp Leu Glu Val Va
    #l Ala Ala Thr Pro Thr
     1               5  
    #                10  
    #                15
    Ser Leu Leu Ile Ser Trp Arg Pro Gln Val Va
    #l Ser Thr Arg Tyr Tyr
                20      
    #            25      
    #            30
    Arg Ile Thr Tyr Gly Glu Thr Gly Gly Asn Se
    #r Pro Val Gln Glu Phe
            35          
    #        40          
    #        45
    Thr Val Pro Pro Trp Ala Ser Ile Ala Thr Il
    #e Ser Gly Leu Lys Pro
        50              
    #    55              
    #    60
    Gly Val Asp Tyr Thr Ile Thr Val Tyr Ala Va
    #l Thr Asn His Lys Ala
    65                  
    #70                  
    #75                  
    #80
    Asn His His Asp Ala Glu Pro Ile Ser Ile As
    #n Tyr Arg Thr
                    85  
    #                90
    <210> SEQ ID NO 93
    <211> LENGTH: 94
    <212> TYPE: PRT
    <213> ORGANISM: Homo sapiens
    <400> SEQUENCE: 93
    Val Ser Asp Val Pro Arg Asp Leu Glu Val Va
    #l Ala Ala Thr Pro Thr
     1               5  
    #                10  
    #                15
    Ser Arg Leu Ile Ser Trp Arg Pro Thr Ser As
    #n His Pro Arg Tyr Tyr
                20      
    #            25      
    #            30
    Arg Ile Thr Tyr Gly Glu Thr Gly Gly Asn Se
    #r Pro Val Gln Glu Phe
            35          
    #        40          
    #        45
    Thr Val Pro Pro Trp Ala Ser Ile Ala Thr Il
    #e Gly Gly Leu Lys Pro
        50              
    #    55              
    #    60
    Gly Val Asp Tyr Thr Ile Thr Val Tyr Ala Va
    #l Thr Thr Thr Asn Glu
    65                  
    #70                  
    #75                  
    #80
    Asp His Val Tyr Ala Leu Pro Ile Ser Ile As
    #n Tyr Arg Ile
                    85  
    #                90
    <210> SEQ ID NO 94
    <211> LENGTH: 94
    <212> TYPE: PRT
    <213> ORGANISM: Homo sapiens
    <400> SEQUENCE: 94
    Val Ser Asp Val Pro Arg Asp Leu Glu Val Va
    #l Ala Ala Thr Pro Thr
     1               5  
    #                10  
    #                15
    Ser Arg Leu Ile Ser Trp Asn Arg Ser Gly Le
    #u Gln Ser Arg Tyr Tyr
                20      
    #            25      
    #            30
    Arg Ile Thr Tyr Gly Glu Thr Gly Gly Asn Se
    #r Pro Val Gln Glu Leu
            35          
    #        40          
    #        45
    Thr Val Pro Pro Trp Ala Ser Ile Ala Thr Il
    #e Ser Gly Leu Lys Pro
        50              
    #    55              
    #    60
    Gly Val Asp Tyr Thr Ile Thr Val Tyr Ala Va
    #l Thr Asp Lys Ser Asp
    65                  
    #70                  
    #75                  
    #80
    Thr Tyr Lys Tyr Asp Asp Pro Ile Ser Ile As
    #n Tyr Arg Thr
                    85  
    #                90
    <210> SEQ ID NO 95
    <211> LENGTH: 94
    <212> TYPE: PRT
    <213> ORGANISM: Homo sapiens
    <400> SEQUENCE: 95
    Val Ser Asp Val Pro Arg Asp Leu Glu Val Va
    #l Ala Ala Thr Pro Thr
     1               5  
    #                10  
    #                15
    Ser Arg Leu Ile Ser Trp Asn Arg Ser Gly Le
    #u Gln Ser Arg Tyr Tyr
                20      
    #            25      
    #            30
    Arg Ile Thr Tyr Gly Glu Thr Gly Gly Asn Gl
    #y Pro Val Gln Glu Phe
            35          
    #        40          
    #        45
    Thr Val Pro Pro Trp Ala Ser Ile Ala Thr Il
    #e Ser Gly Leu Lys Pro
        50              
    #    55              
    #    60
    Gly Val Asp Tyr Thr Ile Thr Val Tyr Ala Va
    #l Thr Asp Met Ser Asp
    65                  
    #70                  
    #75                  
    #80
    Thr Tyr Lys Tyr Asp Asp Pro Ile Ser Ile As
    #n Tyr Arg Thr
                    85  
    #                90
    <210> SEQ ID NO 96
    <211> LENGTH: 94
    <212> TYPE: PRT
    <213> ORGANISM: Homo sapiens
    <220> FEATURE:
    <221> NAME/KEY: VARIANT
    <222> LOCATION: 40
    <223> OTHER INFORMATION: Xaa = Any Amino Aci
    #d
    <400> SEQUENCE: 96
    Val Ser Asp Val Pro Arg Asp Leu Glu Val Va
    #l Ala Ala Thr Pro Thr
     1               5  
    #                10  
    #                15
    Ser Leu Leu Ile Ser Trp Asp Thr His Asn Al
    #a Tyr Asn Gly Tyr Tyr
                20      
    #            25      
    #            30
    Arg Ile Thr Tyr Gly Glu Thr Xaa Gly Asn Se
    #r Pro Val Arg Glu Phe
            35          
    #        40          
    #        45
    Thr Val Pro His Pro Glu Val Thr Ala Thr Il
    #e Ser Gly Leu Lys Pro
        50              
    #    55              
    #    60
    Gly Val Asp Asp Thr Ile Thr Val Tyr Ala Va
    #l Thr Asn His His Met
    65                  
    #70                  
    #75                  
    #80
    Pro Leu Arg Ile Phe Gly Pro Ile Ser Ile As
    #n His Arg Thr
                    85  
    #                90
    <210> SEQ ID NO 97
    <211> LENGTH: 94
    <212> TYPE: PRT
    <213> ORGANISM: Homo sapiens
    <220> FEATURE:
    <221> NAME/KEY: VARIANT
    <222> LOCATION: 13, 21
    <223> OTHER INFORMATION: Xaa = Any Amino Aci
    #d
    <400> SEQUENCE: 97
    Val Ser Asp Val Pro Arg Asp Leu Glu Val Va
    #l Ala Xaa Thr Pro Thr
     1               5  
    #                10  
    #                15
    Ser Leu Leu Ile Xaa Trp Thr Arg Thr Asn Al
    #a Asn Thr Arg Tyr Tyr
                20      
    #            25      
    #            30
    Arg Ile Thr Tyr Gly Glu Thr Gly Gly Asn Se
    #r Pro Val Gln Glu Phe
            35          
    #        40          
    #        45
    Thr Ala Pro Asn Asn Pro Pro Thr Ala Thr Il
    #e Gly Gly Leu Lys Pro
        50              
    #    55              
    #    60
    Gly Val Asp Tyr Thr Ile Thr Val Tyr Ala Va
    #l Thr Pro Asp Gly Ser
    65                  
    #70                  
    #75                  
    #80
    Arg His Met Leu Thr Lys Pro Ile Ser Ile As
    #n Tyr Arg Thr
                    85  
    #                90
    <210> SEQ ID NO 98
    <211> LENGTH: 94
    <212> TYPE: PRT
    <213> ORGANISM: Homo sapiens
    <400> SEQUENCE: 98
    Leu Ser Asp Val Pro Arg Asp Leu Glu Val Va
    #l Ala Ala Thr Pro Thr
     1               5  
    #                10  
    #                15
    Ser Arg Leu Ile Ser Trp Asn Arg Ser Gly Le
    #u Gln Ser Arg Tyr Tyr
                20      
    #            25      
    #            30
    Arg Ile Thr Tyr Gly Glu Thr Gly Gly Asn Se
    #r Pro Val Gln Glu Phe
            35          
    #        40          
    #        45
    Thr Val Pro Pro Trp Ala Ser Ile Ala Ala Il
    #e Ser Gly Leu Lys Pro
        50              
    #    55              
    #    60
    Gly Val Asp Tyr Thr Ile Thr Val Tyr Ala Va
    #l Thr Asp Lys Ser Asp
    65                  
    #70                  
    #75                  
    #80
    Thr Tyr Lys Tyr Asp Asp Pro Ile Ser Ile As
    #n Tyr Arg Thr
                    85  
    #                90
    <210> SEQ ID NO 99
    <211> LENGTH: 94
    <212> TYPE: PRT
    <213> ORGANISM: Homo sapiens
    <400> SEQUENCE: 99
    Val Ser Asp Val Pro Arg Asp Leu Glu Val Va
    #l Ala Ala Thr Pro Thr
     1               5  
    #                10  
    #                15
    Ser Arg Leu Ile Ser Trp Asn Arg Ser Gly Le
    #u Gln Ser Arg Tyr Tyr
                20      
    #            25      
    #            30
    Arg Ile Thr Tyr Gly Glu Thr Gly Gly Asn Se
    #r Pro Val Gln Glu Leu
            35          
    #        40          
    #        45
    Thr Val Pro Pro Trp Ala Ser Ile Ala Thr Il
    #e Ser Gly Leu Lys Pro
        50              
    #    55              
    #    60
    Gly Val Asp Tyr Thr Ile Thr Val Tyr Ala Va
    #l Thr Asp Lys Ser Asp
    65                  
    #70                  
    #75                  
    #80
    Thr Tyr Lys Tyr Asp Asp Pro Ile Ser Ile As
    #n His Arg Thr
                    85  
    #                90
    <210> SEQ ID NO 100
    <211> LENGTH: 94
    <212> TYPE: PRT
    <213> ORGANISM: Homo sapiens
    <400> SEQUENCE: 100
    Val Ser Asp Val Pro Arg Gly Leu Glu Val Va
    #l Ala Ala Thr Pro Thr
     1               5  
    #                10  
    #                15
    Ser Arg Leu Ile Ser Trp Asn Arg Ser Gly Le
    #u Gln Ser Arg Tyr Tyr
                20      
    #            25      
    #            30
    Arg Ile Thr Tyr Gly Glu Thr Gly Gly Asn Se
    #r Pro Val Gln Glu Phe
            35          
    #        40          
    #        45
    Thr Val Pro Pro Trp Ala Ser Ile Ala Thr Il
    #e Ser Gly Leu Lys His
        50              
    #    55              
    #    60
    Gly Val Asp Tyr Thr Ile Thr Val Tyr Ala Va
    #l Thr Asp Lys Ser Asp
    65                  
    #70                  
    #75                  
    #80
    Thr Tyr Lys Tyr Asp Asp Pro Ile Ser Ile As
    #n Tyr Arg Thr
                    85  
    #                90
    <210> SEQ ID NO 101
    <211> LENGTH: 94
    <212> TYPE: PRT
    <213> ORGANISM: Homo sapiens
    <400> SEQUENCE: 101
    Val Ser Asp Val Pro Arg Asp Leu Glu Val Va
    #l Ala Ala Thr Pro Thr
     1               5  
    #                10  
    #                15
    Ser Arg Leu Ile Ser Trp Asn Arg Ser Gly Le
    #u Gln Ser Arg Tyr Tyr
                20      
    #            25      
    #            30
    Arg Ile Thr Tyr Gly Glu Thr Glu Gly Asn Se
    #r Pro Val Gln Glu Phe
            35          
    #        40          
    #        45
    Thr Val Pro Pro Trp Ala Ser Met Ala Thr Il
    #e Ser Gly Leu Lys Pro
        50              
    #    55              
    #    60
    Gly Val Asp Tyr Thr Ile Thr Val Tyr Ala Va
    #l Thr Asp Lys Ser Asp
    65                  
    #70                  
    #75                  
    #80
    Thr Tyr Lys Tyr Asp Asp Pro Ile Ser Ile As
    #n Tyr Arg Thr
                    85  
    #                90
    <210> SEQ ID NO 102
    <211> LENGTH: 94
    <212> TYPE: PRT
    <213> ORGANISM: Homo sapiens
    <400> SEQUENCE: 102
    Val Ser Asp Val Pro Arg Asp Leu Glu Val Va
    #l Ala Ala Thr Pro Thr
     1               5  
    #                10  
    #                15
    Ser Arg Leu Ile Ser Trp Asn Arg Ser Gly Le
    #u Gln Ser Arg Tyr Tyr
                20      
    #            25      
    #            30
    Arg Ile Thr Tyr Gly Glu Thr Gly Gly Asn Se
    #r Pro Val Gln Glu Phe
            35          
    #        40          
    #        45
    Thr Val Pro Pro Trp Ala Ser Ile Ala Thr Il
    #e Ser Gly Leu Lys Pro
        50              
    #    55              
    #    60
    Gly Val Asp Tyr Thr Ile Thr Val Tyr Ala Va
    #l Thr Asp Lys Ser Asp
    65                  
    #70                  
    #75                  
    #80
    Thr Tyr Lys Tyr Asp Asp Pro Thr Ser Ile As
    #n Tyr Arg Thr
                    85  
    #                90
    <210> SEQ ID NO 103
    <211> LENGTH: 94
    <212> TYPE: PRT
    <213> ORGANISM: Homo sapiens
    <400> SEQUENCE: 103
    Val Ser Asp Val Pro Arg Asp Leu Glu Val Va
    #l Ala Ala Thr Pro Thr
     1               5  
    #                10  
    #                15
    Ser Arg Leu Ile Ser Trp Asn Arg Ser Gly Le
    #u Gln Ser Arg Tyr Tyr
                20      
    #            25      
    #            30
    Arg Ile Thr Tyr Gly Glu Thr Gly Gly Asn Se
    #r Pro Val Gln Glu Phe
            35          
    #        40          
    #        45
    Thr Val Pro Pro Trp Ala Ser Ile Ala Thr Il
    #e Ser Gly Leu Lys Pro
        50              
    #    55              
    #    60
    Gly Val Asp Tyr Thr Ile Thr Val Tyr Ala Va
    #l Ala Asp Lys Ser Asp
    65                  
    #70                  
    #75                  
    #80
    Thr Tyr Lys Tyr Asp Asp Pro Ile Ser Ile As
    #n Tyr Arg Thr
                    85  
    #                90
    <210> SEQ ID NO 104
    <211> LENGTH: 94
    <212> TYPE: PRT
    <213> ORGANISM: Homo sapiens
    <400> SEQUENCE: 104
    Val Ser Asp Val Pro Arg Asp Leu Glu Val Va
    #l Ala Ala Thr Pro Thr
     1               5  
    #                10  
    #                15
    Ser Arg Leu Ile Ser Trp Asn Arg Ser Gly Le
    #u Gln Ser Arg Tyr Tyr
                20      
    #            25      
    #            30
    Arg Ile Thr Tyr Gly Glu Thr Gly Gly Asn Se
    #r Pro Val Gln Glu Phe
            35          
    #        40          
    #        45
    Thr Val Pro Pro Trp Ala Ser Ile Ala Thr Il
    #e Ser Gly Leu Lys Pro
        50              
    #    55              
    #    60
    Gly Val Asp Tyr Thr Ile Thr Val Tyr Ala Va
    #l Thr Asp Lys Ser Asp
    65                  
    #70                  
    #75                  
    #80
    Thr Tyr Lys Tyr Asp Asp Pro Ile Ser Ile As
    #n Tyr Arg Thr
                    85  
    #                90
    <210> SEQ ID NO 105
    <211> LENGTH: 94
    <212> TYPE: PRT
    <213> ORGANISM: Homo sapiens
    <400> SEQUENCE: 105
    Val Ser Asp Val Pro Arg Asp Leu Glu Val Va
    #l Ala Ala Thr Pro Thr
     1               5  
    #                10  
    #                15
    Ser Arg Leu Ile Ser Trp Asn Arg Ser Gly Le
    #u Gln Cys Arg Tyr Tyr
                20      
    #            25      
    #            30
    Arg Ile Thr Tyr Gly Glu Thr Gly Gly Asn Se
    #r Pro Val Gln Glu Phe
            35          
    #        40          
    #        45
    Thr Val Pro Pro Trp Ala Ser Ile Ala Thr Il
    #e Ser Gly Leu Lys Pro
        50              
    #    55              
    #    60
    Gly Val Asp Tyr Thr Ile Thr Val Tyr Ala Va
    #l Thr Asp Gln Arg Asp
    65                  
    #70                  
    #75                  
    #80
    Thr Tyr Arg Tyr Asp Asp Pro Ile Ser Thr As
    #n Cys Arg Thr
                    85  
    #                90
    <210> SEQ ID NO 106
    <211> LENGTH: 94
    <212> TYPE: PRT
    <213> ORGANISM: Homo sapiens
    <400> SEQUENCE: 106
    Val Ser Asp Val Pro Arg Asp Leu Glu Val Va
    #l Ala Ala Thr Pro Thr
     1               5  
    #                10  
    #                15
    Ser Arg Leu Ile Ser Trp Arg Asn Ile Tyr Pr
    #o Ile Ala Arg Tyr Tyr
                20      
    #            25      
    #            30
    Arg Ile Thr Tyr Gly Glu Thr Gly Gly Asn Se
    #r Pro Val Gln Glu Phe
            35          
    #        40          
    #        45
    Thr Val Pro Pro Trp Ala Ser Ile Ala Thr Il
    #e Ser Gly Leu Lys Pro
        50              
    #    55              
    #    60
    Gly Ala Asp Tyr Thr Ile Thr Val Tyr Ala Va
    #l Thr Asp Lys Ser Asp
    65                  
    #70                  
    #75                  
    #80
    Thr Tyr Lys Tyr Asp Asp Pro Ile Ser Ile As
    #n Tyr Arg Thr
                    85  
    #                90
    <210> SEQ ID NO 107
    <211> LENGTH: 92
    <212> TYPE: PRT
    <213> ORGANISM: Homo sapiens
    <220> FEATURE:
    <221> NAME/KEY: VARIANT
    <222> LOCATION: 27
    <223> OTHER INFORMATION: Xaa = Any Amino Aci
    #d
    <400> SEQUENCE: 107
    Val Ser Asp Val Pro Arg Asp Leu Glu Val Va
    #l Ala Ala Thr Ala Thr
     1               5  
    #                10  
    #                15
    Ser Gln Leu Ile Ser Trp Pro Trp Pro Ser Xa
    #a Pro Thr Arg Tyr Tyr
                20      
    #            25      
    #            30
    Arg Ile Thr Tyr Gly Glu Thr Glu Gly Asn Se
    #r Pro Val Gln Glu Phe
            35          
    #        40          
    #        45
    Thr Val Pro Pro Trp Ala Ser Thr Ala Thr Il
    #e Ser Gly Ile Lys Pro
        50              
    #    55              
    #    60
    Gly Val Asp Tyr Thr Ile Ala Val Tyr Ala Va
    #l Thr Met Pro Glu Arg
    65                  
    #70                  
    #75                  
    #80
    Lys Tyr Asp Lys Pro Ile Ser Ile Asn Tyr Ar
    #g Thr
                    85  
    #                90
    <210> SEQ ID NO 108
    <211> LENGTH: 94
    <212> TYPE: PRT
    <213> ORGANISM: Homo sapiens
    <400> SEQUENCE: 108
    Val Ser Asp Val Ser Arg Asp Leu Glu Ala Va
    #l Ala Ala Thr Pro Thr
     1               5  
    #                10  
    #                15
    Ser Leu Leu Ile Ser Trp Asn Pro Asn Arg Se
    #r Phe Ala Arg Tyr Tyr
                20      
    #            25      
    #            30
    Arg Ile Thr Tyr Gly Glu Thr Gly Gly Asn Se
    #r Pro Val Gln Glu Phe
            35          
    #        40          
    #        45
    Thr Val Pro Pro Trp Ala Ser Ile Ala Thr Il
    #e Gly Gly Leu Lys Pro
        50              
    #    55              
    #    60
    Arg Val Asp Tyr Thr Ile Thr Val Tyr Ala Va
    #l Thr Ala Gln Thr Gly
    65                  
    #70                  
    #75                  
    #80
    His His Leu His Asp Lys Ser Ile Pro Ile As
    #n Tyr Arg Thr
                    85  
    #                90
    <210> SEQ ID NO 109
    <211> LENGTH: 94
    <212> TYPE: PRT
    <213> ORGANISM: Homo sapiens
    <400> SEQUENCE: 109
    Val Ser Asp Val Pro Arg Asp Leu Glu Val Va
    #l Ala Ala Thr Pro Thr
     1               5  
    #                10  
    #                15
    Ser Arg Leu Ile Ser Trp Arg Pro Gly Arg Th
    #r Tyr Ser Arg Tyr Tyr
                20      
    #            25      
    #            30
    Arg Ile Thr Tyr Gly Glu Thr Gly Gly Asn Se
    #r Pro Val Gln Glu Ser
            35          
    #        40          
    #        45
    Thr Val Pro Pro Trp Ala Asn Thr Ala Thr Il
    #e Ser Gly Leu Lys Pro
        50              
    #    55              
    #    60
    Gly Val Asp Tyr Thr Ile Thr Val Tyr Ala Va
    #l Thr Phe Pro Pro Gly
    65                  
    #70                  
    #75                  
    #80
    Tyr Pro Leu Thr Glu Met Pro Ile Ser Ile As
    #n Tyr Arg Thr
                    85  
    #                90
    <210> SEQ ID NO 110
    <211> LENGTH: 94
    <212> TYPE: PRT
    <213> ORGANISM: Homo sapiens
    <400> SEQUENCE: 110
    Val Ser Asp Val Pro Arg Asp Leu Glu Val Va
    #l Ala Ala Thr Pro Ser
     1               5  
    #                10  
    #                15
    Ser Arg Leu Ile Ser Trp Arg Pro Gly Arg Th
    #r Tyr Ser Arg Tyr Tyr
                20      
    #            25      
    #            30
    Arg Ile Thr Tyr Gly Glu Thr Gly Gly Asn Se
    #r Pro Val Gln Glu Phe
            35          
    #        40          
    #        45
    Thr Val Pro Pro Trp Ala Asn Thr Ala Thr Il
    #e Ser Gly Leu Lys Pro
        50              
    #    55              
    #    60
    Gly Val Asp Tyr Thr Ile Ala Val Tyr Ala Va
    #l Thr Phe Pro Thr Gly
    65                  
    #70                  
    #75                  
    #80
    Tyr Pro Leu Thr Glu Met Pro Ile Ser Ile As
    #n Tyr Arg Thr
                    85  
    #                90
    <210> SEQ ID NO 111
    <211> LENGTH: 94
    <212> TYPE: PRT
    <213> ORGANISM: Homo sapiens
    <400> SEQUENCE: 111
    Val Ser Asp Val Pro Arg Asp Leu Glu Val Va
    #l Ala Ala Thr Pro Thr
     1               5  
    #                10  
    #                15
    Ser Arg Leu Ile Ser Trp Arg Pro Gly Arg Th
    #r Tyr Ser Arg Tyr Tyr
                20      
    #            25      
    #            30
    Arg Ile Thr Tyr Gly Glu Thr Gly Gly Asn Se
    #r Pro Val Gln Glu Phe
            35          
    #        40          
    #        45
    Thr Val Pro Pro Trp Ala Asn Thr Ala Thr Il
    #e Ser Gly Leu Lys Pro
        50              
    #    55              
    #    60
    Gly Val Asp Tyr Thr Ile Thr Ala Tyr Ala Va
    #l Thr Tyr Thr His Ser
    65                  
    #70                  
    #75                  
    #80
    Thr Pro Met Gln Asp Glu Pro Ile Ser Ile As
    #n Tyr Arg Thr
                    85  
    #                90
    <210> SEQ ID NO 112
    <211> LENGTH: 94
    <212> TYPE: PRT
    <213> ORGANISM: Homo sapiens
    <400> SEQUENCE: 112
    Val Ser Asp Val Pro Arg Asp Leu Glu Val Va
    #l Ala Ala Thr Pro Thr
     1               5  
    #                10  
    #                15
    Ser Arg Leu Ile Ser Trp Asp Asn Ser Arg Pr
    #o Asn Thr Arg Tyr Tyr
                20      
    #            25      
    #            30
    Arg Ile Thr Tyr Gly Glu Thr Gly Gly Asn Se
    #r Pro Val Gln Glu Phe
            35          
    #        40          
    #        45
    Thr Val Pro Pro Trp Gly Ser Ile Ala Thr Il
    #e Ser Gly Leu Lys Pro
        50              
    #    55              
    #    60
    Gly Val Lys Tyr Thr Ile Thr Val Tyr Ala Va
    #l Thr Thr Ser Glu Cys
    65                  
    #70                  
    #75                  
    #80
    His Lys Leu Ser Ser Thr Ser Ile Ser Ile As
    #n Tyr Arg Thr
                    85  
    #                90
    <210> SEQ ID NO 113
    <211> LENGTH: 91
    <212> TYPE: PRT
    <213> ORGANISM: Homo sapiens
    <400> SEQUENCE: 113
    Val Ser Asp Val Pro Arg Asp Leu Glu Val Va
    #l Ala Ala Thr Pro Thr
     1               5  
    #                10  
    #                15
    Ser Leu Leu Ile Ser Trp Thr Arg Thr Asn Al
    #a Ser Thr Arg Tyr Tyr
                20      
    #            25      
    #            30
    Arg Ile Thr Tyr Gly Glu Thr Gly Gly Asn Se
    #r Pro Val Gln Glu Phe
            35          
    #        40          
    #        45
    Thr Val Pro Asn Phe Trp Trp Ile Ser Gly Le
    #u Lys Pro Gly Val Asp
        50              
    #    55              
    #    60
    Tyr Thr Ile Thr Val Tyr Ala Val Ala Ser Pr
    #o Asp Glu Thr Ser Ala
    65                  
    #70                  
    #75                  
    #80
    Tyr Ser Glu Pro Ile Ser Ile Asn Tyr Arg Th
    #r
                    85  
    #                90
    <210> SEQ ID NO 114
    <211> LENGTH: 92
    <212> TYPE: PRT
    <213> ORGANISM: Homo sapiens
    <220> FEATURE:
    <221> NAME/KEY: VARIANT
    <222> LOCATION: 3, 18, 23
    <223> OTHER INFORMATION: Xaa = Any Amino Aci
    #d
    <400> SEQUENCE: 114
    Val Ser Xaa Val Pro Arg Asp Leu Glu Val Va
    #l Ala Ala Thr Pro Thr
     1               5  
    #                10  
    #                15
    Ser Xaa Leu Ile Ser Trp Xaa Pro Arg Ser Hi
    #s His Asp Arg Tyr Tyr
                20      
    #            25      
    #            30
    Arg Ile Thr Tyr Gly Glu Thr Gly Gly Asn Se
    #r Pro Val Gln Glu Phe
            35          
    #        40          
    #        45
    Thr Val Pro Pro Trp Gly Thr Ile Ala Thr Il
    #e Asp Gly Leu Lys Pro
        50              
    #    55              
    #    60
    Gly Val Gly Tyr Thr Val Thr Val Tyr Ala Va
    #l Thr Asp Asn Pro Asn
    65                  
    #70                  
    #75                  
    #80
    Ser Ala Lys Ala Gln His Pro Ile Asn Ser Ar
    #g Thr
                    85  
    #                90
    <210> SEQ ID NO 115
    <211> LENGTH: 94
    <212> TYPE: PRT
    <213> ORGANISM: Homo sapiens
    <400> SEQUENCE: 115
    Val Ser Asp Val Pro Arg Asp Leu Glu Val Va
    #l Val Ala Thr Pro Thr
     1               5  
    #                10  
    #                15
    Ser Gln Leu Ile Ser Trp Met Thr Pro His As
    #n His Val Arg Tyr Tyr
                20      
    #            25      
    #            30
    Gly Ile Thr Tyr Gly Glu Thr Gly Gly Asn Se
    #r Pro Val Gln Glu Ser
            35          
    #        40          
    #        45
    Thr Val Pro Thr Gly Asn Ala Thr Ala Thr Il
    #e Ser Gly Leu Lys Pro
        50              
    #    55              
    #    60
    Gly Val Asp Tyr Thr Ile Thr Val Tyr Ala Va
    #l Thr Pro His His Gly
    65                  
    #70                  
    #75                  
    #80
    His Phe Asp Leu Glu Pro Pro Ile Ser Ile As
    #n Tyr Arg Thr
                    85  
    #                90
    <210> SEQ ID NO 116
    <211> LENGTH: 94
    <212> TYPE: PRT
    <213> ORGANISM: Homo sapiens
    <400> SEQUENCE: 116
    Val Ser Asp Val Pro Arg Asp Leu Glu Val Va
    #l Ala Ala Thr Ser Thr
     1               5  
    #                10  
    #                15
    Gly Leu Leu Ile Ser Trp Arg Thr Pro Ala Se
    #r Pro His Gly Tyr Tyr
                20      
    #            25      
    #            30
    Arg Ile Thr Tyr Gly Glu Thr Gly Gly Asn Se
    #r Pro Val Glu Glu Phe
            35          
    #        40          
    #        45
    Thr Val Pro Leu Leu Trp Pro Thr Ala Thr Il
    #e Ser Gly Leu Lys Pro
        50              
    #    55              
    #    60
    Gly Val Asp Tyr Thr Ile Thr Val Tyr Ala Va
    #l Thr Pro Thr His Met
    65                  
    #70                  
    #75                  
    #80
    Leu Lys Pro Gln Ser Met Pro Ile Ser Ile As
    #n Tyr Arg Thr
                    85  
    #                90
    <210> SEQ ID NO 117
    <211> LENGTH: 94
    <212> TYPE: PRT
    <213> ORGANISM: Homo sapiens
    <400> SEQUENCE: 117
    Val Ser Asp Val Pro Arg Asp Leu Glu Val Va
    #l Ala Ala Thr Pro Thr
     1               5  
    #                10  
    #                15
    Ser Leu Leu Ile Ser Trp Ser Pro Pro Asn As
    #p Ala His Arg Tyr Tyr
                20      
    #            25      
    #            30
    Arg Ile Thr Tyr Gly Lys Thr Gly Gly Asp Se
    #r Pro Val Gln Glu Phe
            35          
    #        40          
    #        45
    Thr Val Pro Gly Ser Lys Ser Thr Ala Thr Il
    #e Ser Gly Leu Lys Pro
        50              
    #    55              
    #    60
    Gly Val Asp Tyr Thr Ser Val Val Tyr Ala Va
    #l Thr Asp Gln Gln Ser
    65                  
    #70                  
    #75                  
    #80
    Tyr Thr Tyr Tyr Ser Asn Pro Ile Ser Ile As
    #n Tyr Arg Thr
                    85  
    #                90
    <210> SEQ ID NO 118
    <211> LENGTH: 94
    <212> TYPE: PRT
    <213> ORGANISM: Homo sapiens
    <220> FEATURE:
    <221> NAME/KEY: VARIANT
    <222> LOCATION: 28
    <223> OTHER INFORMATION: Xaa = Any Amino Aci
    #d
    <400> SEQUENCE: 118
    Val Ser Asp Val Pro Ser Asp Leu Glu Val Va
    #l Ala Ala Thr Pro Thr
     1               5  
    #                10  
    #                15
    Ser Leu Leu Ile Ser Trp Glu Gln Ser Pro Th
    #r Xaa Gly Arg Tyr Tyr
                20      
    #            25      
    #            30
    Arg Ile Thr Tyr Gly Glu Thr Gly Gly Asn Se
    #r Pro Val Gln Glu Phe
            35          
    #        40          
    #        45
    Thr Val Pro Gly Ser Lys Ser Thr Ala Thr Il
    #e Ser Gly Arg Lys Pro
        50              
    #    55              
    #    60
    Gly Ala Asp Tyr Thr Ile Thr Val Tyr Ala Va
    #l Thr Ile Glu Lys Asp
    65                  
    #70                  
    #75                  
    #80
    Arg Ile Pro Leu Phe Gly Pro Ile Ser Ile Se
    #r Tyr Arg Thr
                    85  
    #                90
    <210> SEQ ID NO 119
    <211> LENGTH: 94
    <212> TYPE: PRT
    <213> ORGANISM: Homo sapiens
    <400> SEQUENCE: 119
    Val Ser Asp Val Pro Ser Asp Leu Glu Val Va
    #l Ala Ala Thr Pro Thr
     1               5  
    #                10  
    #                15
    Ser Leu Leu Ile Ser Trp Glu Gln Ser Pro Th
    #r Tyr Gly Arg Tyr Tyr
                20      
    #            25      
    #            30
    Arg Ile Thr Tyr Gly Glu Thr Gly Gly Asn Se
    #r Pro Val Gln Glu Phe
            35          
    #        40          
    #        45
    Thr Val Pro Gly Ser Lys Ser Thr Ala Thr Il
    #e Ser Gly Arg Lys Pro
        50              
    #    55              
    #    60
    Gly Ala Asp Tyr Thr Ile Thr Val Tyr Ala Va
    #l Thr Ile Glu Lys Asp
    65                  
    #70                  
    #75                  
    #80
    Arg Ile Pro Leu Phe Gly Pro Ile Ser Ile Se
    #r Tyr Arg Thr
                    85  
    #                90
    <210> SEQ ID NO 120
    <211> LENGTH: 94
    <212> TYPE: PRT
    <213> ORGANISM: Homo sapiens
    <220> FEATURE:
    <221> NAME/KEY: VARIANT
    <222> LOCATION: 28
    <223> OTHER INFORMATION: Xaa = Any Amino Aci
    #d
    <400> SEQUENCE: 120
    Val Ser Asp Val Pro Ser Asp Leu Glu Val Va
    #l Ala Ala Thr Pro Thr
     1               5  
    #                10  
    #                15
    Ser Leu Leu Ile Ser Trp Glu Gln Ser Pro Th
    #r Xaa Gly Arg Tyr Tyr
                20      
    #            25      
    #            30
    Arg Ile Thr Tyr Gly Glu Thr Gly Gly Asn Se
    #r Pro Val Gln Glu Phe
            35          
    #        40          
    #        45
    Thr Val Pro Gly Ser Lys Ser Thr Ala Thr Il
    #e Ser Gly Arg Lys Pro
        50              
    #    55              
    #    60
    Gly Ala Asp Tyr Thr Ile Thr Val Tyr Ala Va
    #l Thr Ile Glu Lys Asp
    65                  
    #70                  
    #75                  
    #80
    Arg Ile Pro Leu Phe Gly Pro Ile Ser Ile Se
    #r Tyr Arg Thr
                    85  
    #                90
    <210> SEQ ID NO 121
    <211> LENGTH: 94
    <212> TYPE: PRT
    <213> ORGANISM: Homo sapiens
    <400> SEQUENCE: 121
    Val Pro Asp Val Pro Arg Asp Leu Glu Val Va
    #l Ala Ala Thr Pro Thr
     1               5  
    #                10  
    #                15
    Ser Leu Leu Ile Ser Trp Asp Thr His Asn Al
    #a Tyr Asn Gly Tyr Tyr
                20      
    #            25      
    #            30
    Arg Ile Thr Tyr Gly Glu Thr Gly Gly Ser Se
    #r Pro Ala Gln Glu Phe
            35          
    #        40          
    #        45
    Thr Val Pro His Pro Glu Val Thr Ala Thr Il
    #e Ser Gly Leu Lys Pro
        50              
    #    55              
    #    60
    Gly Val Asp Asp Thr Ile Thr Val Tyr Ala Va
    #l Thr Ile His His Met
    65                  
    #70                  
    #75                  
    #80
    Pro Leu Arg Ile Phe Gly Pro Ile Ser Ile As
    #n Tyr Arg Thr
                    85  
    #                90
    <210> SEQ ID NO 122
    <211> LENGTH: 94
    <212> TYPE: PRT
    <213> ORGANISM: Homo sapiens
    <400> SEQUENCE: 122
    Val Ser Asp Val Pro Arg Asp Leu Glu Val Va
    #l Ala Ala Thr Pro Thr
     1               5  
    #                10  
    #                15
    Ser Arg Leu Ile Ser Trp Asn Arg Ser Gly Le
    #u Gln Ser Arg Tyr Tyr
                20      
    #            25      
    #            30
    Arg Ile Thr Tyr Gly Glu Thr Gly Gly Asn Se
    #r Pro Val Gln Glu Phe
            35          
    #        40          
    #        45
    Thr Val Pro Pro Trp Ala Ser Ile Ala Thr Il
    #e Ser Gly Leu Lys Pro
        50              
    #    55              
    #    60
    Gly Val Asp Tyr Thr Ile Thr Val Tyr Ala Va
    #l Thr Asp Glu Ser Asp
    65                  
    #70                  
    #75                  
    #80
    Thr Tyr Lys Tyr Asp Asp Pro Val Ser Ile As
    #n Tyr Arg Thr
                    85  
    #                90
    <210> SEQ ID NO 123
    <211> LENGTH: 94
    <212> TYPE: PRT
    <213> ORGANISM: Homo sapiens
    <400> SEQUENCE: 123
    Val Ser Asp Val Pro Arg Asp Leu Glu Val Va
    #l Ala Ala Thr Pro Thr
     1               5  
    #                10  
    #                15
    Ser Arg Leu Ile Ser Trp Asn Arg Ser Gly Le
    #u Gln Ser Arg Tyr Tyr
                20      
    #            25      
    #            30
    Arg Ile Thr Tyr Gly Glu Thr Gly Gly Asn Se
    #r Pro Val Gln Glu Phe
            35          
    #        40          
    #        45
    Thr Val Pro Pro Trp Ala Ser Ile Ala Thr Il
    #e Ser Gly Leu Lys Pro
        50              
    #    55              
    #    60
    Gly Val Asp Tyr Thr Ile Thr Val Tyr Ala Va
    #l Thr Asp Glu Ser Asp
    65                  
    #70                  
    #75                  
    #80
    Thr Tyr Lys Tyr Asp Asp Pro Val Ser Thr As
    #n Tyr Arg Thr
                    85  
    #                90
    <210> SEQ ID NO 124
    <211> LENGTH: 94
    <212> TYPE: PRT
    <213> ORGANISM: Homo sapiens
    <400> SEQUENCE: 124
    Val Ser Asp Val Pro Arg Asp Leu Glu Val Va
    #l Ala Ala Thr Pro Thr
     1               5  
    #                10  
    #                15
    Ser Arg Leu Ile Ser Trp Asn Arg Ser Gly Le
    #u Gln Ser Gly Tyr Tyr
                20      
    #            25      
    #            30
    Arg Ile Thr Tyr Gly Glu Thr Gly Gly Asn Se
    #r Pro Val Gln Glu Phe
            35          
    #        40          
    #        45
    Thr Val Pro Pro Trp Ala Ser Ile Ala Thr Il
    #e Ser Gly Leu Lys Pro
        50              
    #    55              
    #    60
    Gly Val Asp Tyr Thr Ile Thr Val Tyr Ala Va
    #l Thr Pro Asn Val Gly
    65                  
    #70                  
    #75                  
    #80
    Arg Leu Asp Thr Arg Tyr Pro Ile Ser Ile As
    #p Cys Arg Thr
                    85  
    #                90
    <210> SEQ ID NO 125
    <211> LENGTH: 94
    <212> TYPE: PRT
    <213> ORGANISM: Homo sapiens
    <220> FEATURE:
    <221> NAME/KEY: VARIANT
    <222> LOCATION: 86
    <223> OTHER INFORMATION: Xaa = Any Amino Aci
    #d
    <400> SEQUENCE: 125
    Val Ser Asp Val Pro Arg Asp Leu Glu Val Va
    #l Ala Ala Thr Pro Thr
     1               5  
    #                10  
    #                15
    Ser Arg Leu Ile Ser Trp Asn Arg Ser Gly Le
    #u Gln Ser Arg Tyr Tyr
                20      
    #            25      
    #            30
    Arg Thr Thr Tyr Gly Glu Thr Gly Gly Asn Se
    #r Pro Val Gln Glu Phe
            35          
    #        40          
    #        45
    Thr Val Pro Pro Trp Ala Ser Ile Ala Thr Il
    #e Ser Gly Leu Lys Pro
        50              
    #    55              
    #    60
    Gly Val Asp Tyr Thr Ile Thr Val Tyr Ala Va
    #l Thr Ser Asn Val Gly
    65                  
    #70                  
    #75                  
    #80
    Arg Leu Asp Thr Arg Xaa Pro Ile Ser Ile As
    #n Tyr Arg Thr
                    85  
    #                90
    <210> SEQ ID NO 126
    <211> LENGTH: 94
    <212> TYPE: PRT
    <213> ORGANISM: Homo sapiens
    <220> FEATURE:
    <221> NAME/KEY: VARIANT
    <222> LOCATION: 75
    <223> OTHER INFORMATION: Xaa = Any Amino Aci
    #d
    <400> SEQUENCE: 126
    Val Ser Asp Val Pro Arg Asp Leu Glu Val Va
    #l Ala Ala Thr Pro Thr
     1               5  
    #                10  
    #                15
    Ser Arg Leu Ile Ser Trp Arg Thr Met Pro Va
    #l Thr Ala Arg Tyr Tyr
                20      
    #            25      
    #            30
    Arg Ile Thr Tyr Gly Glu Thr Gly Gly Asp Se
    #r Pro Val Gln Glu Phe
            35          
    #        40          
    #        45
    Thr Val Pro Pro Trp Ala Ser Ile Ala Ala Il
    #e Ser Gly Leu Lys Pro
        50              
    #    55              
    #    60
    Gly Ala Asp Tyr Thr Ile Thr Val Tyr Ala Xa
    #a Thr Ser Ala Thr Pro
    65                  
    #70                  
    #75                  
    #80
    Ser Arg Pro Asn Val His Pro Ile Ser Ile As
    #n Leu Thr Thr
                    85  
    #                90
    <210> SEQ ID NO 127
    <211> LENGTH: 88
    <212> TYPE: PRT
    <213> ORGANISM: Homo sapiens
    <400> SEQUENCE: 127
    Val Ser Asp Val Pro Gly Asp Leu Glu Val Va
    #l Ala Ala Thr Pro Thr
     1               5  
    #                10  
    #                15
    Ser Leu Leu Ile Gly Trp Ser Met Thr Pro As
    #n Trp Pro Arg Tyr Tyr
                20      
    #            25      
    #            30
    Arg Ile Ala Tyr Gly Glu Thr Gly Gly Asp Se
    #r Pro Val Gln Glu Phe
            35          
    #        40          
    #        45
    Thr Val Pro Pro Trp Ala Ser Ile Ala Ile Il
    #e Gly Gly Leu Lys Pro
        50              
    #    55              
    #    60
    Gly Val Asp Tyr Thr Ile Thr Val Tyr Ala Va
    #l Thr His Arg Asp Thr
    65                  
    #70                  
    #75                  
    #80
    Pro Ile Ser Ile Asn Tyr Arg Thr
                    85
    <210> SEQ ID NO 128
    <211> LENGTH: 90
    <212> TYPE: PRT
    <213> ORGANISM: Homo sapiens
    <400> SEQUENCE: 128
    Val Ser Asp Val Pro Arg Asp Leu Glu Val Va
    #l Ala Ala Thr Pro Ile
     1               5  
    #                10  
    #                15
    Ser Gln Leu Thr Ser Trp Gln Pro Gln Pro As
    #n Gly Ser Arg Tyr Tyr
                20      
    #            25      
    #            30
    Arg Ile Ala Tyr Gly Glu Thr Gly Gly Asn Se
    #r Pro Val Arg Glu Phe
            35          
    #        40          
    #        45
    Thr Val Pro Ala Arg Glu Gln Thr Ala Thr Se
    #r Gly Leu Lys Pro Gly
        50              
    #    55              
    #    60
    Val Asp Tyr Ala Ile Thr Val Tyr Ala Ala Th
    #r His Gly Lys Pro Pro
    65                  
    #70                  
    #75                  
    #80
    His Ile His Phe Thr Ile Asn Tyr Arg Thr
                    85  
    #                90
    <210> SEQ ID NO 129
    <211> LENGTH: 94
    <212> TYPE: PRT
    <213> ORGANISM: Homo sapiens
    <220> FEATURE:
    <221> NAME/KEY: VARIANT
    <222> LOCATION: 2, 9, 12, 22
    <223> OTHER INFORMATION: Xaa = Any Amino Aci
    #d
    <400> SEQUENCE: 129
    Val Xaa Asp Val Pro Arg Asp Leu Xaa Val Va
    #l Xaa Ala Thr Pro Thr
     1               5  
    #                10  
    #                15
    Ser Leu Leu Ile Ser Xaa Arg Ser Gly Asn Ar
    #g Thr Thr Arg Tyr Tyr
                20      
    #            25      
    #            30
    Arg Ile Thr Tyr Gly Asp Thr Gly Gly Asn Se
    #r Pro Val Gln Glu Phe
            35          
    #        40          
    #        45
    Thr Met Pro Pro Trp Ala Thr Val Ala Ala Il
    #e Ser Gly Leu Lys Pro
        50              
    #    55              
    #    60
    Gly Val Asp Tyr Thr Ile Thr Val Tyr Ala Va
    #l Thr Thr His Asn Ser
    65                  
    #70                  
    #75                  
    #80
    Thr Ala Gln Pro Glu Tyr Pro Ile Pro Phe As
    #n Arg Arg Thr
                    85  
    #                90
    <210> SEQ ID NO 130
    <211> LENGTH: 94
    <212> TYPE: PRT
    <213> ORGANISM: Homo sapiens
    <400> SEQUENCE: 130
    Val Ser Asp Val Pro Arg Asp Leu Glu Val Va
    #l Ala Ala Thr Pro Thr
     1               5  
    #                10  
    #                15
    Ser Arg Leu Ile Ser Trp Arg Pro Gly Arg Th
    #r Tyr Ser Arg Tyr Tyr
                20      
    #            25      
    #            30
    Arg Ile Thr Tyr Gly Glu Thr Gly Gly Asn Se
    #r Pro Val Gln Glu Phe
            35          
    #        40          
    #        45
    Thr Val Pro Pro Trp Ala Asn Thr Ala Thr Il
    #e Ser Cys Leu Lys Pro
        50              
    #    55              
    #    60
    Gly Val Asp Tyr Thr Ile Thr Val Tyr Ala Va
    #l Ala Phe Pro Pro Gly
    65                  
    #70                  
    #75                  
    #80
    Tyr Pro Leu Thr Glu Met Pro Ile Ser Ile As
    #n Tyr Arg Thr
                    85  
    #                90
    <210> SEQ ID NO 131
    <211> LENGTH: 94
    <212> TYPE: PRT
    <213> ORGANISM: Homo sapiens
    <400> SEQUENCE: 131
    Val Ser Asp Val Pro Arg Asp Leu Glu Val Va
    #l Ala Ala Thr Pro Thr
     1               5  
    #                10  
    #                15
    Ser Arg Leu Ile Ser Trp Arg Pro Gly Arg Al
    #a Tyr Ser Arg Tyr Phe
                20      
    #            25      
    #            30
    Arg Ile Thr Tyr Gly Glu Thr Gly Gly Asn Se
    #r Pro Val Gln Glu Phe
            35          
    #        40          
    #        45
    Thr Val Pro Pro Trp Ala Asn Thr Ala Thr Il
    #e Ser Gly Leu Lys Pro
        50              
    #    55              
    #    60
    Gly Val Asp Tyr Thr Ile Ala Val Tyr Ala Va
    #l Thr Phe Pro Pro Arg
    65                  
    #70                  
    #75                  
    #80
    Tyr Pro Leu Thr Glu Met Pro Ile Ser Ile As
    #n Tyr Arg Ala
                    85  
    #                90
    <210> SEQ ID NO 132
    <211> LENGTH: 94
    <212> TYPE: PRT
    <213> ORGANISM: Homo sapiens
    <400> SEQUENCE: 132
    Val Ser Asp Val Pro Arg Asp Leu Glu Val Va
    #l Ala Ala Thr Pro Thr
     1               5  
    #                10  
    #                15
    Ser Arg Leu Ile Ser Trp Arg Pro Gly Arg Th
    #r Tyr Ser Arg Tyr Tyr
                20      
    #            25      
    #            30
    Arg Ile Thr Tyr Gly Glu Ala Gly Gly Asn Se
    #r Pro Val Gln Glu Phe
            35          
    #        40          
    #        45
    Thr Val Pro Pro Trp Ala Ser Ile Ala Thr Il
    #e Ser Gly Leu Lys Pro
        50              
    #    55              
    #    60
    Gly Val Asp Tyr Thr Ile Thr Val Tyr Ala Va
    #l Thr Asp Lys Ser Gly
    65                  
    #70                  
    #75                  
    #80
    Thr Tyr Arg Tyr Asp Asp Pro Ile Ser Ile As
    #n Tyr Arg Thr
                    85  
    #                90
    <210> SEQ ID NO 133
    <211> LENGTH: 94
    <212> TYPE: PRT
    <213> ORGANISM: Homo sapiens
    <400> SEQUENCE: 133
    Val Ser Asp Val Pro Arg Asp Leu Arg Val Va
    #l Ala Ala Thr Pro Thr
     1               5  
    #                10  
    #                15
    Ser Arg Leu Ile Ser Trp Arg Pro Ala Ser As
    #n Pro Ala Arg Tyr Tyr
                20      
    #            25      
    #            30
    Arg Ile Thr Tyr Gly Glu Thr Gly Gly Asn Se
    #r Pro Val Gln Glu Phe
            35          
    #        40          
    #        45
    Thr Val Pro Pro Trp Ala Ser Val Ala Thr Il
    #e Gly Gly Leu Lys Pro
        50              
    #    55              
    #    60
    Gly Val Asp Tyr Thr Val Thr Val Tyr Ala Va
    #l Thr Ala Gln Thr Gly
    65                  
    #70                  
    #75                  
    #80
    His Arg Leu His Asp Lys Pro Ile Ser Ile As
    #n Tyr Arg Thr
                    85  
    #                90
    <210> SEQ ID NO 134
    <211> LENGTH: 87
    <212> TYPE: PRT
    <213> ORGANISM: Homo sapiens
    <400> SEQUENCE: 134
    Val Ser Asp Val Pro Arg Asp Leu Glu Val Va
    #l Ala Ala Thr Pro Ser
     1               5  
    #                10  
    #                15
    Leu Leu Ile Ser Trp Arg Pro Pro Ala Asp Le
    #u Asn Arg Tyr Tyr Arg
                20      
    #            25      
    #            30
    Ile Thr Tyr Gly Glu Thr Gly Gly Asn Ser Pr
    #o Val Gln Glu Phe Thr
            35          
    #        40          
    #        45
    Val Pro Pro Trp Gly Thr Val Ala Thr Val As
    #n Gly Leu Lys Pro Gly
        50              
    #    55              
    #    60
    Val Gly Tyr Thr Ile Thr Val Tyr Ala Val Th
    #r His Arg Asp Thr Pro
    65                  
    #70                  
    #75                  
    #80
    Ile Ser Ile Asn Tyr Arg Ala
                    85
    <210> SEQ ID NO 135
    <211> LENGTH: 93
    <212> TYPE: PRT
    <213> ORGANISM: Homo sapiens
    <400> SEQUENCE: 135
    Val Thr Asp Val Pro Arg Gly Leu Lys Ile Va
    #l Ala Ala Thr Pro Ser
     1               5  
    #                10  
    #                15
    Leu Leu Ile Ser Trp Arg Asn Ala Lys Asp Pr
    #o Gly Arg Tyr Tyr Arg
                20      
    #            25      
    #            30
    Ile Thr Tyr Gly Glu Thr Gly Gly Ser Ser Pr
    #o Val Gln Glu Phe Thr
            35          
    #        40          
    #        45
    Val Pro Pro Trp Gly Thr Ile Ala Ala Ile As
    #n Gly Leu Lys Pro Gly
        50              
    #    55              
    #    60
    Val Asp Tyr Thr Ile Thr Val Tyr Ala Val Th
    #r Ala Thr Asn Pro Gly
    65                  
    #70                  
    #75                  
    #80
    Pro Thr Gln His Arg Pro Ile Pro Ile Asn Ty
    #r Arg Thr
                    85  
    #                90
    <210> SEQ ID NO 136
    <211> LENGTH: 94
    <212> TYPE: PRT
    <213> ORGANISM: Homo sapiens
    <400> SEQUENCE: 136
    Val Ser Asp Val Pro Arg Asp Leu Glu Val Va
    #l Ala Ala Asp Pro His
     1               5  
    #                10  
    #                15
    Gln Pro Leu Ile Cys Trp Ala Ser Pro Pro Me
    #t Trp Cys Arg Tyr Tyr
                20      
    #            25      
    #            30
    Arg Ile Thr Tyr Gly Glu Ser Gly Gly Asn Se
    #r Pro Val Gln Glu Phe
            35          
    #        40          
    #        45
    Thr Val Pro Pro Trp Ala Thr Ala Ala Ala Il
    #e Ser Gly Leu Lys Pro
        50              
    #    55              
    #    60
    Gly Val Asp Tyr Thr Ile Thr Val His Ala Va
    #l Thr Asp Glu Ser Trp
    65                  
    #70                  
    #75                  
    #80
    Ser Asp Arg Ser Met Asp Pro Ile Ser Ile As
    #n Cys Arg Thr
                    85  
    #                90
    <210> SEQ ID NO 137
    <211> LENGTH: 94
    <212> TYPE: PRT
    <213> ORGANISM: Homo sapiens
    <400> SEQUENCE: 137
    Val Ser Asp Val Pro Arg Asp Leu Lys Val Va
    #l Ala Ala Thr Pro Thr
     1               5  
    #                10  
    #                15
    Ser Arg Leu Ile Ser Trp Thr His Asp Asn Va
    #l Pro Ala Arg Tyr Tyr
                20      
    #            25      
    #            30
    Arg Ile Thr Tyr Gly Glu Thr Gly Gly Asn Se
    #r Pro Val Gln Glu Leu
            35          
    #        40          
    #        45
    Thr Val Pro Pro Trp Ala Ser Ile Ala Thr Il
    #e Ser Gly Leu Lys Pro
        50              
    #    55              
    #    60
    Gly Val Asp Tyr Thr Ile Thr Val Tyr Ala Va
    #l Thr Leu Tyr Thr Gly
    65                  
    #70                  
    #75                  
    #80
    Asn His Arg Pro Glu His Pro Ile Ser Ile As
    #n Tyr Arg Thr
                    85  
    #                90
    <210> SEQ ID NO 138
    <211> LENGTH: 93
    <212> TYPE: PRT
    <213> ORGANISM: Homo sapiens
    <400> SEQUENCE: 138
    Val Ser Asp Val Pro Arg Asp Pro Val Val Va
    #l Ala Ala Thr Pro Thr
     1               5  
    #                10  
    #                15
    Ser Leu Leu Ile Ser Trp Tyr Arg His Thr Ty
    #r Arg Asp Arg Tyr Tyr
                20      
    #            25      
    #            30
    Arg Val Thr Tyr Gly Glu Thr Arg Gly Asn Se
    #r Pro Ile Arg Glu Phe
            35          
    #        40          
    #        45
    Thr Val Pro Pro Trp Ala Thr Ile Ala Thr Il
    #e Ser Gly Leu Lys Pro
        50              
    #    55              
    #    60
    Gly Val Asp Tyr Thr Ile Ala Val Tyr Ala Va
    #l Thr Asp Ala Gly Tyr
    65                  
    #70                  
    #75                  
    #80
    Asp Val His Thr Lys Arg Pro Ile Ser Ile As
    #n Arg Thr
                    85  
    #                90
    <210> SEQ ID NO 139
    <211> LENGTH: 94
    <212> TYPE: PRT
    <213> ORGANISM: Homo sapiens
    <400> SEQUENCE: 139
    Val Ser Asp Val Pro Arg Asp Leu Glu Val Va
    #l Ala Ala Thr Pro Thr
     1               5  
    #                10  
    #                15
    Gly Leu Leu Ile Ser Trp Arg Asn Asn Gln Ty
    #r Thr Pro Arg His Tyr
                20      
    #            25      
    #            30
    Gly Ile Thr Tyr Gly Glu Thr Gly Gly Lys Se
    #r Pro Val Gln Glu Phe
            35          
    #        40          
    #        45
    Thr Val Pro Glu Leu Asn Pro Thr Ala Thr Il
    #e Ser Arg Leu Lys Pro
        50              
    #    55              
    #    60
    Gly Val Asp Tyr Thr Ile Thr Val Tyr Ala Va
    #l Thr Gln Asn Gly Thr
    65                  
    #70                  
    #75                  
    #80
    Pro Arg Val Ile Tyr Gly Pro Ile Ser Ile As
    #n Tyr Arg Thr
                    85  
    #                90
    <210> SEQ ID NO 140
    <211> LENGTH: 89
    <212> TYPE: PRT
    <213> ORGANISM: Homo sapiens
    <400> SEQUENCE: 140
    Val Ser Asp Val Pro Arg Asp Leu Glu Val Va
    #l Ala Ala Thr Pro Thr
     1               5  
    #                10  
    #                15
    Ser Leu Leu Asn Val Pro Ile Ile Arg Tyr Ty
    #r Arg Ile Thr Tyr Gly
                20      
    #            25      
    #            30
    Glu Thr Gly Gly Asn Ser Pro Val Gln Glu Ph
    #e Thr Val Pro Ala Pro
            35          
    #        40          
    #        45
    Lys Ala Ile Ala Thr Thr Ser Gly Leu Lys Pr
    #o Gly Val Asp Tyr Thr
        50              
    #    55              
    #    60
    Ile Thr Val Tyr Gly Val Thr Ser His Arg As
    #n His Phe His Val Glu
    65                  
    #70                  
    #75                  
    #80
    Thr Pro Ile Ser Ile Asn Tyr Gln Ala
                    85
    <210> SEQ ID NO 141
    <211> LENGTH: 21
    <212> TYPE: PRT
    <213> ORGANISM: Homo sapiens
    <400> SEQUENCE: 141
    Asp Ala Pro Ala Val Thr Val Gly Ser Lys Se
    #r Gly Arg Gly Asp Ser
     1               5  
    #                10  
    #                15
    Pro Ala Ser Ser Lys
                20
    <210> SEQ ID NO 142
    <211> LENGTH: 21
    <212> TYPE: PRT
    <213> ORGANISM: Homo sapiens
    <400> SEQUENCE: 142
    Ala Ser Pro Pro Met Trp Cys Pro Trp Ala Th
    #r Glu Tyr Leu Pro Glu
     1               5  
    #                10  
    #                15
    Trp Asn Met Thr Gln
                20
    <210> SEQ ID NO 143
    <211> LENGTH: 21
    <212> TYPE: PRT
    <213> ORGANISM: Homo sapiens
    <400> SEQUENCE: 143
    Asn Arg Ser Gly Leu Gln Ser Pro Trp Ala Se
    #r Asp Lys Ser Asp Thr
     1               5  
    #                10  
    #                15
    Tyr Lys Tyr Asp Asp
                20
    <210> SEQ ID NO 144
    <211> LENGTH: 21
    <212> TYPE: PRT
    <213> ORGANISM: Homo sapiens
    <400> SEQUENCE: 144
    Arg Pro Thr Ser Asn Pro Pro Pro Trp Ala Se
    #r Ala Gln Thr Gly His
     1               5  
    #                10  
    #                15
    His Leu His Asp Lys
                20
    <210> SEQ ID NO 145
    <211> LENGTH: 21
    <212> TYPE: PRT
    <213> ORGANISM: Homo sapiens
    <400> SEQUENCE: 145
    His Thr Glu Arg Ser Phe Pro Pro Trp Gly Se
    #r Glu His Tyr Arg Asp
     1               5  
    #                10  
    #                15
    Thr Gly Thr Gly His
                20
    <210> SEQ ID NO 146
    <211> LENGTH: 21
    <212> TYPE: PRT
    <213> ORGANISM: Homo sapiens
    <400> SEQUENCE: 146
    Thr Thr Arg His Ser Pro Val Pro Trp Ala Th
    #r Met Pro Thr Asn Trp
     1               5  
    #                10  
    #                15
    Arg Phe Pro His Arg
                20
    <210> SEQ ID NO 147
    <211> LENGTH: 21
    <212> TYPE: PRT
    <213> ORGANISM: Homo sapiens
    <400> SEQUENCE: 147
    Arg Pro Asn Pro Arg Leu Ser Gly Leu Phe Se
    #r Pro Lys Glu Thr Ser
     1               5  
    #                10  
    #                15
    Asn Ile Phe Ile Ala
                20
    <210> SEQ ID NO 148
    <211> LENGTH: 21
    <212> TYPE: PRT
    <213> ORGANISM: Homo sapiens
    <400> SEQUENCE: 148
    Ser Pro Pro Asn Asp Ala His Gly Ser Lys Se
    #r Asp Gln Gln Ser Tyr
     1               5  
    #                10  
    #                15
    Thr Tyr Tyr Ser Asn
                20
    <210> SEQ ID NO 149
    <211> LENGTH: 21
    <212> TYPE: PRT
    <213> ORGANISM: Homo sapiens
    <400> SEQUENCE: 149
    Arg Thr Pro Ala Ser Pro His Leu Leu Trp Pr
    #o Pro Thr His Met Leu
     1               5  
    #                10  
    #                15
    Lys Pro Gln Ser Met
                20
    <210> SEQ ID NO 150
    <211> LENGTH: 21
    <212> TYPE: PRT
    <213> ORGANISM: Homo sapiens
    <400> SEQUENCE: 150
    Tyr Arg His Thr Tyr Arg Asp Pro Trp Ala Th
    #r Asp Thr Gly Tyr Asp
     1               5  
    #                10  
    #                15
    Val His Thr Lys Arg
                20
    <210> SEQ ID NO 151
    <211> LENGTH: 21
    <212> TYPE: PRT
    <213> ORGANISM: Homo sapiens
    <400> SEQUENCE: 151
    Asn Arg Ser Gly Leu Gln Ser Pro Trp Ala Se
    #r Ser Asn Val Gly Arg
     1               5  
    #                10  
    #                15
    Leu Asp Thr Arg Tyr
                20
    <210> SEQ ID NO 152
    <211> LENGTH: 21
    <212> TYPE: PRT
    <213> ORGANISM: Homo sapiens
    <400> SEQUENCE: 152
    Asp Thr His Asn Ala Tyr Asn His Pro Glu Va
    #l Asn His His Met Pro
     1               5  
    #                10  
    #                15
    Leu Arg Ile Phe Gly
                20
    <210> SEQ ID NO 153
    <211> LENGTH: 21
    <212> TYPE: PRT
    <213> ORGANISM: Homo sapiens
    <400> SEQUENCE: 153
    Arg Pro Thr Ser Asn Pro Pro Pro Trp Ala Se
    #r Pro Val Tyr Pro Met
     1               5  
    #                10  
    #                15
    His Ser Met Leu Ser
                20
    <210> SEQ ID NO 154
    <211> LENGTH: 21
    <212> TYPE: PRT
    <213> ORGANISM: Homo sapiens
    <400> SEQUENCE: 154
    Arg Asn Ile Tyr Pro Ile Ala Pro Trp Ala Se
    #r Asp Lys Ser Asp Thr
     1               5  
    #                10  
    #                15
    Tyr Lys Tyr Asp Asp
                20
    <210> SEQ ID NO 155
    <211> LENGTH: 21
    <212> TYPE: PRT
    <213> ORGANISM: Homo sapiens
    <400> SEQUENCE: 155
    Asn Arg Ser Gly Leu Gln Cys Pro Trp Ala Se
    #r Asp Gln Arg Asp Thr
     1               5  
    #                10  
    #                15
    Tyr Lys Tyr Asp Asp
                20
    <210> SEQ ID NO 156
    <211> LENGTH: 21
    <212> TYPE: PRT
    <213> ORGANISM: Homo sapiens
    <400> SEQUENCE: 156
    Arg Pro Gly Arg Thr Tyr Ser Pro Trp Ala As
    #n Phe Pro Thr Gly Tyr
     1               5  
    #                10  
    #                15
    Pro Leu Thr Glu Met
                20
    <210> SEQ ID NO 157
    <211> LENGTH: 21
    <212> TYPE: PRT
    <213> ORGANISM: Homo sapiens
    <400> SEQUENCE: 157
    Arg Pro Gly Arg Thr Tyr Ser Pro Trp Ala As
    #n Phe Pro Pro Gly Tyr
     1               5  
    #                10  
    #                15
    Pro Leu Thr Glu Met
                20
    <210> SEQ ID NO 158
    <211> LENGTH: 21
    <212> TYPE: PRT
    <213> ORGANISM: Homo sapiens
    <400> SEQUENCE: 158
    Met Thr Pro His Asn His Val Thr Gly Asn Al
    #a Pro His His Gly His
     1               5  
    #                10  
    #                15
    Phe Asp Leu Glu Pro
                20
    <210> SEQ ID NO 159
    <211> LENGTH: 21
    <212> TYPE: PRT
    <213> ORGANISM: Homo sapiens
    <400> SEQUENCE: 159
    Thr Arg Thr Asn Ala Ser Thr Asn Phe Trp Tr
    #p Ser Pro Asp Glu Thr
     1               5  
    #                10  
    #                15
    Ser Ala Tyr Ser Glu
                20
    <210> SEQ ID NO 160
    <211> LENGTH: 21
    <212> TYPE: PRT
    <213> ORGANISM: Homo sapiens
    <400> SEQUENCE: 160
    Asn Arg Ser Gly Leu Gln Ser Pro Trp Ala Se
    #r Asp Lys Ser Asp Thr
     1               5  
    #                10  
    #                15
    Tyr Lys Tyr Asp Asp
                20
    <210> SEQ ID NO 161
    <211> LENGTH: 21
    <212> TYPE: PRT
    <213> ORGANISM: Homo sapiens
    <400> SEQUENCE: 161
    Arg Pro Gly Arg Thr Tyr Ser Pro Trp Ala As
    #n Tyr Thr His Ser Thr
     1               5  
    #                10  
    #                15
    Pro Met Gln Asp Glu
                20
    <210> SEQ ID NO 162
    <211> LENGTH: 21
    <212> TYPE: PRT
    <213> ORGANISM: Homo sapiens
    <400> SEQUENCE: 162
    Arg Thr Pro Ala Ser Pro His Leu Leu Trp Pr
    #o Pro Thr His Met Leu
     1               5  
    #                10  
    #                15
    Lys Pro Gln Ser Met
                20
    <210> SEQ ID NO 163
    <211> LENGTH: 21
    <212> TYPE: PRT
    <213> ORGANISM: Homo sapiens
    <400> SEQUENCE: 163
    Thr Arg Thr Asn Ala Asn Thr Asn Asn Pro Pr
    #o Pro Asp Gly Ser Arg
     1               5  
    #                10  
    #                15
    His Met Leu Thr Lys
                20
    <210> SEQ ID NO 164
    <211> LENGTH: 21
    <212> TYPE: PRT
    <213> ORGANISM: Homo sapiens
    <400> SEQUENCE: 164
    Asp Asn Ser Arg Pro Asn Thr Pro Trp Gly Se
    #r Thr Ser Glu Cys His
     1               5  
    #                10  
    #                15
    Lys Leu Ser Ser Thr
                20
    <210> SEQ ID NO 165
    <211> LENGTH: 21
    <212> TYPE: PRT
    <213> ORGANISM: Homo sapiens
    <400> SEQUENCE: 165
    Asn Pro Asn Arg Ser Phe Ala Pro Trp Ala Se
    #r Ala Gln Thr Gly His
     1               5  
    #                10  
    #                15
    His Leu His Asp Lys
                20
    <210> SEQ ID NO 166
    <211> LENGTH: 15
    <212> TYPE: PRT
    <213> ORGANISM: Homo sapiens
    <400> SEQUENCE: 166
    Ser Met Thr Pro Asn Trp Pro Pro Trp Ala Se
    #r His Arg Asp Thr
     1               5  
    #                10  
    #                15
    <210> SEQ ID NO 167
    <211> LENGTH: 21
    <212> TYPE: PRT
    <213> ORGANISM: Homo sapiens
    <400> SEQUENCE: 167
    Asp Thr His Asn Ala Tyr Asn His Pro Glu Va
    #l Ile His His Met Pro
     1               5  
    #                10  
    #                15
    Leu Arg Ile Phe Gly
                20
    <210> SEQ ID NO 168
    <211> LENGTH: 20
    <212> TYPE: PRT
    <213> ORGANISM: Homo sapiens
    <400> SEQUENCE: 168
    Ala Ser Pro Pro Met Trp Pro Trp Ala Thr As
    #p Glu Ser Trp Ser Asp
     1               5  
    #                10  
    #                15
    Arg Ser Met Asp
                20
    <210> SEQ ID NO 169
    <211> LENGTH: 15
    <212> TYPE: PRT
    <213> ORGANISM: Homo sapiens
    <400> SEQUENCE: 169
    Arg Pro Pro Ala Asp Leu Asn Pro Trp Gly Th
    #r His Arg Asp Thr
     1               5  
    #                10  
    #                15
    <210> SEQ ID NO 170
    <211> LENGTH: 21
    <212> TYPE: PRT
    <213> ORGANISM: Homo sapiens
    <400> SEQUENCE: 170
    Glu Gln Ser Pro Thr Tyr Gly Gly Ser Lys Se
    #r Ile Glu Lys Asp Arg
     1               5  
    #                10  
    #                15
    Ile Pro Leu Phe Gly
                20
    <210> SEQ ID NO 171
    <211> LENGTH: 21
    <212> TYPE: PRT
    <213> ORGANISM: Homo sapiens
    <400> SEQUENCE: 171
    Arg Pro Gly Arg Thr Tyr Ser Pro Trp Ala As
    #n Phe Pro Pro Gly Tyr
     1               5  
    #                10  
    #                15
    Pro Leu Thr Glu Met
                20
    <210> SEQ ID NO 172
    <211> LENGTH: 21
    <212> TYPE: PRT
    <213> ORGANISM: Homo sapiens
    <400> SEQUENCE: 172
    Arg Pro Gly Arg Thr Tyr Ser Pro Trp Ala Se
    #r Asp Lys Ser Gly Thr
     1               5  
    #                10  
    #                15
    Tyr Arg Tyr Asp Asp
                20
    <210> SEQ ID NO 173
    <211> LENGTH: 21
    <212> TYPE: PRT
    <213> ORGANISM: Homo sapiens
    <400> SEQUENCE: 173
    Tyr Arg His Thr Tyr Arg Asp Pro Trp Ala Th
    #r Asp Ala Gly Tyr Asp
     1               5  
    #                10  
    #                15
    Val His Thr Lys Arg
                20
    <210> SEQ ID NO 174
    <211> LENGTH: 21
    <212> TYPE: PRT
    <213> ORGANISM: Homo sapiens
    <400> SEQUENCE: 174
    Arg Thr Met Pro Val Thr Ala Pro Trp Ala Se
    #r Ser Ala Thr Pro Ser
     1               5  
    #                10  
    #                15
    Arg Pro Asn Val His
                20
    <210> SEQ ID NO 175
    <211> LENGTH: 21
    <212> TYPE: PRT
    <213> ORGANISM: Homo sapiens
    <400> SEQUENCE: 175
    Arg Pro Gly Arg Ala Tyr Ser Pro Trp Ala As
    #n Phe Pro Pro Arg Tyr
     1               5  
    #                10  
    #                15
    Pro Leu Thr Glu Met
                20
    <210> SEQ ID NO 176
    <211> LENGTH: 21
    <212> TYPE: PRT
    <213> ORGANISM: Homo sapiens
    <400> SEQUENCE: 176
    Ser Pro Pro Asn Asp Ala His Gly Ser Lys Se
    #r Asp Gln Gln Ser Tyr
     1               5  
    #                10  
    #                15
    Thr Tyr Tyr Ser Asn
                20
    <210> SEQ ID NO 177
    <211> LENGTH: 16
    <212> TYPE: PRT
    <213> ORGANISM: Homo sapiens
    <400> SEQUENCE: 177
    Ile Ile Ala Pro Lys Ala Ser His Arg Asn Hi
    #s Phe His Val Glu Thr
     1               5  
    #                10  
    #                15
    <210> SEQ ID NO 178
    <211> LENGTH: 21
    <212> TYPE: PRT
    <213> ORGANISM: Homo sapiens
    <400> SEQUENCE: 178
    Arg Asn Asn Gln Tyr Thr Pro Glu Leu Asn Pr
    #o Gln Asn Gly Thr Pro
     1               5  
    #                10  
    #                15
    Arg Val Ile Tyr Gly
                20
    <210> SEQ ID NO 179
    <211> LENGTH: 21
    <212> TYPE: PRT
    <213> ORGANISM: Homo sapiens
    <400> SEQUENCE: 179
    Arg Pro Ala Ser Asn Pro Ala Pro Trp Ala Se
    #r Ala Gln Thr Gly His
     1               5  
    #                10  
    #                15
    Arg Leu His Asp Lys
                20
    <210> SEQ ID NO 180
    <211> LENGTH: 21
    <212> TYPE: PRT
    <213> ORGANISM: Homo sapiens
    <400> SEQUENCE: 180
    Asn Arg Ser Gly Leu Gln Ser Pro Trp Ala Se
    #r Pro Asn Val Gly Arg
     1               5  
    #                10  
    #                15
    Leu Asp Thr Arg Tyr
                20
    <210> SEQ ID NO 181
    <211> LENGTH: 21
    <212> TYPE: PRT
    <213> ORGANISM: Homo sapiens
    <400> SEQUENCE: 181
    Asn Arg Ser Gly Leu Gln Ser Pro Trp Ala Se
    #r Asp Glu Ser Asp Thr
     1               5  
    #                10  
    #                15
    Tyr Lys Tyr Asp Asp
                20
    <210> SEQ ID NO 182
    <211> LENGTH: 21
    <212> TYPE: PRT
    <213> ORGANISM: Homo sapiens
    <400> SEQUENCE: 182
    Thr His Asp Asn Val Pro Ala Pro Trp Ala Se
    #r Leu Tyr Thr Gly Asn
     1               5  
    #                10  
    #                15
    His Arg Pro Glu His
                20
    <210> SEQ ID NO 183
    <211> LENGTH: 21
    <212> TYPE: PRT
    <213> ORGANISM: Homo sapiens
    <400> SEQUENCE: 183
    Arg Ser Gly Asn Arg Thr Thr Pro Trp Ala Th
    #r Thr His Asn Ser Thr
     1               5  
    #                10  
    #                15
    Ala Gln Pro Glu Tyr
                20
    <210> SEQ ID NO 184
    <211> LENGTH: 21
    <212> TYPE: PRT
    <213> ORGANISM: Homo sapiens
    <400> SEQUENCE: 184
    Asn Arg Ser Gly Leu Gln Ser Pro Trp Ala Se
    #r Ser Asn Val Gly Arg
     1               5  
    #                10  
    #                15
    Leu Asp Thr Arg Tyr
                20
    <210> SEQ ID NO 185
    <211> LENGTH: 21
    <212> TYPE: PRT
    <213> ORGANISM: Homo sapiens
    <400> SEQUENCE: 185
    Arg Asn Ala Lys Asp Pro Gly Pro Trp Gly Th
    #r Ala Thr Asn Pro Gly
     1               5  
    #                10  
    #                15
    Pro Thr Gln His Arg
                20
    <210> SEQ ID NO 186
    <211> LENGTH: 94
    <212> TYPE: PRT
    <213> ORGANISM: Bovis taurus
    <400> SEQUENCE: 186
    Val Ser Asp Val Pro Arg Asp Leu Glu Val Il
    #e Ala Ala Thr Pro Thr
     1               5  
    #                10  
    #                15
    Ser Leu Leu Ile Ser Trp Asp Ala Pro Ala Va
    #l Thr Val Arg Tyr Tyr
                20      
    #            25      
    #            30
    Arg Ile Thr Tyr Gly Glu Thr Gly Gly Ser Se
    #r Pro Val Gln Glu Phe
            35          
    #        40          
    #        45
    Thr Val Pro Gly Ser Lys Ser Thr Ala Thr Il
    #e Ser Gly Leu Lys Pro
        50              
    #    55              
    #    60
    Gly Val Asp Tyr Thr Ile Thr Val Tyr Ala Va
    #l Thr Gly Arg Gly Asp
    65                  
    #70                  
    #75                  
    #80
    Ser Pro Ala Ser Ser Lys Pro Val Ser Ile As
    #n Tyr Arg Thr
                    85  
    #                90
    <210> SEQ ID NO 187
    <211> LENGTH: 92
    <212> TYPE: PRT
    <213> ORGANISM: Rattus norvegicus
    <400> SEQUENCE: 187
    Val Ser Asp Val Pro Arg Asp Leu Glu Val Il
    #e Ala Ser Thr Pro Thr
     1               5  
    #                10  
    #                15
    Ser Leu Leu Ile Ser Trp Glu Pro Ala Val Se
    #r Val Arg Tyr Tyr Arg
                20      
    #            25      
    #            30
    Ile Thr Tyr Gly Glu Thr Gly Gly Asn Ser Pr
    #o Val Gln Glu Phe Thr
            35          
    #        40          
    #        45
    Val Pro Gly Ser Lys Ser Thr Ala Thr Ile As
    #n Ile Lys Pro Gly Ala
        50              
    #    55              
    #    60
    Asp Tyr Thr Ile Thr Leu Tyr Ala Val Thr Gl
    #y Arg Gly Asp Ser Pro
    65                  
    #70                  
    #75                  
    #80
    Ala Ser Ser Lys Pro Val Ser Ile Asn Tyr Gl
    #n Thr
                    85  
    #                90
    <210> SEQ ID NO 188
    <211> LENGTH: 92
    <212> TYPE: PRT
    <213> ORGANISM: Mus musculus
    <400> SEQUENCE: 188
    Val Ser Asp Ile Pro Arg Asp Leu Glu Val Il
    #e Ala Ser Thr Pro Thr
     1               5  
    #                10  
    #                15
    Ser Leu Leu Ile Ser Trp Glu Pro Ala Val Se
    #r Val Arg Tyr Tyr Arg
                20      
    #            25      
    #            30
    Ile Thr Tyr Gly Glu Thr Gly Gly Asn Ser Pr
    #o Val Gln Glu Phe Thr
            35          
    #        40          
    #        45
    Val Pro Gly Ser Lys Ser Thr Ala Thr Ile As
    #n Ile Lys Pro Gly Ala
        50              
    #    55              
    #    60
    Asp Tyr Thr Ile Thr Leu Tyr Ala Val Thr Gl
    #y Arg Gly Asp Ser Pro
    65                  
    #70                  
    #75                  
    #80
    Ala Ser Ser Lys Pro Val Ser Ile Asn Tyr Ly
    #s Thr
                    85  
    #                90
    <210> SEQ ID NO 189
    <211> LENGTH: 40
    <212> TYPE: PRT
    <213> ORGANISM: Oryctolagus cuniculuc
    <220> FEATURE:
    <221> NAME/KEY: VARIANT
    <222> LOCATION: 24
    <223> OTHER INFORMATION: Xaa = Any Amino Aci
    #d
    <400> SEQUENCE: 189
    Val Ser Asp Val Pro Arg Asp Leu Glu Val Il
    #e Ala Ser Thr Pro Thr
     1               5  
    #                10  
    #                15
    Ser Leu Leu Ile Ser Trp Glu Xaa Pro Ala Va
    #l Thr Val Arg Tyr Tyr
                20      
    #            25      
    #            30
    Arg Ile Thr Tyr Gly Glu Thr Asn
            35          
    #        40
    <210> SEQ ID NO 190
    <211> LENGTH: 92
    <212> TYPE: PRT
    <213> ORGANISM: Gallus gallus
    <400> SEQUENCE: 190
    Val Ser Asp Val Pro Arg Asp Leu Glu Val As
    #n Thr Ser Pro Thr Ser
     1               5  
    #                10  
    #                15
    Leu Glu Ile Ser Trp Asp Ala Pro Ala Val Th
    #r Val Arg Tyr Tyr Arg
                20      
    #            25      
    #            30
    Ile Thr Tyr Gly Glu Thr Gly Gly Ser Ser Pr
    #o Val Gln Glu Phe Thr
            35          
    #        40          
    #        45
    Val Pro Gly Thr Met Ser Ala Thr Ile Thr Gl
    #y Leu Lys Pro Gly Val
        50              
    #    55              
    #    60
    Asp Tyr Thr Ile Thr Val Tyr Ala Val Thr Gl
    #y Arg Gly Asp Ser Pro
    65                  
    #70                  
    #75                  
    #80
    Ala Ser Ser Lys Pro Val Thr Val Thr Tyr Ly
    #s Thr
                    85  
    #                90
    <210> SEQ ID NO 191
    <211> LENGTH: 93
    <212> TYPE: PRT
    <213> ORGANISM: Xenupus laevis
    <400> SEQUENCE: 191
    Val Ser Asp Val Pro Thr Asp Leu Glu Val Th
    #r Ser Ser Ser Pro Asn
     1               5  
    #                10  
    #                15
    Thr Leu Thr Ile Ser Trp Glu Ala Pro Ala Va
    #l Ser Val Arg Tyr Tyr
                20      
    #            25      
    #            30
    Arg Ile Thr Tyr Ser Gln Thr Gly Gly Gly Pr
    #o Glu Lys Glu Phe Thr
            35          
    #        40          
    #        45
    Val Pro Gly Thr Ser Asn Thr Ala Thr Ile Ar
    #g Gly Leu Asn Pro Gly
        50              
    #    55              
    #    60
    Val Ser Tyr Thr Ile Thr Val Tyr Ala Val Th
    #r Gly Arg Gly Asp Ser
    65                  
    #70                  
    #75                  
    #80
    Pro Ala Ser Ser Lys Pro Leu Thr Ile Ile Hi
    #s Lys Thr
                    85  
    #                90
    <210> SEQ ID NO 192
    <211> LENGTH: 68
    <212> TYPE: PRT
    <213> ORGANISM: Canis familiaris
    <400> SEQUENCE: 192
    Ala Asp Ala Pro Ser Leu Phe Leu Ala Thr Th
    #r Pro Ser Leu Leu Val
     1               5  
    #                10  
    #                15
    Ser Trp Gln Pro Ala Ile Thr Gly Tyr Ile Il
    #e Lys Tyr Gly Ser Glu
                20      
    #            25      
    #            30
    Val Val Pro Gly Val Thr Ala Thr Ile Thr Gl
    #y Leu Pro Gly Thr Glu
            35          
    #        40          
    #        45
    Tyr Thr Ile Gln Val Ile Ala Leu Lys Asn Gl
    #n Lys Ser Leu Ile Gly
        50              
    #    55              
    #    60
    Lys Thr Glu Leu
    65
    <210> SEQ ID NO 193
    <211> LENGTH: 67
    <212> TYPE: PRT
    <213> ORGANISM: Equus caballis
    <400> SEQUENCE: 193
    Ala Asp Ala Pro Ser Leu Phe Leu Ala Thr Th
    #r Pro Ser Leu Leu Ile
     1               5  
    #                10  
    #                15
    Ser Trp Gln Pro Ala Ile Thr Gly Tyr Ile Il
    #e Lys Tyr Gly Ser Glu
                20      
    #            25      
    #            30
    Val Val Pro Gly Val Thr Ala Thr Ile Thr Gl
    #y Leu Pro Gly Thr Glu
            35          
    #        40          
    #        45
    Tyr Thr Ile Gln Val Ile Ala Ile Lys Asn Gl
    #n Lys Ser Leu Ile Gly
        50              
    #    55              
    #    60
    Lys Thr Glu
    65
    <210> SEQ ID NO 194
    <211> LENGTH: 70
    <212> TYPE: PRT
    <213> ORGANISM: Homo sapiens
    <400> SEQUENCE: 194
    Val Ser Pro Pro Lys Asp Leu Val Thr Val Th
    #r Thr Val Asn Leu Ala
     1               5  
    #                10  
    #                15
    Trp Asp Met Val Thr Tyr Leu Val Val Tyr Th
    #r Pro Thr His Glu Gly
                20      
    #            25      
    #            30
    Gly Glu Met Gln Phe Val Pro Gly Asp Gln Th
    #r Ser Thr Ile Ile Gln
            35          
    #        40          
    #        45
    Leu Pro Gly Val Glu Tyr Phe Ile Arg Val Ph
    #e Ala Ile Leu Asn Lys
        50              
    #    55              
    #    60
    Lys Ser Val Ser Ala Val
    65                  
    #70
    <210> SEQ ID NO 195
    <211> LENGTH: 69
    <212> TYPE: PRT
    <213> ORGANISM: Sus scrofa
    <400> SEQUENCE: 195
    Val Ser Pro Pro Lys Asp Leu Val Thr Val Th
    #r Thr Val Asn Leu Ala
     1               5  
    #                10  
    #                15
    Trp Asp Met Val Thr Tyr Leu Ile Val Tyr Th
    #r Pro Thr His Glu Gly
                20      
    #            25      
    #            30
    Glu Met Gln Phe Val Pro Gly Asp Gln Thr Se
    #r Thr Thr Ile Arg Leu
            35          
    #        40          
    #        45
    Pro Gly Val Glu Tyr Phe Ile Arg Val Phe Al
    #a Ile Leu Asn Lys Lys
        50              
    #    55              
    #    60
    Ser Val Ser Ala Val
    65
    <210> SEQ ID NO 196
    <211> LENGTH: 75
    <212> TYPE: PRT
    <213> ORGANISM: Mus musculus
    <400> SEQUENCE: 196
    Met Asp Gly Pro Gln Asp Leu Val Val Ala Va
    #l Thr Pro Thr Thr Leu
     1               5  
    #                10  
    #                15
    Asp Leu Ser Trp Pro Gln Ala Val Asp Phe Va
    #l Val Ser Tyr Val Ser
                20      
    #            25      
    #            30
    Ala Gly Asn Arg Val Leu Val Pro Pro Glu Al
    #a Asp Thr Gln Leu Thr
            35          
    #        40          
    #        45
    Leu Met Pro Gly Val Glu Tyr Val Val Thr Va
    #l Thr Ala Glu Arg Gly
        50              
    #    55              
    #    60
    His Ala Val Ser Ala Ser Ile Ala Asn Thr Gl
    #y
    65                  
    #70                  
    #75
    <210> SEQ ID NO 197
    <211> LENGTH: 69
    <212> TYPE: PRT
    <213> ORGANISM: Homo sapiens
    <400> SEQUENCE: 197
    Thr Val Pro Ser Leu Ile Tyr Val Gly Pro Th
    #r Thr Met His Val Gln
     1               5  
    #                10  
    #                15
    Trp Gln Val Gly Gly Ala Thr Gly Tyr Ile Le
    #u Ser Tyr Pro Val Asp
                20      
    #            25      
    #            30
    Thr Glu Thr Lys Glu Val Leu Gly Pro Thr Va
    #l Asn Met Gln Leu Thr
            35          
    #        40          
    #        45
    Leu Val Pro Asn Thr Glu Tyr Ala Val Thr Va
    #l Gln Ala Val Leu Leu
        50              
    #    55              
    #    60
    Thr Ser Val Thr Val
    65
    <210> SEQ ID NO 198
    <211> LENGTH: 44
    <212> TYPE: PRT
    <213> ORGANISM: Oryctolagus cuniculus
    <400> SEQUENCE: 198
    Thr Val Pro Ser Leu Asn Ile Tyr Val Gly Pr
    #o Thr Thr Met His Val
     1               5  
    #                10  
    #                15
    Gln Trp Gln Val Gly Gly Ala Thr Gly Tyr Il
    #e Leu Ser Tyr Pro Val
                20      
    #            25      
    #            30
    Asp Thr Glu Thr Lys Gln Val Leu Arg Val Th
    #r His
            35          
    #        40
    <210> SEQ ID NO 199
    <211> LENGTH: 66
    <212> TYPE: PRT
    <213> ORGANISM: Gallus gallus
    <400> SEQUENCE: 199
    Leu Ser Asp Leu Leu Tyr Val Ser Ser Ser Me
    #t Arg Ala Lys Trp Gly
     1               5  
    #                10  
    #                15
    Val Ala Gly Ala Thr Gly Tyr Met Ile Leu Ty
    #r Ala Pro Leu Thr Glu
                20      
    #            25      
    #            30
    Gly Leu Ala Ala Glu Lys Glu Ile Ile Gly Gl
    #u Ala Ser Thr Leu Glu
            35          
    #        40          
    #        45
    Leu Asp Gly Leu Leu Pro Asn Thr Glu Tyr Th
    #r Val Thr Val Tyr Ala
        50              
    #    55              
    #    60
    Met Phe
    65
    <210> SEQ ID NO 200
    <211> LENGTH: 72
    <212> TYPE: PRT
    <213> ORGANISM: Homo sapiens
    <400> SEQUENCE: 200
    Leu Ser Asp Leu Leu Tyr Val Thr Ser Met Ar
    #g Val Lys Trp Asp Ala
     1               5  
    #                10  
    #                15
    Val Gly Ala Ser Gly Tyr Leu Ile Leu Tyr Al
    #a Pro Leu Thr Glu Gly
                20      
    #            25      
    #            30
    Leu Ala Gly Glu Lys Glu Met Ile Gly Glu Th
    #r His Thr Ile Glu Leu
            35          
    #        40          
    #        45
    Ser Gly Leu Leu Pro Asn Thr Glu Tyr Thr Va
    #l Thr Val Tyr Ala Met
        50              
    #    55              
    #    60
    Phe Gly Ala Ser Asp Val Thr Gly
    65                  
    #70
    <210> SEQ ID NO 201
    <211> LENGTH: 117
    <212> TYPE: PRT
    <213> ORGANISM: Lama glama
    <400> SEQUENCE: 201
    Asp Val Gln Leu Gln Glu Ser Gly Gly Gly Le
    #u Val Gln Ala Gly Gly
     1               5  
    #                10  
    #                15
    Ser Leu Arg Leu Ser Cys Ala Ala Ser Gly Ar
    #g Thr Gly Ser Thr Tyr
                20      
    #            25      
    #            30
    Asp Met Gly Trp Phe Arg Gln Ala Pro Gly Ly
    #s Glu Arg Glu Ser Val
            35          
    #        40          
    #        45
    Ala Ala Ile Asn Trp Asp Ser Ala Arg Thr Ty
    #r Tyr Ala Ser Ser Val
        50              
    #    55              
    #    60
    Arg Gly Arg Phe Thr Ile Ser Arg Asp Asn Al
    #a Lys Lys Thr Val Tyr
    65                  
    #70                  
    #75                  
    #80
    Leu Gln Met Asn Ser Leu Lys Pro Glu Asp Th
    #r Ala Val Tyr Thr Cys
                    85  
    #                90  
    #                95
    Gly Ala Gly Glu Gly Gly Thr Trp Asp Ser Tr
    #p Gly Gln Gly Thr Gln
                100      
    #           105      
    #           110
    Val Thr Val Ser Ser
            115
    <210> SEQ ID NO 202
    <211> LENGTH: 94
    <212> TYPE: PRT
    <213> ORGANISM: Homo sapiens
    <400> SEQUENCE: 202
    Val Ser Asp Val Pro Arg Asp Leu Glu Val Va
    #l Ala Ala Thr Pro Thr
     1               5  
    #                10  
    #                15
    Ser Leu Leu Phe Ser Trp Asp Ala Pro Ala Va
    #l Thr Val Arg Tyr Tyr
                20      
    #            25      
    #            30
    Arg Ile Thr Tyr Gly Glu Thr Gly Gly Asn Se
    #r Leu Val Gln Glu Phe
            35          
    #        40          
    #        45
    Thr Val Pro Gly Ser Lys Ser Thr Ala Thr Il
    #e Ser Gly Leu Lys Pro
        50              
    #    55              
    #    60
    Gly Val Asp Tyr Thr Ile Thr Gly Tyr Ala Va
    #l Thr Gly Arg Gly Asp
    65                  
    #70                  
    #75                  
    #80
    Ser Pro Ala Ser Ser Lys Pro Ile Ser Ile As
    #n Tyr Arg Thr
                    85  
    #                90

Claims (26)

1. A molecule comprising i) a protein comprising a fibronectin type III (Fn3) domain, wherein the Fn3 domain: (a) has at least one loop with a modified amino acid sequence relative to the sequence of the corresponding loop of a human Fn3 domain; and (b) binds to a target compound that is not bound by the corresponding human Fn3 domain; and ii) a nucleic acid, wherein the protein is bonded through a DNA-puromycin linker to the nucleic acid, and wherein the protein is encoded by said nucleic acid.
2. A molecule of claim 1, wherein the Fn3 domain binds to said target compound with a KD of 10 nM or less.
3. A molecule of claim 1, wherein the Fn3 domain is a tenth type Fn3 domain (10Fn3).
4. A molecule of claim 3, wherein the loop is selected from the group of the BC loop, the DE loop and the FG loop
5. A molecule of claim 4, wherein the 10Fn3 domain has at least two loops with a modified amino acid sequence relative to the sequence of the corresponding loop of a human 10Fn3 domain.
6. A molecule of claim 4, wherein the 10Fn3 domain has at least three loops with a modified amino acid sequence relative to the sequence of the corresponding loop of a human 10Fn3 domain.
7. The molecule of claim 4, wherein at least one of the modified loops in the 10Fn3 domain is extended in length relative to the corresponding loop of a human 10Fn3 domain.
8. The molecule of claim 7, wherein the DE loop in the 10Fn3 domain is extended by 10-13 amino acid residues relative to the corresponding loop of a human 10Fn3 domain.
9. A molecule of claim 3, wherein the integrin binding motif, RGD, of the 10Fn3 domain is replaced by an amino acid sequence as follows: basic amino acid-neutral amino acid-acidic amino acid.
10. The molecule of claim 1, wherein the at least one loop is randomized relative to the sequence of the corresponding loop of a human Fn3 domain.
11. The molecule of claim 1, wherein the nucleic acid is ribonucleic acid.
12. The molecule of claim 1 selected by the method comprising the steps of a) producing a population of candidate RNA molecules, each comprising a candidate fibronectin type III (Fn3) domain sequence which differs from human Fn3 domain coding sequence, said RNA molecules each comprising a translation initiation sequence and a start codon operably linked to said candidate Fn3 domain coding sequence and each being operably linked to a DNA-puromycin linker at the 3′ end; b) in vitro translating said candidate Fn3 domain coding sequences to produce a population of candidate RNA-Fn3 fusions; c) contacting said population of candidate RNA-Fn3 fusions with the target compound; and d) selecting an RNA-Fn3 fusion, the protein portion of which has a binding affinity or specificity for said target compound that is altered relative to the binding affinity or specificity of said human Fn3 for said target molecule.
13. A fibronectin type III (Fn3) domain scaffold-based protein that binds to a compound, selected by the method comprising the steps of: a) producing a population of candidate RNA molecules, each comprising a candidate (Fn3) domain scaffold-based protein sequence which differs from human Fn3 domain coding sequence, said RNA molecules each comprising a translation initiation sequence and a start codon operably linked to said candidate protein coding sequence and each being operably linked to a DNA-puromycin linker at the 3′ end; b) in vitro translating said candidate protein coding sequences to produce a population of candidate RNA-protein fusions; c) contacting said population of candidate RNA-protein fusions with said compound; and d) selecting an RNA-protein fusion, the protein portion of which has a binding affinity or specificity for said compound that is altered relative to the binding affinity or specificity of said human Fn3 for said compound.
14. The protein of claim 13, wherein the Fn3 domain contains no free sulfhydryl moieties and no disulfide bonds.
15. The protein of claim 13, wherein the Fn3 domain binds to said compound with a KD of 10 nM or less.
16. The protein of claim 13, wherein the Fn3 domain is a tenth domain (10Fn3).
17. The protein of claim 16, wherein the 10Fn3 domain: (a) has at least one loop with a modified amino acid sequence relative to the sequence of the corresponding loop of a human 10Fn3 domain, wherein the loop is selected from the group of the BC loop, the DE loop and the FG loop.
18. The protein of claim 17, wherein the 10Fn3 domain has at least two loops with a modified amino acid sequence relative to the sequence of the corresponding loop of a human 10Fn3 domain.
19. The protein of claim 17, wherein the 10Fn3 domain has at least three loops with a modified amino acid sequence relative to the sequence of the corresponding loop of a human 10Fn3 domain.
20. The protein of claim 17, wherein at least one of the modified loops in the 10Fn3 domain is extended in length relative to the corresponding loop of a human 10Fn3 domain.
21. The protein of claim 20, wherein the DE loop is extended by 10-13 amino acid residues relative to the corresponding loop of a human 10Fn3 domain.
22. The protein of claim 17, wherein the integrin binding motif, RGD, of the 10Fn3 domain is replaced by an amino acid sequence as follows: basic amino acid-neutral amino acid-acidic amino acid.
23. The protein of claim 17, wherein the 10Fn3 domain has an amino acid sequence that is at least 70% identical to the sequence of a human 10Fn3 domain.
24. The protein of claim 17, wherein the at least one loop is randomized relative to the sequence of the corresponding loop of a human 10Fn3 domain.
25. The protein of claim 19, wherein the at least three loops are randomized relative to the sequence of the corresponding loop of a human 10Fn3 domain.
26. The protein of claim 24, wherein the selection method further comprises: (e) repeating (c) and (d) using a further randomized 10Fn3 domain.
US11/890,627 1998-12-10 2007-08-06 Selection of fibronectin scaffolds using nucleic acid-protein fusions Abandoned US20080108798A1 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
US11/890,627 US20080108798A1 (en) 1998-12-10 2007-08-06 Selection of fibronectin scaffolds using nucleic acid-protein fusions

Applications Claiming Priority (7)

Application Number Priority Date Filing Date Title
US11173798P 1998-12-10 1998-12-10
US45669399A 1999-12-09 1999-12-09
US09/515,260 US6818418B1 (en) 1998-12-10 2000-02-29 Protein scaffolds for antibody mimics and other binding proteins
US68856600A 2000-10-16 2000-10-16
US10/728,078 US7115396B2 (en) 1998-12-10 2003-12-03 Protein scaffolds for antibody mimics and other binding proteins
US11/483,918 US20060246059A1 (en) 1998-12-10 2006-07-07 Pharmaceutically acceptable FN3 polypeptides for human treatments
US11/890,627 US20080108798A1 (en) 1998-12-10 2007-08-06 Selection of fibronectin scaffolds using nucleic acid-protein fusions

Related Parent Applications (1)

Application Number Title Priority Date Filing Date
US11/483,918 Continuation US20060246059A1 (en) 1998-12-10 2006-07-07 Pharmaceutically acceptable FN3 polypeptides for human treatments

Publications (1)

Publication Number Publication Date
US20080108798A1 true US20080108798A1 (en) 2008-05-08

Family

ID=34139775

Family Applications (11)

Application Number Title Priority Date Filing Date
US10/728,078 Expired - Lifetime US7115396B2 (en) 1998-12-10 2003-12-03 Protein scaffolds for antibody mimics and other binding proteins
US11/483,918 Abandoned US20060246059A1 (en) 1998-12-10 2006-07-07 Pharmaceutically acceptable FN3 polypeptides for human treatments
US11/483,808 Abandoned US20060270604A1 (en) 1998-12-10 2006-07-07 Pharmaceutical preparations of Fn3 polypeptides for human treatments
US11/543,316 Expired - Lifetime US9605039B2 (en) 1998-12-10 2006-10-03 Protein scaffolds for antibody mimics and other binding proteins
US11/890,554 Abandoned US20080063651A1 (en) 1998-12-10 2007-08-06 Fibronectin derivative Fc fusions
US11/890,645 Abandoned US20080015339A1 (en) 1998-12-10 2007-08-06 High affinity fibronectin derivatives
US11/890,627 Abandoned US20080108798A1 (en) 1998-12-10 2007-08-06 Selection of fibronectin scaffolds using nucleic acid-protein fusions
US11/894,732 Abandoned US20080139791A1 (en) 1998-12-10 2007-08-20 Pharmaceutically acceptable Fn3 Polypeptides for human treatments
US14/022,827 Abandoned US20140094595A1 (en) 1998-12-10 2013-09-10 Protein Scaffolds for Antibody Mimics and Other Binding Proteins
US15/440,730 Abandoned US20170275342A1 (en) 1998-12-10 2017-02-23 Protein scaffolds for antibody mimics and other binding proteins
US15/583,561 Abandoned US20170334958A1 (en) 1998-12-10 2017-05-01 Protein scaffolds for antibody mimics and other binding proteins

Family Applications Before (6)

Application Number Title Priority Date Filing Date
US10/728,078 Expired - Lifetime US7115396B2 (en) 1998-12-10 2003-12-03 Protein scaffolds for antibody mimics and other binding proteins
US11/483,918 Abandoned US20060246059A1 (en) 1998-12-10 2006-07-07 Pharmaceutically acceptable FN3 polypeptides for human treatments
US11/483,808 Abandoned US20060270604A1 (en) 1998-12-10 2006-07-07 Pharmaceutical preparations of Fn3 polypeptides for human treatments
US11/543,316 Expired - Lifetime US9605039B2 (en) 1998-12-10 2006-10-03 Protein scaffolds for antibody mimics and other binding proteins
US11/890,554 Abandoned US20080063651A1 (en) 1998-12-10 2007-08-06 Fibronectin derivative Fc fusions
US11/890,645 Abandoned US20080015339A1 (en) 1998-12-10 2007-08-06 High affinity fibronectin derivatives

Family Applications After (4)

Application Number Title Priority Date Filing Date
US11/894,732 Abandoned US20080139791A1 (en) 1998-12-10 2007-08-20 Pharmaceutically acceptable Fn3 Polypeptides for human treatments
US14/022,827 Abandoned US20140094595A1 (en) 1998-12-10 2013-09-10 Protein Scaffolds for Antibody Mimics and Other Binding Proteins
US15/440,730 Abandoned US20170275342A1 (en) 1998-12-10 2017-02-23 Protein scaffolds for antibody mimics and other binding proteins
US15/583,561 Abandoned US20170334958A1 (en) 1998-12-10 2017-05-01 Protein scaffolds for antibody mimics and other binding proteins

Country Status (1)

Country Link
US (11) US7115396B2 (en)

Cited By (29)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20090176654A1 (en) * 2007-08-10 2009-07-09 Protelix, Inc. Universal fibronectin type III binding-domain libraries
US20090192048A1 (en) * 2005-12-20 2009-07-30 Michael A Reeve Method of producing a multimeric capture agent for binding a ligand
US20090312192A1 (en) * 2005-12-20 2009-12-17 Reeve Michael A Method for functionalising a hydrophobic substrate
US20100105567A1 (en) * 2005-12-20 2010-04-29 Reeve Michael A Novel capture agents for binding a ligand
US20100121033A1 (en) * 2006-11-22 2010-05-13 Bristol-Myers Squibb Company Targeted therapeutics based on engineered proteins for tyrosine kinases receptors, including igf-ir
US20100144599A1 (en) * 2007-02-02 2010-06-10 Bristol-Myers Squibb Company Vegf pathway blockade
US20100152063A1 (en) * 2007-08-10 2010-06-17 Protelix, Inc. Universal fibronectin type iii binding-domain libraries
US20100179094A1 (en) * 2008-11-24 2010-07-15 Bristol-Myers Squibb Company Bispecific egfr/igfir binding molecules
US20100298541A1 (en) * 2007-10-31 2010-11-25 Medimmune, Llc Protein Scaffolds
US20110053842A1 (en) * 2008-02-14 2011-03-03 Bristol-Myers Squibb Company Targeted therapeutics based on engineered proteins that bind egfr
US8470966B2 (en) 2007-08-10 2013-06-25 Protelica, Inc. Universal fibronectin type III binding-domain libraries
US8728483B2 (en) 2008-05-22 2014-05-20 Bristol-Myers Squibb Company Multivalent fibronectin based scaffold domain proteins
WO2015066480A1 (en) * 2013-11-01 2015-05-07 Regents Of The University Of Minnesota Protein scaffolds and methods of use
US9139825B2 (en) 2009-10-30 2015-09-22 Novartis Ag Universal fibronectin type III bottom-side binding domain libraries
US9212231B2 (en) 2010-04-13 2015-12-15 Medimmune, Llc TRAIL R2-specific multimeric scaffolds
US9416170B2 (en) 2011-10-31 2016-08-16 Bristol-Myers Squibb Company Fibronectin binding domains with reduced immunogenicity
US9469676B2 (en) 2011-04-13 2016-10-18 Bristol-Myers Squibb Company Fc fusion proteins comprising novel linkers or arrangements
US9493546B2 (en) 2012-09-13 2016-11-15 Bristol-Myers Squibb Company Fibronectin based scaffold domain proteins that bind to myostatin
US9540424B2 (en) 2010-05-03 2017-01-10 Bristol-Myers Squibb Company Serum albumin binding molecules
US9562089B2 (en) 2010-05-26 2017-02-07 Bristol-Myers Squibb Company Fibronectin based scaffold proteins having improved stability
US9862758B2 (en) 2003-12-05 2018-01-09 Bristol-Myers Quibb Company Inhibitors of type 2 vascular endothelial growth factor receptors
US10065987B2 (en) 2013-02-12 2018-09-04 Bristol-Myers Squibb Company High pH protein refolding methods
US10121952B2 (en) 2009-04-23 2018-11-06 The University Of Chicago Materials and methods for the preparation of nanocomposites
US10442851B2 (en) 2014-03-20 2019-10-15 Bristol-Myers Squibb Company Serum albumin-binding fibronectin type III domains
US10766946B2 (en) 2015-09-23 2020-09-08 Bristol-Myers Squibb Company Fast-off rate serum albumin binding fibronectin type III domains
US10787498B2 (en) 2013-02-06 2020-09-29 Bristol-Myers Squibb Company Fibronectin type III domain proteins with enhanced solubility
US10898538B2 (en) 2011-05-17 2021-01-26 Bristol-Myers Squibb Company Methods for maintaining pegylation of polypeptides
US11447538B2 (en) 2013-02-01 2022-09-20 Bristol-Myers Squibb Company Fibronectin based scaffold proteins
US11555062B2 (en) 2011-10-11 2023-01-17 Viela Bio, Inc. Methods of administering a composition comprising a Tn3 scaffold and a CD40L-specific monomer

Families Citing this family (208)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP0985039B1 (en) 1997-06-12 2008-02-20 Novartis International Pharmaceutical Ltd. Artificial antibody polypeptides
PT1137812E (en) * 1998-12-02 2007-05-31 Adnexus Therapeutics Inc Dna-protein fusions and uses thereof
US7115396B2 (en) * 1998-12-10 2006-10-03 Compound Therapeutics, Inc. Protein scaffolds for antibody mimics and other binding proteins
JP4578768B2 (en) * 2000-07-11 2010-11-10 リサーチ コーポレイション テクノロジーズ,インコーポレイテッド Artificial antibody polypeptide
AUPR811301A0 (en) * 2001-10-05 2001-10-25 Agresearch Limited Manipulation of flavonoid biosynthesis
AU2003243436A1 (en) * 2002-06-06 2003-12-22 Shohei Koide Reconstituted polypeptides
US9321832B2 (en) 2002-06-28 2016-04-26 Domantis Limited Ligand
US20080220049A1 (en) * 2003-12-05 2008-09-11 Adnexus, A Bristol-Myers Squibb R&D Company Compositions and methods for intraocular delivery of fibronectin scaffold domain proteins
KR101502920B1 (en) 2005-06-21 2015-03-17 조마 (유에스) 엘엘씨 IL-1β Binding antibodies and fragments thereof
JP2009539346A (en) * 2006-05-26 2009-11-19 ワイカトリンク リミテッド OB fold domain
JP2010500967A (en) 2006-07-20 2010-01-14 ザ ジェネラル ホスピタル コーポレイション Methods, compositions, and kits for selective activation of protoxins through combinatorial targeting
EP2423332A1 (en) 2006-08-25 2012-02-29 Oncotherapy Science, Inc. Prognostic markers and therapeutic targets for lung cancer
AU2007323541B2 (en) 2006-11-21 2012-01-19 Kalobios Pharmaceuticals, Inc. Methods of treating chronic inflammatory diseases using a GM-CSF antagonist
WO2008077145A2 (en) 2006-12-20 2008-06-26 Xoma Technology Ltd. Treatment of il-1-beta related diseases
WO2008103378A2 (en) 2007-02-20 2008-08-28 Merrimack Pharmaceuticals, Inc. Methods of treating multiple sclerosis by administration of alpha-fetoprotein in combination with an integrin antagonist
DK3199180T3 (en) 2007-03-08 2022-03-21 Humanigen Inc EPHA3 ANTIBODIES FOR THE TREATMENT OF FIXED TUMORS
US8709731B2 (en) 2007-08-24 2014-04-29 Oncotherapy Science, Inc. DKK1 oncogene as therapeutic target for cancer and a diagnosing marker
BRPI0815769A2 (en) 2007-08-24 2017-08-08 Oncotherapy Science Inc EBI3, DLX5, NPTX1 AND CDKN3 AS TARGET GENES FOR THE THERAPY AND DIAGNOSIS OF LUNG CANCER
WO2009028521A1 (en) 2007-08-24 2009-03-05 Oncotherapy Science, Inc. Pkib and naaladl2 for target genes of prostate cancer therapy and diagnosis
AU2008320823B2 (en) 2007-11-02 2013-01-17 Novartis Ag Molecules and methods for modulating low-density-lipoprotein receptor-related protein 6 (LRP6)
CN104650235A (en) 2007-11-30 2015-05-27 葛兰素集团有限公司 Antigen-Binding Constructs
PL2391650T3 (en) 2007-12-20 2015-03-31 Xoma Us Llc Methods for the treatment of gout
US20090181020A1 (en) 2008-01-15 2009-07-16 Kalobios Pharmaceuticals, Inc. Methods of Treating Bone-Loss Disorders Using a GM-CSF Antagonist
WO2009131909A2 (en) 2008-04-21 2009-10-29 Bio-Rad Laboratories, Inc. Recombinant deamidated gliadin antigen
KR101764081B1 (en) 2008-04-30 2017-08-01 이뮤노젠 아이엔씨 Cross-linkers and their uses
PT2274331E (en) 2008-05-02 2014-02-27 Novartis Ag Improved fibronectin-based binding molecules and uses thereof
JP2011522792A (en) 2008-05-06 2011-08-04 グラクソ グループ リミテッド Encapsulating bioactive agents
EP3629022A1 (en) 2008-07-25 2020-04-01 Richard W. Wagner Protein screening methods
EP2349327A4 (en) * 2008-10-23 2012-11-28 Massachusetts Inst Technology Directed engagement of activating fc receptors
US8415291B2 (en) 2008-10-31 2013-04-09 Centocor Ortho Biotech Inc. Anti-TNF alpha fibronectin type III domain based scaffold compositions, methods and uses
MX2011004550A (en) * 2008-10-31 2011-07-20 Centocor Ortho Biotech Inc Fibronectin type iii domain based scaffold compositions, methods and uses.
US8822213B2 (en) 2008-11-06 2014-09-02 University Of Washington Bispecific intracellular delivery vehicles
TW201029662A (en) 2008-12-19 2010-08-16 Glaxo Group Ltd Novel antigen binding proteins
WO2010093814A1 (en) 2009-02-11 2010-08-19 Kalobios Pharmaceuticals, Inc. Methods of treating dementia using a gm-csf antagonist
WO2010093627A2 (en) * 2009-02-12 2010-08-19 Centocor Ortho Biotech Inc. Fibronectin type iii domain based scaffold compositions, methods and uses
CA2753287A1 (en) 2009-02-24 2010-09-02 Glaxo Group Limited Antigen-binding constructs
CA2753332A1 (en) 2009-02-24 2010-09-02 Glaxo Group Limited Antigen-binding constructs
CA2753263A1 (en) 2009-02-24 2010-09-02 Glaxo Group Limited Multivalent and/or multispecific rankl-binding constructs
WO2010099536A2 (en) 2009-02-27 2010-09-02 Massachusetts Institute Of Technology Engineered proteins with high affinity for dota chelates
AU2010221159B2 (en) 2009-03-06 2015-11-26 Humanigen, Inc. Treatment of leukemias and chronic myeloproliferative diseases with antibodies to EphA3
WO2010104961A1 (en) 2009-03-11 2010-09-16 Promedior, Inc. Treatment methods for autoimmune disorders
DK2405928T3 (en) 2009-03-11 2017-01-30 Promedior Inc Treatment and diagnosis procedure for hypersensitive diseases
US8067201B2 (en) * 2009-04-17 2011-11-29 Bristol-Myers Squibb Company Methods for protein refolding
ES2629167T3 (en) 2009-07-20 2017-08-07 Bristol-Myers Squibb Company Combination of anti-CTLA4 antibody with etoposide for the synergistic treatment of proliferative diseases
SG177763A1 (en) 2009-07-28 2012-03-29 Hoffmann La Roche Non-invasive in vivo optical imaging method
EP2486023A4 (en) 2009-10-06 2014-05-07 Immunogen Inc Potent conjugates and hydrophilic linkers
WO2011051466A1 (en) 2009-11-02 2011-05-05 Novartis Ag Anti-idiotypic fibronectin-based binding molecules and uses thereof
CN102640001A (en) 2009-11-05 2012-08-15 诺瓦提斯公司 Biomarkers predictive of progression of fibrosis
US20110150885A1 (en) 2009-12-11 2011-06-23 Atyr Pharma, Inc. Aminoacyl trna synthetases for modulating hematopoiesis
KR20130028055A (en) 2010-01-28 2013-03-18 글락소 그룹 리미티드 Cd127 binding proteins
WO2011092233A1 (en) 2010-01-29 2011-08-04 Novartis Ag Yeast mating to produce high-affinity combinations of fibronectin-based binders
MX2012009318A (en) 2010-02-10 2012-09-07 Novartis Ag Methods and compounds for muscle growth.
CN102762591A (en) * 2010-02-18 2012-10-31 百时美施贵宝公司 Fibronectin based scaffold domain proteins that bind il-23
US20150231215A1 (en) 2012-06-22 2015-08-20 Randolph J. Noelle VISTA Antagonist and Methods of Use
WO2011120013A2 (en) 2010-03-26 2011-09-29 Trustees Of Dartmouth College Vista regulatory t cell mediator protein, vista binding agents and use thereof
US10745467B2 (en) 2010-03-26 2020-08-18 The Trustees Of Dartmouth College VISTA-Ig for treatment of autoimmune, allergic and inflammatory disorders
US8420098B2 (en) 2010-04-13 2013-04-16 Bristol-Myers Squibb Company Fibronectin based scaffold domain proteins that bind to PCSK9
JP6066900B2 (en) 2010-04-26 2017-01-25 エータイアー ファーマ, インコーポレイテッド Innovative discovery of therapeutic, diagnostic and antibody compositions related to protein fragments of cysteinyl tRNA synthetase
CA2797362C (en) 2010-04-27 2020-12-08 Atyr Pharma, Inc. Innovative discovery of therapeutic, diagnostic, and antibody compositions related to protein fragments of isoleucyl trna synthetases
JP6008837B2 (en) 2010-04-28 2016-10-19 エータイアー ファーマ, インコーポレイテッド Innovative discovery of therapeutic, diagnostic and antibody compositions related to protein fragments of alanyl tRNA synthetase
WO2011139854A2 (en) 2010-04-29 2011-11-10 Atyr Pharma, Inc. Innovative discovery of therapeutic, diagnostic, and antibody compositions related to protein fragments of asparaginyl trna synthetases
WO2011150279A2 (en) 2010-05-27 2011-12-01 Atyr Pharma, Inc. Innovative discovery of therapeutic, diagnostic, and antibody compositions related to protein fragments of glutaminyl-trna synthetases
CA2797393C (en) 2010-04-29 2020-03-10 Atyr Pharma, Inc. Innovative discovery of therapeutic, diagnostic, and antibody compositions related to protein fragments of valyl trna synthetases
MX339126B (en) 2010-04-30 2016-05-11 Janssen Biotech Inc Stabilized fibronectin domain compositions, methods and uses.
JP6008841B2 (en) 2010-05-03 2016-10-19 エータイアー ファーマ, インコーポレイテッド Innovative discovery of therapeutic, diagnostic and antibody compositions related to protein fragments of methionyl tRNA synthetase
EP2566495B1 (en) 2010-05-03 2017-03-01 aTyr Pharma, Inc. Innovative discovery of therapeutic, diagnostic, and antibody compositions related to protein fragments of phenylalanyl-alpha-trna synthetases
CA2797277C (en) 2010-05-03 2021-02-23 Atyr Pharma, Inc. Innovative discovery of therapeutic, diagnostic, and antibody compositions related to protein fragments of arginyl-trna synthetases
EP2566499B1 (en) 2010-05-04 2017-01-25 aTyr Pharma, Inc. Innovative discovery of therapeutic, diagnostic, and antibody compositions related to protein fragments of p38 multi-trna synthetase complex
WO2011138462A1 (en) 2010-05-07 2011-11-10 F. Hoffmann-La Roche Ag Diagnostic method for the detection of cells ex vivo
EP2569431B1 (en) 2010-05-13 2015-09-23 Sarepta Therapeutics, Inc. Methods for identifying compounds which modulate interleukins 17 and 23 signaling activity
CN103200953B (en) 2010-05-14 2017-02-15 Atyr 医药公司 Innovative discovery of therapeutic, diagnostic, and antibody compositions related to protein fragments of phenylalanyl-beta-trna synthetases
UY33421A (en) 2010-06-03 2011-12-30 Glaxo Wellcome House HUMANIZED ANTIGEN UNION PROTEINS
AU2011289831C1 (en) 2010-07-12 2017-06-15 Pangu Biopharma Limited Innovative discovery of therapeutic, diagnostic, and antibody compositions related to protein fragments of glycyl-tRNA synthetases
EP2608801B1 (en) 2010-08-25 2019-08-21 aTyr Pharma, Inc. INNOVATIVE DISCOVERY OF THERAPEUTIC, DIAGNOSTIC, AND ANTIBODY COMPOSITIONS RELATED TO PROTEIN FRAGMENTS OF TYROSYL-tRNA SYNTHETASES
EP3130605B1 (en) 2010-11-05 2020-01-08 Eisai Inc. Folate receptor alpha as a diagnostic and prognostic marker for folate receptor alpha-expressing cancers
AU2011333878B2 (en) 2010-11-23 2016-01-14 Glaxo Group Limited Antigen binding proteins to oncostatin M (OSM)
BR112013013003A2 (en) 2010-11-24 2016-08-09 Glaxo Group Ltd antigen binding protein and pharmaceutical composition
EP2655411A1 (en) 2010-12-22 2013-10-30 Bristol-Myers Squibb Company Fibronectin based scaffold domain proteins that bind il-23
SG191955A1 (en) 2011-02-15 2013-08-30 Immunogen Inc Cytotoxic benzodiazepine derivatives
CA2864868A1 (en) 2011-02-25 2012-11-15 Wellstat Diagnostics, Llc Assays for detecting enzymatic activity
WO2012140627A1 (en) 2011-04-15 2012-10-18 Compugen Ltd. Polypeptides and polynucleotides, and uses thereof for treatment of immune related disorders and cancer
US9347058B2 (en) 2011-05-17 2016-05-24 Bristol-Myers Squibb Company Methods for the selection of binding proteins
BR112013028779B8 (en) 2011-05-27 2021-04-20 Glaxo Group Ltd antigen-binding protein or immunoconjugate, immunoconjugate, pharmaceutical composition, and, use of a composition
JP2014516551A (en) 2011-06-02 2014-07-17 マサチューセッツ インスティテュート オブ テクノロジー Metakaryotic stem cell dsRNA / DNA hybrid genome replication intermediates
MX350954B (en) 2011-06-21 2017-09-26 Immunogen Inc Novel maytansinoid derivatives with peptide linker and conjugates thereof.
EP2729488A4 (en) * 2011-07-06 2015-01-14 Medimmune Llc Methods for making multimeric polypeptides
EP4151785A1 (en) 2011-09-27 2023-03-22 Janssen Biotech, Inc. Fibronectin type iii repeat based protein scaffolds with alternative binding surfaces
US9320567B2 (en) * 2011-09-30 2016-04-26 Becton, Dickinson And Company Medical waste containers and lids therefore
WO2013059886A1 (en) 2011-10-28 2013-05-02 Patrys Limited Pat-lm1 epitopes and methods for using same
CA2857177C (en) 2011-12-05 2020-06-23 Bio-Rad Laboratories, Inc. Recombinant deamidated gliadin antigen
EP2794657B1 (en) 2011-12-19 2017-10-11 Xoma (Us) Llc Methods for treating acne
PT2804878T (en) 2012-01-20 2018-11-29 Genzyme Corp Anti-cxcr3 antibodies
EP2809800A1 (en) 2012-01-30 2014-12-10 VIB vzw Means and method for diagnosis and treatment of alzheimer's disease
SG10201708562VA (en) 2012-02-13 2017-12-28 Agency Science Tech & Res IL-1β Neutralizing Human Monoclonal Antibodies
KR20140123571A (en) 2012-02-16 2014-10-22 에이티와이알 파마, 인코포레이티드 Histidyl-trna synthetases for treating autoimmune and inflammatory diseases
US9592289B2 (en) 2012-03-26 2017-03-14 Sanofi Stable IgG4 based binding agent formulations
GB2502127A (en) 2012-05-17 2013-11-20 Kymab Ltd Multivalent antibodies and in vivo methods for their production
US9890215B2 (en) 2012-06-22 2018-02-13 King's College London Vista modulators for diagnosis and treatment of cancer
WO2014031566A1 (en) 2012-08-22 2014-02-27 Immunogen, Inc. Cytotoxic benzodiazepine derivatives
CN109793893B (en) 2012-09-07 2023-05-26 达特茅斯大学理事会 VISTA modulators for diagnosis and treatment of cancer
WO2014072306A1 (en) 2012-11-08 2014-05-15 F. Hoffmann-La Roche Ag Her3 antigen binding proteins binding to the beta-hairpin of her3
WO2014126871A1 (en) 2013-02-12 2014-08-21 Bristol-Myers Squibb Company Tangential flow filtration based protein refolding methods
WO2014134486A2 (en) 2013-02-28 2014-09-04 Immunogen, Inc. Conjugates comprising cell-binding agents and cytotoxic agents
CA3175634A1 (en) 2013-02-28 2014-09-04 Caprion Proteomics Inc. Tuberculosis biomarkers and uses thereof
JP6494533B2 (en) 2013-02-28 2019-04-03 イミュノジェン・インコーポレーテッド Complexes comprising maytansinoids as cell binding agents and cytotoxic agents
EP2968587A2 (en) 2013-03-13 2016-01-20 Bristol-Myers Squibb Company Fibronectin based scaffold domains linked to serum albumin or a moiety binding thereto
WO2014159961A1 (en) 2013-03-14 2014-10-02 Beth Israel Deaconess Medical Center, Inc. Measurement of fgf21 as a biomarker of fructose metabolism and metabolic disease
US9157910B2 (en) 2013-03-15 2015-10-13 Abbott Laboratories Assay with increased dynamic range
US9005901B2 (en) 2013-03-15 2015-04-14 Abbott Laboratories Assay with internal calibration
EP2999796B1 (en) 2013-05-20 2017-10-25 BioVentures, LLC Gep5 model for multiple myeloma
WO2014194030A2 (en) 2013-05-31 2014-12-04 Immunogen, Inc. Conjugates comprising cell-binding agents and cytotoxic agents
AU2014329437B2 (en) 2013-10-06 2018-10-18 F. Hoffmann-La Roche Ag Modified Pseudomonas exotoxin A
ES2784234T3 (en) 2013-10-08 2020-09-23 Promedior Inc Methods for treating fibrotic cancers
PL3074038T3 (en) 2013-11-28 2019-08-30 Csl Limited Method of treating diabetic nephropathy
DK3082860T3 (en) 2013-12-18 2021-01-25 Csl Ltd PROCEDURE FOR WOUND TREATMENT
US11014987B2 (en) 2013-12-24 2021-05-25 Janssen Pharmaceutics Nv Anti-vista antibodies and fragments, uses thereof, and methods of identifying same
CA2935378C (en) 2013-12-24 2023-04-18 Janssen Pharmaceutica Nv Anti-vista antibodies and fragments
EP3647322B1 (en) 2014-03-20 2021-10-20 Bristol-Myers Squibb Company Stabilized fibronectin based scaffold molecules
MX2016016310A (en) 2014-06-11 2017-10-20 A Green Kathy Use of vista agonists and antagonists to suppress or enhance humoral immunity.
JP6789124B2 (en) 2014-07-03 2020-11-25 イエール ユニバーシティ Inhibition of Dicopf2 (Dkk2) that suppresses tumorigenesis
WO2016008112A1 (en) 2014-07-16 2016-01-21 Medshine Discovery Inc. Linkers and application towards adc thereof
CN106661578B (en) 2014-08-15 2020-08-04 埃迪恩克斯股份有限公司 Oligonucleotide decoys for pain treatment
MA54254A (en) 2014-09-03 2021-09-22 Immunogen Inc CYTOTOXIC BENZODIAZEPINE DERIVATIVES
JP2017527562A (en) 2014-09-03 2017-09-21 イミュノジェン・インコーポレーテッド Cytotoxic benzodiazepine derivatives
CA2962783A1 (en) 2014-10-01 2016-04-07 Medimmune, Llc Method of conjugating a polypeptide
WO2016059602A2 (en) 2014-10-16 2016-04-21 Glaxo Group Limited Methods of treating cancer and related compositions
BR112017009006A2 (en) 2014-11-14 2018-04-10 F. Hoffmann-La Roche Ag binding molecule, isolated polynucleotide, vector, host cell, binding molecule production method, pharmaceutical composition, use of the binding molecule, and method of treating a disease in an individual
JP6622814B2 (en) 2014-11-25 2019-12-18 ブリストル−マイヤーズ スクイブ カンパニーBristol−Myers Squibb Company Novel PD-L1 binding polypeptides for imaging
BR112017010414A2 (en) 2014-11-25 2018-05-15 Bristol-Myers Squibb Company methods and compositions for 18f radiolabeling of biological substances
JP2018505911A (en) 2014-12-05 2018-03-01 イミュネクスト,インコーポレーテッド Identification of VSIG8 as a putative VISTA receptor and its use to produce a VISTA / VSIG8 modulator
EP3061826A1 (en) 2015-02-27 2016-08-31 Novartis AG Flavivirus replicons
CN107614014B (en) 2015-05-28 2022-07-12 生物辐射实验室股份有限公司 Affinity ligands and methods relating thereto
CN107922497B (en) 2015-06-24 2022-04-12 詹森药业有限公司 anti-VISTA antibodies and fragments
EP3328427A4 (en) 2015-07-27 2018-12-12 The General Hospital Corporation Antibody derivatives with conditionally enabled effector function
ES2809125T3 (en) 2015-09-23 2021-03-03 Bristol Myers Squibb Co Glypican-3 binding fibronectin-based scaffold molecules
AR106188A1 (en) 2015-10-01 2017-12-20 Hoffmann La Roche ANTI-CD19 HUMANIZED HUMAN ANTIBODIES AND METHODS OF USE
LT3356411T (en) 2015-10-02 2021-09-10 F. Hoffmann-La Roche Ag Bispecific antibodies specific for pd1 and tim3
RU2761115C1 (en) 2015-10-02 2021-12-06 Ф. Хоффманн-Ля Рош Аг Bispecific antibodies specific relatively to costimulatory tnf-receptor
MY193013A (en) 2015-10-07 2022-09-22 Hoffmann La Roche Bispecific antibodies with tetravalency for a costimulatory tnf receptor
KR20180085723A (en) 2015-10-28 2018-07-27 예일 유니버시티 Humanized anti-DKK2 antibodies and uses thereof
TWI756204B (en) 2016-02-12 2022-03-01 比利時商楊森製藥公司 Anti-vista antibodies and fragments, uses thereof, and methods of identifying same
CN105671068B (en) * 2016-03-15 2019-03-26 大连大学 A method of plasma pro-brain natriuretic peptide levels antigen substitute is prepared based on source of people skelemin Fn3
EP3430051B1 (en) 2016-03-17 2021-01-13 The United States of America as represented by the Secretary of the Department of Health and Human Services Anti-py1235-met immunological binding reagent
WO2017165778A1 (en) 2016-03-24 2017-09-28 Millennium Pharmaceuticals, Inc. Methods of treating gastrointestinal immune-related adverse events in immune oncology treatments
WO2017165742A1 (en) 2016-03-24 2017-09-28 Millennium Pharmaceuticals, Inc. Methods of treating gastrointestinal immune-related adverse events in anti-ctla4 anti-pd-1 combination treatments
EP3231813A1 (en) 2016-03-29 2017-10-18 F. Hoffmann-La Roche AG Trimeric costimulatory tnf family ligand-containing antigen binding molecules
EP3450554B1 (en) 2016-03-30 2022-09-21 Epsilon Molecular Engineering Inc. High-speed in vitro screening method
US11634705B2 (en) 2016-03-30 2023-04-25 Epsilon Molecular Engineering Inc. High-speed in vitro screening method
SG11201808633RA (en) 2016-04-15 2018-10-30 Immunext Inc Anti-human vista antibodies and use thereof
JP7285076B2 (en) 2016-05-11 2023-06-01 エフ・ホフマン-ラ・ロシュ・アクチェンゲゼルシャフト Antigen-binding molecule comprising a TNF family ligand trimer and a tenascin-binding portion
EP3243832A1 (en) 2016-05-13 2017-11-15 F. Hoffmann-La Roche AG Antigen binding molecules comprising a tnf family ligand trimer and pd1 binding moiety
US11344639B2 (en) 2016-06-01 2022-05-31 Bristol-Myers Squibb Company PET imaging with PD-L1 binding polypeptides
WO2017210335A1 (en) 2016-06-01 2017-12-07 Bristol-Myers Squibb Company Imaging methods using 18f-radiolabeled biologics
JP2020501508A (en) 2016-09-15 2020-01-23 クアドルセプト バイオ リミテッド Multimers, tetramers and octamers
CA3034105A1 (en) 2016-09-23 2018-03-29 Csl Limited Coagulation factor binding proteins and uses thereof
WO2018136163A2 (en) 2016-12-09 2018-07-26 Theripion, Inc. Tandem apoa-1 fusion polypeptides
EP3554535A4 (en) 2016-12-14 2020-10-21 Janssen Biotech, Inc. Pd-l1 binding fibronectin type iii domains
EP3554561B1 (en) 2016-12-14 2023-06-28 Janssen Biotech, Inc. Cd137 binding fibronectin type iii domains
JP7104703B2 (en) 2016-12-14 2022-07-21 ヤンセン バイオテツク,インコーポレーテツド CD8A-binding fibronectin type III domain
TW201829469A (en) 2017-01-03 2018-08-16 瑞士商赫孚孟拉羅股份公司 Bispecific antigen binding molecules comprising anti-4-1bb clone 20h4.9
SG10202109376YA (en) 2017-02-28 2021-10-28 Immunogen Inc Maytansinoid derivatives with self-immolative peptide linkers and conjugates thereof
WO2018178076A1 (en) 2017-03-29 2018-10-04 F. Hoffmann-La Roche Ag Bispecific antigen binding molecule for a costimulatory tnf receptor
WO2018178055A1 (en) 2017-03-29 2018-10-04 F. Hoffmann-La Roche Ag Bispecific antigen binding molecule for a costimulatory tnf receptor
WO2018178074A1 (en) 2017-03-29 2018-10-04 F. Hoffmann-La Roche Ag Trimeric antigen binding molecules specific for a costimulatory tnf receptor
BR112019017753A2 (en) 2017-04-04 2020-04-07 Hoffmann La Roche bispecific molecule, polynucleotide, vector, cell, methods for the production of a molecule and for the treatment of an individual, composition and use of the bispecific molecule
KR102408873B1 (en) 2017-04-05 2022-06-15 에프. 호프만-라 로슈 아게 Bispecific antibodies specifically binding to pd1 and lag3
WO2018195243A1 (en) 2017-04-20 2018-10-25 Immunogen, Inc. Cytotoxic benzodiazepine derivatives and conjugates thereof
AR111963A1 (en) 2017-05-26 2019-09-04 Univ California METHOD AND MOLECULES
WO2019040808A1 (en) * 2017-08-25 2019-02-28 Janssen Biotech, Inc. FCγRII BINDING FIBRONECTIN TYPE III DOMAINS, THEIR CONJUGATES AND MULTISPECIFIC MOLECULES COMPRISING THEM
CN111247165B (en) 2017-10-18 2023-11-10 Csl有限公司 Human serum albumin variants and uses thereof
WO2019086500A2 (en) 2017-11-01 2019-05-09 F. Hoffmann-La Roche Ag Bispecific 2+1 contorsbodies
EP3703746A1 (en) 2017-11-01 2020-09-09 F. Hoffmann-La Roche AG Novel tnf family ligand trimer-containing antigen binding molecules
ES2939461T3 (en) 2017-11-29 2023-04-24 Csl Ltd Method for treating or preventing ischemia-reperfusion injury
EP3502140A1 (en) 2017-12-21 2019-06-26 F. Hoffmann-La Roche AG Combination therapy of tumor targeted icos agonists with t-cell bispecific molecules
TWI827575B (en) 2017-12-28 2024-01-01 美商伊繆諾金公司 Benzodiazepine derivatives
EP3774900A1 (en) 2018-04-13 2021-02-17 F. Hoffmann-La Roche AG Her2-targeting antigen binding molecules comprising 4-1bbl
CN112424228A (en) 2018-07-04 2021-02-26 豪夫迈·罗氏有限公司 Novel bispecific agonistic 4-1BB antigen-binding molecules
CN112654641A (en) 2018-10-01 2021-04-13 豪夫迈·罗氏有限公司 Bispecific antigen binding molecules with trivalent binding to CD40
US11242396B2 (en) 2018-10-01 2022-02-08 Hoffmann-La Roche Inc. Bispecific antigen binding molecules comprising anti-FAP clone 212
EP3898682A1 (en) 2018-12-21 2021-10-27 F. Hoffmann-La Roche AG Tumor-targeted agonistic cd28 antigen binding molecules
JP2022513495A (en) 2018-12-21 2022-02-08 エフ・ホフマン-ラ・ロシュ・アクチェンゲゼルシャフト Tumor targeting superagonist CD28 antigen binding molecule
AU2020216295A1 (en) 2019-01-28 2021-09-09 Maple Biotech Llc PSMP antagonists for use in treatment of fibrotic disease of the lung, kidney or liver
GB201903767D0 (en) 2019-03-19 2019-05-01 Quadrucept Bio Ltd Multimers, tetramers & octamers
WO2020191306A1 (en) 2019-03-21 2020-09-24 Immunogen, Inc. Methods of preparing cell-binding agent-drug conjugates
MA55520A (en) 2019-03-29 2022-02-09 Immunogen Inc CYTOTOXIC BIS-BENZODIAZEPINE DERIVATIVES AND THEIR CONJUGATE WITH CELL-BINDING AGENTS FOR INHIBITING ABNORMAL CELL GROWTH OR FOR TREATING PROLIFERATIVE DISEASES
JP7301155B2 (en) 2019-04-12 2023-06-30 エフ・ホフマン-ラ・ロシュ・アクチェンゲゼルシャフト Bispecific antigen-binding molecules containing lipocalin muteins
LT3958977T (en) 2019-04-26 2023-12-27 Immunogen, Inc. Camptothecin derivatives
AU2020304813A1 (en) 2019-06-26 2022-01-06 F. Hoffmann-La Roche Ag Fusion of an antibody binding CEA and 4-1BBL
JP7354306B2 (en) 2019-06-27 2023-10-02 エフ・ホフマン-ラ・ロシュ・アクチェンゲゼルシャフト Novel ICOS antibodies and tumor-targeting antigen-binding molecules containing them
WO2021076574A2 (en) 2019-10-14 2021-04-22 Aro Biotherapeutics Company Fn3 domain-sirna conjugates and uses thereof
WO2021076546A1 (en) 2019-10-14 2021-04-22 Aro Biotherapeutics Company Cd71 binding fibronectin type iii domains
JP2023515633A (en) 2020-02-28 2023-04-13 ブリストル-マイヤーズ スクイブ カンパニー Radiolabeled fibronectin-based scaffolds and antibodies and their theranostic uses
WO2021190980A1 (en) 2020-03-22 2021-09-30 Quadrucept Bio Limited Multimers for viral strain evolution
AR121706A1 (en) 2020-04-01 2022-06-29 Hoffmann La Roche OX40 AND FAP-TARGETED BSPECIFIC ANTIGEN-BINDING MOLECULES
EP4168448A1 (en) 2020-06-23 2023-04-26 F. Hoffmann-La Roche AG Agonistic cd28 antigen binding molecules targeting her2
JP2023531067A (en) 2020-06-25 2023-07-20 エフ・ホフマン-ラ・ロシュ・アクチェンゲゼルシャフト Anti-CD3/Anti-CD28 Bispecific Antigen Binding Molecules
JP2023549316A (en) 2020-11-16 2023-11-24 エフ・ホフマン-ラ・ロシュ・アクチェンゲゼルシャフト Combination therapy with CD40 agonists targeting FAP
WO2022148732A1 (en) 2021-01-06 2022-07-14 F. Hoffmann-La Roche Ag Combination therapy employing a pd1-lag3 bispecific antibody and a cd20 t cell bispecific antibody
WO2022178090A2 (en) 2021-02-19 2022-08-25 Theripion, Inc. Dnase fusion polypeptides and related compositions and methods
WO2022184659A1 (en) 2021-03-01 2022-09-09 Quadrucept Bio Limited Antibody domains & multimers
WO2022243261A1 (en) 2021-05-19 2022-11-24 F. Hoffmann-La Roche Ag Agonistic cd40 antigen binding molecules targeting cea
AR126009A1 (en) 2021-06-02 2023-08-30 Hoffmann La Roche CD28 ANTIGEN-BINDING AGONIST MOLECULES THAT TARGET EPCAM
KR20240028452A (en) 2021-07-02 2024-03-05 제넨테크, 인크. Methods and compositions for treating cancer
WO2023010095A1 (en) 2021-07-28 2023-02-02 F. Hoffmann-La Roche Ag Methods and compositions for treating cancer
AR128031A1 (en) 2021-12-20 2024-03-20 F Hoffmann La Roche Ag ANTI-LTBR AGONIST ANTIBODIES AND BISPECIFIC ANTIBODIES THAT COMPRISE THEM
WO2023186756A1 (en) 2022-03-28 2023-10-05 F. Hoffmann-La Roche Ag Interferon gamma variants and antigen binding molecules comprising these
WO2024044770A1 (en) 2022-08-26 2024-02-29 Core Biotherapeutics, Inc. Oligonucleotides for the treatment of breast cancer
WO2024056861A1 (en) 2022-09-15 2024-03-21 Avidicure Ip B.V. Multispecific antigen binding proteins for stimulating nk cells and use thereof

Citations (14)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5235041A (en) * 1990-12-28 1993-08-10 Protein Polymer Technologies, Inc. Purification of structurally ordered recombinant protein polymers
US5514581A (en) * 1986-11-04 1996-05-07 Protein Polymer Technologies, Inc. Functional recombinantly prepared synthetic protein polymer
US5545620A (en) * 1993-01-19 1996-08-13 The United States Of America As Represented By The Department Of Health And Human Services Synthetic fibronectin fragments as inhibitors of retroviral infection
US5641648A (en) * 1986-11-04 1997-06-24 Protein Polymer Technologies, Inc. Methods for preparing synthetic repetitive DNA
US5770697A (en) * 1986-11-04 1998-06-23 Protein Polymer Technologies, Inc. Peptides comprising repetitive units of amino acids and DNA sequences encoding the same
US5792742A (en) * 1991-06-14 1998-08-11 New York University Fibrin-binding peptide fragments of fibronectin
US6018030A (en) * 1986-11-04 2000-01-25 Protein Polymer Technologies, Inc. Peptides comprising repetitive units of amino acids and DNA sequences encoding the same
US6207446B1 (en) * 1997-01-21 2001-03-27 The General Hospital Corporation Selection of proteins using RNA-protein fusions
US6261804B1 (en) * 1997-01-21 2001-07-17 The General Hospital Corporation Selection of proteins using RNA-protein fusions
US20020019517A1 (en) * 1997-06-12 2002-02-14 Shohei Koide Artifical antibody polypeptides
US6383775B1 (en) * 1996-09-11 2002-05-07 Interleukin Genetics, Inc. Designer proteases
US20030104520A1 (en) * 2000-06-15 2003-06-05 Ellington Andrew D. Regulatable, catalytically active nucleic acids
US6818418B1 (en) * 1998-12-10 2004-11-16 Compound Therapeutics, Inc. Protein scaffolds for antibody mimics and other binding proteins
US7115396B2 (en) * 1998-12-10 2006-10-03 Compound Therapeutics, Inc. Protein scaffolds for antibody mimics and other binding proteins

Family Cites Families (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5922676A (en) 1996-09-20 1999-07-13 The Burnham Institute Methods of inhibiting cancer by using superfibronectin
US6159722A (en) * 1997-12-03 2000-12-12 Boehringer Mannheim Gmbh Chimeric serine proteases
IL138668A0 (en) 1998-04-03 2001-10-31 Phylos Inc Addressable protein arrays
WO2000034784A1 (en) 1998-12-10 2000-06-15 Phylos, Inc. Protein scaffolds for antibody mimics and other binding proteins
JP4578768B2 (en) 2000-07-11 2010-11-10 リサーチ コーポレイション テクノロジーズ,インコーポレイテッド Artificial antibody polypeptide
EP1356075A4 (en) 2000-10-16 2005-04-13 Compound Therapeutics Inc Protein scaffolds for antibody mimics and other binding proteins
AU2002342672A1 (en) 2001-09-11 2003-03-24 Nascacell Gmbh Method for screening for inhibitors of protein/protein interaction and corresponding ribozymes
WO2008097497A2 (en) 2007-02-02 2008-08-14 Adnexus, A Bristol-Myers Squibb R & D Company Vegf pathway blockade
EP2197476A2 (en) 2007-08-20 2010-06-23 Bristol-Myers Squibb Company Use of vegfr-2 inhibitors for treating metastatic cancer
US20110034384A1 (en) 2007-11-28 2011-02-10 Bristol-Myers Squibb Company COMBINATION VEGFR2 THERAPY WITH mTOR INHIBITORS

Patent Citations (21)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5514581A (en) * 1986-11-04 1996-05-07 Protein Polymer Technologies, Inc. Functional recombinantly prepared synthetic protein polymer
US5641648A (en) * 1986-11-04 1997-06-24 Protein Polymer Technologies, Inc. Methods for preparing synthetic repetitive DNA
US5770697A (en) * 1986-11-04 1998-06-23 Protein Polymer Technologies, Inc. Peptides comprising repetitive units of amino acids and DNA sequences encoding the same
US6018030A (en) * 1986-11-04 2000-01-25 Protein Polymer Technologies, Inc. Peptides comprising repetitive units of amino acids and DNA sequences encoding the same
US5235041A (en) * 1990-12-28 1993-08-10 Protein Polymer Technologies, Inc. Purification of structurally ordered recombinant protein polymers
US5792742A (en) * 1991-06-14 1998-08-11 New York University Fibrin-binding peptide fragments of fibronectin
US5545620A (en) * 1993-01-19 1996-08-13 The United States Of America As Represented By The Department Of Health And Human Services Synthetic fibronectin fragments as inhibitors of retroviral infection
US6383775B1 (en) * 1996-09-11 2002-05-07 Interleukin Genetics, Inc. Designer proteases
US6261804B1 (en) * 1997-01-21 2001-07-17 The General Hospital Corporation Selection of proteins using RNA-protein fusions
US6258558B1 (en) * 1997-01-21 2001-07-10 The General Hospital Corporation Method for selection of proteins using RNA-protein fusions
US6214553B1 (en) * 1997-01-21 2001-04-10 Massachusetts General Hospital Libraries of protein encoding RNA-protein fusions
US6281344B1 (en) * 1997-01-21 2001-08-28 The General Hospital Corporation Nucleic acid-protein fusion molecules and libraries
US6207446B1 (en) * 1997-01-21 2001-03-27 The General Hospital Corporation Selection of proteins using RNA-protein fusions
US6518018B1 (en) * 1997-01-21 2003-02-11 The General Hospital Corporation RNA-antibody fusions and their selection
US20020019517A1 (en) * 1997-06-12 2002-02-14 Shohei Koide Artifical antibody polypeptides
US6462189B1 (en) * 1997-06-12 2002-10-08 Research Corporation Technologies Nucleic acids encoding artificial antibody polypeptides
US20030170753A1 (en) * 1997-06-12 2003-09-11 Research Corporation Technologies, Inc. Artificial antibody polypeptides
US6818418B1 (en) * 1998-12-10 2004-11-16 Compound Therapeutics, Inc. Protein scaffolds for antibody mimics and other binding proteins
US20050255548A1 (en) * 1998-12-10 2005-11-17 Phylos, Inc. Protein scaffolds for antibody mimics and other binding proteins
US7115396B2 (en) * 1998-12-10 2006-10-03 Compound Therapeutics, Inc. Protein scaffolds for antibody mimics and other binding proteins
US20030104520A1 (en) * 2000-06-15 2003-06-05 Ellington Andrew D. Regulatable, catalytically active nucleic acids

Cited By (73)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US10995131B2 (en) 2003-12-05 2021-05-04 Bristol-Myers Squibb Company Libraries of modified fibronectin type III tenth domain-containing polypeptides
US9862758B2 (en) 2003-12-05 2018-01-09 Bristol-Myers Quibb Company Inhibitors of type 2 vascular endothelial growth factor receptors
US20090192048A1 (en) * 2005-12-20 2009-07-30 Michael A Reeve Method of producing a multimeric capture agent for binding a ligand
US20090312192A1 (en) * 2005-12-20 2009-12-17 Reeve Michael A Method for functionalising a hydrophobic substrate
US20100105567A1 (en) * 2005-12-20 2010-04-29 Reeve Michael A Novel capture agents for binding a ligand
US10221232B2 (en) 2006-11-22 2019-03-05 Bristol-Myers Squibb Company Methods of treating cancer by administering IGF-IR binding molecules
US20100121033A1 (en) * 2006-11-22 2010-05-13 Bristol-Myers Squibb Company Targeted therapeutics based on engineered proteins for tyrosine kinases receptors, including igf-ir
US8470332B2 (en) 2006-11-22 2013-06-25 Bristol-Myers Squibb Company Targeted therapeutics based on engineered proteins for tyrosine kinases receptors, including IGF-IR
US11149077B2 (en) 2006-11-22 2021-10-19 Bristol-Myers Squibb Company Targeted therapeutics based on engineered proteins for tyrosine kinases receptors, including IGF-IR
US20100144599A1 (en) * 2007-02-02 2010-06-10 Bristol-Myers Squibb Company Vegf pathway blockade
US20110124527A1 (en) * 2007-08-10 2011-05-26 Guido Cappuccilli Universal fibronectin type iii binding-domain libraries
US8470966B2 (en) 2007-08-10 2013-06-25 Protelica, Inc. Universal fibronectin type III binding-domain libraries
US20100152063A1 (en) * 2007-08-10 2010-06-17 Protelix, Inc. Universal fibronectin type iii binding-domain libraries
US20090176654A1 (en) * 2007-08-10 2009-07-09 Protelix, Inc. Universal fibronectin type III binding-domain libraries
US8680019B2 (en) 2007-08-10 2014-03-25 Protelica, Inc. Universal fibronectin Type III binding-domain libraries
US8697608B2 (en) 2007-08-10 2014-04-15 Protelica, Inc. Universal fibronectin type III binding-domain libraries
US9376483B2 (en) 2007-08-10 2016-06-28 Protelica, Inc. Universal fibronectin type III binding-domain libraries
US20100298541A1 (en) * 2007-10-31 2010-11-25 Medimmune, Llc Protein Scaffolds
US8633297B2 (en) * 2007-10-31 2014-01-21 Medimmune, Llc Protein scaffolds
US20140141994A1 (en) * 2007-10-31 2014-05-22 Medimmune, Llc Protein Scaffolds
AU2008319298B2 (en) * 2007-10-31 2014-07-31 Medimmune, Llc Protein scaffolds
US9176129B2 (en) * 2007-10-31 2015-11-03 Medimmune, Llc Protein scaffolds
US8524244B2 (en) 2008-02-14 2013-09-03 Bristol-Myers Squibb Company Targeted therapeutics based on engineered proteins that bind EGFR
US10781247B2 (en) 2008-02-14 2020-09-22 Bristol-Myers Squibb Company Targeted therapeutics based on engineered proteins that bind EGFR
US9920108B2 (en) 2008-02-14 2018-03-20 Bristol-Myers Squibb Company Targeted therapeutics based on engineered proteins that bind EGFR
US9234028B2 (en) 2008-02-14 2016-01-12 Bristol-Myers Squibb Company Targeted therapeutics based on engineered proteins that bind EGFR
US20110053842A1 (en) * 2008-02-14 2011-03-03 Bristol-Myers Squibb Company Targeted therapeutics based on engineered proteins that bind egfr
US10774130B2 (en) 2008-05-22 2020-09-15 Bristol-Myers Squibb Company Method of treating cancer by administering multivalent fibronectin based scaffold domain proteins
US8728483B2 (en) 2008-05-22 2014-05-20 Bristol-Myers Squibb Company Multivalent fibronectin based scaffold domain proteins
US9902762B2 (en) 2008-05-22 2018-02-27 Bristol-Myers Squibb Company Multivalent fibronectin based scaffold domain proteins
US9771411B2 (en) 2008-11-24 2017-09-26 Bristol-Myers Squibb Company Method of treating cancer by administering EGFR and EGFR/IGFIR binding molecules
US10183987B2 (en) 2008-11-24 2019-01-22 Bristol-Myers Squibb Company Polynucleotides encoding bispecific EGFR/IGF-IR binding molecules
US20100179094A1 (en) * 2008-11-24 2010-07-15 Bristol-Myers Squibb Company Bispecific egfr/igfir binding molecules
US8343501B2 (en) 2008-11-24 2013-01-01 Bristol-Myers Squibb Company Bispecific EGFR/IGFIR binding molecules
US10954286B2 (en) 2008-11-24 2021-03-23 Bristol-Myers Squibb Company Bispecific EGFR/IGFIR binding molecules
US9017655B2 (en) 2008-11-24 2015-04-28 Bristol-Myers Squibb Company Bispecific EGFR/IGFIR binding molecules
US10121952B2 (en) 2009-04-23 2018-11-06 The University Of Chicago Materials and methods for the preparation of nanocomposites
US9139825B2 (en) 2009-10-30 2015-09-22 Novartis Ag Universal fibronectin type III bottom-side binding domain libraries
US10253313B2 (en) 2009-10-30 2019-04-09 Novartis Ag Universal fibronectin type III bottom-side binding domain libraries
US9212231B2 (en) 2010-04-13 2015-12-15 Medimmune, Llc TRAIL R2-specific multimeric scaffolds
US9540424B2 (en) 2010-05-03 2017-01-10 Bristol-Myers Squibb Company Serum albumin binding molecules
US10934572B2 (en) 2010-05-03 2021-03-02 Bristol-Myers Squibb Company Serum albumin binding molecules
US10221438B2 (en) 2010-05-03 2019-03-05 Bristol-Myers Squibb Company Serum albumin binding molecules
US10273286B2 (en) 2010-05-26 2019-04-30 Bristol-Myers Squibb Company Fibronectin based scaffold proteins having improved stability
US11161893B2 (en) 2010-05-26 2021-11-02 Bristol-Myers Squibb Company Fibronectin based scaffold proteins having improved stability
US9562089B2 (en) 2010-05-26 2017-02-07 Bristol-Myers Squibb Company Fibronectin based scaffold proteins having improved stability
US10214579B2 (en) 2011-04-13 2019-02-26 Bristol-Myers Squibb Company Fc fusion proteins comprising novel linkers or arrangements
US9469676B2 (en) 2011-04-13 2016-10-18 Bristol-Myers Squibb Company Fc fusion proteins comprising novel linkers or arrangements
US10898538B2 (en) 2011-05-17 2021-01-26 Bristol-Myers Squibb Company Methods for maintaining pegylation of polypeptides
US11555062B2 (en) 2011-10-11 2023-01-17 Viela Bio, Inc. Methods of administering a composition comprising a Tn3 scaffold and a CD40L-specific monomer
US9522951B2 (en) 2011-10-31 2016-12-20 Bristol-Myers Squibb Company Fibronectin binding domains with reduced immunogenicity
US10464995B2 (en) 2011-10-31 2019-11-05 Bristol-Myers Squibb Company Fibronectin binding domains with reduced immunogenicity
US11408093B2 (en) 2011-10-31 2022-08-09 Bristol-Myers Squibb Company Fibronectin binding domains with reduced immunogenicity
US10604556B2 (en) 2011-10-31 2020-03-31 Bristol-Myers Squibb Company Fibronectin binding domains with reduced immunogenicity
US11279751B2 (en) 2011-10-31 2022-03-22 Bristol-Myers Squibb Company Fibronectin binding domains with reduced immunogenicity
US9416170B2 (en) 2011-10-31 2016-08-16 Bristol-Myers Squibb Company Fibronectin binding domains with reduced immunogenicity
US9765132B2 (en) 2011-10-31 2017-09-19 Bristol-Myers Squibb Company Fibronectin binding domains with reduced immunogenicity
US10245302B2 (en) 2012-09-13 2019-04-02 Bristol-Myers Squibb Company Fibronectin based scaffold domain proteins that bind to myostatin
US9662373B2 (en) 2012-09-13 2017-05-30 Bristol-Myers Squibb Company Fibronectin based scaffold domain proteins that bind to myostatin
US11813315B2 (en) 2012-09-13 2023-11-14 Bristol-Myers Squibb Company Fibronectin based scaffold domain proteins that bind to myostatin
US10406212B2 (en) 2012-09-13 2019-09-10 Bristol-Myers Squibb Company Fibronectin based scaffold domain proteins that bind to myostatin
US9493546B2 (en) 2012-09-13 2016-11-15 Bristol-Myers Squibb Company Fibronectin based scaffold domain proteins that bind to myostatin
US11447538B2 (en) 2013-02-01 2022-09-20 Bristol-Myers Squibb Company Fibronectin based scaffold proteins
US11512124B2 (en) 2013-02-06 2022-11-29 Bristol-Myers Squibb Company Fibronectin type III domain proteins with enhanced solubility
US10787498B2 (en) 2013-02-06 2020-09-29 Bristol-Myers Squibb Company Fibronectin type III domain proteins with enhanced solubility
US10065987B2 (en) 2013-02-12 2018-09-04 Bristol-Myers Squibb Company High pH protein refolding methods
US11345722B2 (en) 2013-02-12 2022-05-31 Bristol-Myers Squibb Company High pH protein refolding methods
WO2015066480A1 (en) * 2013-11-01 2015-05-07 Regents Of The University Of Minnesota Protein scaffolds and methods of use
US10287572B2 (en) 2013-11-01 2019-05-14 Regents Of The University Of Minnesota Protein scaffolds and methods of use
US11203630B2 (en) 2014-03-20 2021-12-21 Bristol-Myers Squibb Company Serum albumin-binding fibronectin type III domains
US10442851B2 (en) 2014-03-20 2019-10-15 Bristol-Myers Squibb Company Serum albumin-binding fibronectin type III domains
US11434275B2 (en) 2015-09-23 2022-09-06 Bristol-Myers Squibb Company Fast-off rate serum albumin binding fibronectin type III domains
US10766946B2 (en) 2015-09-23 2020-09-08 Bristol-Myers Squibb Company Fast-off rate serum albumin binding fibronectin type III domains

Also Published As

Publication number Publication date
US20060246059A1 (en) 2006-11-02
US20050038229A1 (en) 2005-02-17
US9605039B2 (en) 2017-03-28
US20080063651A1 (en) 2008-03-13
US20170275342A1 (en) 2017-09-28
US7115396B2 (en) 2006-10-03
US20070082365A1 (en) 2007-04-12
US20080139791A1 (en) 2008-06-12
US20080015339A1 (en) 2008-01-17
US20140094595A1 (en) 2014-04-03
US20170334958A1 (en) 2017-11-23
US20060270604A1 (en) 2006-11-30

Similar Documents

Publication Publication Date Title
US20170334958A1 (en) Protein scaffolds for antibody mimics and other binding proteins
AU2002213251B2 (en) Protein scaffolds for antibody mimics and other binding proteins
US6818418B1 (en) Protein scaffolds for antibody mimics and other binding proteins
AU2002213251A1 (en) Protein scaffolds for antibody mimics and other binding proteins
AU2001241850A1 (en) Protein scaffolds for antibody mimics and other binding proteins
AU2004218675B2 (en) Protein scaffolds for antibody mimics and other binding proteins

Legal Events

Date Code Title Description
AS Assignment

Owner name: ADNEXUS THERAPEUTICS, INC., MASSACHUSETTS

Free format text: CHANGE OF NAME;ASSIGNOR:COMPOUND THERAPEUTICS INC.;REEL/FRAME:020486/0693

Effective date: 20060620

Owner name: COMPOUND THERAPEUTICS, INC., MASSACHUSETTS

Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNOR:PHYLOS, INC.;REEL/FRAME:020486/0688

Effective date: 20031215

Owner name: PHYLOS, INC, MASSACHUSETTS

Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNORS:LIPOVSEK, DASA;WAGNER, RICHARD W.;KUIMELIS, ROBERT G.;REEL/FRAME:020486/0459

Effective date: 20010105

AS Assignment

Owner name: ADNEXUS, A BMS R&D COMPANY, MASSACHUSETTS

Free format text: MERGER;ASSIGNOR:ADNEXUS THERAPEUTICS, INC.;REEL/FRAME:021428/0782

Effective date: 20070921

Owner name: ADNEXUS, A BRISTOL-MYERS SQUIBB R&D COMPANY, MASSA

Free format text: CHANGE OF NAME;ASSIGNOR:ADNEXUS, A BMS R&D COMPANY;REEL/FRAME:021371/0781

Effective date: 20071111

AS Assignment

Owner name: BRISTOL-MYERS SQUIBB COMPANY, NEW JERSEY

Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNOR:ADNEXUS, A BRISTOL-MYERS SQUIBB R&D COMPANY;REEL/FRAME:023056/0255

Effective date: 20080513

STCB Information on status: application discontinuation

Free format text: ABANDONED -- AFTER EXAMINER'S ANSWER OR BOARD OF APPEALS DECISION