WO1985000369A1 - Hybrid dna synthesis of epidermal growth factor - Google Patents

Hybrid dna synthesis of epidermal growth factor Download PDF

Info

Publication number
WO1985000369A1
WO1985000369A1 PCT/US1984/001050 US8401050W WO8500369A1 WO 1985000369 A1 WO1985000369 A1 WO 1985000369A1 US 8401050 W US8401050 W US 8401050W WO 8500369 A1 WO8500369 A1 WO 8500369A1
Authority
WO
WIPO (PCT)
Prior art keywords
egf
dna
encoding
dna sequence
sequence
Prior art date
Application number
PCT/US1984/001050
Other languages
French (fr)
Inventor
Graeme I. Bell
Original Assignee
Chiron Corporation
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Chiron Corporation filed Critical Chiron Corporation
Priority to JP84502699A priority Critical patent/JPS60501737A/en
Priority to DE1984902750 priority patent/DE148922T1/en
Publication of WO1985000369A1 publication Critical patent/WO1985000369A1/en

Links

Classifications

    • CCHEMISTRY; METALLURGY
    • C07ORGANIC CHEMISTRY
    • C07KPEPTIDES
    • C07K14/00Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
    • C07K14/435Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans
    • C07K14/475Growth factors; Growth regulators
    • C07K14/485Epidermal growth factor [EGF] (urogastrone)

Definitions

  • Epidermal growth factor is a polypeptide of 53 amino acids that has been characterized in both mice and humans. It is a potent mitogen for a variety of cells, such as fibroblasts, glia, epithelial, endothelial and epidermal cells, both cultured and in vivo. EGF is also a potent inhibitor of gastric acid secretion. EGF was first isolated from male mouse submaxillary glands, where it exists in inexplicably high levels.
  • EGF In glandular homogenates, EGF is found as a 74,000 dalton complex of two molecules of EGF (Mr 6045) and two molecules of a binding protein (Mr 29,300), a kallikrein-like arginyl enteropeptidase.
  • the amino acid sequence of mouse submaxillary EGF has been determined and the synthesis of EGF and a larger 9000 dalton precursor with a carboxy terminal extension has been demonstrated in cultured submaxillary glands.
  • Human EGF which appears to be similar if not identical to urogastrone, is also found in urine in larger forms of 28,000 and 30,000 daltons that do not dissociate on sodium dodecyl sulfate-polyacrylamide gel electrophoresis.
  • DNA and RNA are provided encoding for mammalian EGF, polypeptide precursors thereof, and numerous other polypeptides also encoded for by the DNA sequence which includes the segment encoding for EGF.
  • the DNA sequences may be used for production of mammalian EGF, precursors of mammalian EGF, and associated related polypeptides.
  • messenger RNA encoding for the mouse EGF precursor is detected, isolated and used for the production of cDNA.
  • the cDNA is sequenced and a fragment employed for hybridization with human DNA under conditions where mismatched heterologous hybrids can be detected. In this manner, a DNA sequence encoding for a large precursor peptide encompassing human EGF or urogastrone and numerous, related polypeptides is detected and isolated.
  • DESCRIPTION OF SPECIFIC EMBODIMENTS DESCRIPTION OF SPECIFIC EMBODIMENTS
  • DNA and RNA sequences are provided which include a segment encoding for EGF, particularly mouse and human, as well as the expression products of these sequences, including propolypeptides and peptides, particularly peptides having one or more physiological functions of EGF, or peptides having one or more other hormonal or growth factor regulatory functions.
  • the DNA sequences of interest are single or double stranded ranging from about 60 bases or base pairs (bp) to about 5 thousand base pairs (kbp), where sequences encoding for a physiologically active polypeptide will generally range from about 60bp to about 1000bp, which may include introns.
  • the DNA sequences will have open reading frames (involving one or more exons) encoding for polypeptides ranging from about 20 amino acids to polypeptides about 1000 amino acids, where the sequence will include at least 2, usually at least 5, more usually at least 10bp, outside the segment encoding EGF.
  • Polypeptides of particular interest will generally be of from about 20 to 250 amino acids, more usually from about 20 to 175 amino acids, particularly 30 to 60 amino acids.
  • DNA segments encoding for polypeptides of interest or mature polypeptides may be located in any region in the DNA sequences described in this invention.
  • of particular interest are sequences bordered by basic amino acids, i.e. arginine and lysine, more particularly when joined to a second basic amino acid, or alanine, leucine, aspartic or glutamic acid or amide thereof.
  • these sequences of interest are seven previously unknown EGF-like polypeptides with amino acid sequences homologous to EGF.
  • the DNA sequences obtained in accordance with this invention were obtained by the following experimental design.
  • a mammalian cDNA or genomic DNA library is screened with a plurality of radiolabeled hybridization probes for detection of a sequence encoding for an amino acid sequence present in EGF.
  • a plurality of probes are employed providing for the various possible redundant codons. encoding for the oligopeptide.
  • a cDNA library from mouse submaxillary gland cells was probed. Plasmids binding strongly to the probes are isolated and the several overlapping cDNA inserts sequenced.
  • the mouse EGF encoding cDNA has about 4800 bases.
  • Mouse EGF is encoded for by nucleotides 3281-3440 ⁇ 5, with an open reading frame encoding for 1217+5 amino acid residues and a protein of approximately 130-140 kilodaltons (kdal), particularly about 133kdal.
  • the mouse cDNA may then be used to probe a human cDNA or genomic DNA bank.
  • a restriction fragment may be employed of about 500 to 1500bp.
  • a BstEII-PvuII fragment of about 1213+5bp, may be employed.
  • the hybridization is carried out under conditions which facilitate the detection of mismatched, heterologous hybrids.
  • the BstEII-PvuII fragment encodes mouse EGF (53 amino acids) and 286 amino acids before and 66 amino acids after the EGF moiety. Clones which hybridized to the probe were isolated and the human DNA inserts characterized.
  • DNA sequence can be used in a variety of ways: For production of synthetic DNA sequences, either in whole or in part, for replication, or for the production of messenger RNA or expression of the precursor protein incorporating EGF, fragments of such protein or of EGF, or analogs of EGF, differing by one or more amino acids, usually by not more than about five amino acids from the naturally occurring EGF amino acid sequence.
  • polypeptide sequences are of particular interest in encoding polypeptides which can be obtained from the cDNA sequence. These DNA sequences are set forth in the argument map set forth in the experimental section, along with the human EGF sequence.
  • the polypeptide sequences of interest include, but are not limited to, seven previously undescribed, EGF-like polypeptides identified on the basis of the homology of their amino acid sequences to EGF, especially the positional relationship (s) of the several cysteine residues. (See diagram in experimental section.) These sequences are frequently bounded by one or more basic amino acids.
  • DNA sequence encoding for a protein or peptide of interest e.g. EGF or its homologues
  • a wide variety of vectors are available for unicellular microorganisms, particularly for bacteria and fungi, where the DNA sequence encoding the poly (amino acid) of interest may be replicated and/or expressed.
  • Various hosts of interest include E. coli, S. cerevisiae, B. subtilis, mouse 3T3 cells, or the like.
  • Conventional vectors include replication systems derived from R6-5, ColEI, the 2 ⁇ m plasmid from yeast, RK plasmids, or the like.
  • Alternative replication systems may be derived from viruses or phage, such as lambda, SV40, etc.
  • shuttle vectors frequently employ a replication system for E . coli and a replication system for a higher organism, e.g. yeast, so that amplification of the gene or cloning may be achieved in the bacterium, while expression may be achieved in the higher organism with appropriate processing, e.g. glycosylation.
  • At least one marker which allows for selection or selective pressure to maintain the DNA construct containing the subject DNA sequence in the host.
  • Convenient markers include biocidal resistance, e.g. antibiotics, heavy metals and toxins; complementation in an auxotrophic host; immunity; etc.
  • the DNA sequence including the fragment encoding for a polypeptide having epidermal growth factor physiological properties or fragments of such sequence may be replicated in a cloning vector, which is capable of replication in a unicellular microorganism, such as bacteria and yeast.
  • the DNA may also be used in an expression vector for expression of a polypeptide of interest, which may be mammalian EGF, particularly mouse or human, other physiologically active polypeptides present in the sequence, e.g. other hormones or growth factors, fragments thereof or analogs thereof differing by from about one to five amino acids.
  • a polypeptide of interest which may be mammalian EGF, particularly mouse or human, other physiologically active polypeptides present in the sequence, e.g. other hormones or growth factors, fragments thereof or analogs thereof differing by from about one to five amino acids.
  • the open reading frame of the DNA sequence allows for the production of a large polypeptide.
  • the large polypeptide may be treated in a variety of ways.
  • the large polypeptide may be partially digested with a variety of proteases either individually or in combination.
  • Illustrative endopeptidases include trypsin, pepsin, membrane dipeptidases, esteropeptidases or the like.
  • the resulting fragments may then be separated by charge and/or molecular weight by any conventional means, e.g. filtration, sedimentation, chromatography, electrophoresis, or the like and then tested for physiological activity.
  • any conventional means e.g. filtration, sedimentation, chromatography, electrophoresis, or the like and then tested for physiological activity.
  • growth factors acting as mitogens or differentiation regulators.
  • the various fractions may be further purified by bioassays to obtain pure active factors.
  • DNA sequences of this invention can be used in a variety of ways. Fragments can be used as probes for detecting complementary sequences in genomic DNA or in messenger RNA for detecting mutations and/or deletions in genomic DNA of hosts.
  • the sequences can be used for expressing the polypeptides encoded for by the se ⁇ uence.
  • the following examples are offered by way of illustration and not by way of limitation:
  • coli HB101 were stored at -70°C in microtiter dishes (Gergen et al., Nucl. Acids Res. (1979) 7 : 2115 -2136 ; Ish-Horowicz and Burke, ibid. (1981) 9:2989-2998).
  • Oligonucleotides were then synthesized by solid-phase phosphoramidite methodology as described in copending application, Serial No. 457,412, followed by isolation from 20% acrylamide gels modification of the method described in Beancage and Camthers, Tetrahedron Lett. (1981) 22: 1859-1862. Dodecamers were prepared which were complementary to the strand coding for amino acids 17 to 23 (lacking the last 5'-nucleotide) of mouse EGF cDNA.
  • the fractions had the following sequences, where after the addition of the eleventh nucleotide, two pools were prepared, one terminating in A and the other terminating in G and after addition of the seventeenth nucleotide, the two pools were further divided with addition of the eighteenth nucleotide, with two of the pools now terminating in G and the other two pools terminating in A. In this manner, a total of four pools were obtained, where each pool had a plurality of eicosamers of differing compositions at positions 3, 6 and 9. The different sequences are required because of the uncertainty as to the specific codon usage due to the redundancy of the genetic code for amino acids.
  • the labeled oligonucleotides were separated from unincorporated ( ⁇ - 32 P) triphosphates by chromatography on a C-18 Sep-Pak TM column (Waters Associates, Inc.) as follows: The crude labeling mixture was applied (disposable syringe) to the Sep-Pak cartridge which was then washed wtih 20ml of water to elute the unincorporated adenosine 5'-( ⁇ - 32 P) triphosphate.
  • the radiolabeled oligonucleotide was then eluted with 1 : 1 (v/v) metnanol: 0.1M triethylammonium acetate (pH 7.3) and the eluate evaported to dryness.
  • the specific activity of the probe was of the order of 10 8 -10 9 cpm/ ⁇ g.
  • Transformants were grown on Whatman 541 filter paper, the plasmids amplified in situ with chloramphenicol and the DNA immobilized on the filters (Gergen et al., supra). 32 P end-labeled probes were used to search the library. Additional screening was with nick translated cloned cDNA, with the same filters being used repeatedly.
  • RNA from male and female mouse submaxillary glands was separated on 2% agarose gels, transferred to nitrocellulose and hybridized with a nick translated 32 P-labeled Pstl-PstI fragment of the EGF cDNA insert. After washing, the RNA was autoradiographed at -70°C using an intensifying screen. Glyoxylated HindiII ⁇ and ⁇ X174 RF Haelll digested DNA fragments were used as size markers.
  • mice EGF mRNA The size of mouse EGF mRNA was determined by
  • the human EGF gene was sequenced and the mouse and human sequences compared.
  • the amino acid sequences of EGF from the two species are described by Carpenter, In: Tissue Growth Factors, Handbook of Experimental Pharmacology, R. Baseraga (ed.), Vol. 57, Springer-Verlag, Berlin, 1981, p. 94.
  • polynucleotide sequences which encode for a large polypeptide which includes the amino acid sequence of EGF.
  • the large polypeptide can be used as a source of polypeptides having physiological activity.
  • seven additional EGF-like polypeptides are identified.
  • the DNA sequences can be used for production of the large polypeptide or fragments thereof by employing recombinant DNA technology and inserting the polypeptide sequence downstream from an appropriate promoter in a functioning episomal element. The episomal element may then be introduced into an appropriate host for replication and expression of the desired polypeptide.

Abstract

DNA sequences and methods of obtaining DNA sequences which include a sequence encoding for mammalian epidermal growth factor. The DNA sequences may be used in cloning and expression vectors for production of DNA and RNA for producing polypeptides including mammalian epidermal growth factor.

Description

HYBRID DNA SYNTHESIS OF EPIDERMAL GROWTH FACTOR BACKGROUND OF THE INVENTION Field of the Invention Epidermal growth factor (EGF) is a polypeptide of 53 amino acids that has been characterized in both mice and humans. It is a potent mitogen for a variety of cells, such as fibroblasts, glia, epithelial, endothelial and epidermal cells, both cultured and in vivo. EGF is also a potent inhibitor of gastric acid secretion. EGF was first isolated from male mouse submaxillary glands, where it exists in inexplicably high levels.
In glandular homogenates, EGF is found as a 74,000 dalton complex of two molecules of EGF (Mr 6045) and two molecules of a binding protein (Mr 29,300), a kallikrein-like arginyl enteropeptidase. The amino acid sequence of mouse submaxillary EGF has been determined and the synthesis of EGF and a larger 9000 dalton precursor with a carboxy terminal extension has been demonstrated in cultured submaxillary glands.
Human EGF, which appears to be similar if not identical to urogastrone, is also found in urine in larger forms of 28,000 and 30,000 daltons that do not dissociate on sodium dodecyl sulfate-polyacrylamide gel electrophoresis.
Isolating both the DNA or RNA encoding for EGF and particularly a putative EGF precursor protein is extremely difficult for a number of reasons. Even where the peptide is abundant, the amount of messenger RNA is extremely small. Immunoprecipitation of in vitro translation products, even under strongly denaturing conditions fails to detect a precursor protein, possibly due to the huge size of the precursor and/or the masking of its antigenic determinants on the native peptide to which antibodies were made. Because of the physiological importance of EGF there is substantial interest in being able to obtain DNA sequences encoding for EGF and the EGF polypeptide precursor. In addition, since it is known that a number of hormones are generated by proteolytic processing from larger precursors the cDNA and derived amino acid sequence of the EGF precursor could reveal "cryptic", previously unknown polypeptide hormones and/or growth factors. Description of the Prior Art A human genomic cDNA library in bacteriophage λ is described in Lawn et al., Cell (1978) 15:1157-1174. Savage et al., J. Biol. Chem. (1972) 247: 7612-7621 report the amino acid sequence of mouse EGF. Sporn et al., Science (1983) 219:1329-1331 and Assoian et al., J. Biol. Chem. (1983) 258:7155-7160 describe transforming growth factors (TGF). See particularly Gray et al., Nature (1983) 303:722-725.
SUMMARY OF THE INVENTION DNA and RNA are provided encoding for mammalian EGF, polypeptide precursors thereof, and numerous other polypeptides also encoded for by the DNA sequence which includes the segment encoding for EGF. The DNA sequences may be used for production of mammalian EGF, precursors of mammalian EGF, and associated related polypeptides. Employing radiolabeled hybridization probes, messenger RNA encoding for the mouse EGF precursor is detected, isolated and used for the production of cDNA. The cDNA is sequenced and a fragment employed for hybridization with human DNA under conditions where mismatched heterologous hybrids can be detected. In this manner, a DNA sequence encoding for a large precursor peptide encompassing human EGF or urogastrone and numerous, related polypeptides is detected and isolated. DESCRIPTION OF SPECIFIC EMBODIMENTS
In accordance with the subject invention, DNA and RNA sequences are provided which include a segment encoding for EGF, particularly mouse and human, as well as the expression products of these sequences, including propolypeptides and peptides, particularly peptides having one or more physiological functions of EGF, or peptides having one or more other hormonal or growth factor regulatory functions.
The DNA sequences of interest are single or double stranded ranging from about 60 bases or base pairs (bp) to about 5 thousand base pairs (kbp), where sequences encoding for a physiologically active polypeptide will generally range from about 60bp to about 1000bp, which may include introns. Generally, the DNA sequences will have open reading frames (involving one or more exons) encoding for polypeptides ranging from about 20 amino acids to polypeptides about 1000 amino acids, where the sequence will include at least 2, usually at least 5, more usually at least 10bp, outside the segment encoding EGF. Polypeptides of particular interest will generally be of from about 20 to 250 amino acids, more usually from about 20 to 175 amino acids, particularly 30 to 60 amino acids.
The DNA segments encoding for polypeptides of interest or mature polypeptides may be located in any region in the DNA sequences described in this invention. of particular interest are sequences bordered by basic amino acids, i.e. arginine and lysine, more particularly when joined to a second basic amino acid, or alanine, leucine, aspartic or glutamic acid or amide thereof. Amongst these sequences of interest are seven previously unknown EGF-like polypeptides with amino acid sequences homologous to EGF. The DNA sequences obtained in accordance with this invention were obtained by the following experimental design.
A mammalian cDNA or genomic DNA library is screened with a plurality of radiolabeled hybridization probes for detection of a sequence encoding for an amino acid sequence present in EGF. A plurality of probes are employed providing for the various possible redundant codons. encoding for the oligopeptide. In the subject method a cDNA library from mouse submaxillary gland cells was probed. Plasmids binding strongly to the probes are isolated and the several overlapping cDNA inserts sequenced. The mouse EGF encoding cDNA has about 4800 bases. Mouse EGF is encoded for by nucleotides 3281-3440±5, with an open reading frame encoding for 1217+5 amino acid residues and a protein of approximately 130-140 kilodaltons (kdal), particularly about 133kdal.
The mouse cDNA may then be used to probe a human cDNA or genomic DNA bank. Conveniently a restriction fragment may be employed of about 500 to 1500bp. Particularly, a BstEII-PvuII fragment of about 1213+5bp, may be employed. The hybridization is carried out under conditions which facilitate the detection of mismatched, heterologous hybrids. The BstEII-PvuII fragment encodes mouse EGF (53 amino acids) and 286 amino acids before and 66 amino acids after the EGF moiety. Clones which hybridized to the probe were isolated and the human DNA inserts characterized.
Once the DNA sequence is isolated, it can be used in a variety of ways: For production of synthetic DNA sequences, either in whole or in part, for replication, or for the production of messenger RNA or expression of the precursor protein incorporating EGF, fragments of such protein or of EGF, or analogs of EGF, differing by one or more amino acids, usually by not more than about five amino acids from the naturally occurring EGF amino acid sequence.
Various DNA sequences are of particular interest in encoding polypeptides which can be obtained from the cDNA sequence. These DNA sequences are set forth in the argument map set forth in the experimental section, along with the human EGF sequence. The polypeptide sequences of interest include, but are not limited to, seven previously undescribed, EGF-like polypeptides identified on the basis of the homology of their amino acid sequences to EGF, especially the positional relationship (s) of the several cysteine residues. (See diagram in experimental section.) These sequences are frequently bounded by one or more basic amino acids.
Once the desired DNA sequence encoding for a protein or peptide of interest, e.g. EGF or its homologues, has been isolated, it can be joined with other DNA sequences for replication and expression. A wide variety of vectors are available for unicellular microorganisms, particularly for bacteria and fungi, where the DNA sequence encoding the poly (amino acid) of interest may be replicated and/or expressed.
Various hosts of interest include E. coli, S. cerevisiae, B. subtilis, mouse 3T3 cells, or the like. Conventional vectors include replication systems derived from R6-5, ColEI, the 2μm plasmid from yeast, RK plasmids, or the like. Alternative replication systems may be derived from viruses or phage, such as lambda, SV40, etc. In some instances, it will be desirable to have two different replication systems, where different functions may be achieved in different hosts. These vectors, referred to as shuttle vectors, frequently employ a replication system for E . coli and a replication system for a higher organism, e.g. yeast, so that amplification of the gene or cloning may be achieved in the bacterium, while expression may be achieved in the higher organism with appropriate processing, e.g. glycosylation.
Conveniently included with the replication system is at least one marker, which allows for selection or selective pressure to maintain the DNA construct containing the subject DNA sequence in the host. Convenient markers include biocidal resistance, e.g. antibiotics, heavy metals and toxins; complementation in an auxotrophic host; immunity; etc. The DNA sequence including the fragment encoding for a polypeptide having epidermal growth factor physiological properties or fragments of such sequence may be replicated in a cloning vector, which is capable of replication in a unicellular microorganism, such as bacteria and yeast. The DNA may also be used in an expression vector for expression of a polypeptide of interest, which may be mammalian EGF, particularly mouse or human, other physiologically active polypeptides present in the sequence, e.g. other hormones or growth factors, fragments thereof or analogs thereof differing by from about one to five amino acids. The open reading frame of the DNA sequence allows for the production of a large polypeptide. The large polypeptide may be treated in a variety of ways. The large polypeptide may be partially digested with a variety of proteases either individually or in combination. Illustrative endopeptidases include trypsin, pepsin, membrane dipeptidases, esteropeptidases or the like. The resulting fragments may then be separated by charge and/or molecular weight by any conventional means, e.g. filtration, sedimentation, chromatography, electrophoresis, or the like and then tested for physiological activity. Of particular interest are growth factors acting as mitogens or differentiation regulators. Based on the activities observed, the various fractions may be further purified by bioassays to obtain pure active factors.
The DNA sequences of this invention can be used in a variety of ways. Fragments can be used as probes for detecting complementary sequences in genomic DNA or in messenger RNA for detecting mutations and/or deletions in genomic DNA of hosts. The sequences can be used for expressing the polypeptides encoded for by the seσuence. The following examples are offered by way of illustration and not by way of limitation:
EXPERIMENTAL Methods cDNA Synthesis and Construction of Recombinant Plasmids To construct the cDNA library, polyAcontaining RNA was isolated from the submaxillary glands of 60-day-old male Swiss-Webster mice. ds cDNAs were prepared and inserted into the PstI site of a pBR322 derivative using the dGdC tailing technique (Chirgwin et al., Biochemistry (1979) 18:5294-5299; Goodman and MacDonald, Methods in Enzymol. (1980) 68:75-90). Resultant tetracyline-resistant trans formants of E. coli HB101 were stored at -70°C in microtiter dishes (Gergen et al., Nucl. Acids Res. (1979) 7 : 2115 -2136 ; Ish-Horowicz and Burke, ibid. (1981) 9:2989-2998).
Oligonucleotides were then synthesized by solid-phase phosphoramidite methodology as described in copending application, Serial No. 457,412, followed by isolation from 20% acrylamide gels modification of the method described in Beancage and Camthers, Tetrahedron Lett. (1981) 22: 1859-1862. Dodecamers were prepared which were complementary to the strand coding for amino acids 17 to 23 (lacking the last 5'-nucleotide) of mouse EGF cDNA. The fractions had the following sequences, where after the addition of the eleventh nucleotide, two pools were prepared, one terminating in A and the other terminating in G and after addition of the seventeenth nucleotide, the two pools were further divided with addition of the eighteenth nucleotide, with two of the pools now terminating in G and the other two pools terminating in A. In this manner, a total of four pools were obtained, where each pool had a plurality of eicosamers of differing compositions at positions 3, 6 and 9.
Figure imgf000010_0001
The different sequences are required because of the uncertainty as to the specific codon usage due to the redundancy of the genetic code for amino acids.
Synthetic oligonucleotides were labeled with adenosine 5'-(γ-32P) triphosphate (ICN, crude preparation, 7000Ci/mmol, lCi=3.7x1010Bq) by a polynucleotide kinase reaction (Wallace et al., Nucl. Acids Res. (1981)
9_: 879-894). The labeled oligonucleotides were separated from unincorporated (γ-32P) triphosphates by chromatography on a C-18 Sep-PakTM column (Waters Associates, Inc.) as follows: The crude labeling mixture was applied (disposable syringe) to the Sep-Pak cartridge which was then washed wtih 20ml of water to elute the unincorporated adenosine 5'-(γ- 32P) triphosphate. The radiolabeled oligonucleotide was then eluted with 1 : 1 (v/v) metnanol: 0.1M triethylammonium acetate (pH 7.3) and the eluate evaported to dryness. The specific activity of the probe was of the order of 108-109 cpm/μg.
Transformants were grown on Whatman 541 filter paper, the plasmids amplified in situ with chloramphenicol and the DNA immobilized on the filters (Gergen et al., supra). 32P end-labeled probes were used to search the library. Additional screening was with nick translated cloned cDNA, with the same filters being used repeatedly.
The sequence of cDNA inserts were determined by the Maxam and Gilbert method. itiRNA Size Estimation
Glyoxylated total RNA from male and female mouse submaxillary glands was separated on 2% agarose gels, transferred to nitrocellulose and hybridized with a nick translated 32P-labeled Pstl-PstI fragment of the EGF cDNA insert. After washing, the RNA was autoradiographed at -70°C using an intensifying screen. Glyoxylated HindiII λ and ΦX174 RF Haelll digested DNA fragments were used as size markers.
Results Screening of the cDNA Library
5000 transformants were initially screened, where 11 colonies yielded strong signals with pool 4 probes, the pools having the nucleotides at positions 12 and 18 of G and A, respectively. Weaker, but definitely positive signals were obtained with pool 3 which had the nucleotides G and G, respectively. Pools 1 and 2 gave no positive signals. The largest clone was 1800bp. Terminal restriction and other fragments of this clone were used to screen the original 5000 plus 7500 additional colonies (12,500 total) and yielded additional overlapping cDNA colonies which did not contain the EGF sequence. Since it was subsequently determined by DNA sequence analysis (vide infra) that these overlapping clones lacked the 5'-terminal region of the mRNA, another cDNA library was synthesized using an oligonucleotide primer complementary to nucleotides 1032-1051 (see argument map, infra) as follows:
3'-CCGCTTCCTTCGGTGCGAAT-5' and this library was then screened as described above. The relative abundance of the cDNA clones in the initial library suggests that EGF mRNA comprises about 0.2% of the polyA mRNA from this tissue. mRNA Sequence
The size of mouse EGF mRNA was determined by
Northern analysis of mRNA from adult male and female glands to be about the same size as 28s ribosomal RNA, approximately 4800 bases. The mRNA in the male gland was at least ten-fold greater in abundance than in the female gland. The nucleotide sequence of overlapping cDNA clones provided 4750bp of sequence as follows:
Figure imgf000013_0001
Figure imgf000014_0001
Figure imgf000015_0001
Figure imgf000016_0001
This includes the exact 53 amino acid residue sequence of mouse EGF (nucleotides 3281-3440), a translational start codon AUG (nucleotides 354-356) and a stop codon TAG (nucleotides 4005-4007). An open reading frame throughout the sequence which encodes for 1217 amino acid residues and a protein of approximately 133kdal. Also, seven additional EGF-like polypeptides are identified on the basis of the homology of their amino acid sequences to EGF, especially the positional relationship of their cysteine residues, as shown below:
Figure imgf000017_0001
Human EGF Gene
A 32P-labeled (O'Farrell, Focus (1981) 3:1-3) 1,213bp BstEII-PvuII fragment of mouse submaxillary EGF cDNA clone, pmEGF10, was hybridized to a human genomic DNA library (Lawn et al., Cell (1978) 15:1157-1174) in bactεriophage λ (available from Dr. T. Maniatis, Harvard University) using conditions which facilitate the detection of mismatched-heterologous hybrids. The BstEII-PvuII fragment of pmEGFlO encoded mouse EGF (53 amino acids) and 286 amino acids amino terminal to and 66 amino acids carboxy terminal to the EGF moiety. The hybridization conditions were 50% formamide, 5X SSC, 10% dextran sulfate, 20mM sodium phosphate, pH 6.5, 100μg/ml sonicated, denatured salmon testes DNA, and
0.1% sodium dodecyl sulfate at 30°C (Wahl et al., Proc. Natl. Acad. Sci. USA (1979) 76: 3683-3687). The filters were washed for one hour at 50°C in IM NaCl (Perler et al., Cell (1980) 20:555-566) before autoradiography. Four of the approximately 10 phage screened hybridized to the probe. Characterization of the human DNA inserts in these phage indicated that they represented overlapping DNA segments from the same region of the human genome. The partial sequence of the human DNA in λhEGF35, corresponding to the exons encoding EGF or uragastrone, indicated that these phage contained portions of the human EGF gene.
The human EGF gene was sequenced and the mouse and human sequences compared. The amino acid sequences of EGF from the two species are described by Carpenter, In: Tissue Growth Factors, Handbook of Experimental Pharmacology, R. Baseraga (ed.), Vol. 57, Springer-Verlag, Berlin, 1981, p. 94.
In accordance with the subject invention, polynucleotide sequences are provided which encode for a large polypeptide which includes the amino acid sequence of EGF. The large polypeptide can be used as a source of polypeptides having physiological activity. In particular, seven additional EGF-like polypeptides are identified. The DNA sequences can be used for production of the large polypeptide or fragments thereof by employing recombinant DNA technology and inserting the polypeptide sequence downstream from an appropriate promoter in a functioning episomal element. The episomal element may then be introduced into an appropriate host for replication and expression of the desired polypeptide. Although the foregoing invention has been described in some detail by way of illustration and example for purposes of clarity of understanding, it will be obvious that certain changes and modifications may be practiced within the scope of the appended claims.

Claims

WHAT IS CLAIMED IS:
1. A mammalian DNA sequence having a portion encoding for EGF and including at least a portion of the flanking coding regions in reading phase therewith.
2. A DNA sequence according to Claim 1, wherein said mammal is mouse.
3. A DNA sequence according to Claim 1, wherein said mammal is human.
4. A DNA fragment of at least about 60 nucleotides being a portion of the DNA sequence according to Claim 1 and including a sequence encoding for other than EGF.
5. A functional episomal element comprising a replication system and a DNA sequence or fragment of at least 60 nucleotides thereof according to Claim 1.
6. A DNA sequence having fewer than 5000 base pairs and greater than 159 base pairs and comprising at least 50% of the nucleotides of the gene encoding for a polypeptide including the amino acid sequence of human EGF.
7. A DNA sequence according to Claim 6 , having at least about 1000 nucleotides in an open reading frame.
8. A DNA sequence according to Claim 7, wherein said open reading frame includes a portion of said sequence encoding for EGF.
PCT/US1984/001050 1983-07-05 1984-07-02 Hybrid dna synthesis of epidermal growth factor WO1985000369A1 (en)

Priority Applications (2)

Application Number Priority Date Filing Date Title
JP84502699A JPS60501737A (en) 1983-07-05 1984-07-02 Epidermal growth factor hybrid DNA synthesis
DE1984902750 DE148922T1 (en) 1983-07-05 1984-07-02 HYBRID DNA SYNTHESIS OF THE EPIDERM GROWTH FACTOR.

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US51137283A 1983-07-05 1983-07-05
US511,372 1983-07-05

Publications (1)

Publication Number Publication Date
WO1985000369A1 true WO1985000369A1 (en) 1985-01-31

Family

ID=24034614

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/US1984/001050 WO1985000369A1 (en) 1983-07-05 1984-07-02 Hybrid dna synthesis of epidermal growth factor

Country Status (3)

Country Link
EP (1) EP0148922A4 (en)
JP (1) JPS60501737A (en)
WO (1) WO1985000369A1 (en)

Cited By (14)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4717717A (en) * 1986-11-05 1988-01-05 Ethicon, Inc. Stabilized compositions containing epidermal growth factor
US4743679A (en) * 1986-02-24 1988-05-10 Creative Biomolecules, Inc. Process for producing human epidermal growth factor and analogs thereof
WO1989001489A1 (en) * 1987-08-10 1989-02-23 Commonwealth Scientific And Industrial Research Or Control of angiogenesis and compositions and methods therefor
EP0438200A1 (en) * 1990-01-16 1991-07-24 Centro De Ingenieria Genetica Y Biotecnologia Method for the expression of heterologous genes in the yeast Pichia pastoris, expression vectors and transformed microorganisms
US5087616A (en) * 1986-08-07 1992-02-11 Battelle Memorial Institute Cytotoxic drug conjugates and their delivery to tumor cells
US5102789A (en) * 1989-03-15 1992-04-07 The Salk Institute Biotechnology/Industrial Associates, Inc. Production of epideramal growth factor in pichia pastoris yeast cells
US5208144A (en) * 1988-08-23 1993-05-04 The General Hospital Corporation Method for detection of human dna containing the gene encoding low density lipoprotein receptor
US5366081A (en) * 1987-08-26 1994-11-22 United States Surgical Corporation Packaged synthetic absorbable surgical elements
US5472702A (en) * 1987-08-26 1995-12-05 United States Surgical Corporation Sterilization of growth factors
EP0910569A1 (en) * 1996-04-10 1999-04-28 Human Genome Sciences, Inc. Extracellular/epidermal growth factor-like protein
US5904716A (en) * 1995-04-26 1999-05-18 Gendler; El Method for reconstituting cartilage tissue using demineralized bone and product thereof
WO2001036632A2 (en) * 1999-11-17 2001-05-25 Compugen Ltd. Variants of alternative splicing
US6852506B1 (en) 1996-04-10 2005-02-08 Human Genome Sciences, Inc. Extracellular/epidermal growth factor-like protein
EP2093241A2 (en) 2008-01-29 2009-08-26 Confluent Surgical Inc. Bioabsorbable block copolymer

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP0046039A1 (en) * 1980-08-05 1982-02-17 G.D. Searle & Co. Synthetic urogastrone gene, corresponding plasmid recombinants, transformed cells, production thereof and urogastrone expression
US4394443A (en) * 1980-12-18 1983-07-19 Yale University Method for cloning genes

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP0046039A1 (en) * 1980-08-05 1982-02-17 G.D. Searle & Co. Synthetic urogastrone gene, corresponding plasmid recombinants, transformed cells, production thereof and urogastrone expression
US4394443A (en) * 1980-12-18 1983-07-19 Yale University Method for cloning genes

Non-Patent Citations (7)

* Cited by examiner, † Cited by third party
Title
GRAY et al., Nature, Volume 303, 1983, pages 722-725 *
HOUGHTON et al., Nucleic Acids Research Volume 8, 1980, pages 2885-2894 *
RUTTER et al., CHEMICAL ABSTRACTS, Volume 101, 1984, Abstract No. 18357k, Biochem. Clin. Aspects Neuropept., Synth., Process, Gene Struct., 1983, KOCH et al., (ED.), Academic Press, Orlando, FL, pages 293-308 *
SAVAGE et al., CHEMICAL ABSTRACTS, Volume 78, 1973, Abstract No. 39662c, J. Biol. Chem., Vol 247, 1972, pages 7612-7621 *
SCOTT et al., Science, Volume 221, 1983 pages 236-240 *
See also references of EP0148922A4 *
ULLRICH et al., CHEMICAL ABSTRACTS, Volume 101, 1984, Abstract No. 18356j, Biochem. Clin. Aspects Neuropept., Synth., Process, Gene Struct., 1983, KOCH et al., (ED.), Academic Press, Orlando, FL, pages 277-291 *

Cited By (18)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4743679A (en) * 1986-02-24 1988-05-10 Creative Biomolecules, Inc. Process for producing human epidermal growth factor and analogs thereof
AU595290B2 (en) * 1986-02-24 1990-03-29 Creative Biomolecules, Inc. Process for producing human epidermal growth factor and analogs thereof
US5004686A (en) * 1986-02-24 1991-04-02 Creative Biomolecules, Inc. Process for producing human epidermal growth factor and analogs thereof
US5087616A (en) * 1986-08-07 1992-02-11 Battelle Memorial Institute Cytotoxic drug conjugates and their delivery to tumor cells
US4717717A (en) * 1986-11-05 1988-01-05 Ethicon, Inc. Stabilized compositions containing epidermal growth factor
WO1989001489A1 (en) * 1987-08-10 1989-02-23 Commonwealth Scientific And Industrial Research Or Control of angiogenesis and compositions and methods therefor
US5472702A (en) * 1987-08-26 1995-12-05 United States Surgical Corporation Sterilization of growth factors
US5366081A (en) * 1987-08-26 1994-11-22 United States Surgical Corporation Packaged synthetic absorbable surgical elements
US5208144A (en) * 1988-08-23 1993-05-04 The General Hospital Corporation Method for detection of human dna containing the gene encoding low density lipoprotein receptor
US5102789A (en) * 1989-03-15 1992-04-07 The Salk Institute Biotechnology/Industrial Associates, Inc. Production of epideramal growth factor in pichia pastoris yeast cells
EP0438200A1 (en) * 1990-01-16 1991-07-24 Centro De Ingenieria Genetica Y Biotecnologia Method for the expression of heterologous genes in the yeast Pichia pastoris, expression vectors and transformed microorganisms
US5904716A (en) * 1995-04-26 1999-05-18 Gendler; El Method for reconstituting cartilage tissue using demineralized bone and product thereof
EP0910569A1 (en) * 1996-04-10 1999-04-28 Human Genome Sciences, Inc. Extracellular/epidermal growth factor-like protein
EP0910569A4 (en) * 1996-04-10 2002-05-08 Human Genome Sciences Inc Extracellular/epidermal growth factor-like protein
US6852506B1 (en) 1996-04-10 2005-02-08 Human Genome Sciences, Inc. Extracellular/epidermal growth factor-like protein
WO2001036632A2 (en) * 1999-11-17 2001-05-25 Compugen Ltd. Variants of alternative splicing
WO2001036632A3 (en) * 1999-11-17 2002-01-03 Compugen Ltd Variants of alternative splicing
EP2093241A2 (en) 2008-01-29 2009-08-26 Confluent Surgical Inc. Bioabsorbable block copolymer

Also Published As

Publication number Publication date
JPS60501737A (en) 1985-10-17
EP0148922A4 (en) 1987-06-15
EP0148922A1 (en) 1985-07-24

Similar Documents

Publication Publication Date Title
US7084262B2 (en) Analysis and separation of platelet-derived growth factor proteins
CA1339447C (en) Recombinant purified protease nexin
Richter et al. Biosynthesis of thyrotropin releasing hormone in the skin of Xenopus laevis: partial sequence of the precursor deduced from cloned cDNA.
EP0180615B1 (en) Novel atrial natriuretic/vasodilator polypeptides
EP0285448B1 (en) Leukaemia inhibitory factor
JP3276933B2 (en) Method for producing glycoprotein having erythropoietin activity
EP0593065B1 (en) Recombinant fibroblast growth factors
US4588684A (en) a-Factor and its processing signals
WO1985000369A1 (en) Hybrid dna synthesis of epidermal growth factor
EP0237966A2 (en) Human basic fibroblast growth factor
Abraham et al. Human basic fibroblast growth factor: nucleotide sequence, genomic organization, and expression in mammalian cells
EP0037723A2 (en) Expression of hormone genomic clones
US4992367A (en) Enhanced expression of human interleukin-2 in mammalian cells
JP2823690B2 (en) Protease resistant PDGF and methods of use
CA2011833C (en) Novel polypeptide and production thereof
US4783412A (en) Hybrid DNA synthesis of epidermal growth factor
US6342374B1 (en) Human collagenase inhibitor, recombinant vector system for using same and recombinant-DNA method for the manufacture of same
EP0355720B1 (en) Process for the preparation of motilin-like polypeptide and expression thereof
US5656458A (en) Expression and processing of amino terminus acetylated FGF's in yeast
JPH07316197A (en) Preparation of growth hormone release factor
Ivanov et al. Chemical synthesis and expression in E. coli of a human Val8‐calcitonin gene by fusion to a synthetic human interferon‐γ gene
JP2990162B2 (en) Composition of human acidic fibroblast growth factor, its preparation, DNA construct and yeast host cell
Francis et al. Carboxyl-terminal amino acid sequences of two variant forms of the gamma chain of human plasma fibrinogen.
JP2623807B2 (en) Serine protease and serine protease gene
EP0192738A1 (en) Expression of human connective tissue activator and related peptides

Legal Events

Date Code Title Description
AK Designated states

Designated state(s): JP

AL Designated countries for regional patents

Designated state(s): AT BE CH DE FR GB LU NL SE

WWE Wipo information: entry into national phase

Ref document number: 1984902750

Country of ref document: EP

WWP Wipo information: published in national office

Ref document number: 1984902750

Country of ref document: EP

EL Fr: translation of claims filed
WWW Wipo information: withdrawn in national office

Ref document number: 1984902750

Country of ref document: EP