US20020098503A1 - Methods for identifying nucleic acid sequences encoding agents that affect cellular phenotypes - Google Patents
Methods for identifying nucleic acid sequences encoding agents that affect cellular phenotypes Download PDFInfo
- Publication number
- US20020098503A1 US20020098503A1 US09/999,003 US99900301A US2002098503A1 US 20020098503 A1 US20020098503 A1 US 20020098503A1 US 99900301 A US99900301 A US 99900301A US 2002098503 A1 US2002098503 A1 US 2002098503A1
- Authority
- US
- United States
- Prior art keywords
- cells
- perturbagen
- expression
- reporter
- cell
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 238000000034 method Methods 0.000 title claims abstract description 107
- 150000007523 nucleic acids Chemical group 0.000 title claims abstract description 57
- 230000001413 cellular effect Effects 0.000 title abstract description 14
- 239000003795 chemical substances by application Substances 0.000 title description 4
- 230000014509 gene expression Effects 0.000 claims abstract description 116
- 210000004027 cell Anatomy 0.000 claims description 258
- 108020004707 nucleic acids Proteins 0.000 claims description 41
- 102000039446 nucleic acids Human genes 0.000 claims description 41
- 230000000694 effects Effects 0.000 claims description 28
- 238000004458 analytical method Methods 0.000 claims description 13
- 108091032973 (ribonucleotides)n+m Proteins 0.000 claims description 8
- 210000004962 mammalian cell Anatomy 0.000 claims description 8
- 230000004913 activation Effects 0.000 claims description 6
- 238000003556 assay Methods 0.000 claims description 5
- 108091062157 Cis-regulatory element Proteins 0.000 claims description 4
- 230000012010 growth Effects 0.000 claims description 3
- 230000004640 cellular pathway Effects 0.000 claims 6
- 108020005544 Antisense RNA Proteins 0.000 claims 1
- 230000000692 anti-sense effect Effects 0.000 claims 1
- 239000003184 complementary RNA Substances 0.000 claims 1
- 230000002779 inactivation Effects 0.000 claims 1
- 108700008625 Reporter Genes Proteins 0.000 abstract description 46
- 230000001747 exhibiting effect Effects 0.000 abstract 1
- 108090000623 proteins and genes Proteins 0.000 description 100
- 239000012634 fragment Substances 0.000 description 46
- 230000002068 genetic effect Effects 0.000 description 44
- 102000004169 proteins and genes Human genes 0.000 description 41
- 235000018102 proteins Nutrition 0.000 description 39
- 108020004414 DNA Proteins 0.000 description 37
- 230000001105 regulatory effect Effects 0.000 description 34
- 239000013604 expression vector Substances 0.000 description 30
- 230000037361 pathway Effects 0.000 description 24
- 239000013598 vector Substances 0.000 description 24
- 240000004808 Saccharomyces cerevisiae Species 0.000 description 20
- 238000011144 upstream manufacturing Methods 0.000 description 19
- 230000008569 process Effects 0.000 description 17
- 201000001441 melanoma Diseases 0.000 description 16
- 238000002474 experimental method Methods 0.000 description 15
- 230000035772 mutation Effects 0.000 description 15
- 238000012546 transfer Methods 0.000 description 14
- 150000003384 small molecules Chemical class 0.000 description 12
- 241000588724 Escherichia coli Species 0.000 description 11
- 108091005948 blue fluorescent proteins Proteins 0.000 description 10
- 102000008607 Integrin beta3 Human genes 0.000 description 9
- 108010020950 Integrin beta3 Proteins 0.000 description 9
- 230000008238 biochemical pathway Effects 0.000 description 9
- 230000006870 function Effects 0.000 description 9
- 230000004927 fusion Effects 0.000 description 9
- 230000003993 interaction Effects 0.000 description 9
- 238000002955 isolation Methods 0.000 description 9
- 241000196324 Embryophyta Species 0.000 description 8
- 108091028043 Nucleic acid sequence Proteins 0.000 description 8
- 108020004999 messenger RNA Proteins 0.000 description 8
- 108700026226 TATA Box Proteins 0.000 description 7
- 108060008724 Tyrosinase Proteins 0.000 description 7
- 238000013459 approach Methods 0.000 description 7
- 230000001580 bacterial effect Effects 0.000 description 7
- 210000004748 cultured cell Anatomy 0.000 description 7
- 239000003623 enhancer Substances 0.000 description 7
- 238000012252 genetic analysis Methods 0.000 description 7
- 230000002829 reductive effect Effects 0.000 description 7
- 238000012216 screening Methods 0.000 description 7
- 206010028980 Neoplasm Diseases 0.000 description 6
- 102000003425 Tyrosinase Human genes 0.000 description 6
- 239000002299 complementary DNA Substances 0.000 description 6
- 238000010276 construction Methods 0.000 description 6
- 238000009826 distribution Methods 0.000 description 6
- 239000013612 plasmid Substances 0.000 description 6
- 102000004196 processed proteins & peptides Human genes 0.000 description 6
- 108090000765 processed proteins & peptides Proteins 0.000 description 6
- 239000000047 product Substances 0.000 description 6
- 210000001082 somatic cell Anatomy 0.000 description 6
- 230000003612 virological effect Effects 0.000 description 6
- 241000894006 Bacteria Species 0.000 description 5
- 102100028501 Galanin peptides Human genes 0.000 description 5
- 108091023040 Transcription factor Proteins 0.000 description 5
- 102000040945 Transcription factor Human genes 0.000 description 5
- 230000006399 behavior Effects 0.000 description 5
- 210000000349 chromosome Anatomy 0.000 description 5
- 238000004520 electroporation Methods 0.000 description 5
- 238000000338 in vitro Methods 0.000 description 5
- 230000001965 increasing effect Effects 0.000 description 5
- 238000003780 insertion Methods 0.000 description 5
- 230000037431 insertion Effects 0.000 description 5
- 230000010076 replication Effects 0.000 description 5
- 230000019491 signal transduction Effects 0.000 description 5
- 210000001519 tissue Anatomy 0.000 description 5
- 108700028369 Alleles Proteins 0.000 description 4
- 241000233866 Fungi Species 0.000 description 4
- 241000238631 Hexapoda Species 0.000 description 4
- 230000008901 benefit Effects 0.000 description 4
- 201000011510 cancer Diseases 0.000 description 4
- 238000010367 cloning Methods 0.000 description 4
- 201000010099 disease Diseases 0.000 description 4
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 description 4
- 239000003814 drug Substances 0.000 description 4
- 230000002538 fungal effect Effects 0.000 description 4
- 229930182830 galactose Natural products 0.000 description 4
- 210000005260 human cell Anatomy 0.000 description 4
- 238000001727 in vivo Methods 0.000 description 4
- 239000003550 marker Substances 0.000 description 4
- 238000010369 molecular cloning Methods 0.000 description 4
- 238000005204 segregation Methods 0.000 description 4
- 230000010473 stable expression Effects 0.000 description 4
- 238000010561 standard procedure Methods 0.000 description 4
- 230000009466 transformation Effects 0.000 description 4
- 102100022005 B-lymphocyte antigen CD20 Human genes 0.000 description 3
- 108091026890 Coding region Proteins 0.000 description 3
- 102000004190 Enzymes Human genes 0.000 description 3
- 108090000790 Enzymes Proteins 0.000 description 3
- 108700039887 Essential Genes Proteins 0.000 description 3
- 101150094690 GAL1 gene Proteins 0.000 description 3
- 101150066002 GFP gene Proteins 0.000 description 3
- 102100039556 Galectin-4 Human genes 0.000 description 3
- 241000282412 Homo Species 0.000 description 3
- 101000897405 Homo sapiens B-lymphocyte antigen CD20 Proteins 0.000 description 3
- 101100121078 Homo sapiens GAL gene Proteins 0.000 description 3
- 101000608765 Homo sapiens Galectin-4 Proteins 0.000 description 3
- 102100031413 L-dopachrome tautomerase Human genes 0.000 description 3
- 241000700605 Viruses Species 0.000 description 3
- 230000004075 alteration Effects 0.000 description 3
- 238000002306 biochemical method Methods 0.000 description 3
- 210000000170 cell membrane Anatomy 0.000 description 3
- 150000001875 compounds Chemical class 0.000 description 3
- 230000003247 decreasing effect Effects 0.000 description 3
- 230000001419 dependent effect Effects 0.000 description 3
- 108010051081 dopachrome isomerase Proteins 0.000 description 3
- 230000001976 improved effect Effects 0.000 description 3
- 230000006698 induction Effects 0.000 description 3
- 230000001939 inductive effect Effects 0.000 description 3
- 230000003834 intracellular effect Effects 0.000 description 3
- 230000013011 mating Effects 0.000 description 3
- 230000007246 mechanism Effects 0.000 description 3
- 239000002773 nucleotide Substances 0.000 description 3
- 125000003729 nucleotide group Chemical group 0.000 description 3
- 230000006798 recombination Effects 0.000 description 3
- 238000005215 recombination Methods 0.000 description 3
- 201000009030 Carcinoma Diseases 0.000 description 2
- 108700039691 Genetic Promoter Regions Proteins 0.000 description 2
- 101000606090 Homo sapiens Tyrosinase Proteins 0.000 description 2
- 108060001084 Luciferase Proteins 0.000 description 2
- XUMBMVFBXHLACL-UHFFFAOYSA-N Melanin Chemical compound O=C1C(=O)C(C2=CNC3=C(C(C(=O)C4=C32)=O)C)=C2C4=CNC2=C1C XUMBMVFBXHLACL-UHFFFAOYSA-N 0.000 description 2
- 102000018697 Membrane Proteins Human genes 0.000 description 2
- 108010052285 Membrane Proteins Proteins 0.000 description 2
- 102000013760 Microphthalmia-Associated Transcription Factor Human genes 0.000 description 2
- 108010050345 Microphthalmia-Associated Transcription Factor Proteins 0.000 description 2
- 229930193140 Neomycin Natural products 0.000 description 2
- 238000012300 Sequence Analysis Methods 0.000 description 2
- 230000003321 amplification Effects 0.000 description 2
- 108010051210 beta-Fructofuranosidase Proteins 0.000 description 2
- 230000003115 biocidal effect Effects 0.000 description 2
- 230000031018 biological processes and functions Effects 0.000 description 2
- 210000004556 brain Anatomy 0.000 description 2
- 230000010261 cell growth Effects 0.000 description 2
- 239000002458 cell surface marker Substances 0.000 description 2
- 108091092328 cellular RNA Proteins 0.000 description 2
- 230000002759 chromosomal effect Effects 0.000 description 2
- 230000001351 cycling effect Effects 0.000 description 2
- 210000000805 cytoplasm Anatomy 0.000 description 2
- 230000007423 decrease Effects 0.000 description 2
- 238000012217 deletion Methods 0.000 description 2
- 230000037430 deletion Effects 0.000 description 2
- 238000006073 displacement reaction Methods 0.000 description 2
- 229940079593 drug Drugs 0.000 description 2
- 239000000975 dye Substances 0.000 description 2
- 230000002900 effect on cell Effects 0.000 description 2
- 238000005516 engineering process Methods 0.000 description 2
- 238000000684 flow cytometry Methods 0.000 description 2
- 108020001507 fusion proteins Proteins 0.000 description 2
- 102000037865 fusion proteins Human genes 0.000 description 2
- 238000012248 genetic selection Methods 0.000 description 2
- 230000006801 homologous recombination Effects 0.000 description 2
- 238000002744 homologous recombination Methods 0.000 description 2
- 239000003112 inhibitor Substances 0.000 description 2
- 230000002401 inhibitory effect Effects 0.000 description 2
- 230000005764 inhibitory process Effects 0.000 description 2
- 239000001573 invertase Substances 0.000 description 2
- 235000011073 invertase Nutrition 0.000 description 2
- 239000000463 material Substances 0.000 description 2
- 238000005259 measurement Methods 0.000 description 2
- 210000002752 melanocyte Anatomy 0.000 description 2
- 239000000693 micelle Substances 0.000 description 2
- 244000005700 microbiome Species 0.000 description 2
- 239000000203 mixture Substances 0.000 description 2
- 238000012544 monitoring process Methods 0.000 description 2
- 229960004927 neomycin Drugs 0.000 description 2
- 238000003199 nucleic acid amplification method Methods 0.000 description 2
- 230000002018 overexpression Effects 0.000 description 2
- 239000000049 pigment Substances 0.000 description 2
- 238000002360 preparation method Methods 0.000 description 2
- 230000000644 propagated effect Effects 0.000 description 2
- 230000001902 propagating effect Effects 0.000 description 2
- 230000004043 responsiveness Effects 0.000 description 2
- 239000000523 sample Substances 0.000 description 2
- 238000010187 selection method Methods 0.000 description 2
- 238000001228 spectrum Methods 0.000 description 2
- 239000000126 substance Substances 0.000 description 2
- 230000001225 therapeutic effect Effects 0.000 description 2
- 238000013518 transcription Methods 0.000 description 2
- 230000035897 transcription Effects 0.000 description 2
- 230000002103 transcriptional effect Effects 0.000 description 2
- 238000013519 translation Methods 0.000 description 2
- 238000001086 yeast two-hybrid system Methods 0.000 description 2
- BFSVOASYOCHEOV-UHFFFAOYSA-N 2-diethylaminoethanol Chemical compound CCN(CC)CCO BFSVOASYOCHEOV-UHFFFAOYSA-N 0.000 description 1
- 102000007469 Actins Human genes 0.000 description 1
- 108010085238 Actins Proteins 0.000 description 1
- 241000256118 Aedes aegypti Species 0.000 description 1
- 241000203069 Archaea Species 0.000 description 1
- 241000238421 Arthropoda Species 0.000 description 1
- 206010003571 Astrocytoma Diseases 0.000 description 1
- 235000014469 Bacillus subtilis Nutrition 0.000 description 1
- 101100064718 Borrelia bavariensis (strain ATCC BAA-2496 / DSM 23469 / PBi) fusA1 gene Proteins 0.000 description 1
- 101100209555 Caenorhabditis elegans vha-17 gene Proteins 0.000 description 1
- OYPRJOBELJOOCE-UHFFFAOYSA-N Calcium Chemical compound [Ca] OYPRJOBELJOOCE-UHFFFAOYSA-N 0.000 description 1
- UXVMQQNJUSDDNG-UHFFFAOYSA-L Calcium chloride Chemical compound [Cl-].[Cl-].[Ca+2] UXVMQQNJUSDDNG-UHFFFAOYSA-L 0.000 description 1
- 241000701489 Cauliflower mosaic virus Species 0.000 description 1
- 206010061765 Chromosomal mutation Diseases 0.000 description 1
- 229920002307 Dextran Polymers 0.000 description 1
- 206010059866 Drug resistance Diseases 0.000 description 1
- YQYJSBFKSSDGFO-UHFFFAOYSA-N Epihygromycin Natural products OC1C(O)C(C(=O)C)OC1OC(C(=C1)O)=CC=C1C=C(C)C(=O)NC1C(O)C(O)C2OCOC2C1O YQYJSBFKSSDGFO-UHFFFAOYSA-N 0.000 description 1
- 241000206602 Eukaryota Species 0.000 description 1
- 101150038178 FUS1 gene Proteins 0.000 description 1
- 238000012413 Fluorescence activated cell sorting analysis Methods 0.000 description 1
- 102000003688 G-Protein-Coupled Receptors Human genes 0.000 description 1
- 108090000045 G-Protein-Coupled Receptors Proteins 0.000 description 1
- 208000032612 Glial tumor Diseases 0.000 description 1
- 206010018338 Glioma Diseases 0.000 description 1
- 102100031547 HLA class II histocompatibility antigen, DO alpha chain Human genes 0.000 description 1
- 241000204946 Halobacterium salinarum Species 0.000 description 1
- 101000866278 Homo sapiens HLA class II histocompatibility antigen, DO alpha chain Proteins 0.000 description 1
- OUYCCCASQSFEME-QMMMGPOBSA-N L-tyrosine Chemical compound OC(=O)[C@@H](N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-QMMMGPOBSA-N 0.000 description 1
- 241000254158 Lampyridae Species 0.000 description 1
- 229910013594 LiOAc Inorganic materials 0.000 description 1
- 239000005089 Luciferase Substances 0.000 description 1
- 102000043136 MAP kinase family Human genes 0.000 description 1
- 108091054455 MAP kinase family Proteins 0.000 description 1
- 108010038049 Mating Factor Proteins 0.000 description 1
- 102000051089 Melanotransferrin Human genes 0.000 description 1
- 108700038051 Melanotransferrin Proteins 0.000 description 1
- 206010027480 Metastatic malignant melanoma Diseases 0.000 description 1
- 241001465754 Metazoa Species 0.000 description 1
- 241000699660 Mus musculus Species 0.000 description 1
- 206010061309 Neoplasm progression Diseases 0.000 description 1
- 206010029260 Neuroblastoma Diseases 0.000 description 1
- 108091034117 Oligonucleotide Proteins 0.000 description 1
- 238000012408 PCR amplification Methods 0.000 description 1
- 102000052575 Proto-Oncogene Human genes 0.000 description 1
- 108700020978 Proto-Oncogene Proteins 0.000 description 1
- 108091034057 RNA (poly(A)) Proteins 0.000 description 1
- 102000009572 RNA Polymerase II Human genes 0.000 description 1
- 108010009460 RNA Polymerase II Proteins 0.000 description 1
- 108091027981 Response element Proteins 0.000 description 1
- 101150006985 STE2 gene Proteins 0.000 description 1
- 241000242583 Scyphozoa Species 0.000 description 1
- 241000256251 Spodoptera frugiperda Species 0.000 description 1
- 108091081024 Start codon Proteins 0.000 description 1
- 241000205098 Sulfolobus acidocaldarius Species 0.000 description 1
- 241000255588 Tephritidae Species 0.000 description 1
- 108700029229 Transcriptional Regulatory Elements Proteins 0.000 description 1
- 102000044209 Tumor Suppressor Genes Human genes 0.000 description 1
- 108700025716 Tumor Suppressor Genes Proteins 0.000 description 1
- 108700005077 Viral Genes Proteins 0.000 description 1
- 240000008042 Zea mays Species 0.000 description 1
- 235000016383 Zea mays subsp huehuetenangensis Nutrition 0.000 description 1
- 235000002017 Zea mays subsp mays Nutrition 0.000 description 1
- JLCPHMBAVCMARE-UHFFFAOYSA-N [3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-hydroxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methyl [5-(6-aminopurin-9-yl)-2-(hydroxymethyl)oxolan-3-yl] hydrogen phosphate Polymers Cc1cn(C2CC(OP(O)(=O)OCC3OC(CC3OP(O)(=O)OCC3OC(CC3O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c3nc(N)[nH]c4=O)C(COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3CO)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cc(C)c(=O)[nH]c3=O)n3cc(C)c(=O)[nH]c3=O)n3ccc(N)nc3=O)n3cc(C)c(=O)[nH]c3=O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)O2)c(=O)[nH]c1=O JLCPHMBAVCMARE-UHFFFAOYSA-N 0.000 description 1
- 230000001464 adherent effect Effects 0.000 description 1
- 210000004102 animal cell Anatomy 0.000 description 1
- 239000003242 anti bacterial agent Substances 0.000 description 1
- 210000004507 artificial chromosome Anatomy 0.000 description 1
- 210000001106 artificial yeast chromosome Anatomy 0.000 description 1
- 238000003287 bathing Methods 0.000 description 1
- 230000003851 biochemical process Effects 0.000 description 1
- 230000033228 biological regulation Effects 0.000 description 1
- 230000015572 biosynthetic process Effects 0.000 description 1
- 230000000981 bystander Effects 0.000 description 1
- 229910052791 calcium Inorganic materials 0.000 description 1
- 239000011575 calcium Substances 0.000 description 1
- 239000001110 calcium chloride Substances 0.000 description 1
- 229910001628 calcium chloride Inorganic materials 0.000 description 1
- 239000003990 capacitor Substances 0.000 description 1
- 101150055766 cat gene Proteins 0.000 description 1
- 229920006317 cationic polymer Polymers 0.000 description 1
- 150000001768 cations Chemical class 0.000 description 1
- 230000025084 cell cycle arrest Effects 0.000 description 1
- 230000036978 cell physiology Effects 0.000 description 1
- 230000003833 cell viability Effects 0.000 description 1
- 230000033077 cellular process Effects 0.000 description 1
- 210000002230 centromere Anatomy 0.000 description 1
- 238000006243 chemical reaction Methods 0.000 description 1
- 239000003153 chemical reaction reagent Substances 0.000 description 1
- 238000000975 co-precipitation Methods 0.000 description 1
- 210000001072 colon Anatomy 0.000 description 1
- 230000000052 comparative effect Effects 0.000 description 1
- 230000001447 compensatory effect Effects 0.000 description 1
- 239000000470 constituent Substances 0.000 description 1
- 230000006552 constitutive activation Effects 0.000 description 1
- 239000013078 crystal Substances 0.000 description 1
- 230000001086 cytosolic effect Effects 0.000 description 1
- 230000000881 depressing effect Effects 0.000 description 1
- 238000010586 diagram Methods 0.000 description 1
- 230000004069 differentiation Effects 0.000 description 1
- 230000001973 epigenetic effect Effects 0.000 description 1
- 210000003527 eukaryotic cell Anatomy 0.000 description 1
- 230000005284 excitation Effects 0.000 description 1
- 230000001605 fetal effect Effects 0.000 description 1
- 210000002950 fibroblast Anatomy 0.000 description 1
- 108091006047 fluorescent proteins Proteins 0.000 description 1
- 102000034287 fluorescent proteins Human genes 0.000 description 1
- 238000003633 gene expression assay Methods 0.000 description 1
- 238000010359 gene isolation Methods 0.000 description 1
- 238000001415 gene therapy Methods 0.000 description 1
- 238000007429 general method Methods 0.000 description 1
- 238000010448 genetic screening Methods 0.000 description 1
- 229940094991 herring sperm dna Drugs 0.000 description 1
- 238000013537 high throughput screening Methods 0.000 description 1
- 238000009396 hybridization Methods 0.000 description 1
- 230000008676 import Effects 0.000 description 1
- 230000000977 initiatory effect Effects 0.000 description 1
- 238000007689 inspection Methods 0.000 description 1
- 230000010354 integration Effects 0.000 description 1
- 230000002452 interceptive effect Effects 0.000 description 1
- 230000009878 intermolecular interaction Effects 0.000 description 1
- 230000009545 invasion Effects 0.000 description 1
- 238000002372 labelling Methods 0.000 description 1
- 150000002632 lipids Chemical class 0.000 description 1
- XIXADJRWDQXREU-UHFFFAOYSA-M lithium acetate Chemical compound [Li+].CC([O-])=O XIXADJRWDQXREU-UHFFFAOYSA-M 0.000 description 1
- 238000012423 maintenance Methods 0.000 description 1
- 235000009973 maize Nutrition 0.000 description 1
- 230000007257 malfunction Effects 0.000 description 1
- 230000036210 malignancy Effects 0.000 description 1
- 238000004519 manufacturing process Methods 0.000 description 1
- 238000013507 mapping Methods 0.000 description 1
- 230000002503 metabolic effect Effects 0.000 description 1
- 230000004060 metabolic process Effects 0.000 description 1
- 208000021039 metastatic melanoma Diseases 0.000 description 1
- 238000000520 microinjection Methods 0.000 description 1
- 238000000386 microscopy Methods 0.000 description 1
- 238000007479 molecular analysis Methods 0.000 description 1
- 230000004001 molecular interaction Effects 0.000 description 1
- 238000002703 mutagenesis Methods 0.000 description 1
- 231100000350 mutagenesis Toxicity 0.000 description 1
- 239000003471 mutagenic agent Substances 0.000 description 1
- 231100000707 mutagenic chemical Toxicity 0.000 description 1
- 210000000633 nuclear envelope Anatomy 0.000 description 1
- 210000004940 nucleus Anatomy 0.000 description 1
- 235000016709 nutrition Nutrition 0.000 description 1
- 244000052769 pathogen Species 0.000 description 1
- 230000008506 pathogenesis Effects 0.000 description 1
- 230000001717 pathogenic effect Effects 0.000 description 1
- NBIIXXVUZAFLBC-UHFFFAOYSA-K phosphate Chemical compound [O-]P([O-])([O-])=O NBIIXXVUZAFLBC-UHFFFAOYSA-K 0.000 description 1
- 210000004694 pigment cell Anatomy 0.000 description 1
- 239000002244 precipitate Substances 0.000 description 1
- 239000002243 precursor Substances 0.000 description 1
- 210000002307 prostate Anatomy 0.000 description 1
- 235000004252 protein component Nutrition 0.000 description 1
- 108020001580 protein domains Proteins 0.000 description 1
- 230000004850 protein–protein interaction Effects 0.000 description 1
- 238000004445 quantitative analysis Methods 0.000 description 1
- 238000011084 recovery Methods 0.000 description 1
- 230000007115 recruitment Effects 0.000 description 1
- 230000022532 regulation of transcription, DNA-dependent Effects 0.000 description 1
- 230000008439 repair process Effects 0.000 description 1
- 230000004044 response Effects 0.000 description 1
- 108091008146 restriction endonucleases Proteins 0.000 description 1
- 230000002441 reversible effect Effects 0.000 description 1
- 150000003839 salts Chemical class 0.000 description 1
- 230000003248 secreting effect Effects 0.000 description 1
- 230000009962 secretion pathway Effects 0.000 description 1
- 238000009394 selective breeding Methods 0.000 description 1
- 230000035945 sensitivity Effects 0.000 description 1
- 230000014639 sexual reproduction Effects 0.000 description 1
- 239000013605 shuttle vector Substances 0.000 description 1
- 230000000392 somatic effect Effects 0.000 description 1
- 230000003595 spectral effect Effects 0.000 description 1
- 239000007858 starting material Substances 0.000 description 1
- 239000000758 substrate Substances 0.000 description 1
- 230000004083 survival effect Effects 0.000 description 1
- 101150003509 tag gene Proteins 0.000 description 1
- 238000002560 therapeutic procedure Methods 0.000 description 1
- 238000001890 transfection Methods 0.000 description 1
- 238000000844 transformation Methods 0.000 description 1
- 230000009261 transgenic effect Effects 0.000 description 1
- 230000001052 transient effect Effects 0.000 description 1
- 230000010474 transient expression Effects 0.000 description 1
- 230000005751 tumor progression Effects 0.000 description 1
- 238000010396 two-hybrid screening Methods 0.000 description 1
- OUYCCCASQSFEME-UHFFFAOYSA-N tyrosine Natural products OC(=O)C(N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-UHFFFAOYSA-N 0.000 description 1
- 241000701161 unidentified adenovirus Species 0.000 description 1
- 241001430294 unidentified retrovirus Species 0.000 description 1
- 230000008189 vertebrate development Effects 0.000 description 1
- 230000008299 viral mechanism Effects 0.000 description 1
- 239000013603 viral vector Substances 0.000 description 1
- 238000012800 visualization Methods 0.000 description 1
- 210000005253 yeast cell Anatomy 0.000 description 1
Images
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/10—Processes for the isolation, preparation or purification of DNA or RNA
- C12N15/1034—Isolating an individual clone by screening libraries
- C12N15/1086—Preparation or screening of expression libraries, e.g. reporter assays
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/10—Processes for the isolation, preparation or purification of DNA or RNA
- C12N15/1034—Isolating an individual clone by screening libraries
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/10—Processes for the isolation, preparation or purification of DNA or RNA
- C12N15/1034—Isolating an individual clone by screening libraries
- C12N15/1051—Gene trapping, e.g. exon-, intron-, IRES-, signal sequence-trap cloning, trap vectors
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/10—Processes for the isolation, preparation or purification of DNA or RNA
- C12N15/1034—Isolating an individual clone by screening libraries
- C12N15/1079—Screening libraries by altering the phenotype or phenotypic trait of the host
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q1/00—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
- C12Q1/68—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
- C12Q1/6811—Selection methods for production or design of target specific oligonucleotides or binding molecules
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q1/00—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
- C12Q1/68—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
- C12Q1/6897—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids involving reporter genes operably linked to promoters
Definitions
- the present invention comprises a general procedure applicable in virtually any cell type for identification of nucleic acid sequences that perturb specific biochemical pathways within a cell.
- Some organisms are especially tractable in genetic studies. These organisms typically are either unicellular, or have short life cycles, small genomes, and a variety of other useful features. Other organisms, such as humans, are less tractable.
- two basic approaches to mutant isolation are available. The first method, termed screening, involves the sometimes painstaking inspection of thousands of individual organisms or clones of cells. Those that have the appropriate mutant phenotype are separated from the others and permitted to grow in isolation. In this manner, homogeneous populations of mutants can be grown and analyzed.
- the second approach involves growth of organisms under conditions that favor the survival of variant phenotypes over the wild type phenotype. In the case of microorganisms, the selection conditions often involve nutritional requirements or resistance to drugs.
- the classical models for genetic studies include E. coli, S. cerevisiae, D. melanogastor, and M. musculus. These organisms share certain features that facilitate genetic studies. First, they can be used to screen and/or select for interesting phenotypic variants (mutants). Second, they can be manipulated in such a way that the underlying genes responsible for specific mutant phenotypes can be localized and isolated by molecular cloning methods. These features permit the analysis of genes in cases where detailed biochemical information about the process under study is unavailable. All that is required at the outset is a tractable experimental organism and a phenotype that can be scored or selected.
- Cancer Genes regulate some of the most medically and commercially important processes in biology.
- a long list of human diseases are caused by mutations or malfunctions of specific genes.
- Cancer may be the most familiar example, as it involves the sequential alteration of proto-oncogenes and tumor suppressor genes as tumors progress through stages of malignancy (Fearon E. R. and Vogelstein B., Cell 61: 759-767 (1990)).
- Methods capable of identifying the underlying genes that regulate important biological processes such as tumor progression would thus be of great value.
- the method should be simple, rapid, and permit identification of components of genetic pathways that regulate traits of interest. It should circumvent many of the obstacles that have interfered with genetic analysis in certain cells and organisms. It should not require an understanding of the detailed basis of a particular phenotype, or the mechanisms that underlie specific cellular behaviors.
- the method should be generally applicable to a great variety of cells, including cells cultured from somatic tissues of multicellular organisms, and it should sidestep certain disadvantages of somatic cell genetics, including the diploid character of most cells, the difficulty of isolating mutant genes once mutations have been induced, and the heterogeneity of many cell populations.
- the present invention is directed to a method of genetic analysis that satisfies the need for a simple, rapid, and general way to identify components of genetic pathways that regulate traits of interest.
- the method involves the use of three basic tools: (1) a reporter gene that reflects the phenotypic state of a particular cell; (2) a selection device or method that permits rapid quantitative measurement of the expression levels of the reporter molecule on a cell-by-cell basis; and (3) an expression library, preferably of proteins, protein fragments, or peptides (“perturbagens”), that can be introduced into the chosen cell population (host cells).
- the reporter gene is typically contained in a construct that places it under the control of a specific cis regulatory element whose activity correlates with the trait of interest.
- This construct is introduced into a population of host cells such that it is stably maintained and expressed.
- a genetic library constructed in a second expression vector is introduced into the host cells that harbor the reporter gene construct.
- This second expression library generates perturbagens in the host cells.
- the host cells are analyzed using a method or device that quantitatively detects reporter expression levels. Cells with reporter gene expression levels that are decreased or increased relative to the expression observed in cells that contain only the stably expressed reporter, without the perturbagens, are selected and their library inserts are isolated.
- the reporter serves as a surrogate for the cellular phenotype and thus must be chosen carefully to reflect the relevant phenotypic state as closely as possible.
- the reporter may be an endogenous gene, preferably encoding a cell surface marker, expressed by cells with the phenotype of interest, or it may be a foreign gene placed under the control of a cell-type-specific or cell-state-specific promoter that is active in the cells under study.
- the reporter is expressed in the host cells at a level sufficient to permit its rapid and quantitative determination.
- Perturbagens are molecules that act in a transdominant mode to interfere with the function of endogenous cellular components.
- perturbagens are typically proteinacious: proteins, protein fragments, or peptides; though perturbagens may also be nucleic acids.
- Perturbagen genetic libraries are introduced into the host cells that harbor the reporter expression construct in such a way that a single type of each perturbagen (or a small number of different perturbagens) is expressed in a host cell.
- the selection device or method is used to screen rapidly through millions of cells that harbor the reporter gene construct for variants that express altered levels of the reporter and to sort (or select) those variant cells away from the majority of cells that express normal levels.
- This selected population that expresses altered levels of the reporter is used in turn to isolate the resident perturbagens by, e.g., PCR (Ausubel F. M., Brent R., et al., Current Protocols in Molecular Biology, John Wiley and Sons, New York (1996)).
- the selection procedure results in enrichment of the initial population of cells harboring the perturbagen library for cells that contain perturbagen fragments that affect reporter gene expression.
- the sub-library of perturbagen fragments that influence reporter gene expression can be reintroduced into the host cells and the process of screening/selection can be repeated. The whole cycle is repeated as many times as necessary to obtain a relatively pure sub-library of perturbagen-encoding inserts which, when introduced into the host cells, causes altered reporter gene expression. Each of these perturbagen fragments can be isolated and studied individually.
- Perturbagen fragments isolated in this manner produce phenocopies; i.e. they generate the equivalent of genetic mutations.
- Each fragment encodes a perturbagen that affects expression of the reporter.
- the reporter gene may be expressed only in the presence of a specific transcription factor. If the perturbagen sequesters this factor, or acts upstream of the factor to reduce its activity, reporter gene expression will be reduced.
- the present invention also can be used to generate a perturbagen disruption that causes a phenotypic transformation such that the original cell type is converted into a different cell type in which the reporter gene is not expressed.
- Such a perturbagen identifies a master switch; a single molecule capable of dictating the phenotype of the cell.
- a cloned perturbagen-encoding sequence may rapidly give direct and indirect information about the pathway it affects. If the perturbagen is derived from a gene or gene fragment, it may be related to a previously identified component of the pathway and its sequence may reveal its identity. The target of the perturbagen may be a second component of that pathway, whose identity can be inferred. Alternatively, the target molecule can be identified by techniques known in the art such as the yeast two-hybrid screen (See Fields S. and Song O. -K., U.S. Pat. No. 5,283,173) or by “suppressor” perturbagen methods outlined infra (Jarvik J. and Botstein D., Proc. Natl. Acad. Sci.
- FIG. 1 Perturbagen cartoon. Three examples of intermolecular interactions illustrating: first, a complex between two native proteins in the cell; second, a complex between a small molecule inhibitor of the interaction between the native proteins; and third, a complex between a protein fragment perturbagen derived from the native protein that is the normal binding partner of the other. This perturbagen is expected to behave in a manner similar to the small molecule inhibitor. Furthermore, the perturbagen/target complex serves as the basis for a screen to identify small molecule mimics.
- FIG. 2 Mammalian perturbagen expression vectors that use autofluorescent proteins (GFP or BFP) as fusion partners for the expressed proteins.
- MCS is a multiple cloning site for insertion of individual sequences or genetic libraries. Either of the illustrated vectors may be used for perturbagen expression library construction.
- FIG. 3 Reporter gene expression vector for mammalian cells with a “crippled” promoter that contains only the TATA box from the CMV promoter and lacks an enhancer; cis regulatory elements can be inserted upstream of the TATA box in the Bg1II or BamH1 sites.
- FIG. 4 Flow chart of the process of genetic analysis disclosed in the invention.
- the reporter expression construct (in this case, the GFP gene) (1) is introduced into the chosen host cells and a stable expresser is selected (2). This reporter-expressing line is clonally expanded to generate a population that is, in this case, bright green (4).
- a perturbagen library (3) is introduced into the host cells to generate a population of reporter gene-containing cells (5), many of which also express perturbagens. This population is examined using a flow sorter device (6) and cells are sorted into two populations: cells (7) that continue to express reporter protein levels similar to the cell in #4.; and, cells (8) that express, in this case, reduced levels of the reporter.
- the perturbagen inserts (9) from such “dim” cells are isolated and either used to determine their DNA sequences (10), or reintroduced into the reporter-containing host cells (11) for another cycle of selection and enrichment.
- FIG. 5 Flow sorter profile diagram. A cartoon depicting the fluorescence intensity distribution of a population of host cells containing perturbagens prior to selection. This presorted population is used to select cells on the left tail of the distribution (in black fill) or on the right tail (in gray fill). If, for example, the dim cells on the left are selected and perturbagens from these cells are reintroduced into the original host cells, the fluorescence intensity distribution that ensues from cells that harbor such a sub-library of sequences becomes skewed to the left (i.e., the mean fluorescence intensity decreases).
- FIG. 6 Genetic pathway involved in a-factor arrest of S. cerevisiae a cells.
- the cell and nuclear membranes are represented as gray circles; the protein components of the pathway are depicted as rounded objects of various types including rectangles and triangles.
- a-factor is the triangle labeled with “a” outside the a cell.
- Interactions among components that lead to activation are represented as arrows; interactions that lead to inhibition are depicted as blunt-ended lines.
- FIG. 7 Expression vector used in yeast as a reporter to identify perturbagens that affect a-factor responsiveness. Three possible inserts upstream of the GFP gene are depicted which depend on the strategy used. The first strategy involves use of four tandemly arrayed a-factor response elements; the second uses the promoter of the FUS1 gene; the third uses genomic DNA selected to confer a-factor responsiveness.
- FIG. 8 Expression vector to express perturbagens in yeast.
- Genemic DNA refers to the perturbagen-encoding inserts.
- Four strategies are used to generate fusion proteins with the perturbagen inserts: 1. Blue Fluorescent Protein (BFP); 2. GAL4 sequences; 3. invertase sequences; 4. no fusion partner sequence.
- BFP Blue Fluorescent Protein
- GAL4 sequences 3. invertase sequences; 4. no fusion partner sequence.
- genomic library or “library” are interchangeably used to refer to a collection of nucleic acid fragments that may individually range in size from about a few base pairs to about a million base pairs. These fragments are contained as inserts in vectors capable of propagating in certain host cells such as bacterial, fungal, plant, insect, or mammalian cells.
- sub-library refers to a portion of a genetic library that has been isolated by application of a specific screening or selection procedure.
- cover in the context of a genetic library refers to the level of redundancy of the library. This redundancy is in turn related to the probability that a specific sequence within the nucleic acid sequences that the library is intended to represent is actually present. Coverage is the ratio of the number of library inserts multiplied by the average insert size to the total complexity of the nucleic acid sequences that the library represents.
- vector refers to a nucleic acid sequence that is capable of propagating in particular host cells and can accommodate inserts of foreign nucleic acid.
- vectors can be manipulated in vitro to insert foreign nucleic acids and the vectors can be introduced into host cells such that the inserted nucleic acid is transiently or stably present in the host cells.
- expression vector refers to a vector designed to express inserted nucleic acid sequences. Such vectors may contain a powerful promoter located upstream of the insertion site.
- nucleic acids refers to transcription and/or translation of nucleic acids into mRNA and/or protein products.
- expression library refers to a library of nucleic acid fragments contained as inserts in an expression vector.
- stable expression refers to the continued presence and expression of a nucleic acid sequence in a host cell for a period of time that is at least as long as that required to carry out the methods of the present invention.
- Stable expression can be achieved through integration of the construct into a host cell chromosome, or engineering the construct so that it possesses elements that ensure its continued replication and segregation within the host (i.e., an artificial chromosome), or alternatively, the construct may contain a selectable marker (e.g., a drug resistance gene) so that stable expression of the construct is ensured by growing the host cells under selective conditions (e.g., in drug-containing media).
- a selectable marker e.g., a drug resistance gene
- nucleic acid fragments refers to a set of nucleic acid molecules from any source.
- a collection of nucleic acid fragments may comprise total genomic DNA, genomic DNA from one or more chromosomes, cDNA that has been reverse-transcribed from total cellular RNA or from messenger RNA (mRNA), total cellular RNA, mRNA, or a set of nucleic acid molecules synthesized in vitro either individually, or using combinatorial methods.
- mRNA messenger RNA
- the term encompasses nucleic acid molecules comprising known analogs of natural nucleotides that can function in a similar manner as naturally occurring nucleotides.
- insert in the context of a library refers to an individual DNA fragment that constitutes a single member of the library.
- the term “host cell” refers to a cell of prokaryotic, archaebacterial, or eukaryotic origin that can serve as a recipient for a vector that is introduced by any one of several procedures.
- the host cell often allows replication and segregation of the vector that resides within. In certain cases, however, replication and/or segregation are irrelevant; expression of vector or insert DNA is the objective.
- Typical bacterial host cells include E. coli and B. subtilis; archaebacterial host cells include S. acidocaldarius and H. salinarium; fungal host cells include S. cerevisiae and S. pombe; plant cells include those isolated from A. thaliana, and Z. maize; insect host cells include those isolated from D.
- melanogastor A. aegypti, and S. frugiperda
- mammalian cells include those isolated from human tissues and cancers including melanocyte (melanoma), colon (carcinoma), prostate (carcinoma), and brain (glioma, neuroblastoma, astrocytoma).
- reporter gene refers to nucleic acid sequences for which screens or selections can be devised. Reporter genes may encode proteins (“reporters”) capable of emitting light such as GFP (Chalfie M., Tu Y, et al., Science February 11; 263 :802-805 (1994)), or luciferase (Gould S. J., and Subramani S., Anal. Biochem. November 15; 175: 5-13 (1988)), or genes that encode intracellular or cell surface proteins detectable by antibodies such as CD20 (Koh J., Enders G. H., et al., Nature 375: 506-510 (1995)).
- proteins capable of emitting light
- GFP Chalfie M., Tu Y, et al., Science February 11; 263 :802-805 (1994)
- luciferase Gould S. J., and Subramani S., Anal. Biochem. November 15; 175: 5-13 (1988)
- the reporters allow the activity of cis regulatory sequences to be monitored in a quantitative manner.
- reporter genes can confer antibiotic resistance such as hygromycin or neomycin resistance (Santerre R. F., et al., Gene 30: 147-156 (1984)).
- Bright cells have high intensity emission relative to the bulk population of cells, and by inference, high levels of reporter gene expression; dim cells have low intensity emission relative to the bulk population.
- the term “genetic pathway” refers to a set of proteins (or the genes that encode them) that act in concert, or sequentially, to accomplish a specific biochemical function or cellular behavior.
- cis regulatory sequence cis sequence
- regulatory sequence a nucleic acid sequence that affects the expression of itself or other sequences physically linked on the same nucleic acid molecule. Such sequences may alter gene expression by affecting such things as transcription, translation, or RNA stability.
- cis regulatory sequences include promoters, enhancers, or negative regulatory sequences (Alberts B., Bray D., et al. (Eds.), Molecular Biology of the Cell, Second Edition, Garland Publishing, Inc., New York and London, (1989); Lewin B, Gene V, Oxford University Press, Oxford, U.K. (1994)).
- perturbagens refers to an agent that acts in a transdominant mode to interfere with specific biochemical processes in cells.
- perturbagens are typically either proteins, protein fragments, or peptides, although the term also encompasses nucleic acids and other organic molecules with similar properties.
- transdominant describes a type of interaction whereby the agent (most typically a perturbagen) is a diffusable substance that can bind its target in solution.
- the agent most typically a perturbagen
- a transdominant agent is dominant as opposed to recessive in a genetic sense, because, e.g., it acts on gene products and not on alleles of genes.
- the effects of a perturbagen are visible in the presence of wild type alleles of its target.
- phenocopy refers to a phenotypic state or appearance that mimics or resembles the state induced by mutation of a specific gene or genes. This state may, for example, be induced by expression of perturbagens within a particular host cell.
- target in the context of a perturbagen refers to the molecule in the cell (typically a protein) to which the perturbagen binds to exert its effect on cellular phenotype.
- flow sorter refers to a machine that analyzes light emission intensity from cells or other objects and separates these cells or objects according to parameters such as light emission intensity.
- the present invention comprises methods to identify components of genetic pathways in cultured cells from plants and animals, or unicellular organisms such as yeast, bacteria, and fungi.
- Three basic tools are involved: (1) a reporter gene under the control of a specific cis regulatory element that reflects the phenotypic state of a particular cell; (2) a selection device or method that permits rapid quantitative measurement of the expression levels of the reporter molecule on a cell-by-cell basis; and (3) an expression library of proteins, protein fragments, or peptides (“perturbagens”) that can be introduced into the chosen cell population. Sequences are isolated from the expression library based on their ability to alter the activity of the cis regulatory sequence, as read out by the reporter expression level.
- the method thus comprises a set of tools and techniques that together permit the identification of components of genetic pathways using a pseudo-genetic approach.
- the method of the invention can be used in human cells, but it can also be modified easily for use in other mammalian cells, in plant cells, in arthropod cells, and in fungi, archaebacteria and bacteria.
- reporter genes have been appropriated for use in expression monitoring and in promoter/enhancer trapping.
- a reporter comprises any gene product for which screens or selections can be applied.
- Reporter genes used in the art include the LacZ gene from E. coli (Shapiro S. K., Chou J., et al., Gene November; 25: 71-82 (1983)), the CAT gene from bacteria (Thiel G., Petersohn D., and Schoch S., Gene February 12; 168: 173-176 (1996)), the luciferase gene from firefly (Gould S. J., and Subramani S., 1988), and the GFP gene from jellyfish (Chalfie M. and Prashner D. C., U.S. Pat.
- autofluorescent proteins e.g., GFP
- the cell surface reporters are potentially of greatest use in monitoring living cells, because they act as “vital dyes.” Their expression can be evaluated in living cells, and the cells can be recovered intact for subsequent analysis. Vital dyes, however, are not specifically required by the methods of the present invention. It is also very useful to employ reporters whose expression can be quantified rapidly and with high sensitivity. Thus, fluorescent reporters (or reporters that can be labeled directly or indirectly with a fluorophore) are especially preferred. This trait permits high throughput screening on a flow sorter machine such as a fluorescence activated cell sorter (FACS).
- FACS fluorescence activated cell sorter
- GFP is a member of a family of naturally occurring fluorescent proteins, whose fluorescence is primarily in the green region of the spectrum. GFP has been developed extensively for use as a reporter and several mutant forms of the protein have been characterized that have altered spectral properties (Cormack B. P., Valdivia R. H., and Falkow S., Gene 173: 33-38 (1996)). High levels of GFP expression have been obtained in cells ranging from yeast to human cells. It is a robust, all-purpose reporter, whose expression in the cytoplasm can be measured quantitatively using a flow sorter instrument such as a FACS.
- a flow sorter instrument such as a FACS.
- Genetic libraries typically involve a collection of DNA fragments, usually genomic DNA or cDNA, but sometimes synthetic DNA or RNA, that together represent all or some portion of a genome, a population of mRNAs, or some other set of nucleic acids that contain sequences of interest.
- genetic libraries represent sequences in a form that can be manipulated.
- a total genomic DNA library in principle includes all the sequences present in the genome of an organism propagated as a collection of cloned sequences. It is often desirable to generate a library that is as representative of the input population of nucleic acids as possible. For example, sequences that are present at one to one ratios in the input population (e.g., genome) are present in the library in the same proportion.
- a library should have at least 5-fold coverage; that is, the library should contain at least 5-fold excess of total inserts beyond the total number required theoretically to cover the collection of nucleic acid sequences one time.
- the coverage i.e., the total number of inserts multiplied by the mean insert size divided by the genome complexity, must be at least five.
- libraries are propagated in vectors that grow in bacterial cells, although eukaryotic cells such as yeast and even human cells can also serve as hosts.
- the mean insert size of a library is a variable that can be manipulated within rather broad limits that depend on vector and cell types, among other things.
- some vectors such as bacterial plasmids accommodate small inserts ranging from a few nucleotides to a few kilobase pairs, whereas others such as yeast artificial chromosomes can accommodate insert sizes that exceed 1,000 kilobase pairs.
- the present invention preferably uses genetic libraries that contain inserts on the smaller end of the spectrum. These inserts would most typically be derived from genomes or transcripts of particular organisms, or from synthetic DNA, and would range from, e.g., 10 base pairs to 10 kilobase pairs. The libraries most typically would have coverage that, if possible, exceeded five-fold.
- the details of library construction, manipulation, and maintenance are known in the art (Ausubel F., Brent R., et al., 1996; Sambrook J., Fritsch E. F., and Maniatis, T., Molecular Cloning: A Laboratory Manual, Second Edition, CHSL Press, New York (1989)).
- a library is created according to the following procedure using methods that are well-known in the art.
- Double stranded cDNA is prepared from random primed MRNA isolated from a particular cell type or tissue. These fragments are treated with enzymes to repair their ends and are ligated into the expression vector described infra. The ligated material is introduced into E. coli and clones are selected. A number of individual clones sufficient to achieve reasonable coverage of the mRNA population (e.g., one million clones) is collected, and grown in mass culture for isolation of the resident vectors and their inserts. This process allows large quantities of the library DNA to be obtained in preparation for subsequent procedures described infra.
- non-natural nucleic acid it is preferable to use non-natural nucleic acid as the starting material for the library.
- a population of synthetic oligonucleotides e.g., representing all possible sequences of length N, or a subset of all possible sequences, as the input nucleic acid for the library.
- mixtures of natural and non-natural nucleic acids for library inserts it may be desirable to use mixtures of natural and non-natural nucleic acids for library inserts.
- the introduced nucleic acid may remain as an extrachromosomal element (e.g., adenoviruses, Amalfitano A., Begy C. R., and Chamberlain J. S., Proc. Natl. Acad. Sci. ( USA ) 93: 3352-3356 (1996)) or maybe incorporated into a host chromosome (e.g., retroviruses, Iida A., Chen S. T., et al., J. Virol. 70: 6054-6059 (1996)).
- extrachromosomal element e.g., adenoviruses, Amalfitano A., Begy C. R., and Chamberlain J. S., Proc. Natl. Acad. Sci. ( USA ) 93: 3352-3356 (1996)
- a host chromosome e.g., retroviruses, Iida A., Chen S. T., et al., J. Virol. 70
- nucleic acid transfer In the case of non-viral nucleic acid transfer, many methods are available (Ausubel F., Brent R., et al., 1996).
- One technique for nucleic acid transfer is CaPO 4 coprecipitation of nucleic acid. This method relies on the ability of nucleic acid to coprecipitate with calcium and phosphate ions into a relatively insoluble CaPO 4 grit, which settles onto the surface of adherent cells on the culture dish bottom. The precipitate is, for reasons that are not clearly understood, absorbed by some cells and the coprecipitated nucleic acid is liberated inside the cell and expressed.
- a second class of methods employs lipophilic cations that are able to bind DNA by charge interactions while forming lipid micelles.
- a third method of nucleic acid transfer is electroporation, a technique that involves discharge of voltage from the plates of a capacitor through a buffer containing DNA and host cells. This process disturbs the bilayer sufficiently that DNA contained in the bathing solution is able to penetrate the cell membrane.
- a fourth method involves cationic polymers such as DEAE dextran which mediate DNA entry and expression in cultured cells.
- a fifth method employs ballistic delivery of DNA contained in ice crystals or adsorbed to the surface of miniature projectiles that are shot into cells. Finally, microinjection of DNA can be used, though it is typically quite slow and labor intensive.
- Electroporation is a particularly flexible method for nucleic acid delivery applicable to most cell types including prokaryotes, fungi, plant and animal cells.
- certain mixtures of specific salts can be used with some cells to facilitate DNA entry.
- CaCl 2 works well with E. coli
- LiOAc works well with S. cerevisiae.
- somatic cell genetics involves the difficulty with which recessive mutations can be observed.
- the problem can be formulated in statistical terms. If mutations occur in one allele at a frequency of, e.g., one in one million, then the chance that two independent mutations will occur, one in each allele, is the product: one in a trillion. Thus, dominant or codominant mutations are much more readily observed in general. Because of the recessive nature of the vast majority of mutations, somatic cell genetics is limited largely to study of dominant alterations such as overexpression.
- Perturbagens typically are proteins, protein fragments, or peptides (though they may be nucleic acids) that bind other proteins in the cell and thereby disrupt specific biochemical pathways (see FIG. 1). Nature generates perturbagen-like molecules by chance in the case of a certain class of dominant, gain-of-function mutations and in specific cases dominant negative mutant genes have been designed (Herskowitz I., Nature 329: 219-222 (1987)). In the present invention, this mode of biochemical/genetic disruption is harnessed and applied in a directed fashion to identify and recover important genes.
- Perturbagens can be constructed in a variety of ways. They may be generated from randomly-primed, size-selected cDNA, sheared or digested genomic DNA, synthetic DNA or other sources of nucleic acid. They may be expressed in cells without any additional protein sequences joined to them. Alternatively, they may be fused to other proteins, e.g., GFP or yeast GAL4, by standard methods of molecular cloning (Ausubel et al., 1996). In addition, they may be presented as insertion sequences within specific proteins.
- Perturbagen libraries can be constructed using techniques similar to construction of conventional gene and expression libraries as described supra. Such libraries, when introduced into cells with standard vectors such as viruses or by other means, act in a manner analogous to mutagens; that is, the perturbagens induce a phenocopy state in the host cells which mimics the mutant state, but does not directly involve alterations to host cell DNA sequences.
- the value of perturbagens is based on the ease with which they can be generated and screened, and the readiness with which the perturbagen sequences can be recovered and used to identify elements in the genetic pathways of interest. Furthermore, they act in a mode similar to small molecule therapeutics. Indeed, they are simply the protein equivalent of a small molecule, and they can be used in combination with their targets (binding partners) to screen for small molecule mimics that affect cells in a manner similar to the original protein perturbagen.
- perturbagen expression libraries comprised of, e.g., fragmented genomic DNA, random-primed cDNA, or synthetic DNA of random sequence are introduced into host cells engineered to contain a reporter gene under the control of a cell-type-specific cis regulatory sequence.
- a natural reporter consisting of a membrane protein (or intracellular protein) for which good specific antibodies are available may be used, provided the expression of this protein correlates with a phenotype of interest.
- Cells harboring perturbagens are screened by a rapid and quantitative method or device, such as a flow sorter, e.g., a FACS, to identify the population of cells that have altered expression of the reporter. These are collected for analysis as described infra.
- a generic promoter capable of conferring robust, high or moderately high expression is required.
- These promoters are typically derived from housekeeping genes that are expressed at reasonably high levels in most or all cell types in the organism, or from viruses. Numerous such cis regulatory sequences are known in the art, suitable for driving expression in mammalian cells, insect cells, plant cells, fungi or bacteria (Ausubel et al., 1996; vector database located at: http://www.atcg.com/vectordb/).
- the promoter for beta actin is useful (Qin Z., Kruger-Krasagakes S., et al., J. Exp.
- the goal is to choose a cis regulatory sequence that is active under the conditions of interest, either by genetic methods, biochemical methods, or by reference to known genes that have the desired expression characteristics. For example, if one desires to study the process of pathogenesis in a particular pathogenic organism, it may be useful to commandeer a promoter that is only active in cells competent for pathogenic invasion of the host.
- Expression vectors are used in the invention to produce RNA, proteins, protein fragments, or peptides derived from sequences (genes and gene fragments) that are introduced into host cells.
- the sequences include reporter genes used as a surrogate for the phenotypic state of the cell, and sequences that encode the perturbagens.
- There are numerous expression vectors known in the art which are readily available for use in the present invention (Ausubel F. M., Brent R., et al., 1996; Sambrook J. et al., 1989). Some of these are tailored for use in specific cell types, but most are designed to be used in a wide variety of cell types. In mammalian cells, viral transcriptional regulatory elements are a typical choice for driving expression of exogenous genes.
- an expression vector that contains a reporter gene flanked downstream by a poly(A) addition sequence may be used.
- This type of expression vector is illustrated in FIG. 2.
- the perturbagen-encoding sequence may be flanked upstream of its initiation codon by a TATA box, capable of binding RNA polymerase II (Pol II), and by an enhancer that preferably confers high expression on the linked perturbagen-encoding sequences.
- the expression vector preferably includes a site appropriate for insertion of perturbagen-encoding library sequences.
- Such library sequences preferably involve generation of a fusion protein with, e.g., BFP, though native protein domains or protein fragments may also be employed.
- BFP native protein domains or protein fragments
- the choice of which, if any, perturbagen fusion partner to use depends on, e.g., if cytoplasmic, nuclear, or extracellular expression of the perturbagen is desired.
- the vector if it is of viral origin, may not require propagation in a bacterial host.
- the vector requires propagation in, e.g., E. coli, and contains sequences necessary for replication and selection in E. coli such as a colE1 replicon and an antibiotic resistance gene.
- cis regulatory sequences are chosen according to similar criteria as discussed above.
- cis regulatory sequences are included upstream of the perturbagen-encoding sequences that cause robust, preferably high expression levels. These sequences are thus, preferably, of a generic type present, e.g., upstream of housekeeping genes.
- a suitable sequence is the consensus promoter that consists of a ⁇ 10 box and a ⁇ 35 box (Alberts B., Bray D., et al., 1989; Lewin B., 1994).
- the reporter vector is customized so that reporter expression reflects as closely as possible the phenotypic state of the host cell under study.
- the expression vector is designed such that the reporter gene (e.g., GFP) is placed under the control of cis regulatory sequences that confer cell-type specific expression, and/or reflect the activation of specific biochemical pathways within the cell.
- the reporter gene e.g., GFP
- FIG. 3 shows a mammalian expression vector that can be used to insert foreign cis regulatory sequences upstream of the TATA box from the CMV promoter, generating GFP expression under the control of the chosen regulatory element.
- the combination of genetic libraries and genetic selection or screening techniques permits identification of specific sequences from libraries based on their functions in living cells.
- This strategy has been used frequently in molecular biology to clone genes based on expression, e.g., by complementation of a mutant phenotype (e.g., Yocum R. R. and Johnston M., Gene 32: 75-82 (1984)).
- the premise of the strategy is that an appropriately constructed library can be introduced into suitable host cells and the effects of the library sequences can be monitored. For example, a particular host cell may die in a particular environment in the absence of a certain gene; the host cell will only grow when a library insert that includes the gene is present.
- screens can be employed to pick out the library sequences that confer a particular phenotype.
- T8 Leu-2
- the T8 (Leu-2) gene was isolated by a protocol that involved expression in cultured cells, labeling by a fluorescent antibody, and enrichment by FACS of T8-expressing cells (Kavanthes P., Sukhatme V. P., et al., Proc. Natl. Acad. Sci. ( USA ) 81: 7688-7692 (1984)).
- the present invention may use a flow sorter such as a FACS or equivalent device to screen through large numbers of host cells harboring perturbagen library inserts to identify those that have a particular phenotype; namely, cells that have reduced or elevated levels of reporter molecule expression.
- a flow sorter such as a FACS or equivalent device to screen through large numbers of host cells harboring perturbagen library inserts to identify those that have a particular phenotype; namely, cells that have reduced or elevated levels of reporter molecule expression.
- the reporter e.g., GFP
- the large majority of cells that are analyzed by FACS are expected to have normal (e.g., high) levels of reporter expression.
- a small number may exhibit reduced expression, detected on the FACS as cells that fall on the dimmer side of the cell fluorescence distribution. These dim cells can be collected and grown in isolation of the others. See FIGS. 3 and 4.
- Such a procedure results in enrichment from the starting population of perturbagen-containing cells for those that contain perturbagens that reduce the level of reporter expression.
- These selected, dim cells can be used to reisolate the perturbagen fragments by, e.g., PCR using primer sites that flank the library inserts, so as to build a sub-library of perturbagen fragments enriched for those that cause reduced reporter expression.
- the sub-library of fragments can be recloned (using e.g., the same expression vector) and reintroduced into the host cells, and the screening/selection process can be repeated as many times as necessary.
- the targets of perturbagens in cells are as interesting as the perturbagens themselves. It is expected that most perturbagens exert their phenotypic effect on cells by binding another specific protein, thus inhibiting its function.
- the other protein may be a wild type counterpart of the perturbagen (e.g., in the case of protein homomultimers), or it may be another unrelated protein. In either case, the perturbagen provides a critical probe for isolation of the target protein.
- perturbagens will be derived from components of a well established biochemical pathway, and strong candidates for the perturbagens' targets may be deduced from the identity of the perturbagens themselves.
- additional perturbagen experiments may reveal the identities of targets.
- a second perturbagen experiment using cells that express a perturbagen that inhibits reporter gene expression may provide a clue. If cells that harbor the reporter construct plus the initial perturbagen (now expressed stably using methods similar to those employed to generate the original reporter-containing host cells) are used as host cells for another round of perturbagen genetics, it is sometimes possible to select revertants that express high levels of reporter once again.
- This revertant phenotype may be caused by, among other things, the presence of a second perturbagen in the cells that mimics the behavior of the first perturbagen's target; i.e., a compensatory effect that involves overexpression of the target or a fragment of the target.
- the set of revertant perturbagens (“anti-perturbagens”) may provide clues as to the nature of perturbagen targets.
- the perturbagen approach used in the present invention has the capacity to identify several components of specific genetic pathways in a single selection experiment. This is because the assay is performed using a population of cells, without the need to isolate and grow individual mutants. All cells that harbor perturbagens capable of increasing or decreasing reporter gene expression are collected together, and the family of resident perturbagens can be amplified, e.g., by PCR, for subsequent analysis. Cloning individual nucleic acid fragments is much faster than cloning individual cells and localizing chromosomal mutations within them. In a sense, genetics is performed on the library of perturbagens rather than on the host cells themselves.
- perturbagen-encoding fragments can be examined in further detail using assays other than the reporter gene expression assay used for their isolation.
- the mechanistic basis for perturbagen activity is likely to be of considerable interest.
- the perturbagen may interfere with reporter gene expression by inhibiting the activity of a transcription factor required for reporter gene expression.
- it may interfere upstream of the transcription factor in a biochemical pathway that leads to activation of the set of transcription factors required for reporter gene expression.
- the perturbagen may cause a transformation in cell fate, such that the host cell no longer resembles the original parental cell type, but instead has been converted into a different cell type.
- Other possible modes of perturbagen disruption that lead to decreased or increased reporter gene expression can be envisioned. These can be sorted out later using cell biological, genetic, and biochemical methods known in the art (Ausubel, et al., 1994; Sambrook et al., 1989).
- perturbagen inserts may be isolated that affect reporter gene expression.
- groups askin to classical “complementation groups”
- anti-perturbagens may be selected as bright revertants of dim cells containing a perturbagen isolated during the first round of selection experiments described supra.
- Example 1 infra If the original reporter gene is inducible (see Example 1 infra), it may be simpler to select perturbagens that are bright in the absence of the inducing signal (i.e., they promote constitutive activation). In either case there are now two sets of perturbagens with opposite phenotypes; one class makes cells dim and the other reverses this phenotype. By introducing all possible pairs of “dim-” and “bright-” inducing perturbagens into the host cells and examining the resulting reporter expression levels, it is possible to group perturbagens (and thus their cellular targets) by common response. If it is desirable to order the pathway in detail, methods using conditional perturbagens (hot and cold sensitive) may be employed according to the strategy described by Jarvik J. and Botstein D. ( Proc. Natl. Acad. Sci ( USA ). 70: 2046-2050 (1973); Proc. Natl. Acad. Sci. ( USA ) 72: 2738-2742 (1975)).
- perturbagens isolated in the fashion described herein may lead directly to new therapeutic molecules.
- the goal is not necessarily to identify perturbagens that have a single specific effect on expression of the reporter gene, e.g., by interfering with the function of the reporter itself. Rather, the goal is through this means to identify perturbagens that have more general effects on cell physiology, including but not limited to cell type transformations.
- Such perturbagens may be relevant to disease therapy because they disrupt specific pathways in cells which have profound phenotypic and physiological consequences.
- These perturbagens and their associated cellular targets may serve to identify novel therapeutic targets in cells, an extremely valuable commodity in the medical arena.
- Perturbagens isolated using the procedures described supra may be further refined in two senses.
- perturbagens that are improved variants of members of the original perturbagen library may be isolated by accidental or deliberate mutation or recombination during the process of selection and enrichment.
- the perturbagens may be passed through additional genetic screens and selections that enrich for those that have more desirable properties in terms of cell-specific activity.
- Such conditions are known in the art (Ausubel et al, 1994) and provide a means for evolving perturbagens that, e.g., are active at lower concentrations and/or demonstrate increased selectivity in cells compared to perturbagens expressed by the original library; thus, they perform better as perturbagens.
- reporter linked e.g., to a second tissue- or cell-type-specific promoter that behaves in the host cells in a manner similar to the first reporter gene promoter may be used to reject perturbagens that affect the host cells in a reporter- or promoter-specific manner, and do not have a more profound effect on the state of the cell.
- a different reporter joined to the first promoter may be used.
- perturbagens that have general, non-specific effects on gene expression may be identified and/or removed by passing perturbagen sub-libraries or individual perturbagen-encoding sequences through a different host cell, unrelated to the first host cell, with a different host-cell-specific promoter.
- Perturbagens isolated as described supra behave in a transdominant mode similar to traditional small molecule pharmaceutical compounds. Thus, in certain cases they may serve much the same function as small molecule therapeutics though it may be necessary to ensure intracellular delivery and expression by gene therapy technology.
- perturbagens in association with their cellular targets, provide the basis for high-throughput in vitro screens for small molecule mimics that have properties similar to the original perturbagen; namely, they bind specifically to the perturbagen target and disrupt the target's function in vivo. Such molecules may have effects on cells similar to the perturbagens used in the screen.
- a system for assessing protein-protein interactions and their inhibition in a cell in vivo, e.g., in a bacterial, fungal, plant, insect, or mammalian cell, or in vitro.
- This system referred to as a small molecule displacement assay, can be used to screen libraries of small molecules to identify specific compounds that disrupt perturbagen/target interactions. This use of perturbagens and their cognate targets is described in detail in co-pending U.S. patent application of Kamb, C. A. (Docket No. 8835-004-999).
- yeast mating pheromone a-factor to a specific 7-transmembrane-domain-containing G-protein-coupled receptor (the product of the STE2 gene) on the surface of yeast cells of a mating type activates a signaling pathway that culminates in cell-cycle arrest and the preparation of the cell for mating to an a cell (FIG. 6).
- This well-characterized signaling pathway (reviewed in Bardwell L., Cook J. G., Inouye C. J., Thomer J., Dev. Biol.
- the promoterless yeast plasmid pRS416-GFP (disclosed in the co-pending application by Carl Alexander Kamb filed Feb. 14, 1997 titled, “Methods for identifying, characterizing, and evolving cell-type specific cis regulatory elements”) contains the GAL1 TATA box (minus the GAL upstream activation sequences, UAS) upstream of the coding sequence of a GFP variant which expresses well in yeast.
- This plasmid can replicate and be selected in yeast (CEN and ARS, URA3) and E. coli (ColE1, AmpR) and has a unique Bg1II site upstream of the GAL1 TATA box for inserting DNA promoter-containing fragments.
- the GFP expression is rendered a-factor responsive by cloning into the Bg1II site 4 copies of the a-factor-responsive element (as a synthetic oligo), a PCR fragment containing bases ⁇ 259 to upstream of the Fus1 gene (Hagen D. C., McCaffrey G., Sprague G. F. Jr., Mol. Cell Biol. 11: 6, 2952-61 (1991)) or, alternatively, any other a-factor-responsive cis regulatory element isolated from a genomic library that has been screened to identify such elements according to the methods described in the co-pending U.S. patent application by Carl Alexander Kamb filed Feb.
- the construct can be introduced into the yeast genome using techniques known in the art (Ausubel et al., 1996; Rothstein R. J., Methods Enzymol. 101: 202-211, (1983)). Briefly, endogenous pathways of homologous recombination are used in vivo to insert an expression vector that lacks an ARS/CEN but contains a selectable marker in addition to the reporter expression cassette. A region of yeast DNA homology is introduced into the vector and the vector is cut with a restriction enzyme that produces a linear molecule, the ends of which contain homology with a yeast chromosomal region.
- Transformation with this linear material results in recruitment of homologous recombination machinery and generates a large number of transformants that contain the expression vector inserted into the chromosomal region of homology.
- Such an expression vector is inherited stably along with the chromosome within which it resides. Individual transformants can be tested to ensure that they continue to express the reporter as they were intended.
- Standard techniques are used to construct a library of yeast genomic DNA fragments in a yeast/ E. coli shuttle vector such as pRS315 (Sikorski R. S., Hieter P., Genetics 122: 1, 19-27 (1989)).
- This vector contains LEU2 as a selectable marker in yeast.
- Four separate libraries may be made to present the perturbagen in different contexts or cellular compartments. In all four cases there is a GAL1 promoter upstream of the inserted genomic fragment in order to drive its expression in a galactose-dependent fashion.
- the coding sequence for Blue Fluorescent Protein (BFP) (Quantum Biotechnologies, Inc., Laval, Canada; Anderson M. T., Tjioe I. M., Lorincz M. C., Parks D. R., Seas D. R., Seas D. R., Seas D. R., Seas D. R., Seas D. R., Seas D. R., Seas D. R., Seas D. R., Miberg L. A., Nolan G. P., Miberg L. A., Proc. Natl. Acad. Sci. ( USA ) 93: 16, 8508-8511 (1996)) is located downstream of the GAL promoter and upstream of the insertion site to allow translational fusions between BFP and the inserted coding sequence (see FIG. 8).
- BFP Blue Fluorescent Protein
- the secreted form of invertase is the fusion partner; this allows export into the secretion pathway of the perturbagens and may provide a mechanism for isolating perturbagens that have activity when secreted outside the cell or when otherwise consigned to the secretory pathway.
- the GAL4 protein a well established fusion partner (Fields S. and Song O.-K., U.S. Pat. No. 5,283,173), is fused to the perturbagen; this facilitates import of the perturbagen into the nucleus.
- there is no fusion partner for the perturbagen sequence this allows production of “stand alone” perturbagens.
- Each of the perturbagen libraries described above is introduced into separate cell populations containing the a-factor-responsive GFP vector.
- the selectable markers used on the perturbagen and reporter plasmids are different so that both can be maintained in the same cell (e.g., URA3 and LEU2).
- the reporter construct can be integrated into the chromosome (which has advantages due to more uniform levels of reporter gene expression in the population of cells).
- a perturbagen that specifically blocks the a-factor signaling pathway should reduce fluorescence of these cells in a galactose-dependent fashion.
- the perturbagen sub-library can be further tested to ensure that, e.g., expression of particular perturbagens does not simply kill cells. This manipulation provides a convenient counterscreen to increase the probability that the perturbagens are specific for the targeted biochemical pathway involving a-factor arrest.
- perturbagens that have the opposite effect; namely, they increase reporter expression in the absence of a-factor and the presence of galactose.
- Such perturbagens may be isolated by screening for perturbagen-containing cells that are bright in the presence of galactose and the absence of a-factor.
- TRP-2 DOPAchrome tautomerase/tyrosinase-related protein 2
- TRP-2 DOPAchrome tautomerase/tyrosinase-related protein 2
- Mtf melanotransferrin
- MITF microphthalmia-associated transcription factor
- the associated regulatory elements of these genes provide the basis for designing melanoma cell-specific reporters that involve fusion of a reporter gene to the cis regulatory sequences of a melanoma-specific gene.
- Tyrosinase encodes an enzyme involved in the conversion of tyrosine into the polymeric, light-absorbing pigment melanin. Regulatory sequences in the human tyrosinase gene are particularly well characterized. Transfection experiments have determined that a promoter fragment located between 1.8-2.7 kilobase pairs upstream of the tyrosinase transcriptional initiation site is sufficient to confer expression specifically in melanoma pigment-producing cells (Shibata K., Muraosa Y., et al., 1992). Further deletion analysis identified a pigment-cell specific enhancer contained on a 200 base pair fragment located 1.8-2.0 kilobase pairs upstream of the start site. A 39-base pair core element was sufficient to confer melanoma cell-specific expression.
- the promoter region defined in the series of experiments described supra is used to direct expression of a reporter gene (GFP in this case) specifically in human melanoma cells.
- a reporter gene GFP in this case
- Numerous such cultured cell lines are available (Satyamoorthy K., DeJesus E., et al., Melanoma Res. (In press)), many of which (e.g., HS294T) grow well in culture and can be used in the experiments described in this example.
- the promoter region may include the entire 2.7 kilobase pairs upstream of the human tyrosinase gene, or the 200 base pair fragment located upstream of a TATA box sequence (FIG. 3). Based on the published literature, such a construct should be selectively active in melanoma cells and not in, e.g., fibroblast cells.
- the fusion construct consisting of tyrosinase regulatory sequences joined to the GFP reporter will be introduced in an expression vector such that GFP is expressed at high levels in the host cells. Selection for stable expressers will be applied using, e.g., the dominantly selectable marker for neomycin resistance carried on the expression vector such as that shown in FIG. 3. Stable expressers will be selected using techniques known in the art (Ausubel et al., 1996), and the population of GFP-expressing cells will be verified by flow cytometry. A suitable clone, characterized by high, stable expression of GFP will be employed in subsequent experiments.
- the library consists of cDNA fragments (derived from, e.g., randomly primed human fetal brain mRNA) or random peptide-encoding sequences carried on an expression vector that, e.g., may be derived from a typical mammalian expression vector such as that shown in FIG. 2.
- the library is under control of CMV sequences.
- the library is introduced into the host cells using standard protocols for electroporation (Ausubel et al., 1996). The specific conditions are chosen to optimize nucleic acid transfer (see Example 3).
- the cells are then passed through a flow sorter device such as a FACS to collect cells that are dim (i.e., express levels of GFP that are lower than the mean level of GFP expression in the host cells that lack perturbagens, or are lower than the mean level of GFP expression exhibited by the bulk population of host cells, many of which express perturbagens).
- the resident perturbagen-encoding DNA inserts contained within the dim cells are recovered by, e.g., PCR amplification using primer sites that flank the perturbagen insert sequences. These perturbagen fragments are recloned in the expression vector and the sub-library is reintroduced into the reporter-bearing host cells.
- This cycling process is continued a sufficient number of times to generate a reasonably pure set of perturbagen fragments that have the effect, when introduced singly into host cells, of depressing GFP expression.
- Such fragments can be characterized further, including determination of their DNA sequences and examination of their effects on the gross phenotype of the cell.
- a common feature of advanced melanomas is high level expression of the adhesion molecule beta-3 integrin (Varner J. A. and Cheresh D. A., Curr. Opin. Cell Biol. 8: 724-730 (1996)). This provides an example of how the invention disclosed herein can be used to identify perturbagens (and perturbagen targets) involved in the expression of specific cell surface molecules.
- Standard GFP expression vectors such as those sold by Clontech (Palo Alto, Calif.) provide a convenient method to assess the results of different electroporation conditions.
- the GFP expression vectors are introduced into the cells using a variety of voltages and capacitances and the cells are returned to culture for a period (typically one day) sufficient to permit recovery of the cells and expression of the transferred DNA.
- the cells are then analyzed by a flow sorter such as a FACS to determine the percentage of cells that are bright; i.e., the fraction that have accepted the transferred DNA. Conditions are selected that maximize this number for further experiments.
- a perturbagen expression library of the type described in Example 2 is introduced into the melanoma host cells using the conditions defined above. After one to three days, the cells are collected, stained with the monoclonal antibody directed against beta-3 integrin, and labeled with a secondary fluorescently-labeled antibody that allows indirect visualization of the beta-3 integrin on the cells by binding the Fc domain of the first antibody (Robinson J. P., Darzynkiewicz Z., et al., (Eds.), Current Protocols in Flow Cytometry, John Wiley and Sons, New York (1997); Ausubel et al., 1996).
- a flow sorter e.g., a FACS
- the collected cells are lysed and their perturbagen inserts are recovered by PCR for either another cycle of enrichment or for sequence analysis. In either case the inserts are recloned in E. coli before proceeding.
- Individual perturbagen fragments identified through the above procedure are analyzed further to ensure that many (preferably the majority) have the expected properties when tested singly, as opposed to being part of a population. The majority of such fragments, when introduced alone into the melanoma cells, should cause a decrease in the level of beta-3 integrin protein expressed at the cell surface.
- the DNA sequences of these fragments can be determined and used to explore the public sequence databases to check if they match a known protein.
- the results of such a search may provide valuable information about the nature of the perturbagen interaction in cells (i.e., the mechanism of the effect) and may point to the perturbagen target in vivo.
- the perturbagen target may also be found using the method of two-hybrid analysis in S. cerevisiae as described in (Fields S. and Song O.-K., U.S. Pat. No. 5,283,173; Serrano et al., 1993).
Abstract
Methods for identifying nucleic acid sequences that affect a cellular phenotype are disclosed. The method uses a reporter gene whose level of expression correlates with the phenotype in conjunction with a method or device for measuring the level of reporter expression. An expression library is introduced into the cells, and those cells exhibiting changes in reporter expression level are selected. Expression library inserts from the selected cells are isolated, thereby providing a sub-library enriched for sequences that affect the phenotype reflected by the reporter. Further rounds of sub-library introduction and cell selection may be carried out to provide additional enrichment. Sequences identified using this method may be used to ascertain the identity of additional molecules involved in generating the cellular phenotype.
Description
- This application is a continuation of Ser. No. 09/320,080, filed on May 26, 1999, which is a continuation of Ser. No. 08/812,994, filed Mar. 4, 1997 (U.S. Pat. No. 5,955,275 issued Sep. 21, 1999 which is a continuation-in part of Ser. No. 08/800,664, filed Feb. 14, 1997.
- The present invention comprises a general procedure applicable in virtually any cell type for identification of nucleic acid sequences that perturb specific biochemical pathways within a cell.
- Genetic methods have played a major role in efforts to understand the molecular basis for biological phenomena. For example, genetic analysis of the fruit fly,D. melanogastor, provided the entry point for isolation of numerous genes that regulate the formation of the fly body. These genes in turn served as probes for isolation of mammalian homologs that have been the primary tools in molecular studies of vertebrate development.
- A variety of genetic and biochemical studies have proved that virtually any biological process (i.e., cell behaviors and the like) can be broken down into components. This reductionist approach to biological inquiry aims to understand the greater part of life's complexity in the relatively simple chemical terms of molecules and molecular interactions. In the middle part of the twentieth century, several scientists, perhaps most notably George Beadle, showed that metabolism can be understood as a series of enzymes that act sequentially to convert precursor compounds into the final metabolic products. This insight gave rise to the notion of genetic or biochemical pathways that control cellular processes. More complicated cellular behaviors such as differentiation have recently been defined in terms of genetic programs and pathways. Even disease processes can be thought of in such terms. For example, cancer is a disease characterized by loss of cellular growth control. An effective strategy to study cancer involves the elucidation of cellular growth regulation pathways. Many genes involved in growth control have been identified and substantial progress has been made in understanding the genetic/biochemical circuitry of these component genes.
- Some organisms are especially tractable in genetic studies. These organisms typically are either unicellular, or have short life cycles, small genomes, and a variety of other useful features. Other organisms, such as humans, are less tractable. For tractable experimental organisms, two basic approaches to mutant isolation are available. The first method, termed screening, involves the sometimes painstaking inspection of thousands of individual organisms or clones of cells. Those that have the appropriate mutant phenotype are separated from the others and permitted to grow in isolation. In this manner, homogeneous populations of mutants can be grown and analyzed. The second approach involves growth of organisms under conditions that favor the survival of variant phenotypes over the wild type phenotype. In the case of microorganisms, the selection conditions often involve nutritional requirements or resistance to drugs.
- The classical models for genetic studies includeE. coli, S. cerevisiae, D. melanogastor, and M. musculus. These organisms share certain features that facilitate genetic studies. First, they can be used to screen and/or select for interesting phenotypic variants (mutants). Second, they can be manipulated in such a way that the underlying genes responsible for specific mutant phenotypes can be localized and isolated by molecular cloning methods. These features permit the analysis of genes in cases where detailed biochemical information about the process under study is unavailable. All that is required at the outset is a tractable experimental organism and a phenotype that can be scored or selected.
- In certain organisms such as humans which are of great interest, but in which classical genetic methods of selective breeding cannot be applied, it is still possible to use genetic analysis to identify genes. The techniques are somewhat different and involve retrospective phenotypic and genotypic analysis of kindreds that segregate traits of interest. Such kindreds can be used to determine the approximate location of genes that affect the trait of interest. This approach relies heavily on aspects of heredity that involve sexual reproduction, segregation, and recombination. From rough mapping information, the responsible gene(s) can often be isolated (Miki Y., Swensen J., et al.,Science 266: 66-71 (1994)).
- Cultured cells from multicellular organisms, as well as single-celled organisms, offer the great advantage that genetic studies can be performed on the simplest unit of life, the cell. In many microorganisms, genetic methods are suitably advanced so that detailed genetic analysis of a wide variety of phenotypic traits is possible. In other organisms such as humans, however, genetic studies in cultured cells are still very difficult. Though cultured somatic cells have provided the route to identification of several important human genes, somatic cells have traits that seriously limit their utility. They are diploid; hence mutants with a recessive phenotype are rarely observed. They reproduce clonally; hence it is not possible generally to map interesting mutations. They are often heterogeneous; hence, each cell in a supposedly identical population of cells may differ slightly in phenotype from another cell for a variety of genetic and epigenetic reasons. They do not lend themselves to a large variety of selection schemes. Genetic methods that can mitigate against these problems in human cells would be particularly valuable.
- Genes regulate some of the most medically and commercially important processes in biology. A long list of human diseases are caused by mutations or malfunctions of specific genes. Cancer may be the most familiar example, as it involves the sequential alteration of proto-oncogenes and tumor suppressor genes as tumors progress through stages of malignancy (Fearon E. R. and Vogelstein B.,Cell 61: 759-767 (1990)). Methods capable of identifying the underlying genes that regulate important biological processes such as tumor progression would thus be of great value.
- For the foregoing reasons, a general method of genetic analysis in cultured cells is needed. The method should be simple, rapid, and permit identification of components of genetic pathways that regulate traits of interest. It should circumvent many of the obstacles that have interfered with genetic analysis in certain cells and organisms. It should not require an understanding of the detailed basis of a particular phenotype, or the mechanisms that underlie specific cellular behaviors. The method should be generally applicable to a great variety of cells, including cells cultured from somatic tissues of multicellular organisms, and it should sidestep certain disadvantages of somatic cell genetics, including the diploid character of most cells, the difficulty of isolating mutant genes once mutations have been induced, and the heterogeneity of many cell populations.
- The present invention is directed to a method of genetic analysis that satisfies the need for a simple, rapid, and general way to identify components of genetic pathways that regulate traits of interest. The method involves the use of three basic tools: (1) a reporter gene that reflects the phenotypic state of a particular cell; (2) a selection device or method that permits rapid quantitative measurement of the expression levels of the reporter molecule on a cell-by-cell basis; and (3) an expression library, preferably of proteins, protein fragments, or peptides (“perturbagens”), that can be introduced into the chosen cell population (host cells). The reporter gene is typically contained in a construct that places it under the control of a specific cis regulatory element whose activity correlates with the trait of interest. This construct is introduced into a population of host cells such that it is stably maintained and expressed. A genetic library constructed in a second expression vector is introduced into the host cells that harbor the reporter gene construct. This second expression library generates perturbagens in the host cells. The host cells are analyzed using a method or device that quantitatively detects reporter expression levels. Cells with reporter gene expression levels that are decreased or increased relative to the expression observed in cells that contain only the stably expressed reporter, without the perturbagens, are selected and their library inserts are isolated.
- The reporter serves as a surrogate for the cellular phenotype and thus must be chosen carefully to reflect the relevant phenotypic state as closely as possible. The reporter may be an endogenous gene, preferably encoding a cell surface marker, expressed by cells with the phenotype of interest, or it may be a foreign gene placed under the control of a cell-type-specific or cell-state-specific promoter that is active in the cells under study. The reporter is expressed in the host cells at a level sufficient to permit its rapid and quantitative determination.
- Perturbagens are molecules that act in a transdominant mode to interfere with the function of endogenous cellular components. In the present invention, perturbagens are typically proteinacious: proteins, protein fragments, or peptides; though perturbagens may also be nucleic acids. By expressing perturbagens in cells, it is possible to disrupt specific normal interactions, thus generating a “phenocopy” of a mutant phenotype; that is, although no mutations are created by the method, the function of specific cellular constituents is affected as if the genes encoding these proteins were altered by mutation. Perturbagen genetic libraries are introduced into the host cells that harbor the reporter expression construct in such a way that a single type of each perturbagen (or a small number of different perturbagens) is expressed in a host cell.
- The selection device or method is used to screen rapidly through millions of cells that harbor the reporter gene construct for variants that express altered levels of the reporter and to sort (or select) those variant cells away from the majority of cells that express normal levels. This selected population that expresses altered levels of the reporter is used in turn to isolate the resident perturbagens by, e.g., PCR (Ausubel F. M., Brent R., et al.,Current Protocols in Molecular Biology, John Wiley and Sons, New York (1996)). The selection procedure results in enrichment of the initial population of cells harboring the perturbagen library for cells that contain perturbagen fragments that affect reporter gene expression. The sub-library of perturbagen fragments that influence reporter gene expression can be reintroduced into the host cells and the process of screening/selection can be repeated. The whole cycle is repeated as many times as necessary to obtain a relatively pure sub-library of perturbagen-encoding inserts which, when introduced into the host cells, causes altered reporter gene expression. Each of these perturbagen fragments can be isolated and studied individually.
- Because the selection occurs at the population level, and further enrichment cycles are simple to perform, the time associated with gene isolation is greatly reduced. In addition, this approach diminishes the chance that a particular perturbagen isolated according to the methods described herein acts idiosyncratically in a minority of host cells. Screens/selections for virtually any phenotype are possible, limited only by the fidelity with which the reporter represents the cell phenotype of interest.
- Perturbagen fragments isolated in this manner produce phenocopies; i.e. they generate the equivalent of genetic mutations. Each fragment encodes a perturbagen that affects expression of the reporter. In principle, any component of the genetic pathway that leads to reporter gene expression is vulnerable to perturbagen disruption. For example, the reporter gene may be expressed only in the presence of a specific transcription factor. If the perturbagen sequesters this factor, or acts upstream of the factor to reduce its activity, reporter gene expression will be reduced. The present invention also can be used to generate a perturbagen disruption that causes a phenotypic transformation such that the original cell type is converted into a different cell type in which the reporter gene is not expressed. Such a perturbagen identifies a master switch; a single molecule capable of dictating the phenotype of the cell.
- A cloned perturbagen-encoding sequence may rapidly give direct and indirect information about the pathway it affects. If the perturbagen is derived from a gene or gene fragment, it may be related to a previously identified component of the pathway and its sequence may reveal its identity. The target of the perturbagen may be a second component of that pathway, whose identity can be inferred. Alternatively, the target molecule can be identified by techniques known in the art such as the yeast two-hybrid screen (See Fields S. and Song O. -K., U.S. Pat. No. 5,283,173) or by “suppressor” perturbagen methods outlined infra (Jarvik J. and Botstein D.,Proc. Natl. Acad. Sci. (USA) 72: 2738-2742 (1975)). Thus, a few selection experiments performed on several millions of cells should enable identification of most or all of the components of a particular pathway which are vulnerable to this type of disruption. Finally, if these components are involved in a process of commercial significance, the perturbagen provides a tool to develop valuable reagents either directly, or as a substrate for screening.
- These and other features, aspects, and advantages of the present invention will become better understood with regard to the following description, appended claims, and accompanying drawings.
- FIG. 1: Perturbagen cartoon. Three examples of intermolecular interactions illustrating: first, a complex between two native proteins in the cell; second, a complex between a small molecule inhibitor of the interaction between the native proteins; and third, a complex between a protein fragment perturbagen derived from the native protein that is the normal binding partner of the other. This perturbagen is expected to behave in a manner similar to the small molecule inhibitor. Furthermore, the perturbagen/target complex serves as the basis for a screen to identify small molecule mimics.
- FIG. 2: Mammalian perturbagen expression vectors that use autofluorescent proteins (GFP or BFP) as fusion partners for the expressed proteins. MCS is a multiple cloning site for insertion of individual sequences or genetic libraries. Either of the illustrated vectors may be used for perturbagen expression library construction.
- FIG. 3: Reporter gene expression vector for mammalian cells with a “crippled” promoter that contains only the TATA box from the CMV promoter and lacks an enhancer; cis regulatory elements can be inserted upstream of the TATA box in the Bg1II or BamH1 sites.
- FIG. 4: Flow chart of the process of genetic analysis disclosed in the invention. The reporter expression construct (in this case, the GFP gene) (1) is introduced into the chosen host cells and a stable expresser is selected (2). This reporter-expressing line is clonally expanded to generate a population that is, in this case, bright green (4). A perturbagen library (3) is introduced into the host cells to generate a population of reporter gene-containing cells (5), many of which also express perturbagens. This population is examined using a flow sorter device (6) and cells are sorted into two populations: cells (7) that continue to express reporter protein levels similar to the cell in #4.; and, cells (8) that express, in this case, reduced levels of the reporter. The perturbagen inserts (9) from such “dim” cells are isolated and either used to determine their DNA sequences (10), or reintroduced into the reporter-containing host cells (11) for another cycle of selection and enrichment.
- FIG. 5: Flow sorter profile diagram. A cartoon depicting the fluorescence intensity distribution of a population of host cells containing perturbagens prior to selection. This presorted population is used to select cells on the left tail of the distribution (in black fill) or on the right tail (in gray fill). If, for example, the dim cells on the left are selected and perturbagens from these cells are reintroduced into the original host cells, the fluorescence intensity distribution that ensues from cells that harbor such a sub-library of sequences becomes skewed to the left (i.e., the mean fluorescence intensity decreases).
- FIG. 6: Genetic pathway involved in a-factor arrest ofS. cerevisiae a cells. The cell and nuclear membranes are represented as gray circles; the protein components of the pathway are depicted as rounded objects of various types including rectangles and triangles. a-factor is the triangle labeled with “a” outside the a cell. Interactions among components that lead to activation are represented as arrows; interactions that lead to inhibition are depicted as blunt-ended lines.
- FIG. 7: Expression vector used in yeast as a reporter to identify perturbagens that affect a-factor responsiveness. Three possible inserts upstream of the GFP gene are depicted which depend on the strategy used. The first strategy involves use of four tandemly arrayed a-factor response elements; the second uses the promoter of the FUS1 gene; the third uses genomic DNA selected to confer a-factor responsiveness.
- FIG. 8: Expression vector to express perturbagens in yeast. “Genomic DNA” refers to the perturbagen-encoding inserts. Four strategies are used to generate fusion proteins with the perturbagen inserts: 1. Blue Fluorescent Protein (BFP); 2. GAL4 sequences; 3. invertase sequences; 4. no fusion partner sequence.
- The terms “genetic library” or “library” are interchangeably used to refer to a collection of nucleic acid fragments that may individually range in size from about a few base pairs to about a million base pairs. These fragments are contained as inserts in vectors capable of propagating in certain host cells such as bacterial, fungal, plant, insect, or mammalian cells.
- The term “sub-library” refers to a portion of a genetic library that has been isolated by application of a specific screening or selection procedure.
- The term “coverage” in the context of a genetic library refers to the level of redundancy of the library. This redundancy is in turn related to the probability that a specific sequence within the nucleic acid sequences that the library is intended to represent is actually present. Coverage is the ratio of the number of library inserts multiplied by the average insert size to the total complexity of the nucleic acid sequences that the library represents.
- The term “vector” refers to a nucleic acid sequence that is capable of propagating in particular host cells and can accommodate inserts of foreign nucleic acid. Typically, vectors can be manipulated in vitro to insert foreign nucleic acids and the vectors can be introduced into host cells such that the inserted nucleic acid is transiently or stably present in the host cells.
- The term “expression vector” refers to a vector designed to express inserted nucleic acid sequences. Such vectors may contain a powerful promoter located upstream of the insertion site.
- The term “expression” in the context of nucleic acids refers to transcription and/or translation of nucleic acids into mRNA and/or protein products.
- The term “expression library” refers to a library of nucleic acid fragments contained as inserts in an expression vector.
- The term “stable expression” refers to the continued presence and expression of a nucleic acid sequence in a host cell for a period of time that is at least as long as that required to carry out the methods of the present invention. Stable expression can be achieved through integration of the construct into a host cell chromosome, or engineering the construct so that it possesses elements that ensure its continued replication and segregation within the host (i.e., an artificial chromosome), or alternatively, the construct may contain a selectable marker (e.g., a drug resistance gene) so that stable expression of the construct is ensured by growing the host cells under selective conditions (e.g., in drug-containing media).
- The term “collection of nucleic acid fragments” refers to a set of nucleic acid molecules from any source. For example, a collection of nucleic acid fragments may comprise total genomic DNA, genomic DNA from one or more chromosomes, cDNA that has been reverse-transcribed from total cellular RNA or from messenger RNA (mRNA), total cellular RNA, mRNA, or a set of nucleic acid molecules synthesized in vitro either individually, or using combinatorial methods. Unless otherwise limited, the term encompasses nucleic acid molecules comprising known analogs of natural nucleotides that can function in a similar manner as naturally occurring nucleotides.
- The term “insert” in the context of a library refers to an individual DNA fragment that constitutes a single member of the library.
- The term “host cell” refers to a cell of prokaryotic, archaebacterial, or eukaryotic origin that can serve as a recipient for a vector that is introduced by any one of several procedures. The host cell often allows replication and segregation of the vector that resides within. In certain cases, however, replication and/or segregation are irrelevant; expression of vector or insert DNA is the objective. Typical bacterial host cells includeE. coli and B. subtilis; archaebacterial host cells include S. acidocaldarius and H. salinarium; fungal host cells include S. cerevisiae and S. pombe; plant cells include those isolated from A. thaliana, and Z. maize; insect host cells include those isolated from D. melanogastor, A. aegypti, and S. frugiperda; and mammalian cells include those isolated from human tissues and cancers including melanocyte (melanoma), colon (carcinoma), prostate (carcinoma), and brain (glioma, neuroblastoma, astrocytoma).
- The term “reporter gene” refers to nucleic acid sequences for which screens or selections can be devised. Reporter genes may encode proteins (“reporters”) capable of emitting light such as GFP (Chalfie M., Tu Y, et al.,Science February 11; 263 :802-805 (1994)), or luciferase (Gould S. J., and Subramani S., Anal. Biochem. November 15; 175: 5-13 (1988)), or genes that encode intracellular or cell surface proteins detectable by antibodies such as CD20 (Koh J., Enders G. H., et al., Nature 375: 506-510 (1995)). Preferably, the reporters allow the activity of cis regulatory sequences to be monitored in a quantitative manner. Alternatively, reporter genes can confer antibiotic resistance such as hygromycin or neomycin resistance (Santerre R. F., et al., Gene 30: 147-156 (1984)).
- The terms “bright” and “dim” in the context of a cell sorter refer to the intensity levels of fluorescence (or other modes of light emission) exhibited by particular cells. Bright cells have high intensity emission relative to the bulk population of cells, and by inference, high levels of reporter gene expression; dim cells have low intensity emission relative to the bulk population.
- The term “genetic pathway” refers to a set of proteins (or the genes that encode them) that act in concert, or sequentially, to accomplish a specific biochemical function or cellular behavior.
- The terms “cis regulatory sequence,” “cis sequence,” “regulatory sequence,” or “regulatory element” are interchangeably used to refer to a nucleic acid sequence that affects the expression of itself or other sequences physically linked on the same nucleic acid molecule. Such sequences may alter gene expression by affecting such things as transcription, translation, or RNA stability. Examples of cis regulatory sequences include promoters, enhancers, or negative regulatory sequences (Alberts B., Bray D., et al. (Eds.),Molecular Biology of the Cell, Second Edition, Garland Publishing, Inc., New York and London, (1989); Lewin B, Gene V, Oxford University Press, Oxford, U.K. (1994)).
- The term “perturbagen” refers to an agent that acts in a transdominant mode to interfere with specific biochemical processes in cells. In the context of the present invention, perturbagens are typically either proteins, protein fragments, or peptides, although the term also encompasses nucleic acids and other organic molecules with similar properties.
- The term “transdominant” describes a type of interaction whereby the agent (most typically a perturbagen) is a diffusable substance that can bind its target in solution. Thus, a transdominant agent is dominant as opposed to recessive in a genetic sense, because, e.g., it acts on gene products and not on alleles of genes. The effects of a perturbagen are visible in the presence of wild type alleles of its target.
- The term “phenocopy” refers to a phenotypic state or appearance that mimics or resembles the state induced by mutation of a specific gene or genes. This state may, for example, be induced by expression of perturbagens within a particular host cell.
- The term “target” in the context of a perturbagen refers to the molecule in the cell (typically a protein) to which the perturbagen binds to exert its effect on cellular phenotype.
- The term “flow sorter” refers to a machine that analyzes light emission intensity from cells or other objects and separates these cells or objects according to parameters such as light emission intensity.
- Overview
- The present invention comprises methods to identify components of genetic pathways in cultured cells from plants and animals, or unicellular organisms such as yeast, bacteria, and fungi. Three basic tools are involved: (1) a reporter gene under the control of a specific cis regulatory element that reflects the phenotypic state of a particular cell; (2) a selection device or method that permits rapid quantitative measurement of the expression levels of the reporter molecule on a cell-by-cell basis; and (3) an expression library of proteins, protein fragments, or peptides (“perturbagens”) that can be introduced into the chosen cell population. Sequences are isolated from the expression library based on their ability to alter the activity of the cis regulatory sequence, as read out by the reporter expression level. The method thus comprises a set of tools and techniques that together permit the identification of components of genetic pathways using a pseudo-genetic approach. The method of the invention can be used in human cells, but it can also be modified easily for use in other mammalian cells, in plant cells, in arthropod cells, and in fungi, archaebacteria and bacteria.
- Reporter Genes
- Numerous reporter genes have been appropriated for use in expression monitoring and in promoter/enhancer trapping. A reporter comprises any gene product for which screens or selections can be applied. Reporter genes used in the art include the LacZ gene fromE. coli (Shapiro S. K., Chou J., et al., Gene November; 25: 71-82 (1983)), the CAT gene from bacteria (Thiel G., Petersohn D., and Schoch S., Gene February 12; 168: 173-176 (1996)), the luciferase gene from firefly (Gould S. J., and Subramani S., 1988), and the GFP gene from jellyfish (Chalfie M. and Prashner D. C., U.S. Pat. No. 5,491,084). This set has been primarily used to monitor expression of genes in the cytoplasm. A different family of genes has been used to monitor expression at the cell surface, e.g., the gene for lymphocyte antigen CD20. Normally a labeled antibody is used that binds to the cell surface marker (e.g., CD20) to quantify the level of reporter (Koh J., Enders G. H., et al., 1995).
- Of these reporters, autofluorescent proteins (e.g., GFP) and the cell surface reporters are potentially of greatest use in monitoring living cells, because they act as “vital dyes.” Their expression can be evaluated in living cells, and the cells can be recovered intact for subsequent analysis. Vital dyes, however, are not specifically required by the methods of the present invention. It is also very useful to employ reporters whose expression can be quantified rapidly and with high sensitivity. Thus, fluorescent reporters (or reporters that can be labeled directly or indirectly with a fluorophore) are especially preferred. This trait permits high throughput screening on a flow sorter machine such as a fluorescence activated cell sorter (FACS).
- GFP is a member of a family of naturally occurring fluorescent proteins, whose fluorescence is primarily in the green region of the spectrum. GFP has been developed extensively for use as a reporter and several mutant forms of the protein have been characterized that have altered spectral properties (Cormack B. P., Valdivia R. H., and Falkow S.,Gene 173: 33-38 (1996)). High levels of GFP expression have been obtained in cells ranging from yeast to human cells. It is a robust, all-purpose reporter, whose expression in the cytoplasm can be measured quantitatively using a flow sorter instrument such as a FACS.
- Genetic Libraries
- Genetic libraries typically involve a collection of DNA fragments, usually genomic DNA or cDNA, but sometimes synthetic DNA or RNA, that together represent all or some portion of a genome, a population of mRNAs, or some other set of nucleic acids that contain sequences of interest. Typically, genetic libraries represent sequences in a form that can be manipulated. A total genomic DNA library in principle includes all the sequences present in the genome of an organism propagated as a collection of cloned sequences. It is often desirable to generate a library that is as representative of the input population of nucleic acids as possible. For example, sequences that are present at one to one ratios in the input population (e.g., genome) are present in the library in the same proportion. To achieve reasonable (e.g., >99% predicted) representation of the nucleic acid sequences that the library is intended to contain, a library should have at least 5-fold coverage; that is, the library should contain at least 5-fold excess of total inserts beyond the total number required theoretically to cover the collection of nucleic acid sequences one time. For example, if the library is intended to represent the genome of an organism, the coverage, i.e., the total number of inserts multiplied by the mean insert size divided by the genome complexity, must be at least five. Typically libraries are propagated in vectors that grow in bacterial cells, although eukaryotic cells such as yeast and even human cells can also serve as hosts.
- The mean insert size of a library is a variable that can be manipulated within rather broad limits that depend on vector and cell types, among other things. For example, some vectors such as bacterial plasmids accommodate small inserts ranging from a few nucleotides to a few kilobase pairs, whereas others such as yeast artificial chromosomes can accommodate insert sizes that exceed 1,000 kilobase pairs.
- The present invention preferably uses genetic libraries that contain inserts on the smaller end of the spectrum. These inserts would most typically be derived from genomes or transcripts of particular organisms, or from synthetic DNA, and would range from, e.g., 10 base pairs to 10 kilobase pairs. The libraries most typically would have coverage that, if possible, exceeded five-fold. The details of library construction, manipulation, and maintenance are known in the art (Ausubel F., Brent R., et al., 1996; Sambrook J., Fritsch E. F., and Maniatis, T.,Molecular Cloning: A Laboratory Manual, Second Edition, CHSL Press, New York (1989)). In one embodiment of the invention, a library is created according to the following procedure using methods that are well-known in the art. Double stranded cDNA is prepared from random primed MRNA isolated from a particular cell type or tissue. These fragments are treated with enzymes to repair their ends and are ligated into the expression vector described infra. The ligated material is introduced into E. coli and clones are selected. A number of individual clones sufficient to achieve reasonable coverage of the mRNA population (e.g., one million clones) is collected, and grown in mass culture for isolation of the resident vectors and their inserts. This process allows large quantities of the library DNA to be obtained in preparation for subsequent procedures described infra.
- In specific embodiments of the invention, it is preferable to use non-natural nucleic acid as the starting material for the library. For example, it may be desirable to use a population of synthetic oligonucleotides, e.g., representing all possible sequences of length N, or a subset of all possible sequences, as the input nucleic acid for the library. In addition, it may be desirable to use mixtures of natural and non-natural nucleic acids for library inserts.
- Nucleic Acid Transfer
- During the last two decades several basic methods have evolved for transferring exogenous nucleic acid into cultured host cells. These methods are well-known in the art (Ausubel F., Brent R.; et al., 1996; Sambrook J., et al., 1989). Some methods give rise primarily to transient expression in host cells; i.e., the expression is gradually lost from the cell population. Other methods can also generate cells that stably express the transferred nucleic acid, though the percentage of stable expressers is typically lower than transient expressers. Such methods include viral and non-viral mechanisms for nucleic acid transfer. In the case of viral transfer, a viral vector is used to carry nucleic acid inserts into the host cell. Depending on the specific virus type, the introduced nucleic acid may remain as an extrachromosomal element (e.g., adenoviruses, Amalfitano A., Begy C. R., and Chamberlain J. S.,Proc. Natl. Acad. Sci. (USA) 93: 3352-3356 (1996)) or maybe incorporated into a host chromosome (e.g., retroviruses, Iida A., Chen S. T., et al., J. Virol. 70: 6054-6059 (1996)).
- In the case of non-viral nucleic acid transfer, many methods are available (Ausubel F., Brent R., et al., 1996). One technique for nucleic acid transfer is CaPO4 coprecipitation of nucleic acid. This method relies on the ability of nucleic acid to coprecipitate with calcium and phosphate ions into a relatively insoluble CaPO4 grit, which settles onto the surface of adherent cells on the culture dish bottom. The precipitate is, for reasons that are not clearly understood, absorbed by some cells and the coprecipitated nucleic acid is liberated inside the cell and expressed. A second class of methods employs lipophilic cations that are able to bind DNA by charge interactions while forming lipid micelles. These micelles can fuse with cell membranes, dumping their DNA cargo into the host cell where it is expressed. A third method of nucleic acid transfer is electroporation, a technique that involves discharge of voltage from the plates of a capacitor through a buffer containing DNA and host cells. This process disturbs the bilayer sufficiently that DNA contained in the bathing solution is able to penetrate the cell membrane. A fourth method involves cationic polymers such as DEAE dextran which mediate DNA entry and expression in cultured cells. A fifth method employs ballistic delivery of DNA contained in ice crystals or adsorbed to the surface of miniature projectiles that are shot into cells. Finally, microinjection of DNA can be used, though it is typically quite slow and labor intensive.
- Several of these methods often result in the transfer of multiple DNA fragments into individual cells. It is often difficult to limit the quantity of DNA taken up by a single cell to one fragment. However, methods are known in the art to minimize transfer of multiple fragments. For example, by using “carrier” nucleic acid (e.g., DNA such as herring sperm DNA that contains no sequences relevant to the experiment), or reducing the total amount of DNA applied to the host cells, the problem of multiple fragment entry can be reduced. In addition, the invention does not specifically require that each recipient cell have a single type of library sequence. Multiple passages of the library through the host cells (see below), permit sequences of interest to be separated ultimately from sequences that may be present initially as bystanders. Alternatively, it may be useful to take advantage of the feature that many methods of gene transfer into somatic cells deliver multiple copies. This trait may permit more library sequences to be screened in a smaller number of cells, especially since perturbagens act in a transdominant mode; i.e. if a particular cell contains several different perturbagens, one of which alters expression of the reporter, this cell should be collected during screening and the active perturbagen should be recovered (along with others which have no effect).
- If it is desirable to carry out genetic experiments on bacterial or fungal cells, a variety of techniques are also available for gene transfer. Electroporation is a particularly flexible method for nucleic acid delivery applicable to most cell types including prokaryotes, fungi, plant and animal cells. In addition, certain mixtures of specific salts can be used with some cells to facilitate DNA entry. For example, CaCl2 works well with E. coli and LiOAc works well with S. cerevisiae.
- Perturbagens
- One of the great shortcomings of somatic cell genetics involves the difficulty with which recessive mutations can be observed. The problem can be formulated in statistical terms. If mutations occur in one allele at a frequency of, e.g., one in one million, then the chance that two independent mutations will occur, one in each allele, is the product: one in a trillion. Thus, dominant or codominant mutations are much more readily observed in general. Because of the recessive nature of the vast majority of mutations, somatic cell genetics is limited largely to study of dominant alterations such as overexpression.
- Perturbagens typically are proteins, protein fragments, or peptides (though they may be nucleic acids) that bind other proteins in the cell and thereby disrupt specific biochemical pathways (see FIG. 1). Nature generates perturbagen-like molecules by chance in the case of a certain class of dominant, gain-of-function mutations and in specific cases dominant negative mutant genes have been designed (Herskowitz I.,Nature 329: 219-222 (1987)). In the present invention, this mode of biochemical/genetic disruption is harnessed and applied in a directed fashion to identify and recover important genes.
- Perturbagens can be constructed in a variety of ways. They may be generated from randomly-primed, size-selected cDNA, sheared or digested genomic DNA, synthetic DNA or other sources of nucleic acid. They may be expressed in cells without any additional protein sequences joined to them. Alternatively, they may be fused to other proteins, e.g., GFP or yeast GAL4, by standard methods of molecular cloning (Ausubel et al., 1996). In addition, they may be presented as insertion sequences within specific proteins.
- Perturbagen libraries can be constructed using techniques similar to construction of conventional gene and expression libraries as described supra. Such libraries, when introduced into cells with standard vectors such as viruses or by other means, act in a manner analogous to mutagens; that is, the perturbagens induce a phenocopy state in the host cells which mimics the mutant state, but does not directly involve alterations to host cell DNA sequences. The value of perturbagens is based on the ease with which they can be generated and screened, and the readiness with which the perturbagen sequences can be recovered and used to identify elements in the genetic pathways of interest. Furthermore, they act in a mode similar to small molecule therapeutics. Indeed, they are simply the protein equivalent of a small molecule, and they can be used in combination with their targets (binding partners) to screen for small molecule mimics that affect cells in a manner similar to the original protein perturbagen.
- In the present invention, perturbagen expression libraries comprised of, e.g., fragmented genomic DNA, random-primed cDNA, or synthetic DNA of random sequence are introduced into host cells engineered to contain a reporter gene under the control of a cell-type-specific cis regulatory sequence. Alternatively, a natural reporter consisting of a membrane protein (or intracellular protein) for which good specific antibodies are available may be used, provided the expression of this protein correlates with a phenotype of interest. Cells harboring perturbagens are screened by a rapid and quantitative method or device, such as a flow sorter, e.g., a FACS, to identify the population of cells that have altered expression of the reporter. These are collected for analysis as described infra.
- Cis Regulatory Elements
- In order to drive perturbagen expression in host cells of a particular type, a generic promoter capable of conferring robust, high or moderately high expression is required. These promoters are typically derived from housekeeping genes that are expressed at reasonably high levels in most or all cell types in the organism, or from viruses. Numerous such cis regulatory sequences are known in the art, suitable for driving expression in mammalian cells, insect cells, plant cells, fungi or bacteria (Ausubel et al., 1996; vector database located at: http://www.atcg.com/vectordb/). For example, in eukaryotes the promoter for beta actin is useful (Qin Z., Kruger-Krasagakes S., et al.,J. Exp. Med. 178: 355-360); in plants the Cauliflower Mosaic Virus 35S promoter (Goddijn O. J., Pennings E. J., et al., Transgenic Res. 4: 315-323 (1995)); and in general, a promoter that drives high level expression of, e.g., a housekeeping or viral gene can be identified with relative ease using current molecular genetic methods.
- To identify cis regulatory sequences that drive reporter gene expression, it is necessary to choose or select sequences that have the appropriate characteristics; that is, because the reporter is intended to act as a surrogate for the phenotypic trait(s) under study, it must be regulated in a manner that approximates the phenotype as closely as possible. Many such sequences are known in the art as tissue-specific regulatory elements (Lewin B., (1994)). Alternatively, such regulatory sequences can be identified by standard procedures that involve: first, isolation of cell- or tissue-specific genes using procedures of differential display, subtractive hybridization, representative difference analysis, and others (Ausubel et al., 1996; and see for discussion: Kamb A., Feldhaus M. J., “Method for the comparative assessment of relative amounts of nucleic acids,” U.S. patent application Attorney Docket No. 8835-0005-999); second, the cis regulatory elements that are responsible for the pattern of gene expression can be elucidated by application of standard methods of promoter/enhancer analysis including generation of deletion and linker scanned mutants, and expression assays in cells (Lewin B., 1994; Latchman D. S.,Eukaryotic Transcription Factors, Second Edition, Academic Press, London (1996); McKnight S. L. and Yamamoto K. R., Transcriptional Regulation, CHSL Press, New York (1992)). In addition, genetic methods that fall under the general name of enhancer/promoter traps can be employed to find cis sequences with particular characteristics (see discussion in Ruley and von Melchner, U.S. Pat. No. 5,364,783; Bellen H. J., O'Kane C. J., et al., Genes Dev. 3: 1288-1300 (1989)). Finally, methods for genetic selections of regulatory sequences that have predetermined characteristics as described in the co-pending United States patent application of Kamb C. A. titled, “Methods for identifying, characterizing, and evolving cell-type specific cis regulatory elements” (Attorney Docket No. 20410-701), may also be applied to identify useful cis sequences for driving reporter gene expression. The goal is to choose a cis regulatory sequence that is active under the conditions of interest, either by genetic methods, biochemical methods, or by reference to known genes that have the desired expression characteristics. For example, if one desires to study the process of pathogenesis in a particular pathogenic organism, it may be useful to commandeer a promoter that is only active in cells competent for pathogenic invasion of the host.
- Expression Vectors
- Expression vectors are used in the invention to produce RNA, proteins, protein fragments, or peptides derived from sequences (genes and gene fragments) that are introduced into host cells. The sequences include reporter genes used as a surrogate for the phenotypic state of the cell, and sequences that encode the perturbagens. There are numerous expression vectors known in the art which are readily available for use in the present invention (Ausubel F. M., Brent R., et al., 1996; Sambrook J. et al., 1989). Some of these are tailored for use in specific cell types, but most are designed to be used in a wide variety of cell types. In mammalian cells, viral transcriptional regulatory elements are a typical choice for driving expression of exogenous genes. For the purposes of the present invention involving perturbagen expression in mammalian host cells, an expression vector that contains a reporter gene flanked downstream by a poly(A) addition sequence, e.g., derived from the SV40 TAg gene, may be used. This type of expression vector is illustrated in FIG. 2. The perturbagen-encoding sequence may be flanked upstream of its initiation codon by a TATA box, capable of binding RNA polymerase II (Pol II), and by an enhancer that preferably confers high expression on the linked perturbagen-encoding sequences. In addition to cis regulatory sequences that are constitutively active such as those in powerful viral promoters, the expression vector preferably includes a site appropriate for insertion of perturbagen-encoding library sequences. Such library sequences preferably involve generation of a fusion protein with, e.g., BFP, though native protein domains or protein fragments may also be employed. The choice of which, if any, perturbagen fusion partner to use depends on, e.g., if cytoplasmic, nuclear, or extracellular expression of the perturbagen is desired. The vector, if it is of viral origin, may not require propagation in a bacterial host.
- However, more typically the vector requires propagation in, e.g.,E. coli, and contains sequences necessary for replication and selection in E. coli such as a colE1 replicon and an antibiotic resistance gene.
- For prokaryotic and archaebacterial host cells, cis regulatory sequences are chosen according to similar criteria as discussed above. For the perturbagen expression vector, cis regulatory sequences are included upstream of the perturbagen-encoding sequences that cause robust, preferably high expression levels. These sequences are thus, preferably, of a generic type present, e.g., upstream of housekeeping genes. InE. coli for example, a suitable sequence is the consensus promoter that consists of a −10 box and a −35 box (Alberts B., Bray D., et al., 1989; Lewin B., 1994).
- In contrast to the perturbagen expression vector, the reporter vector is customized so that reporter expression reflects as closely as possible the phenotypic state of the host cell under study. Thus, the expression vector is designed such that the reporter gene (e.g., GFP) is placed under the control of cis regulatory sequences that confer cell-type specific expression, and/or reflect the activation of specific biochemical pathways within the cell. For example, FIG. 3 shows a mammalian expression vector that can be used to insert foreign cis regulatory sequences upstream of the TATA box from the CMV promoter, generating GFP expression under the control of the chosen regulatory element. Such regulatory sequences are known in the art (Lewin B., 1994 and see supra), or they can be identified using methods disclosed in the co-pending U.S. patent application by Carl Alexander Kamb filed Feb. 14, 1997 titled, “Methods for identifying, characterizing, and evolving cell-type specific cis regulatory elements,” Attorney Docket No. 20410-701.
- Enrichment for Phenocopy Variants Induced by Perturbagen Expression
- The combination of genetic libraries and genetic selection or screening techniques permits identification of specific sequences from libraries based on their functions in living cells. This strategy has been used frequently in molecular biology to clone genes based on expression, e.g., by complementation of a mutant phenotype (e.g., Yocum R. R. and Johnston M.,Gene 32: 75-82 (1984)). The premise of the strategy is that an appropriately constructed library can be introduced into suitable host cells and the effects of the library sequences can be monitored. For example, a particular host cell may die in a particular environment in the absence of a certain gene; the host cell will only grow when a library insert that includes the gene is present. Alternatively, screens can be employed to pick out the library sequences that confer a particular phenotype. For example, the T8 (Leu-2) gene was isolated by a protocol that involved expression in cultured cells, labeling by a fluorescent antibody, and enrichment by FACS of T8-expressing cells (Kavanthes P., Sukhatme V. P., et al., Proc. Natl. Acad. Sci. (USA) 81: 7688-7692 (1984)).
- The present invention may use a flow sorter such as a FACS or equivalent device to screen through large numbers of host cells harboring perturbagen library inserts to identify those that have a particular phenotype; namely, cells that have reduced or elevated levels of reporter molecule expression. If the perturbagen library is introduced into host cells engineered to express the reporter (e.g., GFP) in a stable context, the large majority of cells that are analyzed by FACS are expected to have normal (e.g., high) levels of reporter expression. However, a small number may exhibit reduced expression, detected on the FACS as cells that fall on the dimmer side of the cell fluorescence distribution. These dim cells can be collected and grown in isolation of the others. See FIGS. 3 and 4. Such a procedure results in enrichment from the starting population of perturbagen-containing cells for those that contain perturbagens that reduce the level of reporter expression. These selected, dim cells can be used to reisolate the perturbagen fragments by, e.g., PCR using primer sites that flank the library inserts, so as to build a sub-library of perturbagen fragments enriched for those that cause reduced reporter expression. The sub-library of fragments can be recloned (using e.g., the same expression vector) and reintroduced into the host cells, and the screening/selection process can be repeated as many times as necessary.
- After a sufficient number of cycles, a substantial difference should be observed in the fluorescence intensity distribution of the original reporter-containing host cells as compared to the host cells harboring the enriched perturbagen sub-library inserts. Preferably, the procedure should be repeated until a minimal overlap is observed between these two fluorescence intensity distributions. Ultimately, the process of FACS sorting and cycling should result in a population of perturbagen fragments that, e.g., inhibit expression of the reporter. These can be isolated and studied individually by molecular cloning and DNA sequence analysis. If a sufficient number of cycles has been carried out, many, preferably most, separate fragments should produce roughly the same effect on reporter expression in the host cells as the effect produced by the enriched population from which they were isolated.
- Perturbagen Targets
- The targets of perturbagens in cells are as interesting as the perturbagens themselves. It is expected that most perturbagens exert their phenotypic effect on cells by binding another specific protein, thus inhibiting its function. The other protein may be a wild type counterpart of the perturbagen (e.g., in the case of protein homomultimers), or it may be another unrelated protein. In either case, the perturbagen provides a critical probe for isolation of the target protein.
- With present technology such as the yeast two-hybrid system or other genetic or biochemical approaches known in the art, it is possible to identify the relevant target molecules in the cell (Fields S. and Song O.-K., U.S. Pat. No. 5,283,173; Serrano M., Hannon G. J., et al.,Nature 366, 704-707 (1993); Ausubel, et al., 1996). It is also possible that the perturbagen sequence may reveal the probable identity of the target, based on existing knowledge of biochemical pathways and comparisons with sequence databases; for example the sequence of a specific perturbagen can be used to search a public database such as GenBank. Any “hits” that reveal database sequences with homologies that exceed a threshold for statistical significance can be carefully studied, and their biological roles can be investigated in the published literature. In some cases perturbagens will be derived from components of a well established biochemical pathway, and strong candidates for the perturbagens' targets may be deduced from the identity of the perturbagens themselves.
- In certain cases, additional perturbagen experiments may reveal the identities of targets. For example, a second perturbagen experiment using cells that express a perturbagen that inhibits reporter gene expression may provide a clue. If cells that harbor the reporter construct plus the initial perturbagen (now expressed stably using methods similar to those employed to generate the original reporter-containing host cells) are used as host cells for another round of perturbagen genetics, it is sometimes possible to select revertants that express high levels of reporter once again. This revertant phenotype may be caused by, among other things, the presence of a second perturbagen in the cells that mimics the behavior of the first perturbagen's target; i.e., a compensatory effect that involves overexpression of the target or a fragment of the target. Thus, the set of revertant perturbagens (“anti-perturbagens”) may provide clues as to the nature of perturbagen targets.
- Genetic Pathways
- The perturbagen approach used in the present invention has the capacity to identify several components of specific genetic pathways in a single selection experiment. This is because the assay is performed using a population of cells, without the need to isolate and grow individual mutants. All cells that harbor perturbagens capable of increasing or decreasing reporter gene expression are collected together, and the family of resident perturbagens can be amplified, e.g., by PCR, for subsequent analysis. Cloning individual nucleic acid fragments is much faster than cloning individual cells and localizing chromosomal mutations within them. In a sense, genetics is performed on the library of perturbagens rather than on the host cells themselves.
- Individual perturbagen-encoding fragments can be examined in further detail using assays other than the reporter gene expression assay used for their isolation. The mechanistic basis for perturbagen activity is likely to be of considerable interest. For example, the perturbagen may interfere with reporter gene expression by inhibiting the activity of a transcription factor required for reporter gene expression. Alternatively, it may interfere upstream of the transcription factor in a biochemical pathway that leads to activation of the set of transcription factors required for reporter gene expression. Finally, the perturbagen may cause a transformation in cell fate, such that the host cell no longer resembles the original parental cell type, but instead has been converted into a different cell type. Other possible modes of perturbagen disruption that lead to decreased or increased reporter gene expression can be envisioned. These can be sorted out later using cell biological, genetic, and biochemical methods known in the art (Ausubel, et al., 1994; Sambrook et al., 1989).
- During the course of a particular experiment, several perturbagen inserts may be isolated that affect reporter gene expression. Using further rounds of perturbagen selection it is possible to place the perturbagens into groups (akin to classical “complementation groups”) based on the step in the pathway that they affect, and even to order those steps. The first stage in this process involves generating a new set of anti-perturbagens that act to increase the reporter expression. If the original reporter gene is constitutively expressed in the absence of perturbagens, for instance, then anti-perturbagens may be selected as bright revertants of dim cells containing a perturbagen isolated during the first round of selection experiments described supra. If the original reporter gene is inducible (see Example 1 infra), it may be simpler to select perturbagens that are bright in the absence of the inducing signal (i.e., they promote constitutive activation). In either case there are now two sets of perturbagens with opposite phenotypes; one class makes cells dim and the other reverses this phenotype. By introducing all possible pairs of “dim-” and “bright-” inducing perturbagens into the host cells and examining the resulting reporter expression levels, it is possible to group perturbagens (and thus their cellular targets) by common response. If it is desirable to order the pathway in detail, methods using conditional perturbagens (hot and cold sensitive) may be employed according to the strategy described by Jarvik J. and Botstein D. (Proc. Natl. Acad. Sci (USA). 70: 2046-2050 (1973); Proc. Natl. Acad. Sci. (USA) 72: 2738-2742 (1975)).
- Note that perturbagens isolated in the fashion described herein may lead directly to new therapeutic molecules. The goal is not necessarily to identify perturbagens that have a single specific effect on expression of the reporter gene, e.g., by interfering with the function of the reporter itself. Rather, the goal is through this means to identify perturbagens that have more general effects on cell physiology, including but not limited to cell type transformations. Such perturbagens may be relevant to disease therapy because they disrupt specific pathways in cells which have profound phenotypic and physiological consequences. These perturbagens and their associated cellular targets may serve to identify novel therapeutic targets in cells, an extremely valuable commodity in the medical arena.
- Additional Manipulations Designed to Improve Perturbagen Specificity
- Perturbagens isolated using the procedures described supra may be further refined in two senses. First, perturbagens that are improved variants of members of the original perturbagen library may be isolated by accidental or deliberate mutation or recombination during the process of selection and enrichment. Second, the perturbagens may be passed through additional genetic screens and selections that enrich for those that have more desirable properties in terms of cell-specific activity.
- In the first case, amplification of DNA by, e.g., PCR is known to introduce sequence changes during the replication process (Cline J., Braman J. C., et al.,Nucleic Acids Res. 24, 3546-3551 (1996)). This can lead to sequence variants in subsequent experiments, some of which may have useful properties. For example, they may interfere more effectively with reporter gene expression than the original perturbagen in the library. These perturbagens will be identified by conferring a phenotype of, e.g., even lower reporter expression. Alternatively, it may be desirable to evolve improved variants of existing perturbagens by deliberately subjecting the amplification process to conditions that enhance mutation and/or recombination of the nucleic acid by, e.g., in vitro mutagenesis, error-prone PCR, or recombinational PCR (Stemmer W. P., Nature 370, 389-391 (1994)). Such conditions are known in the art (Ausubel et al, 1994) and provide a means for evolving perturbagens that, e.g., are active at lower concentrations and/or demonstrate increased selectivity in cells compared to perturbagens expressed by the original library; thus, they perform better as perturbagens.
- In the second case, it may be desirable to passage the sub-library of perturbagen fragments that have been isolated by application of the principles described supra through additional screens to enrich for those with improved selectivity for particular biochemical pathways. For instance, trivial effects on reporter expression, or general effects on gene expression and/or cell viability may be detected or eliminated by appropriate secondary screens. If desired, reporters linked, e.g., to a second tissue- or cell-type-specific promoter that behaves in the host cells in a manner similar to the first reporter gene promoter may be used to reject perturbagens that affect the host cells in a reporter- or promoter-specific manner, and do not have a more profound effect on the state of the cell. Alternatively, a different reporter joined to the first promoter may be used. In addition, perturbagens that have general, non-specific effects on gene expression may be identified and/or removed by passing perturbagen sub-libraries or individual perturbagen-encoding sequences through a different host cell, unrelated to the first host cell, with a different host-cell-specific promoter.
- Small Molecule Displacement Screen Based on Perturbagen-target Interactions
- Perturbagens isolated as described supra behave in a transdominant mode similar to traditional small molecule pharmaceutical compounds. Thus, in certain cases they may serve much the same function as small molecule therapeutics though it may be necessary to ensure intracellular delivery and expression by gene therapy technology. In addition, perturbagens, in association with their cellular targets, provide the basis for high-throughput in vitro screens for small molecule mimics that have properties similar to the original perturbagen; namely, they bind specifically to the perturbagen target and disrupt the target's function in vivo. Such molecules may have effects on cells similar to the perturbagens used in the screen.
- In a specific embodiment of the invention, a system is used for assessing protein-protein interactions and their inhibition in a cell in vivo, e.g., in a bacterial, fungal, plant, insect, or mammalian cell, or in vitro. This system, referred to as a small molecule displacement assay, can be used to screen libraries of small molecules to identify specific compounds that disrupt perturbagen/target interactions. This use of perturbagens and their cognate targets is described in detail in co-pending U.S. patent application of Kamb, C. A. (Docket No. 8835-004-999).
- Identification of Perturbagens that Modulate the a-Factor Signaling Pathway in Yeast a Cells
- The binding of yeast mating pheromone a-factor to a specific 7-transmembrane-domain-containing G-protein-coupled receptor (the product of the STE2 gene) on the surface of yeast cells of a mating type activates a signaling pathway that culminates in cell-cycle arrest and the preparation of the cell for mating to an a cell (FIG. 6). This well-characterized signaling pathway (reviewed in Bardwell L., Cook J. G., Inouye C. J., Thomer J.,Dev. Biol. 166: 2, 363-379 (1994); Herskowitz I., Cell 80: 2, 187-197, (1995)) involves activation of a MAP kinase cascade and the transcriptional induction of at least 6 genes. Analysis of the promoters of some of these genes has identified a sequence element that is necessary and sufficient for induction. The method of the invention can be applied to identify perturbagens that block the a-factor signaling pathway and thus prevent the a-factor-dependent induction of specific genes.
- Construction of an a-Factor-responsive GFP Reporter Plasmid
- The promoterless yeast plasmid pRS416-GFP (disclosed in the co-pending application by Carl Alexander Kamb filed Feb. 14, 1997 titled, “Methods for identifying, characterizing, and evolving cell-type specific cis regulatory elements”) contains the GAL1 TATA box (minus the GAL upstream activation sequences, UAS) upstream of the coding sequence of a GFP variant which expresses well in yeast. This plasmid can replicate and be selected in yeast (CEN and ARS, URA3) andE. coli (ColE1, AmpR) and has a unique Bg1II site upstream of the GAL1 TATA box for inserting DNA promoter-containing fragments. The GFP expression is rendered a-factor responsive by cloning into the
Bg1II site 4 copies of the a-factor-responsive element (as a synthetic oligo), a PCR fragment containing bases−259 to upstream of the Fus1 gene (Hagen D. C., McCaffrey G., Sprague G. F. Jr., Mol. Cell Biol. 11: 6, 2952-61 (1991)) or, alternatively, any other a-factor-responsive cis regulatory element isolated from a genomic library that has been screened to identify such elements according to the methods described in the co-pending U.S. patent application by Carl Alexander Kamb filed Feb. 14, 1997 titled, “Methods for identifying, characterizing, and evolving cell-type specific cis regulatory elements,” (Attorney Docket Number 20410-701) (see FIG. 7). When this construct is introduced into yeast and the cells are exposed to a-factor, they show increased fluorescence either by microscopy or FACS analysis compared to the same cells grown in the absence of a-factor. Thus this construct satisfies the conditions necessary for a reporter that can be employed in the invention disclosed herein; namely, the reporter responds in a manner that reflects the relevant phenotypic state of the cell and/or cell environment. - As an alternative to carrying the reporter gene on a centromere-containing plasmid in yeast, the construct can be introduced into the yeast genome using techniques known in the art (Ausubel et al., 1996; Rothstein R. J.,Methods Enzymol. 101: 202-211, (1983)). Briefly, endogenous pathways of homologous recombination are used in vivo to insert an expression vector that lacks an ARS/CEN but contains a selectable marker in addition to the reporter expression cassette. A region of yeast DNA homology is introduced into the vector and the vector is cut with a restriction enzyme that produces a linear molecule, the ends of which contain homology with a yeast chromosomal region. Transformation with this linear material results in recruitment of homologous recombination machinery and generates a large number of transformants that contain the expression vector inserted into the chromosomal region of homology. Such an expression vector is inherited stably along with the chromosome within which it resides. Individual transformants can be tested to ensure that they continue to express the reporter as they were intended.
- Construction of a Yeast Genomic DNA Perturbagen Library
- Standard techniques are used to construct a library of yeast genomic DNA fragments in a yeast/E. coli shuttle vector such as pRS315 (Sikorski R. S., Hieter P., Genetics 122: 1, 19-27 (1989)). This vector contains LEU2 as a selectable marker in yeast. Four separate libraries may be made to present the perturbagen in different contexts or cellular compartments. In all four cases there is a GAL1 promoter upstream of the inserted genomic fragment in order to drive its expression in a galactose-dependent fashion.
- In one vector the coding sequence for Blue Fluorescent Protein (BFP) (Quantum Biotechnologies, Inc., Laval, Canada; Anderson M. T., Tjioe I. M., Lorincz M. C., Parks D. R., Herzenberg L. A., Nolan G. P., Herzenberg L. A.,Proc. Natl. Acad. Sci. (USA) 93: 16, 8508-8511 (1996)) is located downstream of the GAL promoter and upstream of the insertion site to allow translational fusions between BFP and the inserted coding sequence (see FIG. 8). In a second case the secreted form of invertase is the fusion partner; this allows export into the secretion pathway of the perturbagens and may provide a mechanism for isolating perturbagens that have activity when secreted outside the cell or when otherwise consigned to the secretory pathway. In a third case the GAL4 protein, a well established fusion partner (Fields S. and Song O.-K., U.S. Pat. No. 5,283,173), is fused to the perturbagen; this facilitates import of the perturbagen into the nucleus. In a fourth case there is no fusion partner for the perturbagen sequence; this allows production of “stand alone” perturbagens.
- Analysis of the Library in (a-Factor-responsive) GFP Reporter-bearing a Cells
- Each of the perturbagen libraries described above is introduced into separate cell populations containing the a-factor-responsive GFP vector. The selectable markers used on the perturbagen and reporter plasmids are different so that both can be maintained in the same cell (e.g., URA3 and LEU2). Alternatively the reporter construct can be integrated into the chromosome (which has advantages due to more uniform levels of reporter gene expression in the population of cells).
- A perturbagen that specifically blocks the a-factor signaling pathway should reduce fluorescence of these cells in a galactose-dependent fashion. The perturbagen sub-library can be further tested to ensure that, e.g., expression of particular perturbagens does not simply kill cells. This manipulation provides a convenient counterscreen to increase the probability that the perturbagens are specific for the targeted biochemical pathway involving a-factor arrest.
- It is also possible to reverse the selection process and identify perturbagens that have the opposite effect; namely, they increase reporter expression in the absence of a-factor and the presence of galactose. Such perturbagens may be isolated by screening for perturbagen-containing cells that are bright in the presence of galactose and the absence of a-factor.
- Note that it is possible to use the BFP perturbagen library in this sort (the second case above) because levels of GFP expression in a cell can be monitored independently of the BFP expression in the same cell by the appropriate use of bandpass filters in the FACS machine. Because the excitation and emission maxima of GFP differ from those of BFP, it is necessary to employ appropriate filters and lasers (Anderson, et al., 1996).
- Pathway that Leads to Expression of the Tyrosinase Gene in Melanoma Cells
- A variety of human melanoma-specific genes have been identified including DOPAchrome tautomerase/tyrosinase-related protein 2 (TRP-2) (Yokoyama K., Yasumoto K., et al.,J. Biol. Chem. 269: 27080-27087 (1994)), melanotransferrin (Mtf) (Duchange N., Ochoa A., et al., Nucleic Acids Res. 20: 2853-2859 (1992)), microphthalmia-associated transcription factor (MITF) (Fuse N., Yasumoto K., et al., Biochem. Biophys. Res. Commun. 219: 702-707 (1996), and tyrosinase (Shibata K., Muraosa Y., et al., J. Biol. Chem. 267:
-
- Construction of a GFP Expression Vector with Tyrosinase Regulatory Elements
- Tyrosinase encodes an enzyme involved in the conversion of tyrosine into the polymeric, light-absorbing pigment melanin. Regulatory sequences in the human tyrosinase gene are particularly well characterized. Transfection experiments have determined that a promoter fragment located between 1.8-2.7 kilobase pairs upstream of the tyrosinase transcriptional initiation site is sufficient to confer expression specifically in melanoma pigment-producing cells (Shibata K., Muraosa Y., et al., 1992). Further deletion analysis identified a pigment-cell specific enhancer contained on a 200 base pair fragment located 1.8-2.0 kilobase pairs upstream of the start site. A 39-base pair core element was sufficient to confer melanoma cell-specific expression.
- The promoter region defined in the series of experiments described supra is used to direct expression of a reporter gene (GFP in this case) specifically in human melanoma cells. Numerous such cultured cell lines are available (Satyamoorthy K., DeJesus E., et al., Melanoma Res. (In press)), many of which (e.g., HS294T) grow well in culture and can be used in the experiments described in this example. The promoter region may include the entire 2.7 kilobase pairs upstream of the human tyrosinase gene, or the 200 base pair fragment located upstream of a TATA box sequence (FIG. 3). Based on the published literature, such a construct should be selectively active in melanoma cells and not in, e.g., fibroblast cells.
- The fusion construct consisting of tyrosinase regulatory sequences joined to the GFP reporter will be introduced in an expression vector such that GFP is expressed at high levels in the host cells. Selection for stable expressers will be applied using, e.g., the dominantly selectable marker for neomycin resistance carried on the expression vector such as that shown in FIG. 3. Stable expressers will be selected using techniques known in the art (Ausubel et al., 1996), and the population of GFP-expressing cells will be verified by flow cytometry. A suitable clone, characterized by high, stable expression of GFP will be employed in subsequent experiments.
- Screen for Perturbagens that Inhibit Tyrosinase Expression
- This host cell line will be used as a recipient for transfer of a perturbagen library of the type described supra. Briefly, the library consists of cDNA fragments (derived from, e.g., randomly primed human fetal brain mRNA) or random peptide-encoding sequences carried on an expression vector that, e.g., may be derived from a typical mammalian expression vector such as that shown in FIG. 2. In this case, the library is under control of CMV sequences. The library is introduced into the host cells using standard protocols for electroporation (Ausubel et al., 1996). The specific conditions are chosen to optimize nucleic acid transfer (see Example 3). The cells are then passed through a flow sorter device such as a FACS to collect cells that are dim (i.e., express levels of GFP that are lower than the mean level of GFP expression in the host cells that lack perturbagens, or are lower than the mean level of GFP expression exhibited by the bulk population of host cells, many of which express perturbagens). The resident perturbagen-encoding DNA inserts contained within the dim cells are recovered by, e.g., PCR amplification using primer sites that flank the perturbagen insert sequences. These perturbagen fragments are recloned in the expression vector and the sub-library is reintroduced into the reporter-bearing host cells. This cycling process is continued a sufficient number of times to generate a reasonably pure set of perturbagen fragments that have the effect, when introduced singly into host cells, of depressing GFP expression. Such fragments can be characterized further, including determination of their DNA sequences and examination of their effects on the gross phenotype of the cell.
- Pathway that Leads to Expression of Beta-3 Integrin in Metastatic Melanoma
- A common feature of advanced melanomas is high level expression of the adhesion molecule beta-3 integrin (Varner J. A. and Cheresh D. A.,Curr. Opin. Cell Biol. 8: 724-730 (1996)). This provides an example of how the invention disclosed herein can be used to identify perturbagens (and perturbagen targets) involved in the expression of specific cell surface molecules.
- Transfer of Perturbagen Libraries into Melanoma Cells
- Melanoma cells that over express beta-3 integrin are used as the departure point for these experiments. When stained with a monoclonal antibody that binds beta-3 integrin, these cells reveal a reproducible high level of expression that is quantitatively distinct from a variety of other cell types that express either low levels of beta-3 integrin, or none at all, e.g., normal melanocytes. The cell line chosen from among the set of high beta-3-integrin-expressing lines described in Satyamoorthy K., DeJesus E., et al.,Melanoma Res. (in press) is first tested to optimize nucleic acid transfer using, e.g., electroporation. Standard GFP expression vectors such as those sold by Clontech (Palo Alto, Calif.) provide a convenient method to assess the results of different electroporation conditions. The GFP expression vectors are introduced into the cells using a variety of voltages and capacitances and the cells are returned to culture for a period (typically one day) sufficient to permit recovery of the cells and expression of the transferred DNA. The cells are then analyzed by a flow sorter such as a FACS to determine the percentage of cells that are bright; i.e., the fraction that have accepted the transferred DNA. Conditions are selected that maximize this number for further experiments.
- Flow Sorter Analysis and Selection of Dim Cells
- A perturbagen expression library of the type described in Example 2 is introduced into the melanoma host cells using the conditions defined above. After one to three days, the cells are collected, stained with the monoclonal antibody directed against beta-3 integrin, and labeled with a secondary fluorescently-labeled antibody that allows indirect visualization of the beta-3 integrin on the cells by binding the Fc domain of the first antibody (Robinson J. P., Darzynkiewicz Z., et al., (Eds.),Current Protocols in Flow Cytometry, John Wiley and Sons, New York (1997); Ausubel et al., 1996). These stained cells are passed through a flow sorter, e.g., a FACS, and the dim fraction of cells is collected. The collected cells are lysed and their perturbagen inserts are recovered by PCR for either another cycle of enrichment or for sequence analysis. In either case the inserts are recloned in E. coli before proceeding. Individual perturbagen fragments identified through the above procedure are analyzed further to ensure that many (preferably the majority) have the expected properties when tested singly, as opposed to being part of a population. The majority of such fragments, when introduced alone into the melanoma cells, should cause a decrease in the level of beta-3 integrin protein expressed at the cell surface. The DNA sequences of these fragments can be determined and used to explore the public sequence databases to check if they match a known protein. The results of such a search may provide valuable information about the nature of the perturbagen interaction in cells (i.e., the mechanism of the effect) and may point to the perturbagen target in vivo. The perturbagen target may also be found using the method of two-hybrid analysis in S. cerevisiae as described in (Fields S. and Song O.-K., U.S. Pat. No. 5,283,173; Serrano et al., 1993).
- The above examples are provided to illustrate the invention but not to limit its scope. Other variants of the invention will be readily apparent to one of ordinary skill in the art and encompassed by the appended claims. All publications, patents, and patent applications cited herein are hereby incorporated by reference.
-
1 1 1 11 DNA Artificial Sequence Hypothetical sequence for illustrative purposes 1 acggtgcata c 11
Claims (10)
1. An assay for a nucleic acid that exerts an effect on a cellular pathway, comprising the steps of:
(a) providing an initial cell population transformed with a reporter construct under the control of a cis-regulatory element related to said cellular pathway;
(b) transfecting said cell population with an expression library;
(c) evaluating reporter expression levels in said cells transfected with said expression library;
(d) selecting a subpopulation of cells with a desired reporter expression level; and
(e) obtaining a sublibrary from said subpopulation of cells, wherein said sublibrary contains at least one nucleic acid encoding a perturbagen that exerts an effect on said cellular pathway.
2. The method of claim 1 , wherein said initial cell population is a mammalian cell population.
3. The method of claim 1 , wherein said step of evaluating comprises fluorescence activated cell sorter analysis of a fluorescent reporter.
4. The method of claim 1 , wherein said effect is activation of said cellular pathway.
5. The method of claim 1 , wherein said effect is inactivation of said cellular pathway.
6. The method of claim 1 , wherein said cellular pathway is growth-related.
7. The method of claim 1 , wherein said perturbagen is an RNA perturbagen.
8. The method of claim 7 , wherein said RNA perturbagen is a non-antisense RNA perturbagen.
9. The method of claim 7 , wherein said RNA perturbagen is a randomly generated RNA perturbagen.
10. The method of claim 9 , wherein said RNA perturbagen is an antisense perturbagen.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US09/999,003 US20020098503A1 (en) | 1997-02-14 | 2001-11-15 | Methods for identifying nucleic acid sequences encoding agents that affect cellular phenotypes |
Applications Claiming Priority (4)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US08/800,664 US6623922B1 (en) | 1997-02-14 | 1997-02-14 | Methods for identifying, characterizing, and evolving cell-type specific CIS regulatory elements |
US08/812,994 US5955275A (en) | 1997-02-14 | 1997-03-04 | Methods for identifying nucleic acid sequences encoding agents that affect cellular phenotypes |
US09/320,080 US6579675B2 (en) | 1997-02-14 | 1999-05-26 | Methods for identifying nucleic acid sequences encoding agents that effect cellular phenotypes |
US09/999,003 US20020098503A1 (en) | 1997-02-14 | 2001-11-15 | Methods for identifying nucleic acid sequences encoding agents that affect cellular phenotypes |
Related Parent Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US09/320,080 Continuation US6579675B2 (en) | 1997-02-14 | 1999-05-26 | Methods for identifying nucleic acid sequences encoding agents that effect cellular phenotypes |
Publications (1)
Publication Number | Publication Date |
---|---|
US20020098503A1 true US20020098503A1 (en) | 2002-07-25 |
Family
ID=25211168
Family Applications (3)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US08/812,994 Expired - Fee Related US5955275A (en) | 1996-08-19 | 1997-03-04 | Methods for identifying nucleic acid sequences encoding agents that affect cellular phenotypes |
US09/320,080 Expired - Fee Related US6579675B2 (en) | 1997-02-14 | 1999-05-26 | Methods for identifying nucleic acid sequences encoding agents that effect cellular phenotypes |
US09/999,003 Pending US20020098503A1 (en) | 1997-02-14 | 2001-11-15 | Methods for identifying nucleic acid sequences encoding agents that affect cellular phenotypes |
Family Applications Before (2)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US08/812,994 Expired - Fee Related US5955275A (en) | 1996-08-19 | 1997-03-04 | Methods for identifying nucleic acid sequences encoding agents that affect cellular phenotypes |
US09/320,080 Expired - Fee Related US6579675B2 (en) | 1997-02-14 | 1999-05-26 | Methods for identifying nucleic acid sequences encoding agents that effect cellular phenotypes |
Country Status (11)
Country | Link |
---|---|
US (3) | US5955275A (en) |
EP (1) | EP0973943B1 (en) |
JP (1) | JP2001514510A (en) |
AT (1) | ATE224457T1 (en) |
AU (1) | AU745827B2 (en) |
CA (1) | CA2283261C (en) |
DE (1) | DE69808060T2 (en) |
DK (1) | DK0973943T3 (en) |
IL (1) | IL131583A0 (en) |
NO (1) | NO994290L (en) |
WO (1) | WO1998039483A1 (en) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2017083874A1 (en) * | 2015-11-11 | 2017-05-18 | Serimmune Inc. | Methods and compositions for assessing antibody specificities |
Families Citing this family (81)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6365344B1 (en) * | 1996-01-23 | 2002-04-02 | The Board Of Trustees Of The Leland Stanford Junior University | Methods for screening for transdominant effector peptides and RNA molecules |
AU725716C (en) * | 1996-01-23 | 2003-02-20 | Board Of Trustees Of The Leland Stanford Junior University | Methods for screening for transdominant effector peptides and RNA molecules |
US5955275A (en) * | 1997-02-14 | 1999-09-21 | Arcaris, Inc. | Methods for identifying nucleic acid sequences encoding agents that affect cellular phenotypes |
US20040219569A1 (en) * | 1999-07-06 | 2004-11-04 | Fruma Yehiely | Gene identification method |
US20020086386A1 (en) * | 1997-03-04 | 2002-07-04 | Kamb Carl Alexander | B-catenin assays, and compositions therefrom |
US20020081688A1 (en) * | 1997-03-04 | 2002-06-27 | Kamb Carl Alexander | Retinoid pathway assays, and compositions therefrom |
JPH10260223A (en) * | 1997-03-19 | 1998-09-29 | Fujitsu Ltd | Semiconductor inspection device and inspection method using the same |
US6589730B1 (en) * | 1997-08-29 | 2003-07-08 | Selective Genetics, Inc. | Methods for identifying protein-protein interactions by selective transduction |
US6451527B1 (en) | 1997-08-29 | 2002-09-17 | Selective Genetics, Inc. | Methods using genetic package display for selecting internalizing ligands for gene delivery |
EP1009819A1 (en) * | 1997-08-29 | 2000-06-21 | Selective Genetics, Inc. | Methods using phage display for selecting internalizing ligands for gene delivery |
WO1999024563A1 (en) * | 1997-11-07 | 1999-05-20 | Iconix Pharmaceuticals, Inc. | Surrogate genetics target characterization method |
US7037648B1 (en) * | 1997-11-07 | 2006-05-02 | John Hopkins University | Somatic transfer of modified genes to predict drug effects |
US6228579B1 (en) | 1997-11-14 | 2001-05-08 | San Diego State University Foundation | Method for identifying microbial proliferation genes |
GB9726431D0 (en) * | 1997-12-15 | 1998-02-11 | Dower Steven | Expression cloning and single cell detection of phenotype |
US6897031B1 (en) * | 1998-04-17 | 2005-05-24 | Rigel Pharmaceuticals, Inc. | Multiparameter FACS assays to detect alterations in exocytosis |
US8334238B2 (en) * | 1998-04-17 | 2012-12-18 | Rigel, Inc. | Multiparameter FACS assays to detect alterations in cellular parameters and to screen small molecule libraries |
US6472151B1 (en) * | 1998-08-19 | 2002-10-29 | Bristol-Myers Squibb Company | Method of screening for compounds that modulate the activity of a molecular target |
US7297482B2 (en) | 1998-10-08 | 2007-11-20 | Rigel Pharmaceuticals, Inc. | Structurally biased random peptide libraries based on different scaffolds |
US6180343B1 (en) | 1998-10-08 | 2001-01-30 | Rigel Pharmaceuticals, Inc. | Green fluorescent protein fusions with random peptides |
US6936421B2 (en) | 1998-10-08 | 2005-08-30 | Rigel Pharmaceuticals, Inc. | Structurally biased random peptide libraries based on different scaffolds |
JP2002530074A (en) * | 1998-11-17 | 2002-09-17 | アーカリス | Methods for validating polypeptide targets that correlate with cell phenotype |
WO2000039346A1 (en) * | 1998-12-31 | 2000-07-06 | Iconix Pharmaceuticals, Inc. | Method for generating a pathway reporter system |
US6720139B1 (en) | 1999-01-27 | 2004-04-13 | Elitra Pharmaceuticals, Inc. | Genes identified as required for proliferation in Escherichia coli |
AU774332B2 (en) * | 1999-03-12 | 2004-06-24 | Gpc Biotech Inc. | Methods and reagents for identifying synthetic genetic elements |
US7270969B2 (en) * | 1999-05-05 | 2007-09-18 | Phylogica Limited | Methods of constructing and screening diverse expression libraries |
US7803765B2 (en) * | 1999-05-05 | 2010-09-28 | Phylogica Limited | Methods of constructing biodiverse gene fragment libraries and biological modulators isolated therefrom |
CA2721199A1 (en) * | 1999-05-05 | 2000-11-16 | Phylogica Limited | Isolating biological modulators from biodiverse gene fragment libraries |
AU2041901A (en) * | 1999-11-09 | 2001-06-06 | Elitra Pharmaceuticals, Inc. | Genes essential for microbial proliferation and antisense thereto |
EP1238107A2 (en) | 1999-12-16 | 2002-09-11 | Iconix Pharmaceuticals, Inc. | Random domain mapping |
EP1254212A2 (en) | 2000-02-09 | 2002-11-06 | Genvec, Inc. | Methods of preparing and using a viral vector library |
US6582899B1 (en) * | 2000-02-15 | 2003-06-24 | Deltagen Proteomics, Inc. | Methods for identifying agents that cause a lethal phenotype, and agents thereof |
IL151872A0 (en) * | 2000-03-24 | 2003-04-10 | Micromet Ag | mRNA AMPLIFICATION |
AU2001289284A1 (en) * | 2000-04-04 | 2001-10-15 | Enanta Pharmaceuticals, Inc. | Methods for identifying peptide aptamers capable of altering a cell phenotype |
JP2003534806A (en) * | 2000-05-31 | 2003-11-25 | ジェンベク、インコーポレイティッド | Methods and compositions for targeting adenovirus vectors |
US6410271B1 (en) | 2000-06-23 | 2002-06-25 | Genetastix Corporation | Generation of highly diverse library of expression vectors via homologous recombination in yeast |
US6406863B1 (en) * | 2000-06-23 | 2002-06-18 | Genetastix Corporation | High throughput generation and screening of fully human antibody repertoire in yeast |
US6410246B1 (en) | 2000-06-23 | 2002-06-25 | Genetastix Corporation | Highly diverse library of yeast expression vectors |
CA2429515A1 (en) * | 2000-11-17 | 2002-05-23 | Deltagen Proteomics, Inc. | Retinoid pathway assays, and compositions therefrom |
AU2002236451A1 (en) * | 2000-11-27 | 2002-07-24 | Deltagen Proteomics Inc. | Human rhinovirus assays, and compositions comprising anti-rhinoviral perturbagens |
AU2002230372B2 (en) * | 2001-02-05 | 2008-03-13 | Shen Quan Pan | Methods and kits for identifying scavengers of reactive oxygen species (ROS) |
WO2002070650A2 (en) * | 2001-02-13 | 2002-09-12 | Massachusetts Institute Of Technology | Dynamic whole genome screening methodology and systems |
US20030175685A1 (en) * | 2001-02-22 | 2003-09-18 | Praecis Pharmaceuticals Inc. | Methods for identifying peptides which modulate a biological process |
ES2360205T3 (en) | 2001-03-02 | 2011-06-01 | Agennix Ag | THREE HYBRID TEST SYSTEM. |
AU2002254212A1 (en) * | 2001-03-12 | 2002-09-24 | Irm, Llc | Identification of cellular targets for biologically active molecules |
WO2002072789A2 (en) * | 2001-03-12 | 2002-09-19 | Irm, Llc. | Genomics-driven high speed cellular assays, development thereof, and collections of cellular reporters |
WO2002079493A2 (en) * | 2001-03-29 | 2002-10-10 | Hybrigen, Inc. | Improved hybrid gene libraries and uses thereof |
US7026123B1 (en) | 2001-08-29 | 2006-04-11 | Pioneer Hi-Bred International, Inc. | UTR tag assay for gene function discovery |
US20030134329A1 (en) * | 2001-10-09 | 2003-07-17 | Thea Norman | Cross-species bioactive peptides |
US20040029129A1 (en) * | 2001-10-25 | 2004-02-12 | Liangsu Wang | Identification of essential genes in microorganisms |
US20030170694A1 (en) * | 2001-12-21 | 2003-09-11 | Daniel Wall | Stabilized nucleic acids in gene and drug discovery and methods of use |
US20030143547A1 (en) * | 2002-01-24 | 2003-07-31 | Xianqiang Li | Method for identifying multiple activated transcription factors |
US20030219723A1 (en) * | 2002-05-20 | 2003-11-27 | Lu Henry H. | Compositions and methods for screening and identifying anti-HCV agents |
US20040067532A1 (en) | 2002-08-12 | 2004-04-08 | Genetastix Corporation | High throughput generation and affinity maturation of humanized antibody |
WO2005044978A2 (en) * | 2003-07-15 | 2005-05-19 | Washington University | Methods of treating, preventing and inhibiting cancer metastasis and tumor formation |
ITRM20030376A1 (en) | 2003-07-31 | 2005-02-01 | Univ Roma | PROCEDURE FOR THE ISOLATION AND EXPANSION OF CARDIOC STAMIN CELLS FROM BIOPSIA. |
US7388976B2 (en) * | 2004-03-09 | 2008-06-17 | Siemens Medical Solutions Usa, Inc. | Time-based system to link periodic X-ray images |
US20110218118A1 (en) * | 2004-06-03 | 2011-09-08 | Phylogica Limited | Peptide modulators of cellular phenotype and bi-nucleic acid fragment library |
US11660317B2 (en) | 2004-11-08 | 2023-05-30 | The Johns Hopkins University | Compositions comprising cardiosphere-derived cells for use in cell therapy |
CA2950465A1 (en) | 2006-02-20 | 2007-08-30 | Phylogica Limited | Method of constructing and screening libraries of peptide structures |
EP2074138A4 (en) * | 2006-09-19 | 2009-12-30 | Phylogica Ltd | Neuroprotective peptide inhibitors of ap-1 signaling and uses therefor |
WO2008058273A2 (en) * | 2006-11-09 | 2008-05-15 | The Johns Hopkins University | Dedifferentiation of adult mammalian cardiomyocytes into cardiac stem cells |
EP2170932A4 (en) * | 2007-06-20 | 2012-10-10 | Phylogica Ltd | Compositions and uses thereof for the treatment of acute respiratory distress syndrome (ards) and clinical disorders associated with therewith |
US7550323B2 (en) * | 2007-08-08 | 2009-06-23 | International Business Machines Corporation | Electrical fuse with a thinned fuselink middle portion |
US20090040006A1 (en) * | 2007-08-08 | 2009-02-12 | International Business Machines Corporation | Electrical fuse with enhanced programming current divergence |
BRPI0816785A2 (en) | 2007-09-14 | 2017-05-02 | Adimab Inc | rationally designed synthetic antibody libraries, and uses thereof |
US8877688B2 (en) | 2007-09-14 | 2014-11-04 | Adimab, Llc | Rationally designed, synthetic antibody libraries and uses therefor |
US7838963B2 (en) * | 2007-10-26 | 2010-11-23 | International Business Machines Corporation | Electrical fuse having a fully silicided fuselink and enhanced flux divergence |
EP2424986A1 (en) * | 2009-04-27 | 2012-03-07 | Roswell Park Cancer Institute | Reagents and methods for producing bioactive secreted peptides |
US20130035255A1 (en) | 2010-03-26 | 2013-02-07 | Integratech Proteomics, Llc | Controlled release hybrid systems |
US9845457B2 (en) | 2010-04-30 | 2017-12-19 | Cedars-Sinai Medical Center | Maintenance of genomic stability in cultured stem cells |
US9249392B2 (en) | 2010-04-30 | 2016-02-02 | Cedars-Sinai Medical Center | Methods and compositions for maintaining genomic stability in cultured stem cells |
EP4219805A1 (en) | 2010-07-16 | 2023-08-02 | Adimab, LLC | Antibody libraries |
JP2015521054A (en) | 2012-06-05 | 2015-07-27 | カプリコール,インコーポレイテッド | Optimized methods for generating cardiac stem cells from heart tissue and their use in cardiac therapy |
CA2881394A1 (en) | 2012-08-13 | 2014-02-20 | Cedars-Sinai Medical Center | Exosomes and micro-ribonucleic acids for tissue regeneration |
CA2962444C (en) | 2014-10-03 | 2023-09-05 | Cedars-Sinai Medical Center | Cardiosphere-derived cells and exosomes secreted by such cells in the treatment of muscular dystrophy |
EP3402543B1 (en) | 2016-01-11 | 2021-09-08 | Cedars-Sinai Medical Center | Cardiosphere-derived cells and exosomes secreted by such cells in the treatment of heart failure with preserved ejection fraction |
WO2017210652A1 (en) | 2016-06-03 | 2017-12-07 | Cedars-Sinai Medical Center | Cdc-derived exosomes for treatment of ventricular tachyarrythmias |
EP3515459A4 (en) | 2016-09-20 | 2020-08-05 | Cedars-Sinai Medical Center | Cardiosphere-derived cells and their extracellular vesicles to retard or reverse aging and age-related disorders |
US11043823B2 (en) * | 2017-04-06 | 2021-06-22 | Tesla, Inc. | System and method for facilitating conditioning and testing of rechargeable battery cells |
EP3612191A4 (en) | 2017-04-19 | 2020-12-30 | Cedars-Sinai Medical Center | Methods and compositions for treating skeletal muscular dystrophy |
US11660355B2 (en) | 2017-12-20 | 2023-05-30 | Cedars-Sinai Medical Center | Engineered extracellular vesicles for enhanced tissue delivery |
Family Cites Families (20)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5135854A (en) * | 1987-10-29 | 1992-08-04 | Zymogenetics, Inc. | Methods of regulating protein glycosylation |
US5283173A (en) * | 1990-01-24 | 1994-02-01 | The Research Foundation Of State University Of New York | System to detect protein-protein interactions |
US5364783A (en) * | 1990-05-14 | 1994-11-15 | Massachusetts Institute Of Technology | Retrovirus promoter-trap vectors |
US5401629A (en) | 1990-08-07 | 1995-03-28 | The Salk Institute Biotechnology/Industrial Associates, Inc. | Assay methods and compositions useful for measuring the transduction of an intracellular signal |
US5217889A (en) * | 1990-10-19 | 1993-06-08 | Roninson Igor B | Methods and applications for efficient genetic suppressor elements |
AU2808392A (en) * | 1991-10-30 | 1993-06-07 | General Hospital Corporation, The | C-myc dna binding partners, motifs, screening assays, and uses thereof |
GB9123987D0 (en) | 1991-11-12 | 1992-01-02 | Primm Srl | New reporter genes |
US5491084A (en) * | 1993-09-10 | 1996-02-13 | The Trustees Of Columbia University In The City Of New York | Uses of green-fluorescent protein |
JPH09506768A (en) * | 1993-12-16 | 1997-07-08 | コールド スプリング ハーバー ラボラトリー | Origin of replication complex genes, proteins and methods |
ES2146749T3 (en) * | 1994-01-21 | 2000-08-16 | Icos Corp | MATERIALS AND METHODS IN RELATION TO PROTEINS THAT INTERACT WITH CASEIN QUINASE I. |
US6117679A (en) * | 1994-02-17 | 2000-09-12 | Maxygen, Inc. | Methods for generating polynucleotides having desired characteristics by iterative selection and recombination |
US5691137A (en) * | 1994-08-30 | 1997-11-25 | Brandeis University | Methods of screening candidate agents for biological activity using yeast cells |
US5569588A (en) * | 1995-08-09 | 1996-10-29 | The Regents Of The University Of California | Methods for drug screening |
AU725716C (en) | 1996-01-23 | 2003-02-20 | Board Of Trustees Of The Leland Stanford Junior University | Methods for screening for transdominant effector peptides and RNA molecules |
US5945276A (en) | 1996-04-10 | 1999-08-31 | Signal Pharmaceuticals, Inc. | Reporter cell line system for detecting cytomegalovirus and identifying modulators of viral gene expression |
US5783431A (en) * | 1996-04-24 | 1998-07-21 | Chromaxome Corporation | Methods for generating and screening novel metabolic pathways |
US6004808A (en) * | 1996-06-21 | 1999-12-21 | Aurora Biosciences Corporation | Promiscuous G-protein compositions and their use |
US5998136A (en) * | 1996-08-19 | 1999-12-07 | Arcaris, Inc. | Selection systems and methods for identifying genes and gene products involved in cell proliferation |
US5955275A (en) * | 1997-02-14 | 1999-09-21 | Arcaris, Inc. | Methods for identifying nucleic acid sequences encoding agents that affect cellular phenotypes |
US5928888A (en) | 1996-09-26 | 1999-07-27 | Aurora Biosciences Corporation | Methods and compositions for sensitive and rapid, functional identification of genomic polynucleotides and secondary screening capabilities |
-
1997
- 1997-03-04 US US08/812,994 patent/US5955275A/en not_active Expired - Fee Related
-
1998
- 1998-02-27 JP JP53883398A patent/JP2001514510A/en active Pending
- 1998-02-27 IL IL13158398A patent/IL131583A0/en unknown
- 1998-02-27 WO PCT/US1998/004376 patent/WO1998039483A1/en active IP Right Grant
- 1998-02-27 DE DE69808060T patent/DE69808060T2/en not_active Expired - Fee Related
- 1998-02-27 AU AU65438/98A patent/AU745827B2/en not_active Ceased
- 1998-02-27 AT AT98911497T patent/ATE224457T1/en not_active IP Right Cessation
- 1998-02-27 EP EP98911497A patent/EP0973943B1/en not_active Expired - Lifetime
- 1998-02-27 CA CA002283261A patent/CA2283261C/en not_active Expired - Fee Related
- 1998-02-27 DK DK98911497T patent/DK0973943T3/en active
-
1999
- 1999-05-26 US US09/320,080 patent/US6579675B2/en not_active Expired - Fee Related
- 1999-09-03 NO NO994290A patent/NO994290L/en not_active Application Discontinuation
-
2001
- 2001-11-15 US US09/999,003 patent/US20020098503A1/en active Pending
Cited By (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2017083874A1 (en) * | 2015-11-11 | 2017-05-18 | Serimmune Inc. | Methods and compositions for assessing antibody specificities |
US10386373B2 (en) | 2015-11-11 | 2019-08-20 | Serimmune Inc. | Methods and compositions for assessing antibody specificities |
US10871494B2 (en) | 2015-11-11 | 2020-12-22 | Serimmune Inc. | Methods and compositions for assessing antibody specificities |
US11828762B2 (en) | 2015-11-11 | 2023-11-28 | Serimmune Inc. | Methods and compositions for assessing antibody specificities |
Also Published As
Publication number | Publication date |
---|---|
DE69808060D1 (en) | 2002-10-24 |
EP0973943A1 (en) | 2000-01-26 |
AU745827B2 (en) | 2002-04-11 |
NO994290D0 (en) | 1999-09-03 |
EP0973943B1 (en) | 2002-09-18 |
CA2283261A1 (en) | 1998-09-11 |
US5955275A (en) | 1999-09-21 |
NO994290L (en) | 1999-11-03 |
AU6543898A (en) | 1998-09-22 |
US20020018992A1 (en) | 2002-02-14 |
DE69808060T2 (en) | 2003-04-30 |
ATE224457T1 (en) | 2002-10-15 |
JP2001514510A (en) | 2001-09-11 |
DK0973943T3 (en) | 2003-01-27 |
US6579675B2 (en) | 2003-06-17 |
IL131583A0 (en) | 2001-01-28 |
WO1998039483A1 (en) | 1998-09-11 |
CA2283261C (en) | 2002-07-02 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US6579675B2 (en) | Methods for identifying nucleic acid sequences encoding agents that effect cellular phenotypes | |
WO1998039483A9 (en) | Methods for identifying nucleic acid sequences encoding agents that affect cellular phenotypes | |
Vidalain et al. | Increasing specificity in high-throughput yeast two-hybrid experiments | |
EP0830459B1 (en) | Reverse two-hybrid systems | |
US6566057B1 (en) | Methods and compositions for peptide libraries displayed on light-emitting scaffolds | |
US6623922B1 (en) | Methods for identifying, characterizing, and evolving cell-type specific CIS regulatory elements | |
WO1998046796A1 (en) | A method of screening nucleotide sequences to identify disruptors or effectors of biological processes or pathways | |
WO1999014319A1 (en) | An improved yeast interaction trap assay | |
US6509153B1 (en) | Genetic markers of toxicity preparation and uses | |
Koloteva-Levine et al. | Interaction of hnRNP-C1/C2 proteins with RNA: analysis using the yeast three-hybrid system | |
CN108474796B (en) | Method of screening | |
US20020090605A1 (en) | Methods for identifying, characterizing, and evolving cell-type specific cis regulatory elements | |
JP2003535576A (en) | Bar-coded synthetic lethal screening to identify drug targets | |
Hampton | Fusion‐Based Strategies to Identify Genes Involved in Degradation of a Specific Substrate | |
Geyer | Peptide aptamers: Dominant “genetic” agents for forward and reverse analysis of cellular processes | |
Conklin et al. | Use of a Novel, Stable Gene Silencing Technology to Determine the Contribution of the Receptor Tyrosine Kinase to the Breast Cancer Phenotype |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
AS | Assignment |
Owner name: DELTAGEN PROTEOMICS, INC., DELAWARE Free format text: MERGER AND CHANGE OF NAME;ASSIGNOR:ARCARIS, INC.;REEL/FRAME:013447/0883 Effective date: 20010713 |
|
STPP | Information on status: patent application and granting procedure in general |
Free format text: ADVISORY ACTION MAILED |