US20070067111A1 - Computer-aided visualization of expression comparison - Google Patents

Computer-aided visualization of expression comparison Download PDF

Info

Publication number
US20070067111A1
US20070067111A1 US11/489,292 US48929206A US2007067111A1 US 20070067111 A1 US20070067111 A1 US 20070067111A1 US 48929206 A US48929206 A US 48929206A US 2007067111 A1 US2007067111 A1 US 2007067111A1
Authority
US
United States
Prior art keywords
probes
expression level
expression
genes
computer
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Abandoned
Application number
US11/489,292
Inventor
David Mack
Kurt Gish
David Balaban
Elina Khurgin
Josie Dai
Jim Snyder
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Affymetrix Inc
Original Assignee
Affymetrix Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Priority claimed from US09/122,167 external-priority patent/US6229911B1/en
Application filed by Affymetrix Inc filed Critical Affymetrix Inc
Priority to US11/489,292 priority Critical patent/US20070067111A1/en
Publication of US20070067111A1 publication Critical patent/US20070067111A1/en
Assigned to GENERAL ELECTRIC CAPITAL CORPORATION, AS AGENT reassignment GENERAL ELECTRIC CAPITAL CORPORATION, AS AGENT SECURITY AGREEMENT Assignors: AFFYMETRIX, INC.
Priority to US13/626,773 priority patent/US20130169645A1/en
Assigned to AFFYMETRIX, INC. reassignment AFFYMETRIX, INC. RELEASE BY SECURED PARTY (SEE DOCUMENT FOR DETAILS). Assignors: GENERAL ELECTRIC CAPITAL CORPORATION, AS AGENT
Abandoned legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B25/00ICT specially adapted for hybridisation; ICT specially adapted for gene or protein expression
    • G16B25/10Gene or protein expression profiling; Expression-ratio estimation or normalisation
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T11/002D [Two Dimensional] image generation
    • G06T11/20Drawing from basic elements, e.g. lines or circles
    • G06T11/206Drawing of charts or graphs
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12NMICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
    • C12N15/00Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
    • C12N15/09Recombinant DNA-technology
    • C12N15/10Processes for the isolation, preparation or purification of DNA or RNA
    • C12N15/1034Isolating an individual clone by screening libraries
    • C12N15/1089Design, preparation, screening or analysis of libraries using computer algorithms
    • CCHEMISTRY; METALLURGY
    • C12BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
    • C12QMEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
    • C12Q1/00Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
    • C12Q1/68Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B45/00ICT specially adapted for bioinformatics-related data visualisation, e.g. displaying of maps or networks
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B25/00ICT specially adapted for hybridisation; ICT specially adapted for gene or protein expression

Definitions

  • the present invention relates to the field of computer systems. More specifically, the present invention relates to computer systems for visualizing analysis results.
  • an array of nucleic acid probes is fabricated at known locations on a substrate or chip.
  • a fluorescently labeled nucleic acid is then brought into contact with the chip and a scanner generates an image file (which is processed into a cell file) indicating the locations where the labeled nucleic acids bound to the chip.
  • image file which is processed into a cell file
  • Such systems have been used to form, for example, arrays of DNA that may be used to study and detect mutations relevant to cystic fibrosis, the P53 gene (relevant to certain cancers), HIV, and other genetic characteristics.
  • the present invention provides innovative systems and methods for visualizing information collected from analyzing samples.
  • the samples may include nucleic acids, proteins, or other polymers.
  • Gene expression level as determined from analysis of a nucleic acid sample is one possible analysis result that may be visualized.
  • a computer system may display the expression levels of multiple genes simultaneously in a way that facilitates user identification of genes whose expression is significant to a characteristic such as disease or resistance to disease. Additionally, the computer system may facilitate display of further information about relevant genes once they are identified.
  • a first aspect of the invention provides a computer implemented method for presenting expression level information as collected from first and second samples.
  • the method includes steps of: displaying a first axis corresponding to expression level in the first sample, and displaying a second axis substantially perpendicular to the first axis, the second axis corresponding to expression level in the second sample.
  • the method further includes a step of: for a selected expressed sequence, displaying a mark at a position. The position is selected relative to the first axis in accordance with an expression level of the selected expressed sequence in the first sample and relative to the second axis in accordance with an expression level of the selected expressed sequence in the second sample.
  • a particularly useful application is displaying many marks simultaneously for many selected genes to discover which ones of the selected genes may be relevant to the characteristic.
  • a second aspect of the invention provides a computer-implemented method of presenting sample analysis information.
  • the method includes steps of: displaying a first axis corresponding to a concentration of a compound in a first sample as determined by monitoring binding of the compound to a selected polymer having binding affinity to the compound, and displaying a second axis substantially perpendicular to the first axis.
  • the second axis corresponds to a concentration of the compound in the second sample as determined by monitoring binding of the compound to the selected polymer.
  • the method further preferably includes a step of displaying a mark at a position. The position is selected relative to the first axis in accordance with the concentration in the first sample and relative to the second axis in accordance with the concentration in the second sample.
  • FIG. 1 illustrates an example of a computer system that may be used to execute software embodiments of the present invention.
  • FIG. 2 shows a system block diagram of a typical computer system.
  • FIG. 3 illustrates an overall system for forming and analyzing arrays of polymers including biological materials such as DNA or RNA.
  • FIG. 4 is an illustration of an embodiment of software for the overall system.
  • FIG. 5 shows a flowchart of a process of monitoring the expression of a gene by comparing hybridization intensities of pairs of perfect match and mismatch probes.
  • FIG. 6 shows a screen display illustrating gene expression levels for multiple genes as collected from both normal and diseased tissue.
  • FIGS. 7A-7B show screen displays illustrating information (SEQ ID NOS:1 and 2) about a particular gene selected from the display of FIG. 6 .
  • the present invention provides innovative methods of monitoring visualizing gene expression.
  • the invention will be described in reference to preferred embodiments. However, the description is provided for purposes of illustration and not for limiting the spirit and scope of the invention.
  • FIG. 1 illustrates an example of a computer system that may be used to execute software embodiments of the present invention.
  • FIG. 1 shows a computer system 1 which includes a monitor 3 , screen 5 , cabinet 7 , keyboard 9 , and mouse 11 .
  • Mouse 11 may have one or more buttons such as mouse buttons 13 .
  • Cabinet 7 houses a CD-ROM drive 15 and a hard drive (not shown) that may be utilized to store and retrieve software programs including computer code incorporating the present invention.
  • a CD-ROM 17 is shown as the computer readable medium, other computer readable media including floppy disks, DRAM, hard drives, flash memory, tape, and the like may be utilized.
  • Cabinet 7 also houses familiar computer components (not shown) such as a processor, memory, and the like.
  • FIG. 2 shows a system block diagram of computer system 1 used to execute software embodiments of the present invention.
  • computer system 1 includes monitor 3 and keyboard 9 .
  • Computer system 1 further includes subsystems such as a central processor 50 , system memory 52 , I/O controller 54 , display adapter 56 , removable disk 58 , fixed disk 60 , network interface 62 , and speaker 64 .
  • Removable disk 58 is representative of removable computer readable media like floppies, tape, CD-ROM, removable hard drive, flash memory, and the like.
  • Fixed disk 60 is representative of an internal hard drive or the like.
  • Other computer systems suitable for use with the present invention may include additional or fewer subsystems.
  • another computer system could include more than one processor 50 (i.e., a multi processor system) or memory cache.
  • Arrows such as 66 represent the system bus architecture of computer system 1 . However, these arrows are illustrative of any interconnection scheme serving to link the subsystems.
  • display adapter 56 may be connected to central processor 50 through a local bus or the system may include a memory cache.
  • Computer system 1 shown in FIG. 2 is but an example of a computer system suitable for use with the present invention. Other configurations of subsystems suitable for use with the present invention will be readily apparent to one of ordinary skill in the art. In one embodiment, the computer system is an IBM compatible personal computer.
  • VLSIPSTM and GeneChipTM technologies provide methods of making and using very large arrays of polymers, such as nucleic acids, on very small chips. See U.S. Pat. No. 5,143,854 and PCT Patent Publication Nos. WO 90/15070 and 92/10092, each of which is hereby incorporated by reference for all purposes. Nucleic acid probes on the chip are used to detect complementary nucleic acid sequences in a sample nucleic acid of interest (the “target” nucleic acid).
  • probes need not be nucleic acid probes but may also be other receptors, such as antibodies, or polymers such as peptides.
  • Peptide probes may be used to detect the concentration of other peptides, proteins, or other compounds in a sample. The probes must be carefully selected to have bonding affinity to the compound whose concentration they are to be used to measure.
  • the present invention provides methods of visualizing information relating to the concentration of compounds in a sample as measured by monitoring affinity of the compounds to probes.
  • the concentration information is generated by analysis of hybridization intensity files for a chip containing hybridized nucleic acid probes.
  • the hybridization of a nucleic acid sample to certain probes may represent the expression level of one more genes or expressed sequence tags (ESTs).
  • ESTs expressed sequence tags
  • Expression level information visualized by virtue of the present invention need not be obtained from probes but may originate from any source. If the expression information is collected from a probe array, the probe array need not meet any particular criteria for size and density. Furthermore, the present invention is not limited to visualizing fluorescent measurements of bondings such as hybridizations but may be readily utilized to visualize other measurements.
  • a probe array may include peptide probes which may be exposed to protein samples, polypeptide samples, or other compounds which may or may not bond to the peptide probes. By appropriate selection of the peptide probes, one may detect the presence or absence of particular compounds which would bond to the peptide probes.
  • the present invention is described as being part of a system that designs a chip mask, synthesizes the probes on the chip, labels nucleic acids from a target sample, and scans the hybridized probes.
  • a system is set forth in U.S. Pat. No. 5,571,639 which is hereby incorporated by reference for all purposes.
  • the present invention may be used separately from the overall system for analyzing data generated by such systems, such as at remote locations, or for visualizing the results of other systems for generating expression information, or for visualizing concentrations of polymers other than nucleic acids.
  • FIG. 3 illustrates a computerized system for forming and analyzing arrays of biological materials such as RNA or DNA.
  • a computer 100 is used to design arrays of biological polymers such as RNA or DNA.
  • the computer 100 may be, for example, an appropriately programmed IBM personal computer compatible running Windows NT including appropriate memory and a CPU as shown in FIGS. 1 and 2 .
  • the computer system 100 obtains inputs from a user regarding characteristics of a gene of interest, and other inputs regarding the desired features of the array.
  • the computer system may obtain information regarding a specific genetic sequence of interest from an external or internal database 102 such as GenBank.
  • the output of the computer system 100 is a set of chip design computer files 104 in the form of, for example, a switch matrix, as described in PCT application WO 92/10092, and other associated computer files.
  • the chip design files are provided to a system 106 that designs the lithographic masks used in the fabrication of arrays of molecules such as DNA.
  • the system or process 106 may include the hardware necessary to manufacture masks 110 and also the necessary computer hardware and software 108 necessary to lay the mask patterns out on the mask in an efficient manner. As with the other features in FIG. 3 , such equipment may or may not be located at the same physical site, but is shown together for ease of illustration in FIG. 3 .
  • the system 106 generates masks 110 or other synthesis patterns such as chrome on glass masks for use in the fabrication of polymer arrays.
  • Synthesis system 112 includes the necessary hardware and software used to fabricate arrays of polymers on a substrate or chip 114 .
  • synthesizer 112 includes a light source 116 and a chemical flow cell 118 on which the substrate or chip 114 is placed.
  • Mask 110 is placed between the light source and the substrate/chip, and the two are translated relative to each other at appropriate times for deprotection of selected regions of the chip.
  • Selected chemical reagents are directed through flow cell 118 for coupling to deprotected regions, as well as for washing and other operations. All operations are preferably directed by an appropriately programmed computer 119 , which may or may not be the same computer as the computer(s) used in mask design and mask making.
  • the substrates fabricated by synthesis system 112 are optionally diced into smaller chips and exposed to marked targets.
  • the targets may or may not be complementary to one or more of the molecules on the substrate.
  • the targets are marked with a label such as a fluorescein label (indicated by an asterisk in FIG. 3 ) and placed in scanning system 120 .
  • Scanning system 120 again operates under the direction of an appropriately programmed digital computer 122 , which also may or may not be the same computer as the computers used in synthesis, mask making, and mask design.
  • the scanner 120 includes a detection device 124 such as a confocal microscope or CCD (charge coupled device) that is used to detect the location where labeled target has bound to the substrate.
  • a detection device 124 such as a confocal microscope or CCD (charge coupled device) that is used to detect the location where labeled target has bound to the substrate.
  • the output of scanner 120 is an image files) 124 indicating, in the case of fluorescein labeled target, the fluorescence intensity (photon counts or other related measurements, such as voltage) as a function of position on the substrate. Since higher photon counts will be observed where the labeled target has bound more strongly to the array of polymers, and since the monomer sequence of the polymers on the substrate is known as a function of position, it becomes possible to determine the sequence(s) of polymer(s) on the substrate that are complementary to the target.
  • the image file 124 is provided as input to an analysis system 126 that incorporates the visualization and analysis methods of the present invention.
  • the analysis system may be any one of a wide variety of computer system.
  • the present invention provides various methods of analyzing and visualizing the chip design files and the image files, providing appropriate output 128 .
  • the chip design need not include any particular number of probes. It should be understood that the present invention does not require any particular source of expression level information.
  • FIG. 4 provides a simplified illustration of the overall software system used in the operation of one embodiment of the invention.
  • the system first identifies the nucleotide sequence(s) or targets that would be of interest in a particular expression level analysis at step 202 .
  • the sequences of interest correspond to mRNA transcripts of one or more genes, ESTs or nucleic acids derived from the mRNA transcripts. Sequence selection may be provided via manual input of text files or may be from external sources such as GenBank.
  • the system evaluates the sequences of interest to determine or assist the user in determining which probes would be desirable on the chip, and provides an appropriate “layout” on the chip for the probes.
  • the process of selecting probes for an expression level analysis is explained in PCT Publication No. WO 97/10365, the contents of which are herein incorporated by reference.
  • An alternative probe selection process that does not require prior knowledge of sequences of interest is explained in PCT Publication No. WO97/27317 (Attorney Docket No. 18547-019410PC), the contents of which are herein incorporated by reference. Further general background on probe selection is found in PCT Publication No. WO95/11995 (Attorney Docket No.
  • perfect match probe refers to a probe that has a sequence that is perfectly complementary to a particular target sequence.
  • the test probe is typically perfectly complementary to a portion (subsequence) of the target sequence.
  • mismatch control or “mismatch probe” refer to probes whose sequence is deliberately selected not to be perfectly complementary to a particular target sequence. For each mismatch (MM) control in an array there typically exists a corresponding perfect match (PM) probe that is perfectly complementary to the same particular target sequence.
  • the process compares hybridization intensities of pairs of perfect match and mismatch probes that are preferably covalently attached to the surface of a substrate or chip.
  • the nucleic acid probes have a density greater than about 60 different nucleic acid probes per 1 cm 2 of the substrate.
  • nucleic acid probes are selected that are complementary to the target sequence. These probes are the perfect match probes. Another set of probes is specified that are intended to be not perfectly complementary to the target sequence. These probes are the mismatch probes and each mismatch probe includes at least one nucleotide mismatch from a perfect match probe. Accordingly, a mismatch probe and the perfect match probe to which it is identical except for one base make up a pair. As mentioned earlier, the nucleotide mismatch is preferably near the center of the mismatch probe.
  • the probe lengths of the perfect match probes are typically chosen to exhibit detectably greater hybridization with the target sequence relative to the mismatch probes.
  • the nucleic acid probes may be all 20-mers.
  • probes of varying lengths may also be synthesized on the substrate for any number of reasons including resolving ambiguities.
  • the masks for the synthesis are designed.
  • the software utilizes the mask design and layout information to make the DNA or other polymer chips. This step 208 will control, among other things, relative translation of a substrate and the mask, the flow of desired reagents through a flow cell, the synthesis temperature of the flow cell, and other parameters.
  • another piece of software is used in scanning a chip thus synthesized and exposed to a labeled target. The software controls the scanning of the chip, and stores the data thus obtained in a file that may later be utilized to extract hybridization information.
  • a computer system utilizes the layout information and the fluorescence information to evaluate the hybridized nucleic acid probes on the chip.
  • the important pieces of information obtained from DNA chips are the relative fluorescent intensities obtained from the perfect match probes and mismatch probes. These intensity levels are used to estimate an expression level for a gene or EST.
  • the computer system used for analysis will preferably have available other details of the experiment including possibly the gene name, gene sequence, probe sequences, probe locations on the substrate, and the like.
  • the same computer system used for analysis or another one displays the expression level information in a format useful for identifying genes of interest.
  • the visualized expression level information may include information collected from multiple applications of one or more previous steps of FIG. 4 .
  • FIG. 5 is a flowchart describing steps of estimating an expression level for a particular gene and determining whether the expression level is sufficiently high to be displayed.
  • the computer system receives raw scan data of N pairs of perfect match and mismatch probes.
  • the hybridization intensities are photon counts from a fluorescein labeled target that has hybridized to the probes on the substrate.
  • the hybridization intensity of a perfect match probe will be designed “I pm ” and the hybridization intensity of a mismatch probe will be designed “I mm .”
  • Hybridization intensities for a pair of probes are retrieved at step 954 .
  • the background signal intensity is subtracted from each of the hybridization intensities of the pair at step 956 . Background subtraction can also be performed on all the raw scan data at the same time.
  • the hybridization intensities of the pair of probes are compared to a difference threshold (D) and a ratio threshold (R). It is determined if the difference between the hybridization intensities of the pair (I pm ⁇ I mm ) is greater than or equal to the difference threshold AND the quotient of the hybridization intensities of the pair (I pm /I mm ) is greater than or equal to the ratio threshold.
  • the difference thresholds are typically user defined values that have been determined to produce accurate expression monitoring of a gene or genes. In one embodiment, the difference threshold is 20 and the ratio threshold is 1.2.
  • NPOS is a value that indicates the number of pairs of probes which have hybridization intensities indicating that the gene is likely expressed. NPOS is utilized in a determination of the expression of the gene.
  • NNEG is a value that indicates the number of pairs of probes which have hybridization intensities indicating that the gene is likely not expressed.
  • NNEG like NPOS, is utilized in a determination of the expression of the gene.
  • LR log ratio value
  • IDIF intensity difference value
  • a decision matrix is utilized to indicate if the gene is expressed.
  • the decision matrix utilizes the values N, NPOS, NNEG, LR (multiple LRs), and IDIF (multiple IDIFs).
  • P 1 NPOS/NNEG
  • P 2 NPOS/N
  • P 3 SUM( LR )/ N
  • P 4 SUM( IDIF )/ N
  • an average of the IDIF values for the probes that incremented NPOS or NNEG is calculated at step 975 , which is utilized as an expression level.
  • other values including one of P1 through P4 could be used to indicate expression level.
  • FIG. 5 was described in reference to a single gene or EST.
  • the visualization system of the present invention displays expression results for many genes to facilitate discovery of genes of interest or ESTs.
  • the present invention contemplates display of expression levels of a single gene or ESTs as collected from two or more different samples such as tissue samples.
  • the sample sources preferably differ in some characteristic. It will be understood that when the term “sample” is used herein, measurements made on a single “sample” can be based on an aggregation of multiple sample collection events or even multiple organisms.
  • FIG. 6 shows a screen display illustrating gene expression levels for multiple genes as collected from two tissue samples.
  • a displayed horizontal axis 1002 represents expression level measured in one or more nucleic acid samples taken from the first tissue sample.
  • a displayed vertical axis 1004 represents expression level in one or more nucleic acid samples taken from the second tissue sample.
  • Each of marks 1006 represent a particular gene whose expression level has been measured in both the first and second tissue samples. Each mark 1006 is placed at a distance from vertical axis 1004 corresponding to expression level in the first tissue sample and at a distance from the horizontal axis 1002 corresponding to expression level in the second tissue sample.
  • the expression levels used for determining the position of marks 1006 are preferably taken from the result of step 975 .
  • the position of each of marks 1006 depends on two iterations of the steps of FIG. 5 , once for the sample taken from the first tissue sample and once for the sample taken from the second tissue sample. However, a mark is preferably displayed only if one of the samples meets the threshold criteria at step 972 .
  • the first tissue sample is a cancerous tissue sample and the second tissue sample is a normal tissue sample.
  • the individual marks represent the expression levels of selected genes in both cancerous and normal tissue.
  • a first group of marks 1008 represent genes that are neither tumor suppressors nor oncogenes since their expression levels are roughly similar for both normal and cancerous tissue. These marks 1008 fall roughly along a line which is rotated 45 degrees from each of the axes.
  • a second group of marks 1010 represent genes that are likely oncogenes since their expression levels are found to be significantly higher in cancerous tissue than in normal tissue.
  • a third group of marks 1012 represent genes that are likely tumor suppressors since their expression levels are found to be significantly higher in normal tissue than in cancerous tissue. It will be appreciated that expression levels for large numbers of genes can be reviewed at once to discover the oncogenes and tumor suppressors.
  • the present invention would aid in the discovery of genes whose expression is associated with any characteristic that varies among tissue samples. For example, once can compare expression results from tissue from individuals who have been exposed to HIV but remain infected to tissue obtained from infected individuals to identify genes conferring resistance to HIV. One can compare expression results between tissue from plants that survive drought to plants that do not. One can compare expression levels among tissue samples at successive stages or severity levels of the same disease, among tissue samples where different ultimate outcomes of the disease (e.g., patient death or remission) are known, among diseased tissue samples that have been subject to different treatment regimes including e.g., chemotherapy, antisense RNA, etc. For cancers, one can compare expression levels between malignant cells and non malignant cells. Also expression levels can be compared among different organs, between species, and among different stages of development of an organ.
  • a third visual dimension can be used to illustrate expression level from a third tissue sample.
  • the time dimension can also be used to illustrate successive groups of two or three tissue samples at successive time periods.
  • the time dimension can be also used to correspond to tissue samples obtained at, e.g., successive stages of a disease.
  • senses can also be incorporated within the presentation system of the present invention.
  • the senses may correspond to additional dimensions.
  • marks can be displayed in succession accompanies by a sound having characteristics corresponding to expression level in another tissue sample.
  • the user can employ a cursor 1014 to identify a particular mark as being of interest.
  • Cursor 1014 can be moved to a particular mark by use of, e.g., mouse 11 .
  • the mark can be selected by, e.g., depression of one of mouse buttons 13 . Selection of a particular mark can be facilitated by use of a zoom display feature (not shown).
  • a special mouse can transmit a tactile sensation back to the user corresponding to expression level in a tissue sample as the user passes the mouse over a corresponding mark.
  • each mark may correspond to a different polymer, polypeptide, or other compound.
  • the distance of the mark from each axis would correspond to a measure of presence of the particular polymer in the sample corresponding to the axis.
  • One possible measure is produced by fluorescently tagging polymer samples such as protein samples and exposing a probe array such as a peptide probe array to the protein samples. The fluorescent intensity of the probes will then correspond to the bonding affinity of the sample to the probes. The intensity measurement or a measurement derived from the intensity measurement may then be used to position the marks of FIG. 6 .
  • FIG. 7A shows a screen display giving information about a particular gene selected from the display of FIG. 6 .
  • a cluster number 702 , a GenBank accession number 704 , and a verbal description 706 for the selected gene are displayed.
  • the user can also select a number of marks 1006 by circling them with cursor 1014 . Then a list of information as shown in FIG. 7A is displayed for all the genes corresponding to the selected marks.
  • GenBank accession number 704 By selecting GenBank accession number 704 with another cursor (not shown), the user can direct retrieval of the GenBank information for the selected gene. If the GenBank information is not available locally, the retrieval process can include formulating a query and transmitting the query to a GenBank web site. Once the GenBank information is retrieved, it can also be displayed. FIG. 7B depicts the GenBank information for the gene identified in FIG. 7A .

Abstract

Innovative systems and methods for visualizing information collected from analyzing samples are provided. The samples may include nucleic acids, proteins, or other polymers. Gene expression level as determined from analysis of a nucleic acid sample is one possible analysis result that may be visualized. In one embodiment, a computer system may display the expression levels of multiple genes simultaneously in a way that facilitates user identification of genes whose expression is significant to a characteristic such as disease or resistance to disease. Additionally, the computer system may facilitate display of further information about relevant genes once they are identified.

Description

    CROSS REFERENCE TO RELATED APPLICATIONS
  • This application is a divisional application of U.S. application Ser. No. 10/028,748, which is a continuation application of U.S. application Ser. No. 09/020,743, which claims priority to U.S. Provisional No. 60/069,436. U.S. application Ser. Nos. 10/028,748 and 09/020,743 are incorporated by reference herein, and U.S. application Ser. No. 09/020,743 has been issued into U.S. Pat. No. 6,420,108.
  • BACKGROUND OF THE INVENTION
  • The present invention relates to the field of computer systems. More specifically, the present invention relates to computer systems for visualizing analysis results.
  • Devices and computer systems for forming and using arrays of materials on a substrate are known. For example, PCT Publication No. WO 92/10588, incorporated herein by reference for all purposes, describes techniques for sequencing or sequence checking nucleic acids and other materials. Arrays for performing these operations may be formed according to the methods of, for example, the pioneering techniques disclosed in U.S. Pat. No. 5,143,854 and U.S. Pat. No. 5,593,839 both incorporated herein by reference for all purposes.
  • According to one aspect of the techniques described therein, an array of nucleic acid probes is fabricated at known locations on a substrate or chip. A fluorescently labeled nucleic acid is then brought into contact with the chip and a scanner generates an image file (which is processed into a cell file) indicating the locations where the labeled nucleic acids bound to the chip. Based upon the cell file and identities of the probes at specific locations, it becomes possible to extract information such as the monomer sequence of DNA or RNA. Such systems have been used to form, for example, arrays of DNA that may be used to study and detect mutations relevant to cystic fibrosis, the P53 gene (relevant to certain cancers), HIV, and other genetic characteristics.
  • Computer aided techniques for monitoring gene expression using such arrays of probes have also been developed as disclosed in U.S. patent application Ser. No. 08/828,952 (Attorney Docket No. 16528X-028900US) and PCT Publication No. WO 97/10365 (Attorney Docket No. 16528X-01711OPC), the contents of which are herein incorporated by reference. Many disease states are characterized by differences in the expression levels of various genes either through changes in the copy number of the genetic DNA or through changes in levels of transcription (e.g., through control of initiation, provision of RNA precursors, RNA processing, etc.) of particular genes. For example, losses and gains of genetic material play an important role in malignant transformation and progression. Furthermore, changes in the expression (transcription) levels of particular genes (e.g., oncogenes or tumor suppressors), serve as signposts for the presence and progression of various cancers.
  • It is desirable to identify genes having expression levels relevant to diagnosis of a diseased state by analyzing the expression levels of large numbers of genes in both diseased and normal individuals. Methods for collecting the expression level information have been developed. However, the user interfaces for gene expression monitoring systems that have been developed until now are designed to clearly present the expression of particular pre-selected genes. A user seeking to identify, e.g., an oncogene or a tumor suppressor gene, must individually review the expression level of large numbers of genes and compare the expression levels between diseased and normal individuals. What is needed is a user interface that takes advantage of collected gene expression information to help the user to identify particular genes of interest.
  • BRIEF SUMMARY OF THE INVENTION
  • The present invention provides innovative systems and methods for visualizing information collected from analyzing samples. The samples may include nucleic acids, proteins, or other polymers. Gene expression level as determined from analysis of a nucleic acid sample is one possible analysis result that may be visualized. In one embodiment, a computer system may display the expression levels of multiple genes simultaneously in a way that facilitates user identification of genes whose expression is significant to a characteristic such as disease or resistance to disease. Additionally, the computer system may facilitate display of further information about relevant genes once they are identified.
  • A first aspect of the invention provides a computer implemented method for presenting expression level information as collected from first and second samples. The method includes steps of: displaying a first axis corresponding to expression level in the first sample, and displaying a second axis substantially perpendicular to the first axis, the second axis corresponding to expression level in the second sample. The method further includes a step of: for a selected expressed sequence, displaying a mark at a position. The position is selected relative to the first axis in accordance with an expression level of the selected expressed sequence in the first sample and relative to the second axis in accordance with an expression level of the selected expressed sequence in the second sample. A particularly useful application is displaying many marks simultaneously for many selected genes to discover which ones of the selected genes may be relevant to the characteristic.
  • A second aspect of the invention provides a computer-implemented method of presenting sample analysis information. The method includes steps of: displaying a first axis corresponding to a concentration of a compound in a first sample as determined by monitoring binding of the compound to a selected polymer having binding affinity to the compound, and displaying a second axis substantially perpendicular to the first axis. The second axis corresponds to a concentration of the compound in the second sample as determined by monitoring binding of the compound to the selected polymer. The method further preferably includes a step of displaying a mark at a position. The position is selected relative to the first axis in accordance with the concentration in the first sample and relative to the second axis in accordance with the concentration in the second sample.
  • A further understanding of the nature and advantages of the inventions herein may be realized by reference to the remaining portions of the specification and the attached drawings.
  • BRIEF DESCRIPTION OF THE DRAWINGS
  • FIG. 1 illustrates an example of a computer system that may be used to execute software embodiments of the present invention.
  • FIG. 2 shows a system block diagram of a typical computer system.
  • FIG. 3 illustrates an overall system for forming and analyzing arrays of polymers including biological materials such as DNA or RNA.
  • FIG. 4 is an illustration of an embodiment of software for the overall system.
  • FIG. 5 shows a flowchart of a process of monitoring the expression of a gene by comparing hybridization intensities of pairs of perfect match and mismatch probes.
  • FIG. 6 shows a screen display illustrating gene expression levels for multiple genes as collected from both normal and diseased tissue.
  • FIGS. 7A-7B show screen displays illustrating information (SEQ ID NOS:1 and 2) about a particular gene selected from the display of FIG. 6.
  • DETAILED DESCRIPTION OF THE INVENTION
  • The present invention provides innovative methods of monitoring visualizing gene expression. In the description that follows, the invention will be described in reference to preferred embodiments. However, the description is provided for purposes of illustration and not for limiting the spirit and scope of the invention.
  • FIG. 1 illustrates an example of a computer system that may be used to execute software embodiments of the present invention. FIG. 1 shows a computer system 1 which includes a monitor 3, screen 5, cabinet 7, keyboard 9, and mouse 11. Mouse 11 may have one or more buttons such as mouse buttons 13. Cabinet 7 houses a CD-ROM drive 15 and a hard drive (not shown) that may be utilized to store and retrieve software programs including computer code incorporating the present invention. Although a CD-ROM 17 is shown as the computer readable medium, other computer readable media including floppy disks, DRAM, hard drives, flash memory, tape, and the like may be utilized. Cabinet 7 also houses familiar computer components (not shown) such as a processor, memory, and the like.
  • FIG. 2 shows a system block diagram of computer system 1 used to execute software embodiments of the present invention. As in FIG. 1, computer system 1 includes monitor 3 and keyboard 9. Computer system 1 further includes subsystems such as a central processor 50, system memory 52, I/O controller 54, display adapter 56, removable disk 58, fixed disk 60, network interface 62, and speaker 64. Removable disk 58 is representative of removable computer readable media like floppies, tape, CD-ROM, removable hard drive, flash memory, and the like. Fixed disk 60 is representative of an internal hard drive or the like. Other computer systems suitable for use with the present invention may include additional or fewer subsystems. For example, another computer system could include more than one processor 50 (i.e., a multi processor system) or memory cache.
  • Arrows such as 66 represent the system bus architecture of computer system 1. However, these arrows are illustrative of any interconnection scheme serving to link the subsystems. For example, display adapter 56 may be connected to central processor 50 through a local bus or the system may include a memory cache. Computer system 1 shown in FIG. 2 is but an example of a computer system suitable for use with the present invention. Other configurations of subsystems suitable for use with the present invention will be readily apparent to one of ordinary skill in the art. In one embodiment, the computer system is an IBM compatible personal computer.
  • The VLSIPS™ and GeneChip™ technologies provide methods of making and using very large arrays of polymers, such as nucleic acids, on very small chips. See U.S. Pat. No. 5,143,854 and PCT Patent Publication Nos. WO 90/15070 and 92/10092, each of which is hereby incorporated by reference for all purposes. Nucleic acid probes on the chip are used to detect complementary nucleic acid sequences in a sample nucleic acid of interest (the “target” nucleic acid).
  • It should be understood that the probes need not be nucleic acid probes but may also be other receptors, such as antibodies, or polymers such as peptides. Peptide probes may be used to detect the concentration of other peptides, proteins, or other compounds in a sample. The probes must be carefully selected to have bonding affinity to the compound whose concentration they are to be used to measure.
  • In one embodiment, the present invention provides methods of visualizing information relating to the concentration of compounds in a sample as measured by monitoring affinity of the compounds to probes. In a particular application, the concentration information is generated by analysis of hybridization intensity files for a chip containing hybridized nucleic acid probes. The hybridization of a nucleic acid sample to certain probes may represent the expression level of one more genes or expressed sequence tags (ESTs). The expression level of a gene or EST is herein understood to be the concentration within a sample of mRNA or protein that would result from the transcription of the gene or EST.
  • Expression level information visualized by virtue of the present invention need not be obtained from probes but may originate from any source. If the expression information is collected from a probe array, the probe array need not meet any particular criteria for size and density. Furthermore, the present invention is not limited to visualizing fluorescent measurements of bondings such as hybridizations but may be readily utilized to visualize other measurements.
  • Concentration of compounds other than nucleic acids may be visualized according to one embodiment of the present invention. For example, a probe array may include peptide probes which may be exposed to protein samples, polypeptide samples, or other compounds which may or may not bond to the peptide probes. By appropriate selection of the peptide probes, one may detect the presence or absence of particular compounds which would bond to the peptide probes.
  • For purposes of illustration, the present invention is described as being part of a system that designs a chip mask, synthesizes the probes on the chip, labels nucleic acids from a target sample, and scans the hybridized probes. Such a system is set forth in U.S. Pat. No. 5,571,639 which is hereby incorporated by reference for all purposes. However, the present invention may be used separately from the overall system for analyzing data generated by such systems, such as at remote locations, or for visualizing the results of other systems for generating expression information, or for visualizing concentrations of polymers other than nucleic acids.
  • FIG. 3 illustrates a computerized system for forming and analyzing arrays of biological materials such as RNA or DNA. A computer 100 is used to design arrays of biological polymers such as RNA or DNA. The computer 100 may be, for example, an appropriately programmed IBM personal computer compatible running Windows NT including appropriate memory and a CPU as shown in FIGS. 1 and 2. The computer system 100 obtains inputs from a user regarding characteristics of a gene of interest, and other inputs regarding the desired features of the array. Optionally, the computer system may obtain information regarding a specific genetic sequence of interest from an external or internal database 102 such as GenBank. The output of the computer system 100 is a set of chip design computer files 104 in the form of, for example, a switch matrix, as described in PCT application WO 92/10092, and other associated computer files.
  • The chip design files are provided to a system 106 that designs the lithographic masks used in the fabrication of arrays of molecules such as DNA. The system or process 106 may include the hardware necessary to manufacture masks 110 and also the necessary computer hardware and software 108 necessary to lay the mask patterns out on the mask in an efficient manner. As with the other features in FIG. 3, such equipment may or may not be located at the same physical site, but is shown together for ease of illustration in FIG. 3. The system 106 generates masks 110 or other synthesis patterns such as chrome on glass masks for use in the fabrication of polymer arrays.
  • The masks 110, as well as selected information relating to the design of the chips from system 100, are used in a synthesis system 112. Synthesis system 112 includes the necessary hardware and software used to fabricate arrays of polymers on a substrate or chip 114. For example, synthesizer 112 includes a light source 116 and a chemical flow cell 118 on which the substrate or chip 114 is placed. Mask 110 is placed between the light source and the substrate/chip, and the two are translated relative to each other at appropriate times for deprotection of selected regions of the chip. Selected chemical reagents are directed through flow cell 118 for coupling to deprotected regions, as well as for washing and other operations. All operations are preferably directed by an appropriately programmed computer 119, which may or may not be the same computer as the computer(s) used in mask design and mask making.
  • The substrates fabricated by synthesis system 112 are optionally diced into smaller chips and exposed to marked targets. The targets may or may not be complementary to one or more of the molecules on the substrate. The targets are marked with a label such as a fluorescein label (indicated by an asterisk in FIG. 3) and placed in scanning system 120. Scanning system 120 again operates under the direction of an appropriately programmed digital computer 122, which also may or may not be the same computer as the computers used in synthesis, mask making, and mask design. The scanner 120 includes a detection device 124 such as a confocal microscope or CCD (charge coupled device) that is used to detect the location where labeled target has bound to the substrate. The output of scanner 120 is an image files) 124 indicating, in the case of fluorescein labeled target, the fluorescence intensity (photon counts or other related measurements, such as voltage) as a function of position on the substrate. Since higher photon counts will be observed where the labeled target has bound more strongly to the array of polymers, and since the monomer sequence of the polymers on the substrate is known as a function of position, it becomes possible to determine the sequence(s) of polymer(s) on the substrate that are complementary to the target.
  • The image file 124 is provided as input to an analysis system 126 that incorporates the visualization and analysis methods of the present invention. Again, the analysis system may be any one of a wide variety of computer system. The present invention provides various methods of analyzing and visualizing the chip design files and the image files, providing appropriate output 128. The chip design need not include any particular number of probes. It should be understood that the present invention does not require any particular source of expression level information.
  • FIG. 4 provides a simplified illustration of the overall software system used in the operation of one embodiment of the invention. As shown in FIG. 4, the system first identifies the nucleotide sequence(s) or targets that would be of interest in a particular expression level analysis at step 202. The sequences of interest correspond to mRNA transcripts of one or more genes, ESTs or nucleic acids derived from the mRNA transcripts. Sequence selection may be provided via manual input of text files or may be from external sources such as GenBank.
  • At step 204 the system evaluates the sequences of interest to determine or assist the user in determining which probes would be desirable on the chip, and provides an appropriate “layout” on the chip for the probes. The process of selecting probes for an expression level analysis is explained in PCT Publication No. WO 97/10365, the contents of which are herein incorporated by reference. An alternative probe selection process that does not require prior knowledge of sequences of interest is explained in PCT Publication No. WO97/27317 (Attorney Docket No. 18547-019410PC), the contents of which are herein incorporated by reference. Further general background on probe selection is found in PCT Publication No. WO95/11995 (Attorney Docket No. 18547-004111PC) and PCT Publication No. WO97/29212 (Attorney Docket No. 18547-018540PC), the contents of which are herein incorporated by reference. The term “perfect match probe” refers to a probe that has a sequence that is perfectly complementary to a particular target sequence. The test probe is typically perfectly complementary to a portion (subsequence) of the target sequence. The term “mismatch control” or “mismatch probe” refer to probes whose sequence is deliberately selected not to be perfectly complementary to a particular target sequence. For each mismatch (MM) control in an array there typically exists a corresponding perfect match (PM) probe that is perfectly complementary to the same particular target sequence.
  • The process compares hybridization intensities of pairs of perfect match and mismatch probes that are preferably covalently attached to the surface of a substrate or chip. Most preferably, the nucleic acid probes have a density greater than about 60 different nucleic acid probes per 1 cm2 of the substrate.
  • Initially, nucleic acid probes are selected that are complementary to the target sequence. These probes are the perfect match probes. Another set of probes is specified that are intended to be not perfectly complementary to the target sequence. These probes are the mismatch probes and each mismatch probe includes at least one nucleotide mismatch from a perfect match probe. Accordingly, a mismatch probe and the perfect match probe to which it is identical except for one base make up a pair. As mentioned earlier, the nucleotide mismatch is preferably near the center of the mismatch probe.
  • The probe lengths of the perfect match probes are typically chosen to exhibit detectably greater hybridization with the target sequence relative to the mismatch probes. For example, the nucleic acid probes may be all 20-mers. However, probes of varying lengths may also be synthesized on the substrate for any number of reasons including resolving ambiguities.
  • Again referring to FIG. 4, at step 206 the masks for the synthesis are designed. At step 208 the software utilizes the mask design and layout information to make the DNA or other polymer chips. This step 208 will control, among other things, relative translation of a substrate and the mask, the flow of desired reagents through a flow cell, the synthesis temperature of the flow cell, and other parameters. At step 210, another piece of software is used in scanning a chip thus synthesized and exposed to a labeled target. The software controls the scanning of the chip, and stores the data thus obtained in a file that may later be utilized to extract hybridization information.
  • At step 212 a computer system utilizes the layout information and the fluorescence information to evaluate the hybridized nucleic acid probes on the chip. Among the important pieces of information obtained from DNA chips are the relative fluorescent intensities obtained from the perfect match probes and mismatch probes. These intensity levels are used to estimate an expression level for a gene or EST. The computer system used for analysis will preferably have available other details of the experiment including possibly the gene name, gene sequence, probe sequences, probe locations on the substrate, and the like.
  • According to the present invention, at step 214, the same computer system used for analysis or another one displays the expression level information in a format useful for identifying genes of interest. The visualized expression level information may include information collected from multiple applications of one or more previous steps of FIG. 4.
  • FIG. 5 is a flowchart describing steps of estimating an expression level for a particular gene and determining whether the expression level is sufficiently high to be displayed. At step 952, the computer system receives raw scan data of N pairs of perfect match and mismatch probes. In a preferred embodiment, the hybridization intensities are photon counts from a fluorescein labeled target that has hybridized to the probes on the substrate. For simplicity, the hybridization intensity of a perfect match probe will be designed “Ipm” and the hybridization intensity of a mismatch probe will be designed “Imm.”
  • Hybridization intensities for a pair of probes are retrieved at step 954. The background signal intensity is subtracted from each of the hybridization intensities of the pair at step 956. Background subtraction can also be performed on all the raw scan data at the same time.
  • At step 958, the hybridization intensities of the pair of probes are compared to a difference threshold (D) and a ratio threshold (R). It is determined if the difference between the hybridization intensities of the pair (Ipm−Imm) is greater than or equal to the difference threshold AND the quotient of the hybridization intensities of the pair (Ipm/Imm) is greater than or equal to the ratio threshold. The difference thresholds are typically user defined values that have been determined to produce accurate expression monitoring of a gene or genes. In one embodiment, the difference threshold is 20 and the ratio threshold is 1.2.
  • If Ipm−Imm>=D and Ipm/Imm>=R, the value NPOS is incremented at step 960. In general, NPOS is a value that indicates the number of pairs of probes which have hybridization intensities indicating that the gene is likely expressed. NPOS is utilized in a determination of the expression of the gene.
  • At step 962, it is determined if Imm−Ipm>=D and Imm/Ipm>=R. If these expressions are true, the value NNEG is incremented at step 964. In general, NNEG is a value that indicates the number of pairs of probes which have hybridization intensities indicating that the gene is likely not expressed. NNEG, like NPOS, is utilized in a determination of the expression of the gene.
  • For each pair that exhibits hybridization intensities either indicating the gene is expressed or not expressed, a log ratio value (LR) and intensity difference value (IDIF) are calculated at step 966. LR is calculated by the log of the quotient of the hybridization intensities of the pair (Ipm/Imm). The IDIF is calculated by the difference between the hybridization intensities of the pair (Ipm−Imm). If there is a next pair of hybridization intensities at step 968, they are retrieved at step 954.
  • At step 972, a decision matrix is utilized to indicate if the gene is expressed. The decision matrix utilizes the values N, NPOS, NNEG, LR (multiple LRs), and IDIF (multiple IDIFs). The following four assignments are performed:
    P1=NPOS/NNEG
    P2=NPOS/N
    P3=SUM(LR)/N
    P4=SUM(IDIF)/N
    These P values are then utilized to determine if the gene is expressed and if the expression level should be displayed. In a preferred embodiment, the expression level of a gene should be displayed if:
    P1>2.2
    P2>0.3
    P3>0.8
    P4>30
  • Once all the pairs of probes have been processed and the expression of the gene indicated, an average of the IDIF values for the probes that incremented NPOS or NNEG is calculated at step 975, which is utilized as an expression level. Of course, other values including one of P1 through P4 could be used to indicate expression level.
  • For simplicity, FIG. 5 was described in reference to a single gene or EST. However, the visualization system of the present invention displays expression results for many genes to facilitate discovery of genes of interest or ESTs. Furthermore, the present invention contemplates display of expression levels of a single gene or ESTs as collected from two or more different samples such as tissue samples. The sample sources preferably differ in some characteristic. It will be understood that when the term “sample” is used herein, measurements made on a single “sample” can be based on an aggregation of multiple sample collection events or even multiple organisms.
  • FIG. 6 shows a screen display illustrating gene expression levels for multiple genes as collected from two tissue samples. A displayed horizontal axis 1002 represents expression level measured in one or more nucleic acid samples taken from the first tissue sample. A displayed vertical axis 1004 represents expression level in one or more nucleic acid samples taken from the second tissue sample. Each of marks 1006 represent a particular gene whose expression level has been measured in both the first and second tissue samples. Each mark 1006 is placed at a distance from vertical axis 1004 corresponding to expression level in the first tissue sample and at a distance from the horizontal axis 1002 corresponding to expression level in the second tissue sample.
  • The expression levels used for determining the position of marks 1006 are preferably taken from the result of step 975. The position of each of marks 1006 depends on two iterations of the steps of FIG. 5, once for the sample taken from the first tissue sample and once for the sample taken from the second tissue sample. However, a mark is preferably displayed only if one of the samples meets the threshold criteria at step 972.
  • In the depicted representative screen display, the first tissue sample is a cancerous tissue sample and the second tissue sample is a normal tissue sample. The individual marks represent the expression levels of selected genes in both cancerous and normal tissue. A first group of marks 1008 represent genes that are neither tumor suppressors nor oncogenes since their expression levels are roughly similar for both normal and cancerous tissue. These marks 1008 fall roughly along a line which is rotated 45 degrees from each of the axes. A second group of marks 1010 represent genes that are likely oncogenes since their expression levels are found to be significantly higher in cancerous tissue than in normal tissue. A third group of marks 1012 represent genes that are likely tumor suppressors since their expression levels are found to be significantly higher in normal tissue than in cancerous tissue. It will be appreciated that expression levels for large numbers of genes can be reviewed at once to discover the oncogenes and tumor suppressors.
  • Although in the depicted display, the two types of tissue are normal tissue and cancerous tissue, the present invention would aid in the discovery of genes whose expression is associated with any characteristic that varies among tissue samples. For example, once can compare expression results from tissue from individuals who have been exposed to HIV but remain infected to tissue obtained from infected individuals to identify genes conferring resistance to HIV. One can compare expression results between tissue from plants that survive drought to plants that do not. One can compare expression levels among tissue samples at successive stages or severity levels of the same disease, among tissue samples where different ultimate outcomes of the disease (e.g., patient death or remission) are known, among diseased tissue samples that have been subject to different treatment regimes including e.g., chemotherapy, antisense RNA, etc. For cancers, one can compare expression levels between malignant cells and non malignant cells. Also expression levels can be compared among different organs, between species, and among different stages of development of an organ.
  • It will be appreciated that the present invention also encompasses displays with more than two dimensions. A third visual dimension can be used to illustrate expression level from a third tissue sample. The time dimension can also be used to illustrate successive groups of two or three tissue samples at successive time periods.
  • The time dimension can be also used to correspond to tissue samples obtained at, e.g., successive stages of a disease.
  • Other interface methods corresponding to human senses other than sight can also be incorporated within the presentation system of the present invention. The senses may correspond to additional dimensions. For example, marks can be displayed in succession accompanies by a sound having characteristics corresponding to expression level in another tissue sample.
  • The user can employ a cursor 1014 to identify a particular mark as being of interest. Cursor 1014 can be moved to a particular mark by use of, e.g., mouse 11. Once cursor 1014 is over a mark of interest, the mark can be selected by, e.g., depression of one of mouse buttons 13. Selection of a particular mark can be facilitated by use of a zoom display feature (not shown). Once a particular mark is selected, further information is displayed about the gene represented by the mark. A special mouse can transmit a tactile sensation back to the user corresponding to expression level in a tissue sample as the user passes the mouse over a corresponding mark.
  • It will be appreciated that the display of FIG. 6 is not limited to expression information. The two dimensions of FIG. 6 may correspond to indicators of the presence of various polymers other than nucleic acids in two different samples. For example, each mark may correspond to a different polymer, polypeptide, or other compound. The distance of the mark from each axis would correspond to a measure of presence of the particular polymer in the sample corresponding to the axis. One possible measure is produced by fluorescently tagging polymer samples such as protein samples and exposing a probe array such as a peptide probe array to the protein samples. The fluorescent intensity of the probes will then correspond to the bonding affinity of the sample to the probes. The intensity measurement or a measurement derived from the intensity measurement may then be used to position the marks of FIG. 6.
  • FIG. 7A shows a screen display giving information about a particular gene selected from the display of FIG. 6. A cluster number 702, a GenBank accession number 704, and a verbal description 706 for the selected gene are displayed. The user can also select a number of marks 1006 by circling them with cursor 1014. Then a list of information as shown in FIG. 7A is displayed for all the genes corresponding to the selected marks.
  • By selecting GenBank accession number 704 with another cursor (not shown), the user can direct retrieval of the GenBank information for the selected gene. If the GenBank information is not available locally, the retrieval process can include formulating a query and transmitting the query to a GenBank web site. Once the GenBank information is retrieved, it can also be displayed. FIG. 7B depicts the GenBank information for the gene identified in FIG. 7A.
  • In the foregoing specification, the invention has been described with reference to specific exemplary embodiments thereof. It will, however, be evident that various modifications and changes may be made thereunto without departing from the broader spirit and scope of the invention as set forth in the appended claims and their full scope of equivalents.

Claims (3)

1-48. (canceled)
49. A method for analyzing expression level information, the method comprising:
displaying a first axis indicative of a value of a first expression level for a first expressed sequence;
displaying a first mark at a first position, the first position associated with a first coordinate related to the first axis in accordance with the first expression level of the first expressed sequence;
generating a sound associated with the first mark, the sound indicative of a second expression level for the first expressed sequence.
50. A method for analyzing expression level information, the method comprising:
displaying a first axis indicative of a value of a first expression level for a first expressed sequence;
displaying a first mark at a first position, the first position associated with a first coordinate related to the first axis in accordance with the first expression level of the first expressed sequence;
generating a sound associated with the first mark, the sound indicative of a second expression level for the first expressed sequence;
receiving an input of selection of the first mark; and
in response to the input, displaying information associated with the first expressed sequence.
US11/489,292 1997-07-25 2006-07-18 Computer-aided visualization of expression comparison Abandoned US20070067111A1 (en)

Priority Applications (2)

Application Number Priority Date Filing Date Title
US11/489,292 US20070067111A1 (en) 1997-07-25 2006-07-18 Computer-aided visualization of expression comparison
US13/626,773 US20130169645A1 (en) 1997-12-11 2012-09-25 Computer-aided visualization of expression comparison

Applications Claiming Priority (10)

Application Number Priority Date Filing Date Title
US5384297P 1997-07-25 1997-07-25
US6919897P 1997-12-11 1997-12-11
US6943697P 1997-12-11 1997-12-11
US09/020,743 US6420108B2 (en) 1998-02-09 1998-02-09 Computer-aided display for comparative gene expression
US09/122,167 US6229911B1 (en) 1997-07-25 1998-07-24 Method and apparatus for providing a bioinformatics database
US09/836,867 US6567540B2 (en) 1997-07-25 2001-04-16 Method and apparatus for providing a bioinformatics database
US10/028,748 US20020150932A1 (en) 1997-12-11 2001-12-21 Computer-aided visualization of expression comparison
US10/374,170 US6882742B2 (en) 1997-07-25 2003-02-25 Method and apparatus for providing a bioinformatics database
US11/080,216 US7215804B2 (en) 1997-07-25 2005-03-14 Method and apparatus for providing a bioinformatics database
US11/489,292 US20070067111A1 (en) 1997-07-25 2006-07-18 Computer-aided visualization of expression comparison

Related Parent Applications (2)

Application Number Title Priority Date Filing Date
US10/028,748 Division US20020150932A1 (en) 1997-07-25 2001-12-21 Computer-aided visualization of expression comparison
US11/080,216 Continuation-In-Part US7215804B2 (en) 1997-07-25 2005-03-14 Method and apparatus for providing a bioinformatics database

Related Child Applications (1)

Application Number Title Priority Date Filing Date
US13/626,773 Continuation US20130169645A1 (en) 1997-12-11 2012-09-25 Computer-aided visualization of expression comparison

Publications (1)

Publication Number Publication Date
US20070067111A1 true US20070067111A1 (en) 2007-03-22

Family

ID=21800291

Family Applications (4)

Application Number Title Priority Date Filing Date
US09/020,743 Expired - Lifetime US6420108B2 (en) 1997-07-25 1998-02-09 Computer-aided display for comparative gene expression
US10/028,748 Abandoned US20020150932A1 (en) 1997-07-25 2001-12-21 Computer-aided visualization of expression comparison
US11/489,292 Abandoned US20070067111A1 (en) 1997-07-25 2006-07-18 Computer-aided visualization of expression comparison
US13/626,773 Abandoned US20130169645A1 (en) 1997-12-11 2012-09-25 Computer-aided visualization of expression comparison

Family Applications Before (2)

Application Number Title Priority Date Filing Date
US09/020,743 Expired - Lifetime US6420108B2 (en) 1997-07-25 1998-02-09 Computer-aided display for comparative gene expression
US10/028,748 Abandoned US20020150932A1 (en) 1997-07-25 2001-12-21 Computer-aided visualization of expression comparison

Family Applications After (1)

Application Number Title Priority Date Filing Date
US13/626,773 Abandoned US20130169645A1 (en) 1997-12-11 2012-09-25 Computer-aided visualization of expression comparison

Country Status (4)

Country Link
US (4) US6420108B2 (en)
EP (1) EP0935210A3 (en)
JP (1) JPH11342000A (en)
CA (1) CA2259887A1 (en)

Families Citing this family (101)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5959098A (en) * 1996-04-17 1999-09-28 Affymetrix, Inc. Substrate preparation process
US6706875B1 (en) * 1996-04-17 2004-03-16 Affyemtrix, Inc. Substrate preparation process
US6420108B2 (en) * 1998-02-09 2002-07-16 Affymetrix, Inc. Computer-aided display for comparative gene expression
US20040111219A1 (en) * 1999-02-22 2004-06-10 Sandeep Gulati Active interferometric signal analysis in software
US6142681A (en) 1999-02-22 2000-11-07 Vialogy Corporation Method and apparatus for interpreting hybridized bioelectronic DNA microarray patterns using self-scaling convergent reverberant dynamics
US6245511B1 (en) * 1999-02-22 2001-06-12 Vialogy Corp Method and apparatus for exponentially convergent therapy effectiveness monitoring using DNA microarray based viral load measurements
US6136541A (en) 1999-02-22 2000-10-24 Vialogy Corporation Method and apparatus for analyzing hybridized biochip patterns using resonance interactions employing quantum expressor functions
EP1041514B1 (en) * 1999-03-30 2006-03-01 Fuji Photo Film Co., Ltd. Method and apparatus for selectively displaying measurement result and corresponding images
AU5751000A (en) * 1999-06-18 2001-01-09 Brook Biotechnologies, Inc. Groups of borrelia burgdorferi and borrelia afzelii that cause lyme disease in humans
JP3451035B2 (en) 1999-07-27 2003-09-29 日立ソフトウエアエンジニアリング株式会社 Microarray information display method
JP4320112B2 (en) * 2000-03-27 2009-08-26 日立ソフトウエアエンジニアリング株式会社 Gene experiment data display method
US7363165B2 (en) 2000-05-04 2008-04-22 The Board Of Trustees Of The Leland Stanford Junior University Significance analysis of microarrays
US7062092B2 (en) 2000-08-22 2006-06-13 Affymetrix, Inc. System, method, and computer software product for gain adjustment in biological microarray scanner
JP2004522216A (en) * 2000-10-12 2004-07-22 アイコニックス ファーマシューティカルズ インコーポレイテッド Cross-correlation between compound information and genome information
US6510391B2 (en) * 2000-11-22 2003-01-21 Affymetrix, Inc. Computer software products for nucleic acid hybridization analysis
JPWO2002048915A1 (en) * 2000-12-11 2004-07-02 東京大学長 Methods for detecting associations between genes
US20020183936A1 (en) * 2001-01-24 2002-12-05 Affymetrix, Inc. Method, system, and computer software for providing a genomic web portal
US20030009294A1 (en) * 2001-06-07 2003-01-09 Jill Cheng Integrated system for gene expression analysis
JP2003159074A (en) * 2001-11-26 2003-06-03 Kudo Norio Cancer-related gene
CN1281324C (en) * 2001-12-19 2006-10-25 阿菲梅特里克斯公司 Manufacturing process for array plate assembly
JP3563394B2 (en) * 2002-03-26 2004-09-08 株式会社日立製作所 Screen display system
US7006680B2 (en) * 2002-05-03 2006-02-28 Vialogy Corp. System and method for characterizing microarray output data
US9453251B2 (en) 2002-10-08 2016-09-27 Pfenex Inc. Expression of mammalian proteins in Pseudomonas fluorescens
BR0316111A (en) * 2002-11-21 2005-09-13 Wyeth Corp Methods to diagnose rcc and other solid tumors
BRPI0410511A (en) * 2003-05-01 2006-06-20 Japan Science & Tech Agency arrangement in which different types of biosubstances obtained from an organism of interest or synthetic substances interacting with those substances are arranged and immobilized on a support, in an orderly manner, process for producing an arrangement, genotype identification method, diagnostic method gene identification to identify human genotypes, screening method to select a variety of hybrid target trait transport, genotype analysis and display system, quantitative site analysis system, gene interaction analysis system, screening method for select a variety of hybrid target trait transport by crossing organisms, quantitative site analysis system, quantitative trait analysis method to analyze a quantitative trait of an organism, gene search method to search for a gene associated with expression of a trait of interest, met variety enhancement method for organisms, gene interaction analysis system, gene interaction analysis method for analyzing gene interaction, variety improvement method
US20050027460A1 (en) * 2003-07-29 2005-02-03 Kelkar Bhooshan Prafulla Method, program product and apparatus for discovering functionally similar gene expression profiles
US8321137B2 (en) * 2003-09-29 2012-11-27 Pathwork Diagnostics, Inc. Knowledge-based storage of diagnostic models
US20050069863A1 (en) * 2003-09-29 2005-03-31 Jorge Moraleda Systems and methods for analyzing gene expression data for clinical diagnostics
AU2004285103A1 (en) 2003-09-29 2005-05-12 Pathwork Diagnostics, Inc. Systems and methods for detecting biological features
CA2497324A1 (en) 2004-02-17 2005-08-17 Affymetrix, Inc. Methods for fragmenting and labelling dna
US7588892B2 (en) 2004-07-19 2009-09-15 Entelos, Inc. Reagent sets and gene signatures for renal tubule injury
US8603824B2 (en) * 2004-07-26 2013-12-10 Pfenex, Inc. Process for improved protein expression by strain engineering
US8484000B2 (en) * 2004-09-02 2013-07-09 Vialogy Llc Detecting events of interest using quantum resonance interferometry
US20060073506A1 (en) 2004-09-17 2006-04-06 Affymetrix, Inc. Methods for identifying biological samples
EP1645640B1 (en) 2004-10-05 2013-08-21 Affymetrix, Inc. Method for detecting chromosomal translocations
US7682782B2 (en) 2004-10-29 2010-03-23 Affymetrix, Inc. System, method, and product for multiple wavelength detection using single source excitation
EP1652580A1 (en) 2004-10-29 2006-05-03 Affymetrix, Inc. High throughput microarray, package assembly and methods of manufacturing arrays
JPWO2006088208A1 (en) * 2005-02-21 2008-07-10 大日本住友製薬株式会社 Method and apparatus for predicting physiological changes in living body
US8121793B2 (en) * 2005-06-03 2012-02-21 Eppendorf Ag Method and device for comparative display of biological data
JP2007034343A (en) 2005-07-21 2007-02-08 Fujitsu Ltd Genetic information display device, genetic information display method, genetic information display program and recording medium
US7707907B2 (en) 2005-11-17 2010-05-04 Socovar, Société En Commandite Planar parallel mechanism and method
US20070198653A1 (en) * 2005-12-30 2007-08-23 Kurt Jarnagin Systems and methods for remote computer-based analysis of user-provided chemogenomic data
US7467118B2 (en) 2006-01-12 2008-12-16 Entelos Inc. Adjusted sparse linear programming method for classifying multi-dimensional biological data
US8009889B2 (en) 2006-06-27 2011-08-30 Affymetrix, Inc. Feature intensity reconstruction of biological probe array
AU2007284651B2 (en) 2006-08-09 2014-03-20 Institute For Systems Biology Organ-specific proteins and methods of their use
US7996584B2 (en) * 2006-11-02 2011-08-09 Redmere Technology Ltd. Programmable cable with deskew and performance analysis circuits
EP2450456A3 (en) 2006-11-02 2012-08-01 Yale University Assessment of oocyte competence
WO2008134461A2 (en) 2007-04-27 2008-11-06 Dow Global Technologies, Inc. Method for rapidly screening microbial hosts to identify certain strains with improved yield and/or quality in the expression of heterologous proteins
US9580719B2 (en) 2007-04-27 2017-02-28 Pfenex, Inc. Method for rapidly screening microbial hosts to identify certain strains with improved yield and/or quality in the expression of heterologous proteins
EP3260123A1 (en) 2008-11-06 2017-12-27 University of Miami Role of soluble upar in the pathogenesis of proteinuric kidney disease
CA2749103A1 (en) 2009-01-07 2010-07-15 Steve Stone Cancer biomarkers
ES2805347T3 (en) 2009-02-11 2021-02-11 Caris Mpi Inc Molecular profiling of tumors
AU2010315400B2 (en) 2009-10-27 2016-07-21 Caris Mpi, Inc. Molecular profiling for personalized medicine
US20110201008A1 (en) * 2009-12-01 2011-08-18 University Of Miami Assays, methods and kits for measuring response to therapy and predicting clinical outcome in patients with b-cell lymphoma
US8835358B2 (en) 2009-12-15 2014-09-16 Cellular Research, Inc. Digital counting of individual molecules by stochastic attachment of diverse labels
US9798855B2 (en) 2010-01-07 2017-10-24 Affymetrix, Inc. Differential filtering of genetic data
WO2011091435A2 (en) 2010-01-25 2011-07-28 Mount Sinai School Of Medicine Methods of treating liver disease
WO2011139714A2 (en) 2010-04-26 2011-11-10 Atyr Pharma, Inc. Innovative discovery of therapeutic, diagnostic, and antibody compositions related to protein fragments of cysteinyl-trna synthetase
US8961960B2 (en) 2010-04-27 2015-02-24 Atyr Pharma, Inc. Innovative discovery of therapeutic, diagnostic, and antibody compositions related to protein fragments of isoleucyl tRNA synthetases
EP2563911B1 (en) 2010-04-28 2021-07-21 aTyr Pharma, Inc. Innovative discovery of therapeutic, diagnostic, and antibody compositions related to protein fragments of alanyl trna synthetases
US9034320B2 (en) 2010-04-29 2015-05-19 Atyr Pharma, Inc. Innovative discovery of therapeutic, diagnostic, and antibody compositions related to protein fragments of Valyl-tRNA synthetases
AU2011248490B2 (en) 2010-04-29 2016-11-10 Pangu Biopharma Limited Innovative discovery of therapeutic, diagnostic, and antibody compositions related to protein fragments of Asparaginyl tRNA synthetases
JP5976638B2 (en) 2010-05-03 2016-08-23 エータイアー ファーマ, インコーポレイテッド Innovative discovery of therapeutic, diagnostic and antibody compositions related to protein fragments of arginyl tRNA synthetase
US8981045B2 (en) 2010-05-03 2015-03-17 Atyr Pharma, Inc. Innovative discovery of therapeutic, diagnostic, and antibody compositions related to protein fragments of methionyl-tRNA synthetases
EP2566495B1 (en) 2010-05-03 2017-03-01 aTyr Pharma, Inc. Innovative discovery of therapeutic, diagnostic, and antibody compositions related to protein fragments of phenylalanyl-alpha-trna synthetases
US9062302B2 (en) 2010-05-04 2015-06-23 Atyr Pharma, Inc. Innovative discovery of therapeutic, diagnostic, and antibody compositions related to protein fragments of p38 multi-tRNA synthetase complex
EP2568996B1 (en) 2010-05-14 2017-10-04 aTyr Pharma, Inc. Therapeutic, diagnostic, and antibody compositions related to protein fragments of phenylalanyl-beta-trna synthetases
AU2011258106B2 (en) 2010-05-27 2017-02-23 Pangu Biopharma Limited Innovative discovery of therapeutic, diagnostic, and antibody compositions related to protein fragments of glutaminyl-tRNA synthetases
CN103118694B (en) 2010-06-01 2016-08-03 Atyr医药公司 The discovery for the treatment of, diagnosis and the antibody compositions relevant to the protein fragments of lysyl-tRNA synzyme
US20120053253A1 (en) 2010-07-07 2012-03-01 Myriad Genetics, Incorporated Gene signatures for cancer prognosis
EP2593125B1 (en) 2010-07-12 2017-11-01 aTyr Pharma, Inc. Innovative discovery of therapeutic, diagnostic, and antibody compositions related to protein fragments of glycyl-trna synthetases
US9029506B2 (en) 2010-08-25 2015-05-12 Atyr Pharma, Inc. Innovative discovery of therapeutic, diagnostic, and antibody compositions related to protein fragments of tyrosyl-tRNA synthetases
GB201021502D0 (en) 2010-12-20 2011-02-02 Cambridge Entpr Ltd Biomarkers
WO2013112216A1 (en) 2012-01-24 2013-08-01 Cd Diagnostics, Llc System for detecting infection in synovial fluid
CA2864300A1 (en) 2012-02-16 2013-08-22 Atyr Pharma, Inc. Histidyl-trna synthetases for treating autoimmune and inflammatory diseases
EP3321378B1 (en) 2012-02-27 2021-11-10 Becton, Dickinson and Company Compositions for molecular counting
EP2820174B1 (en) 2012-02-27 2019-12-25 The University of North Carolina at Chapel Hill Methods and uses for molecular tags
GB201210565D0 (en) 2012-06-14 2012-08-01 Cambridge Entpr Ltd Biomarkers
EP4190918A1 (en) 2012-11-16 2023-06-07 Myriad Genetics, Inc. Gene signatures for cancer prognosis
EP2925886B1 (en) 2012-11-27 2019-04-24 Pontificia Universidad Católica de Chile Compositions and methods for diagnosing thyroid tumors
DK2971156T3 (en) 2013-03-15 2020-10-19 Myriad Genetics Inc GENES AND GENSIGNATURES FOR DIAGNOSIS AND TREATMENT OF MELANOMA
US10535420B2 (en) 2013-03-15 2020-01-14 Affymetrix, Inc. Systems and methods for probe design to detect the presence of simple and complex indels
GB201404189D0 (en) 2014-03-10 2014-04-23 Cambridge Entpr Ltd Novel biomarkers
EP3143160B1 (en) 2014-05-13 2019-11-06 Myriad Genetics, Inc. Gene signatures for cancer prognosis
ES2946681T3 (en) 2014-07-02 2023-07-24 Myriad Mypath Llc Genes and gene signatures for the diagnosis and treatment of melanoma
WO2016042067A1 (en) * 2014-09-17 2016-03-24 Vito Nv Methods and tools for analyzing hybridization
GB201500729D0 (en) 2015-01-16 2015-03-04 Cambridge Entpr Ltd Novel Biomarkers
CA2994416A1 (en) 2015-08-04 2017-02-09 Cd Diagnostics, Inc. Methods for detecting adverse local tissue reaction (altr) necrosis
EP3377650A1 (en) 2015-11-19 2018-09-26 Susanne Wagner Signatures for predicting cancer immune therapy response
EP3400312A4 (en) 2016-01-06 2019-08-28 Myriad Genetics, Inc. Genes and gene signatures for diagnosis and treatment of melanoma
WO2017193062A1 (en) 2016-05-06 2017-11-09 Myriad Genetics, Inc. Gene signatures for renal cancer prognosis
MA50057A (en) 2017-09-01 2020-07-08 Juno Therapeutics Inc GENE EXPRESSION AND ASSESSMENT OF A RISK OF DEVELOPING TOXICITY FOLLOWING CELL THERAPY
WO2019246160A2 (en) 2018-06-18 2019-12-26 Igenomix, S.L. Methods, compositions, and kits for assessing endometrial transformation
WO2020002621A2 (en) 2018-06-29 2020-01-02 F. Hoffmann-La Roche Ag Detection of microsatellite instability
EP3864165A4 (en) 2018-10-09 2022-08-03 Genecentric Therapeutics, Inc. Detecting cancer cell of origin
MX2021006234A (en) 2018-11-30 2021-09-10 Caris Mpi Inc Next-generation molecular profiling.
EP3956476A1 (en) 2019-04-17 2022-02-23 Igenomix S.L. Improved methods for the early diagnosis of uterine leiomyomas and leiomyosarcomas
CA3142361A1 (en) 2019-06-12 2020-12-17 Juno Therapeutics, Inc. Combination therapy of a cell-mediated cytotoxic therapy and an inhibitor of a prosurvival bcl2 family protein
KR20220066892A (en) 2019-08-22 2022-05-24 주노 쎄러퓨티크스 인코퍼레이티드 Combination therapy of T cell therapy and Zest homologue 2 enhancer (EH2) inhibitor and related methods
IL293489A (en) 2019-12-02 2022-08-01 Caris Mpi Inc Pan-cancer platinum response predictor
GB201919092D0 (en) 2019-12-20 2020-02-05 Cambridge Entpr Ltd Method of determining risk of fetal size abnormality

Citations (38)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4617431A (en) * 1983-12-02 1986-10-14 Plantronics, Inc. Voice tube assemblies for post-auricle headsets
US4683202A (en) * 1985-03-28 1987-07-28 Cetus Corporation Process for amplifying nucleic acid sequences
US4845653A (en) * 1987-05-07 1989-07-04 Becton, Dickinson And Company Method of displaying multi-parameter data sets to aid in the analysis of data characteristics
US5143854A (en) * 1989-06-07 1992-09-01 Affymax Technologies N.V. Large scale photolithographic solid phase synthesis of polypeptides and receptor binding screening thereof
US5206137A (en) * 1988-09-08 1993-04-27 Lifecodes Corporation Compositions and methods useful for genetic analysis
US5492806A (en) * 1987-04-01 1996-02-20 Hyseq, Inc. Method of determining an ordered sequence of subfragments of a nucleic acid fragment by hybridization of oligonucleotide probes
US5524070A (en) * 1992-10-07 1996-06-04 The Research Foundation Of State University Of New York Local adaptive contrast enhancement
US5525464A (en) * 1987-04-01 1996-06-11 Hyseq, Inc. Method of sequencing by hybridization of oligonucleotide probes
US5571639A (en) * 1994-05-24 1996-11-05 Affymax Technologies N.V. Computer-aided engineering system for design of sequence arrays and lithographic masks
US5632282A (en) * 1993-07-20 1997-05-27 Hay; S. Hutson Ocular disease detection apparatus
US5700637A (en) * 1988-05-03 1997-12-23 Isis Innovation Limited Apparatus and method for analyzing polynucleotide sequences and method of generating oligonucleotide arrays
US5707808A (en) * 1996-04-15 1998-01-13 The Regents Of The University Of California Optical selection and collection of DNA fragments
US5777888A (en) * 1995-08-09 1998-07-07 Regents Of The University Of California Systems for generating and analyzing stimulus-response output signal matrices
US5800992A (en) * 1989-06-07 1998-09-01 Fodor; Stephen P.A. Method of detecting nucleic acids
US5843767A (en) * 1993-10-28 1998-12-01 Houston Advanced Research Center Microfabricated, flowthrough porous apparatus for discrete detection of binding reactions
US5854927A (en) * 1994-09-30 1998-12-29 U.S. Philips Corporation Multimedia system receptive for presentation of mass data comprising an application program inclusive of a multiplatform interpreter, and a platform subsystem arranged for interaction with said multiplatform interpreter and mass memory for use with such s
US5871697A (en) * 1995-10-24 1999-02-16 Curagen Corporation Method and apparatus for identifying, classifying, or quantifying DNA sequences in a sample without sequencing
US5871928A (en) * 1989-06-07 1999-02-16 Fodor; Stephen P. A. Methods for nucleic acid analysis
US5925525A (en) * 1989-06-07 1999-07-20 Affymetrix, Inc. Method of identifying nucleotide differences
US5961923A (en) * 1995-04-25 1999-10-05 Irori Matrices with memories and uses thereof
US5974164A (en) * 1994-10-21 1999-10-26 Affymetrix, Inc. Computer-aided visualization and analysis system for sequence evaluation
US6023659A (en) * 1996-10-10 2000-02-08 Incyte Pharmaceuticals, Inc. Database system employing protein function hierarchies for viewing biomolecular sequence data
US6028593A (en) * 1995-12-01 2000-02-22 Immersion Corporation Method and apparatus for providing simulated physical interactions within computer generated environments
US6040138A (en) * 1995-09-15 2000-03-21 Affymetrix, Inc. Expression monitoring by hybridization to high density oligonucleotide arrays
US6203977B1 (en) * 1988-11-15 2001-03-20 Yale University Delineation of individual human chromosomes in metaphase and interphase cells by in situ suppression hybridization
US6229911B1 (en) * 1997-07-25 2001-05-08 Affymetrix, Inc. Method and apparatus for providing a bioinformatics database
US6262079B1 (en) * 1992-09-25 2001-07-17 Neorx Corporation Prevention and treatment of cardiovascular pathologies
US20010018183A1 (en) * 1999-02-02 2001-08-30 Yijia Bao Simultaneous measurement of gene expression and genomic abnormalities using nucleic acid microarrays
US6284465B1 (en) * 1999-04-15 2001-09-04 Agilent Technologies, Inc. Apparatus, systems and method for locating nucleic acids bound to surfaces
US6331396B1 (en) * 1998-09-23 2001-12-18 The Cleveland Clinic Foundation Arrays for identifying agents which mimic or inhibit the activity of interferons
US6420108B2 (en) * 1998-02-09 2002-07-16 Affymetrix, Inc. Computer-aided display for comparative gene expression
US20020182610A1 (en) * 2000-09-19 2002-12-05 Tadashi Okamoto Method for making probe support and apparatus used for the method
US20020182659A1 (en) * 1993-05-13 2002-12-05 Neorx Corporation Method to determine TGF-beta
US20030036855A1 (en) * 1998-03-16 2003-02-20 Praelux Incorporated, A Corporation Of New Jersey Method and apparatus for screening chemical compounds
US6567570B1 (en) * 1998-10-30 2003-05-20 Hewlett-Packard Development Company, L.P. Optical image scanner with internal measurement of point-spread function and compensation for optical aberrations
US6569615B1 (en) * 2000-04-10 2003-05-27 The United States Of America As Represented By The Department Of Veteran's Affairs Composition and methods for tissue preservation
US6573039B1 (en) * 1997-02-27 2003-06-03 Cellomics, Inc. System for cell-based screening
US6600996B2 (en) * 1994-10-21 2003-07-29 Affymetrix, Inc. Computer-aided techniques for analyzing biological sequences

Family Cites Families (16)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
NO870613L (en) 1986-03-05 1987-09-07 Molecular Diagnostics Inc DETECTION OF MICROORGANISMS IN A SAMPLE CONTAINING NUCLEIC ACID.
EP0307476A4 (en) 1986-12-20 1990-12-12 Kukita, Takeshi Bilirubin antigen, monoclonal antibody therefor, process for their preparation, and their use
JP2897959B2 (en) 1988-05-20 1999-05-31 エフ.ホフマン―ラ ロシュ アクチェンゲゼルシャフト Immobilized sequence-specific probe
JPH02299598A (en) 1989-04-14 1990-12-11 Ro Inst For Molecular Genetics & Geneteic Res Determination by means of hybridization, together with oligonucleotide probe of all or part of extremely short sequence in sample of nucleic acid connecting with separate particle of microscopic size
DE69132843T2 (en) 1990-12-06 2002-09-12 Affymetrix Inc N D Ges D Staat Identification of nucleic acids in samples
JPH06504997A (en) 1990-12-06 1994-06-09 アフィメトリックス, インコーポレイテッド Synthesis of immobilized polymers on a very large scale
US6384847B1 (en) * 1992-03-20 2002-05-07 International Business Machines Corporation Interactive graphical method for analyzing many-dimensional data sets
EP0655090B1 (en) 1992-04-27 2000-12-27 The Trustees Of Dartmouth College Detection of gene sequences in biological fluids
DE69433180T2 (en) 1993-10-26 2004-06-24 Affymetrix, Inc., Santa Clara FIELDS OF NUCLEIC ACID PROBE ON ORGANIC CHIPS
JPH11501741A (en) 1995-01-27 1999-02-09 インサイト ファーマシューティカルズ インク. Computer system for storing and analyzing microbiological data
US5707806A (en) 1995-06-07 1998-01-13 Genzyme Corporation Direct sequence identification of mutations by cleavage- and ligation-associated mutation-specific sequencing
GB9522615D0 (en) 1995-11-03 1996-01-03 Pharmacia Spa 4-Phenyl-4-oxo-butanoic acid derivatives with kynurenine-3-hydroxylase inhibiting activity
US5778200A (en) 1995-11-21 1998-07-07 Advanced Micro Devices, Inc. Bus arbiter including aging factor counters to dynamically vary arbitration priority
JP2002515738A (en) 1996-01-23 2002-05-28 アフィメトリックス,インコーポレイティド Nucleic acid analysis
JP2000504575A (en) 1996-02-08 2000-04-18 アフィメトリックス,インコーポレイテッド Chip-based speciation and phenotypic characterization of microorganisms
US5687972A (en) * 1996-11-26 1997-11-18 Petrak; Gregory H. Unitary oil seal assembly

Patent Citations (45)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4617431A (en) * 1983-12-02 1986-10-14 Plantronics, Inc. Voice tube assemblies for post-auricle headsets
US4683202B1 (en) * 1985-03-28 1990-11-27 Cetus Corp
US4683202A (en) * 1985-03-28 1987-07-28 Cetus Corporation Process for amplifying nucleic acid sequences
US5695940A (en) * 1987-04-01 1997-12-09 Hyseq, Inc. Method of sequencing by hybridization of oligonucleotide probes
US5667972A (en) * 1987-04-01 1997-09-16 Hyseg, Inc. Method of sequencing of genoms by hybridization of oligonucleotide probes
US5525464A (en) * 1987-04-01 1996-06-11 Hyseq, Inc. Method of sequencing by hybridization of oligonucleotide probes
US5492806A (en) * 1987-04-01 1996-02-20 Hyseq, Inc. Method of determining an ordered sequence of subfragments of a nucleic acid fragment by hybridization of oligonucleotide probes
US4845653A (en) * 1987-05-07 1989-07-04 Becton, Dickinson And Company Method of displaying multi-parameter data sets to aid in the analysis of data characteristics
US5700637A (en) * 1988-05-03 1997-12-23 Isis Innovation Limited Apparatus and method for analyzing polynucleotide sequences and method of generating oligonucleotide arrays
US5206137A (en) * 1988-09-08 1993-04-27 Lifecodes Corporation Compositions and methods useful for genetic analysis
US6203977B1 (en) * 1988-11-15 2001-03-20 Yale University Delineation of individual human chromosomes in metaphase and interphase cells by in situ suppression hybridization
US5445934A (en) * 1989-06-07 1995-08-29 Affymax Technologies N.V. Array of oligonucleotides on a solid substrate
US5143854A (en) * 1989-06-07 1992-09-01 Affymax Technologies N.V. Large scale photolithographic solid phase synthesis of polypeptides and receptor binding screening thereof
US5871928A (en) * 1989-06-07 1999-02-16 Fodor; Stephen P. A. Methods for nucleic acid analysis
US5800992A (en) * 1989-06-07 1998-09-01 Fodor; Stephen P.A. Method of detecting nucleic acids
US5925525A (en) * 1989-06-07 1999-07-20 Affymetrix, Inc. Method of identifying nucleotide differences
US6262079B1 (en) * 1992-09-25 2001-07-17 Neorx Corporation Prevention and treatment of cardiovascular pathologies
US5524070A (en) * 1992-10-07 1996-06-04 The Research Foundation Of State University Of New York Local adaptive contrast enhancement
US20020182659A1 (en) * 1993-05-13 2002-12-05 Neorx Corporation Method to determine TGF-beta
US5632282A (en) * 1993-07-20 1997-05-27 Hay; S. Hutson Ocular disease detection apparatus
US5843767A (en) * 1993-10-28 1998-12-01 Houston Advanced Research Center Microfabricated, flowthrough porous apparatus for discrete detection of binding reactions
US5571639A (en) * 1994-05-24 1996-11-05 Affymax Technologies N.V. Computer-aided engineering system for design of sequence arrays and lithographic masks
US5856101A (en) * 1994-05-24 1999-01-05 Affymetrix, Inc. Computer-aided engineering system for design of sequence arrays and lithographic masks
US5593839A (en) * 1994-05-24 1997-01-14 Affymetrix, Inc. Computer-aided engineering system for design of sequence arrays and lithographic masks
US5854927A (en) * 1994-09-30 1998-12-29 U.S. Philips Corporation Multimedia system receptive for presentation of mass data comprising an application program inclusive of a multiplatform interpreter, and a platform subsystem arranged for interaction with said multiplatform interpreter and mass memory for use with such s
US5974164A (en) * 1994-10-21 1999-10-26 Affymetrix, Inc. Computer-aided visualization and analysis system for sequence evaluation
US6600996B2 (en) * 1994-10-21 2003-07-29 Affymetrix, Inc. Computer-aided techniques for analyzing biological sequences
US5961923A (en) * 1995-04-25 1999-10-05 Irori Matrices with memories and uses thereof
US5777888A (en) * 1995-08-09 1998-07-07 Regents Of The University Of California Systems for generating and analyzing stimulus-response output signal matrices
US6040138A (en) * 1995-09-15 2000-03-21 Affymetrix, Inc. Expression monitoring by hybridization to high density oligonucleotide arrays
US5871697A (en) * 1995-10-24 1999-02-16 Curagen Corporation Method and apparatus for identifying, classifying, or quantifying DNA sequences in a sample without sequencing
US6028593A (en) * 1995-12-01 2000-02-22 Immersion Corporation Method and apparatus for providing simulated physical interactions within computer generated environments
US5707808A (en) * 1996-04-15 1998-01-13 The Regents Of The University Of California Optical selection and collection of DNA fragments
US6023659A (en) * 1996-10-10 2000-02-08 Incyte Pharmaceuticals, Inc. Database system employing protein function hierarchies for viewing biomolecular sequence data
US6573039B1 (en) * 1997-02-27 2003-06-03 Cellomics, Inc. System for cell-based screening
US6229911B1 (en) * 1997-07-25 2001-05-08 Affymetrix, Inc. Method and apparatus for providing a bioinformatics database
US6882742B2 (en) * 1997-07-25 2005-04-19 Affymetrix, Inc. Method and apparatus for providing a bioinformatics database
US6420108B2 (en) * 1998-02-09 2002-07-16 Affymetrix, Inc. Computer-aided display for comparative gene expression
US20030036855A1 (en) * 1998-03-16 2003-02-20 Praelux Incorporated, A Corporation Of New Jersey Method and apparatus for screening chemical compounds
US6331396B1 (en) * 1998-09-23 2001-12-18 The Cleveland Clinic Foundation Arrays for identifying agents which mimic or inhibit the activity of interferons
US6567570B1 (en) * 1998-10-30 2003-05-20 Hewlett-Packard Development Company, L.P. Optical image scanner with internal measurement of point-spread function and compensation for optical aberrations
US20010018183A1 (en) * 1999-02-02 2001-08-30 Yijia Bao Simultaneous measurement of gene expression and genomic abnormalities using nucleic acid microarrays
US6284465B1 (en) * 1999-04-15 2001-09-04 Agilent Technologies, Inc. Apparatus, systems and method for locating nucleic acids bound to surfaces
US6569615B1 (en) * 2000-04-10 2003-05-27 The United States Of America As Represented By The Department Of Veteran's Affairs Composition and methods for tissue preservation
US20020182610A1 (en) * 2000-09-19 2002-12-05 Tadashi Okamoto Method for making probe support and apparatus used for the method

Non-Patent Citations (5)

* Cited by examiner, † Cited by third party
Title
Anderson et al. Introduction to Statistics: Concepts and Applications, 2nd Edition. New York: West Publishing Company, 1991, pages 69-72. *
Guenther WC. Analysis of Variance. Englewood Cliffs, New Jersey: Prentice-Hall, Inc., 1964, pages 20-21. *
Okuda et al. Natural history of hepatocellular carcinoma and prognosis in relation to treatment: Study of 850 patients. Cancer, 1985, volume 56, pages 918-928. *
Schena et al. Parallel human genome analysis: Microarray-based expression monitoring of 1000 genes. PNAS, volume 93, October 1996, pages 10614-10619. *
Sehgal et al. Developmental expression patterns of CFTR in ferret tracheal surface airway and submucosal gland epithelia. Am. J. Respir. Cell Mol. Biol., 1996, volume 15, pages 122-131. *

Also Published As

Publication number Publication date
US20130169645A1 (en) 2013-07-04
US6420108B2 (en) 2002-07-16
JPH11342000A (en) 1999-12-14
EP0935210A3 (en) 2002-11-20
US20020150932A1 (en) 2002-10-17
CA2259887A1 (en) 1999-08-09
US20020015948A1 (en) 2002-02-07
EP0935210A2 (en) 1999-08-11

Similar Documents

Publication Publication Date Title
US20070067111A1 (en) Computer-aided visualization of expression comparison
US6600996B2 (en) Computer-aided techniques for analyzing biological sequences
US6308170B1 (en) Gene expression and evaluation system
EP1019536B1 (en) Polymorphism detection utilizing clustering analysis
US5733729A (en) Computer-aided probability base calling for arrays of nucleic acid probes on chips
US6242180B1 (en) Computer-aided visualization and analysis system for sequence evaluation
US6361937B1 (en) Computer-aided nucleic acid sequencing
Buchholz et al. Use of DNA arrays/microarrays in pancreatic research
JP4557609B2 (en) How to display splice variant sequence mapping
US6994965B2 (en) Method for displaying results of hybridization experiment
US20030220748A1 (en) Computer-aided techniques for analyzing biological sequences
US20040175718A1 (en) Computer-aided visualization and analysis system for sequence evaluation
EP1632579A2 (en) Computer-aided techniques for analyzing biological sequences
JP2003532367A (en) Polymer identification, validation, mapping and classification techniques

Legal Events

Date Code Title Description
AS Assignment

Owner name: GENERAL ELECTRIC CAPITAL CORPORATION, AS AGENT, MA

Free format text: SECURITY AGREEMENT;ASSIGNOR:AFFYMETRIX, INC.;REEL/FRAME:028465/0541

Effective date: 20120625

STCB Information on status: application discontinuation

Free format text: ABANDONED -- FAILURE TO RESPOND TO AN OFFICE ACTION

AS Assignment

Owner name: AFFYMETRIX, INC., CALIFORNIA

Free format text: RELEASE BY SECURED PARTY;ASSIGNOR:GENERAL ELECTRIC CAPITAL CORPORATION, AS AGENT;REEL/FRAME:037109/0132

Effective date: 20151028