WO1997011350A2 - A neural network assisted multi-spectral segmentation system - Google Patents

A neural network assisted multi-spectral segmentation system Download PDF

Info

Publication number
WO1997011350A2
WO1997011350A2 PCT/CA1996/000619 CA9600619W WO9711350A2 WO 1997011350 A2 WO1997011350 A2 WO 1997011350A2 CA 9600619 W CA9600619 W CA 9600619W WO 9711350 A2 WO9711350 A2 WO 9711350A2
Authority
WO
WIPO (PCT)
Prior art keywords
images
neural network
nuclear
cellular material
map
Prior art date
Application number
PCT/CA1996/000619
Other languages
French (fr)
Other versions
WO1997011350A3 (en
Inventor
Ryan S. Raz
Original Assignee
Morphometrix Technologies Inc.
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Morphometrix Technologies Inc. filed Critical Morphometrix Technologies Inc.
Priority to JP9512263A priority Critical patent/JPH11515097A/en
Priority to AU69214/96A priority patent/AU726049B2/en
Priority to EP96929994A priority patent/EP0850405A2/en
Priority to CA002232164A priority patent/CA2232164A1/en
Publication of WO1997011350A2 publication Critical patent/WO1997011350A2/en
Publication of WO1997011350A3 publication Critical patent/WO1997011350A3/en
Priority to US09/040,378 priority patent/US6463425B2/en

Links

Classifications

    • G01N15/1433
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T7/00Image analysis
    • G06T7/10Segmentation; Edge detection
    • G06T7/11Region-based segmentation
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T7/00Image analysis
    • G06T7/10Segmentation; Edge detection
    • G06T7/136Segmentation; Edge detection involving thresholding
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T7/00Image analysis
    • G06T7/10Segmentation; Edge detection
    • G06T7/155Segmentation; Edge detection involving morphological operators
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V20/00Scenes; Scene-specific elements
    • G06V20/60Type of objects
    • G06V20/69Microscopic objects, e.g. biological cells or cellular parts
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/10Image acquisition modality
    • G06T2207/10056Microscopic image
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/20Special algorithmic details
    • G06T2207/20036Morphological image processing
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/20Special algorithmic details
    • G06T2207/20084Artificial neural networks [ANN]
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/30Subject of image; Context of image processing
    • G06T2207/30004Biomedical image processing
    • G06T2207/30024Cell structures in vitro; Tissue sections in vitro

Definitions

  • the present invention relates to automated diagnostic techniques in medicine and biology, and more particularly to neural network for multi-spectral segmentation of nuclear and cytoplasmic objects.
  • the segmentation is the delineation of the objects of interest within the micrographic image.
  • the cervical cells required for an analysis there is a wide range of "background” material, debris and contamination that interferes with the identification of the cervical cells and therefore must be delineated. Also for each cervical cell, it is necessary to delineate the nucleus with the cytoplasm.
  • Feature extraction operation is performed after the completion of the segmentation operation.
  • Feature extraction comprises characterizing the segmented regions as a series of descriptors based on the morphological, textural, densitometric and colorimetric attributes of these regions.
  • the Classification step is the final step in the image analysis.
  • the features extracted in the previous stage are used in some type of discriminant-based classification procedure.
  • the results of this classification are then translated into a "diagnosis" of the cells in the image.
  • segmentation is the most crucial and the most difficult. This is particularly true for the types of images typically encountered in medical or biological specimens.
  • the goal of segmentation is to accurately delineate the cervical cells and their nuclei.
  • the situation is complicated not only by the variety of cells found in the smear, but also by the alterations in morphology produced by the sample preparation technique and by the quantity of debris associated with these specimens.
  • Furthermore, during preparation it is difficult to control the way cervical cells are deposited on the surface of the slide which as a result leads to a large amount of cell overlap and distortion.
  • Papanicolaou Stain is a combination of several stains or dyes together with a specific protocol designed to emphasize and delineate cellular structures of importance for pathological analysis.
  • the stains or dyes included in the Papanicolaou Stain are Haematoxylin, Orange G and Eosin Azure (a mixture of two acid dyes, Eosin Y and Light Green SF Yellowish, together with Bismark Brown) .
  • Each stain component is sensitive to or binds selectively to a particular cell structure or material.
  • Haematoxylin binds to the nuclear material colouring it dark blue.
  • Orange G is an indicator of keratin protein content.
  • Eosin Y stains nucleoli, red blood cells and mature squamous epithelial cells.
  • Light Green SF yellowish acid stains metabolically active epithelial cells.
  • Bismark Brown stains vegetable material and cellulose.
  • differential staining characteristics are only the means to the end in the solution to the problem of segmentation. Of equal importance is the procedure for handling the information provided by the spectral character of the cellular objects when making a decision concerning identity.
  • the present invention provides a Neural-Network Assisted Multi-Spectral Segmentation (also referred to as the NNA-MSS) method and system.
  • NNA-MSS Neural-Network Assisted Multi-Spectral Segmentation
  • the first stage according to the present invention comprises the acquisition of three images of the same micrographic scene. Each image is obtained using a different narrow band-pass optical filter which has the effect of selecting a narrow band of optical wavelengths associated with distinguishing absorption peaks in the stain spectra. The choice of optical wavelength bands is guided by the degree of separation afforded by these peaks when used to distinguish the different types of cellular material on the slide surface.
  • the second stage according to the invention comprises a neural-network (trained on an extensive set of typical examples) to make decisions on the identity of material already deemed to be cellular in origin. The neural network decides whether or not a picture element in the digitized image is nuclear or not nuclear in character.
  • the system can continue on applying a standard range of image processing techniques to refine the segmentation.
  • the relationship between the cellular components and the transmission intensity of the light images in each of the three spectral bands is a complex and non-linear one.
  • Papanicolaou Stain is a combination of several stains or dyes together with a specific protocol designed to emphasize and delineate cellular structures of importance to pathological analysis.
  • the stains or dyes included in the Papanicolaou Stain are Haematoxylin, Orange G and Eosin Azure (a mixture of two acid dyes, Eosin Y and Light Green SF Yellowish, together with Bismarck Brown) .
  • Each stain component is sensitive to or binds selectively to a particular cellular structure or material.
  • Haematoxylin binds to the nuclear material colouring it dark blue; Orange G is an indicator of keratin protein content; Eosin Y stains nucleoli, red blood cells and mature squamous epithelial cells; Light Green SF yellowish stains metabolically active epithelial cells,- Bismarck Brown stains vegetable material and cellulose.
  • three optical wavelength bands are used in a complex procedure to segment Papanicolaou-stained epithelial cells in digitized images.
  • the procedure utilizes standard segmentation operations (erosion, dilation, etc.) together with the neural-network to identify the location of nuclear components in areas already determined to be cellular material.
  • the purpose of the segmentation is to extract the cellular objects, i.e. to distinguish the nucleus of the cell from the cytoplasm.
  • the multi- spectral images are divided into two classes: cytoplasm objects and nuclear objects, which are separated by a multi-dimensional threshold t which comprises a 3-dimensional space.
  • the neural network according to the invention comprises a Probability Projection Neural Network (PPNN) .
  • PPNN Probability Projection Neural Network
  • the PPNN according to the present invention features fast training for a large volume of data, processing of multi-modal non-Gaussian data distribution, good generalization simultaneously with high sensitivity to small clusters of patterns representing the useful subclasses of cells.
  • the PPNN is implemented as a hardware-encoded algorithm.
  • the present invention provides a method for identifying nuclear and cytoplasmic objects in a biological specimen, said method comprising the steps of: (a) acquiring a plurality of images of said biological specimen; (b) identifying cellular material from said images and creating a cellular material map,- (c) applying a neural network to said cellular material map and classifying nuclear and cytoplasmic objects from said images.
  • the present invention provides a system for identifying nuclear and cytoplasmic objects in biological specimen, said system comprising: (a) image acquisition means for acquiring a plurality of images of said biological specimen; (b) processing means for processing said images and generating a cellular material map identifying cellular material; (c) neural processor means for processing said cellular material map and including means for classifying nuclear and cytoplasmic objects from said images.
  • the present invention provides a hardware-encoded neural processor for classifying input data, said hardware-encoded neural processor comprising: (a) a memory having a plurality of addressable storage locations; (b) said addressable storage locations containing classification information associated with the input data,- (c) address generation means for generating an address from said input data for accessing the classification information stored in said memory for selected input data.
  • Fig. 1 shows in flow chart form a neural network assisted multi-spectral segmentation method according to the present invention
  • Fig. 2 shows in diagrammatic form a processing element for the neural network
  • Fig. 3 shows in diagrammatic form a neural network comprising the processing elements of Fig. 2;
  • Fig. 4 shows in diagrammatic form a training step for the neural network
  • Fig. 5 shows in flow chart form a clustering algorithm for the neural network according to the present invention.
  • Fig. 6 shows a hardware implementation for the neural network according to the present invention.
  • the present invention provides a Neural Network Assisted Multi-Spectral Segmentation (also referred to as NNA- MSS) system and method.
  • NNA- MSS Neural Network Assisted Multi-Spectral Segmentation
  • the multi-spectral segmentation method is related to that described and claimed in co-pending International Patent Application No. CA96/00477 filed July 18, 1996 and in the name of the applicant.
  • the NNA-MSS according to the present invention is particularly suited to Papanicolaou-stained gynaecological smears and will be described in this context. It is however to be understood that the present invention has wider applicability to applications outside of Papanicolaou-stained smears.
  • Fig. 1 shows in flow chart a Neural Network Assisted Multi-Spectral Segmentation (NNA- MSS) method 1 according to the present invention.
  • NNA- MSS Neural Network Assisted Multi-Spectral Segmentation
  • the first step 10 involves inputting three digitized images, i.e. micrographic scenes, of a cellular specimen.
  • the images are taken in each of the three narrow optical bands: 540 ⁇ 5 nm,- 577 + 5 nm and 630 ⁇ 5 nm.
  • the images are generated by an imaging system (not shown) as will be understood by one skilled in the art, and thus need not be described in detail here.)
  • the images are next processed by the multi-segmentation method 1 and neural network as will be described.
  • the levelling operation 12 involves removing the spatial variations in the illumination intensity from the images.
  • the levelling operation is implemented as a simple mathematical routine using known image processing techniques.
  • the result of the levelling operation is a set of 8-bit digitized images with uniform illumination across their fields.
  • the 8-bit digitized images first undergo a series of processing steps to identify cellular material in the digitized images.
  • the digitized images are then processed by the neural network to segment the nuclear objects from the cytoplasm objects.
  • the next operation comprises a threshold procedure block 14.
  • the threshold procedure involves analyzing the levelled images in a search for material of cellular origin.
  • the threshold procedure 14 is applied to the 530 nm and 630 nm optical wavelength bands and comprises identifying material in the image of cellular origin as regions of the digitized image that fall within a range of specific digital values.
  • the threshold procedure 14 produces a single binary "map" of the image where the single binary bit identifies regions that are, or are not, cellular material.
  • the threshold operation 14 is followed by a dilation operation (block 16) .
  • the dilation operation 16 is a conventional image processing operation which modifies The binary map of cellular material generated in block 14.
  • the dilation operation allows the regions of cellular material to grow or dilate by one pixel in order to fill small voids in large regions.
  • the dilation operation 16 is modified with the condition that the dilation does not allow two separate regions of cellular material to join to make a single region, i.e. a "no-join" condition. This condition allows the accuracy of the binary map to be preserved through dilation operation 16.
  • the dilation operation is applied twice to ensure a proper filling of voids.
  • the result of the dilation operations 16 is a modified binary map of cellular material.
  • the dilation operation 16 is followed by an erosion operation (block 18) .
  • the erosion operation 18 brings the modified binary map of cellular material (a result of the dilation operation 16) back to its original boundaries.
  • the erosion operation 18 is implemented using conventional image processing techniques.
  • the erosion operation 18 allows the cellular boundaries in the binary image to shrink or erode but will not affect the filled voids.
  • the erosion operation 18 has the additional effect of eliminating small regions of cellular material that are not important to the later diagnostic analysis.
  • the result of the erosion operation 18 is a final binary map of the regions in the digitized image that are cytoplasm.
  • the next stage according to the invention is the operation of the neural network at block 20.
  • the neural network 20 is applied to the 8-bit digitized images, with attention restricted to those regions that lie within the cytoplasm as determined by the final binary cytoplasm map generated as a result of the previous operations.
  • the neural network 20 makes decisions concerning the identity of individual picture elements (or "pixels") in the binary image as either being part of a nucleus or not part of a nucleus.
  • the result of the operation of the neural network is a digital map of the regions within the cytoplasm that are considered to be nuclear material.
  • the nuclear material map is then subjected to further processing.
  • the neural network 20 according to the present invention is described in detail below.
  • the resulting nuclear material map is subjected to an erosion operation (block 22) .
  • the erosion operation 22 eliminates regions of the nuclear material map that are too small to be of diagnostic significance.
  • the result is a modified binary map of nuclear regions.
  • the modified binary map resulting from the erosion operation 22 is then subjected to a dilation operation (block 24) .
  • the dilation operation 24 is subject to a no-join condition, such that, the dilation operation does not allow two separate regions of nuclear material to join to make a single region. In this way the accuracy of the binary map is preserved notwithstanding the dilation operation.
  • the dilation operation 24 is preferably applied twice to ensure a proper filling of voids.
  • the result of these dilation operations is a modified binary map of nuclear material.
  • an erosion operation is applied (block 26) .
  • Double application of the erosion operation 26 eliminates regions of the nuclear material in the binary map that are too small to be of diagnostic significance. The result is a modified binary map of nuclear regions.
  • the remaining operations involve constructing a binary map comprising high gradients, i.e boundaries, of pixel intensity, in order to sever nuclear regions that share high gradient boundaries.
  • high gradients i.e boundaries, of pixel intensity
  • the first step in severing the high-gradient boundaries in the nuclear map is to construct a binary map of these high gradient boundaries using a threshold operation (block 28) applied to a Sobel map.
  • the Sobel map is generated by applying the Sobel gradient operator to the 577 nm 8-bit digitized image to determine regions of that image that contain high gradients of pixel intensity (block 29) .
  • the 8-bit digitized image for the 577 nm band was obtained from the levelling operation in block 12.
  • the result of the Sobel operation in block 29 is an 8-bit map of gradient intensity.
  • a logical NOT operation is performed (block 30) .
  • the logical NOT operation 30 determines the coincidence of the two states, high-gradients and nuclei, and reverses the pixel value of the nuclear map at the point of the coincidence in order to eliminate it from regions that are presumed to be nuclear material.
  • the result of this logical operation is a modified nuclear map.
  • the modified nuclear map is next subjected to an erosion operation (block 32) .
  • the erosion operation 32 eliminates regions in the modified nuclear map that are too small to be of diagnostic significance.
  • the result is a modified binary map of nuclear regions.
  • the binary map of nuclear regions is dramatically altered.
  • the dilation operation 34 includes the condition that no two nuclear regions will become joined as they dilate and that no nuclear region will be allowed to grow outside its old boundary as defined by the binary map that existed before the Sobel procedure was applied.
  • the dilation operation 34 is preferably applied four times. The result is a modified binary map of nuclear material.
  • the operation at block 20 in Fig. 1 comprises neural network processing of the digitized images.
  • the neural network 20 is a highly parallel, distributed, information processing system that has the topology of a directed graph.
  • the network comprises a set of "nodes” and series of "connexions" between the nodes.
  • the nodes comprise processing elements and the connexions between the nodes represent the transfer of information from one node to another.
  • FIG. 2 shows a node or processing element 100a for a backpropagation neural network 20.
  • Each of the nodes 100a accepts one or more inputs 102 shown individually as a 1( a 2 , a 3 ... a n in Fig 2.
  • the inputs 102 are taken into the node 100a and each input 102 is multiplied by its own mathematical weighting factor before being summed together with the threshold factor for the processing element 100a.
  • the processing element 100a then generates a single output 104 (i.e. b-,) according to the "transfer function" being used in the network 20.
  • the output 104 is then available as an input to other nodes or processing elements, for example processing elements 100b, 100c, lOOd, lOOe and lOOf as depicted in Fig. 1.
  • the transfer function may be any suitable mathematical function but it is usual to employ a "sig oid" function.
  • the relationship between the inputs 102 into the node 100 and the output 104 is given by expression (1) as follows:
  • b j is the output 104 of the node 100
  • a is the value of the input 102 to the node labelled "I”
  • W j;l is the weighting given to that input 102
  • ⁇ j is the threshold value for the node 100.
  • the transfer function is modelled after a sigmoid function.
  • the nodes or processing elements for the neural network are arranged in a series of layers denoted by 106, 108 and 110 as shown in Fig. 3.
  • the first layer 106 comprises nodes or processing elements 112 shown individually as 112a, 112b, 112c, 112d and 112e.
  • the first layer 106 is an input layer and accepts the information required for a decision.
  • the second layer 108 in the neural network 20 is known as the hidden layer and comprises processing elements 114 shown individually as 114a, 114b, 114c, 114d and 114e. All of the nodes 112 in the input layer 106 are connected to all of the nodes 114 in the hidden layer 108. It will be understood that there may be more than one hidden layer, with each node in the successive layer connected to each node of the previous layer. For convenience only one hidden layer 108 is shown in Fig. 3.
  • the (last) hidden layer 108 leads to the output layer 110.
  • the output layer 110 comprises processing elements 116 shown individually as 116a, 116b, 116c, 116d and 116e in Fig. 3.
  • Each node 114 of the (last) hidden layer 108 (Fig. 3) is connected to each node 116 of the output layer 110.
  • the output layer 110 renders the decision to be interpreted by subsequent computing machinery.
  • the strength of the neural network architecture is its ability to generalize based on previous training of particular examples.
  • the neural network is presented a series of examples of the type of objects that it is destined to classify.
  • the backpropagation neural network organizes itself by altering the multiplicity of its connexion weights and thresholds according to its success in rendering a correct decision. This is called supervised learning wherein the operator provides the network with the information regarding its success in classification.
  • the network relies on a standard general rule for modifying its connexion weights and thresholds based on the success of its performance, i.e. back-propagation.
  • the multi-spectral images are divided into two classes: C 0 - cytoplasm and C : - nuclear, separated by the multi ⁇ dimensional threshold t which comprises a 3-dimensional space.
  • the distribution of the pixels for the nuclear and cytoplasm objects is complex and the 3-D space comprises numerous clusters and non-overlapped regions. It has been found that the optimal threshold has a complex non-linear surface in the 3-D space, and the neural network according to the present invention provides the means for finding the complex threshold surface in the 3-D space in order to segment the nuclear and cytoplasmic objects.
  • the neural network 20 comprises an input layer 106, a single hidden layer 108, and an output layer 110.
  • the input layer 106 comprises three nodes or processing elements 112 (Fig. 3) for each of the three 8-bit digitized values for the particular pixel being examined. (The three digitized values arise from the three levelled images collected in each of the three optical bands, as described above with reference to Fig. 1.)
  • the output layer 110 comprises a single processing element 116 (Fig. 3) which indicates whether the pixel under examination is or is not part of the nucleus.
  • the neural network 20 Before the neural network 20 can be successfully operated for decision-making it must first be “trained” in order to establish the proper combination of weights and thresholds. The training is performed outside of the segmentation procedure on a large set of examples. Errors made in the classification of pixels in the examples are "back-propagated” as corrections to the connexion weights and the threshold values in each of the processing units. Once the classification error is acceptable the network is "frozen” at these weight and threshold values and it is integrated as a simple algebraic operation into the segmentation procedure as shown at block 20 in Fig 1.
  • the neural network 20 comprises a Probability Projection Neural Network which will also be referred to as a PPNN.
  • the PPNN according to the present invention features fast training for a large volume of data, processing of multi-modal non- Gaussian data distribution, good generalization simultaneously with high sensitivity to small clusters of patterns representing the useful subclasses of cells.
  • the PPNN is well-suited to a hardware-encoded implementation.
  • the PPNN utilizes a Probability Density Function (PDF) estimator.
  • PDF Probability Density Function
  • the PPNN is suitable for use as a Probability Density Function estimator or as a general classifier in pattern recognition.
  • the PPNN uses the training data to create an N-dimensional PDF array which in turn is used to estimate the likelihood of a feature vector being within the given classes as will now be described.
  • the input space is partitioned into m x m x ... m discrete nodes (if the discrete input space is known, then m is usually selected less than the range) . For example, for a 3-D PDF array creating a 2 6 x 2 s x 2 6 grid is sufficient.
  • the next step involves mapping or projecting the influence of the each training pattern to the neighbour nodes. This is accomplished according to expression (2) as shown below:
  • P [x 0 , x l f . . . ,x n-1 . is the current value of the (x 0 , x-, , . . . , x n . ⁇ ) node after the j 'th iteration;
  • d_ [x 0 l x 1 , . . . ,x ⁇ -I J represents the influence of j 'th input pattern to the (x 0 , x ⁇ r . . . , x n _ 1 ) node;
  • r k is the distance from the pattern to the k'th node;
  • r 0 is the minimum distance between two neighbour nodes,- and
  • n is the dimension of the space.
  • Fig. 5 shows in flow chart form an embodiment of a clustering algorithm 200 according to the present invention.
  • All training patterns, i.e. ⁇ samples, in block 202 and a given number (i.e. "K") of clusters in block 204 are applied to a K-mean clustering operation block 206.
  • the clustering operation 206 clusters the input data and generates clusters 1 through K (block 208) .
  • all the training data which belongs to an i th -cluster is extracted into a separate sub ⁇ class.
  • the final operation in the clustering algorithm comprises joining all of the K PPNN's together and normalizing the resulting PPNN by dividing all nodes by the number of clusters (block 212) .
  • the operation performed in block 212 may be expressed as follows:
  • the clustering algorithm 200 may be implemented to the each class separately before creating the final classifier according the expression (6) above, as follows.
  • the optimal number of clusters for each of two classes may be found from final PPNN performance analysis (expression (6) above) .
  • the two optimal networks PPN ⁇ ⁇ PPN 2 opt are combined together according to expression (6) .
  • BP Backpropagation
  • EMF Elliptic Basic Functions
  • LQV Learning Vector Quantization
  • PPNN the PPNN is preferred. The performance results of the Probability Projection Neural Net have been found to exceed those achieved by conventional networks.
  • the neural network assisted multi-spectral segmentation process is implemented as a hardware-encoded procedure embedded in conventional FPGA (Field Programmable Gate Array) logic as part of a special-purpose computer.
  • FPGA Field Programmable Gate Array
  • the hardware implementation of this network is found in the form of a look-up table contained in a portion of hardware memory (Fig. 6) .
  • the neural network 20 comprises three input nodes and a single, binary output node.
  • the structure of the neural network 20 according to the present invention also simplifies the hardware implementation of the network.
  • the three input nodes correspond to three optical bands 301, 302, 303 used in gathering the images.
  • the images taken in the 530 nm and 630 nm bands have 7- bits of useful resolution while the 577 nm band retains all 8- bits. (The 577 nm band is centered on the nucleus.)
  • the performance of the neural network 20 is then determined for all possible combinations of these three inputs. Since there are 22 bits in total, there are 2 22 or 4.2 million possible combinations.
  • To create the look-up table all input pixels in the space (2 7 x 2 7 x 2 s variants for the three images in the present embodiment) are scanned and the look-up table is filled with the PPNN decision, i.e. 1 - pixel belongs to nuclear,- 0 - pixel doesn't belong to nuclear, for all each of these pixel combinations.
  • the coding of the results (i.e. outputs) of the neural network comprises assigning each possible combination of inputs a unique address 304 in a look-up table 305 stored in memory.
  • the address 304 in the table 305 is formed from by joining together the binary values of the three channel values indicated by 306, 307, 308, respectively in Fig. 6.
  • the pixel for the image from the first channel 301 i.e. 530 nm
  • the pixel for image from the second channel 302 i.e.
  • the address 304 points to a location in the look-up table 305 (i.e. memory) which stores a single binary value 309 that represents the response of the neural network to this combination of inputs, e.g. the logic 0 at memory location 0101011010101100101011 signifies that the pixel in question does not belong to the nucleus.
  • NNA-MSS The hardware-encoding of NNA-MSS advantageously allows the process to execute at a high speed while making a complex decision. Secondly, as experimental data is further tabulated and evaluated more complex decision spaces can be utilized to improve segmentation accuracy. Thus, an algorithm according to the present invention can be optimized further by the adjustment of a table of coefficients that describe the neural-network connection weights without the necessity of altering the system architecture.

Abstract

A neural network assisted multi-spectral segmentation method and system. According to the invention, three images having different optical bands are acquired for the same micrographic scene of a biological sample. The images are processed and a cellular material map is generated identifying cellular material. The cellular material map is then applied to a neural network. The neural network classifies the cellular material map into nuclear objects and cytoplasmic objects by determining a threshold surface in the 3-dimensional space separating the cytoplasmic and nuclear regions. In another aspect, the neural network comprises a hardware-encoded algorithm in the form of a look-up table.

Description

A NEURAL NETWORK ASSISTED MULTI-SPECTRAL SEGMENTATION SYSTEM
FIELD OF THE INVENTION
The present invention relates to automated diagnostic techniques in medicine and biology, and more particularly to neural network for multi-spectral segmentation of nuclear and cytoplasmic objects.
BACKGROUND OF THE INVENTION
Automated diagnostic systems in medicine and biology often rely on the visual inspection of microscopic images. Known systems attempt to mimic or imitate the procedures employed by humans. An appropriate example of this type of system is an automated instrument designed to assist a cyto-technologist in the review or diagnosis of Pap smears. In its usual operation such a system will rapidly acquire microscopic images of the cellular content of the Pap smears and then subject them to a battery of image analysis procedures. The goal of these procedures is the identification of images that are likely to contain unusual or potentially abnormal cervical cells.
The image analysis techniques utilized by these automated instruments are similar to the procedures consciously, and often unconsciously, performed by the human cyto- technologist. There are three distinct operations that must follow each other for this type of evaluation: (1) segmentation,- (2) feature extraction; and (3) classification.
The segmentation is the delineation of the objects of interest within the micrographic image. In addition to the cervical cells required for an analysis there is a wide range of "background" material, debris and contamination that interferes with the identification of the cervical cells and therefore must be delineated. Also for each cervical cell, it is necessary to delineate the nucleus with the cytoplasm.
The Feature Extraction operation is performed after the completion of the segmentation operation. Feature extraction comprises characterizing the segmented regions as a series of descriptors based on the morphological, textural, densitometric and colorimetric attributes of these regions.
The Classification step is the final step in the image analysis. The features extracted in the previous stage are used in some type of discriminant-based classification procedure. The results of this classification are then translated into a "diagnosis" of the cells in the image.
Of the three stages outlined above, segmentation is the most crucial and the most difficult. This is particularly true for the types of images typically encountered in medical or biological specimens.
In the case of a Pap smear, the goal of segmentation is to accurately delineate the cervical cells and their nuclei. The situation is complicated not only by the variety of cells found in the smear, but also by the alterations in morphology produced by the sample preparation technique and by the quantity of debris associated with these specimens. Furthermore, during preparation it is difficult to control the way cervical cells are deposited on the surface of the slide which as a result leads to a large amount of cell overlap and distortion.
Under these circumstances a segmentation operation is difficult. One known way to improve the accuracy and speed of segmentation for these types of images involves exploiting the differential staining procedure associated with all Pap smears. According to the Papanicolaou protocol the nuclei are stained dark blue while the cytoplasm is stained anything from a blue- green to an orange-pink. The Papanicolaou Stain is a combination of several stains or dyes together with a specific protocol designed to emphasize and delineate cellular structures of importance for pathological analysis. The stains or dyes included in the Papanicolaou Stain are Haematoxylin, Orange G and Eosin Azure (a mixture of two acid dyes, Eosin Y and Light Green SF Yellowish, together with Bismark Brown) . Each stain component is sensitive to or binds selectively to a particular cell structure or material. Haematoxylin binds to the nuclear material colouring it dark blue. Orange G is an indicator of keratin protein content. Eosin Y stains nucleoli, red blood cells and mature squamous epithelial cells. Light Green SF yellowish acid stains metabolically active epithelial cells. Bismark Brown stains vegetable material and cellulose.
The combination of these stains and their diagnostic interpretation has evolved into a stable medical protocol which predates the advent of computer-aided imaging instruments. Consequently, the dyes present a complex pattern of spectral properties to standard image analysis procedures. Specifically, a simple spectral decomposition based on the optical behaviour of the dyes is not sufficient on its own to reliably distinguish the cellular components within an image. The overlap of the spectral response of the dyes is too large for this type of straight-forward segmentation.
The use of differential staining characteristics is only the means to the end in the solution to the problem of segmentation. Of equal importance is the procedure for handling the information provided by the spectral character of the cellular objects when making a decision concerning identity.
In the art, attempts have been made to automate diagnostic procedures, however, there remains a need for a system for performing the segmentation process.
BRIEF SUMMARY OF THE INVENTION
The present invention provides a Neural-Network Assisted Multi-Spectral Segmentation (also referred to as the NNA-MSS) method and system.
The first stage according to the present invention comprises the acquisition of three images of the same micrographic scene. Each image is obtained using a different narrow band-pass optical filter which has the effect of selecting a narrow band of optical wavelengths associated with distinguishing absorption peaks in the stain spectra. The choice of optical wavelength bands is guided by the degree of separation afforded by these peaks when used to distinguish the different types of cellular material on the slide surface. The second stage according to the invention comprises a neural-network (trained on an extensive set of typical examples) to make decisions on the identity of material already deemed to be cellular in origin. The neural network decides whether or not a picture element in the digitized image is nuclear or not nuclear in character. With the completion of this step the system can continue on applying a standard range of image processing techniques to refine the segmentation. The relationship between the cellular components and the transmission intensity of the light images in each of the three spectral bands is a complex and non-linear one. By using a neural network to combine the information from these three images it is possible to achieve a high degree of success in separating the cervical cell from the background and the nuclei from the cytoplasm. A success that would not be possible with a set of linear operations alone.
The diagnosis and evaluation of Pap smears is aided by the introduction of a differential staining procedure called the Papanicolaou Stain. The Papanicolaou Stain is a combination of several stains or dyes together with a specific protocol designed to emphasize and delineate cellular structures of importance to pathological analysis. The stains or dyes included in the Papanicolaou Stain are Haematoxylin, Orange G and Eosin Azure (a mixture of two acid dyes, Eosin Y and Light Green SF Yellowish, together with Bismarck Brown) . Each stain component is sensitive to or binds selectively to a particular cellular structure or material. Haematoxylin binds to the nuclear material colouring it dark blue; Orange G is an indicator of keratin protein content; Eosin Y stains nucleoli, red blood cells and mature squamous epithelial cells; Light Green SF yellowish stains metabolically active epithelial cells,- Bismarck Brown stains vegetable material and cellulose.
According to another aspect of the invention, three optical wavelength bands are used in a complex procedure to segment Papanicolaou-stained epithelial cells in digitized images. The procedure utilizes standard segmentation operations (erosion, dilation, etc.) together with the neural-network to identify the location of nuclear components in areas already determined to be cellular material.
The purpose of the segmentation is to extract the cellular objects, i.e. to distinguish the nucleus of the cell from the cytoplasm. According to this segmentation the multi- spectral images are divided into two classes: cytoplasm objects and nuclear objects, which are separated by a multi-dimensional threshold t which comprises a 3-dimensional space.
The neural network according to the invention comprises a Probability Projection Neural Network (PPNN) . The PPNN according to the present invention features fast training for a large volume of data, processing of multi-modal non-Gaussian data distribution, good generalization simultaneously with high sensitivity to small clusters of patterns representing the useful subclasses of cells. In another aspect, the PPNN is implemented as a hardware-encoded algorithm.
In one aspect, the present invention provides a method for identifying nuclear and cytoplasmic objects in a biological specimen, said method comprising the steps of: (a) acquiring a plurality of images of said biological specimen; (b) identifying cellular material from said images and creating a cellular material map,- (c) applying a neural network to said cellular material map and classifying nuclear and cytoplasmic objects from said images.
In second aspect, the present invention provides a system for identifying nuclear and cytoplasmic objects in biological specimen, said system comprising: (a) image acquisition means for acquiring a plurality of images of said biological specimen; (b) processing means for processing said images and generating a cellular material map identifying cellular material; (c) neural processor means for processing said cellular material map and including means for classifying nuclear and cytoplasmic objects from said images.
In a third aspect, the present invention provides a hardware-encoded neural processor for classifying input data, said hardware-encoded neural processor comprising: (a) a memory having a plurality of addressable storage locations; (b) said addressable storage locations containing classification information associated with the input data,- (c) address generation means for generating an address from said input data for accessing the classification information stored in said memory for selected input data.
A preferred embodiment of the present invention will now be described, by way of example, with reference to the following specification, claims, and drawings.
BRIEF DESCRIPTION OF THE DRAWINGS
Fig. 1 shows in flow chart form a neural network assisted multi-spectral segmentation method according to the present invention,-
Fig. 2 shows in diagrammatic form a processing element for the neural network;
Fig. 3 shows in diagrammatic form a neural network comprising the processing elements of Fig. 2;
Fig. 4 shows in diagrammatic form a training step for the neural network;
Fig. 5 shows in flow chart form a clustering algorithm for the neural network according to the present invention; and
Fig. 6 shows a hardware implementation for the neural network according to the present invention.
DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENT
The present invention provides a Neural Network Assisted Multi-Spectral Segmentation (also referred to as NNA- MSS) system and method. The multi-spectral segmentation method is related to that described and claimed in co-pending International Patent Application No. CA96/00477 filed July 18, 1996 and in the name of the applicant.
The NNA-MSS according to the present invention is particularly suited to Papanicolaou-stained gynaecological smears and will be described in this context. It is however to be understood that the present invention has wider applicability to applications outside of Papanicolaou-stained smears.
Reference is first made to Fig. 1 which shows in flow chart a Neural Network Assisted Multi-Spectral Segmentation (NNA- MSS) method 1 according to the present invention.
The first step 10 involves inputting three digitized images, i.e. micrographic scenes, of a cellular specimen. The images are taken in each of the three narrow optical bands: 540 ± 5 nm,- 577 + 5 nm and 630 ± 5 nm. (The images are generated by an imaging system (not shown) as will be understood by one skilled in the art, and thus need not be described in detail here.) The images are next processed by the multi-segmentation method 1 and neural network as will be described.
As shown in Fig. l, the images are subjected to a levelling operation (block 12) . The levelling operation 12 involves removing the spatial variations in the illumination intensity from the images. The levelling operation is implemented as a simple mathematical routine using known image processing techniques. The result of the levelling operation is a set of 8-bit digitized images with uniform illumination across their fields.
The 8-bit digitized images first undergo a series of processing steps to identify cellular material in the digitized images. The digitized images are then processed by the neural network to segment the nuclear objects from the cytoplasm objects.
Referring to Fig. 1, following the levelling operation 12 the next operation comprises a threshold procedure block 14. The threshold procedure involves analyzing the levelled images in a search for material of cellular origin. The threshold procedure 14 is applied to the 530 nm and 630 nm optical wavelength bands and comprises identifying material in the image of cellular origin as regions of the digitized image that fall within a range of specific digital values. The threshold procedure 14 produces a single binary "map" of the image where the single binary bit identifies regions that are, or are not, cellular material. The threshold operation 14 is followed by a dilation operation (block 16) . The dilation operation 16 is a conventional image processing operation which modifies The binary map of cellular material generated in block 14. The dilation operation allows the regions of cellular material to grow or dilate by one pixel in order to fill small voids in large regions. Preferably, the dilation operation 16 is modified with the condition that the dilation does not allow two separate regions of cellular material to join to make a single region, i.e. a "no-join" condition. This condition allows the accuracy of the binary map to be preserved through dilation operation 16. Preferably, the dilation operation is applied twice to ensure a proper filling of voids. The result of the dilation operations 16 is a modified binary map of cellular material.
As shown in Fig. 1, the dilation operation 16 is followed by an erosion operation (block 18) . The erosion operation 18 brings the modified binary map of cellular material (a result of the dilation operation 16) back to its original boundaries. The erosion operation 18 is implemented using conventional image processing techniques. The erosion operation 18 allows the cellular boundaries in the binary image to shrink or erode but will not affect the filled voids. Advantageously, the erosion operation 18 has the additional effect of eliminating small regions of cellular material that are not important to the later diagnostic analysis. The result of the erosion operation 18 is a final binary map of the regions in the digitized image that are cytoplasm.
The next stage according to the invention, is the operation of the neural network at block 20. The neural network 20 is applied to the 8-bit digitized images, with attention restricted to those regions that lie within the cytoplasm as determined by the final binary cytoplasm map generated as a result of the previous operations. The neural network 20 makes decisions concerning the identity of individual picture elements (or "pixels") in the binary image as either being part of a nucleus or not part of a nucleus. The result of the operation of the neural network is a digital map of the regions within the cytoplasm that are considered to be nuclear material. The nuclear material map is then subjected to further processing. The neural network 20 according to the present invention is described in detail below.
Following the application of the neural network 20, the resulting nuclear material map is subjected to an erosion operation (block 22) . The erosion operation 22 eliminates regions of the nuclear material map that are too small to be of diagnostic significance. The result is a modified binary map of nuclear regions.
The modified binary map resulting from the erosion operation 22 is then subjected to a dilation operation (block 24) . The dilation operation 24 is subject to a no-join condition, such that, the dilation operation does not allow two separate regions of nuclear material to join to make a single region. In this way the accuracy of the binary map is preserved notwithstanding the dilation operation. The dilation operation 24 is preferably applied twice to ensure a proper filling of voids. The result of these dilation operations is a modified binary map of nuclear material.
Following the dilation operation 24, an erosion operation is applied (block 26) . Double application of the erosion operation 26 eliminates regions of the nuclear material in the binary map that are too small to be of diagnostic significance. The result is a modified binary map of nuclear regions.
The remaining operations involve constructing a binary map comprising high gradients, i.e boundaries, of pixel intensity, in order to sever nuclear regions that share high gradient boundaries. The presence of these high gradient boundaries is evidence of two, closely spaced but separate nuclei.
The first step in severing the high-gradient boundaries in the nuclear map is to construct a binary map of these high gradient boundaries using a threshold operation (block 28) applied to a Sobel map. The Sobel map is generated by applying the Sobel gradient operator to the 577 nm 8-bit digitized image to determine regions of that image that contain high gradients of pixel intensity (block 29) . (The 8-bit digitized image for the 577 nm band was obtained from the levelling operation in block 12.) The result of the Sobel operation in block 29 is an 8-bit map of gradient intensity.
Following the threshold Sobel operation 28, a logical NOT operation is performed (block 30) . The logical NOT operation 30 determines the coincidence of the two states, high-gradients and nuclei, and reverses the pixel value of the nuclear map at the point of the coincidence in order to eliminate it from regions that are presumed to be nuclear material. The result of this logical operation is a modified nuclear map.
The modified nuclear map is next subjected to an erosion operation (block 32) . The erosion operation 32 eliminates regions in the modified nuclear map that are too small to be of diagnostic significance. The result is a modified binary map of nuclear regions.
After the application of the gradient technique for severing close nuclear boundaries (blocks 28 and 30) and the erosion operation (block 32) for clearing the image of insignificant regions, the binary map of nuclear regions is dramatically altered. To restore the map to its original boundaries while preserving the newly-formed separations, the process applies a dilation operation at block 34. The dilation operation 34 includes the condition that no two nuclear regions will become joined as they dilate and that no nuclear region will be allowed to grow outside its old boundary as defined by the binary map that existed before the Sobel procedure was applied. The dilation operation 34 is preferably applied four times. The result is a modified binary map of nuclear material.
With the application of the dilation operation 34, the nuclear segmentation procedure according to the multi-spectral segmentation process 1 is complete and the resulting binary nuclear map is labelled in block 36, and if required further image processing is applied. As described above, the operation at block 20 in Fig. 1 comprises neural network processing of the digitized images. In general, the neural network 20 is a highly parallel, distributed, information processing system that has the topology of a directed graph. The network comprises a set of "nodes" and series of "connexions" between the nodes. The nodes comprise processing elements and the connexions between the nodes represent the transfer of information from one node to another.
Reference is made to Fig. 2 which shows a node or processing element 100a for a backpropagation neural network 20. Each of the nodes 100a accepts one or more inputs 102 shown individually as a1( a2, a3 ... an in Fig 2. The inputs 102 are taken into the node 100a and each input 102 is multiplied by its own mathematical weighting factor before being summed together with the threshold factor for the processing element 100a. The processing element 100a then generates a single output 104 (i.e. b-,) according to the "transfer function" being used in the network 20. The output 104 is then available as an input to other nodes or processing elements, for example processing elements 100b, 100c, lOOd, lOOe and lOOf as depicted in Fig. 1.
The transfer function may be any suitable mathematical function but it is usual to employ a "sig oid" function. The relationship between the inputs 102 into the node 100 and the output 104 is given by expression (1) as follows:
b3 = { Σ Wjl a± - θ, } -1 (i)
where bj is the output 104 of the node 100, a is the value of the input 102 to the node labelled "I", Wj;l is the weighting given to that input 102, and θj is the threshold value for the node 100. In the present application, the transfer function is modelled after a sigmoid function.
In its general form, the nodes or processing elements for the neural network are arranged in a series of layers denoted by 106, 108 and 110 as shown in Fig. 3. The first layer 106 comprises nodes or processing elements 112 shown individually as 112a, 112b, 112c, 112d and 112e. The first layer 106 is an input layer and accepts the information required for a decision.
The second layer 108 in the neural network 20 is known as the hidden layer and comprises processing elements 114 shown individually as 114a, 114b, 114c, 114d and 114e. All of the nodes 112 in the input layer 106 are connected to all of the nodes 114 in the hidden layer 108. It will be understood that there may be more than one hidden layer, with each node in the successive layer connected to each node of the previous layer. For convenience only one hidden layer 108 is shown in Fig. 3.
The (last) hidden layer 108 leads to the output layer 110. The output layer 110 comprises processing elements 116 shown individually as 116a, 116b, 116c, 116d and 116e in Fig. 3. Each node 114 of the (last) hidden layer 108 (Fig. 3) is connected to each node 116 of the output layer 110. The output layer 110 renders the decision to be interpreted by subsequent computing machinery.
The strength of the neural network architecture is its ability to generalize based on previous training of particular examples. In order to take advantage of this, the neural network is presented a series of examples of the type of objects that it is destined to classify. The backpropagation neural network organizes itself by altering the multiplicity of its connexion weights and thresholds according to its success in rendering a correct decision. This is called supervised learning wherein the operator provides the network with the information regarding its success in classification. The network relies on a standard general rule for modifying its connexion weights and thresholds based on the success of its performance, i.e. back-propagation.
In the context of the multi-spectral segmentation process, the multi-spectral images are divided into two classes: C0 - cytoplasm and C: - nuclear, separated by the multi¬ dimensional threshold t which comprises a 3-dimensional space. The distribution of the pixels for the nuclear and cytoplasm objects is complex and the 3-D space comprises numerous clusters and non-overlapped regions. It has been found that the optimal threshold has a complex non-linear surface in the 3-D space, and the neural network according to the present invention provides the means for finding the complex threshold surface in the 3-D space in order to segment the nuclear and cytoplasmic objects.
According to this aspect of the invention, the neural network 20 comprises an input layer 106, a single hidden layer 108, and an output layer 110. The input layer 106 comprises three nodes or processing elements 112 (Fig. 3) for each of the three 8-bit digitized values for the particular pixel being examined. (The three digitized values arise from the three levelled images collected in each of the three optical bands, as described above with reference to Fig. 1.) The output layer 110 comprises a single processing element 116 (Fig. 3) which indicates whether the pixel under examination is or is not part of the nucleus.
Before the neural network 20 can be successfully operated for decision-making it must first be "trained" in order to establish the proper combination of weights and thresholds. The training is performed outside of the segmentation procedure on a large set of examples. Errors made in the classification of pixels in the examples are "back-propagated" as corrections to the connexion weights and the threshold values in each of the processing units. Once the classification error is acceptable the network is "frozen" at these weight and threshold values and it is integrated as a simple algebraic operation into the segmentation procedure as shown at block 20 in Fig 1.
In a preferred embodiment, the neural network 20 according to the invention comprises a Probability Projection Neural Network which will also be referred to as a PPNN. The PPNN according to the present invention features fast training for a large volume of data, processing of multi-modal non- Gaussian data distribution, good generalization simultaneously with high sensitivity to small clusters of patterns representing the useful subclasses of cells. In another aspect, the PPNN is well-suited to a hardware-encoded implementation.
The PPNN according to the invention utilizes a Probability Density Function (PDF) estimator. As a result, the PPNN is suitable for use as a Probability Density Function estimator or as a general classifier in pattern recognition. The PPNN uses the training data to create an N-dimensional PDF array which in turn is used to estimate the likelihood of a feature vector being within the given classes as will now be described.
To create and train the PPN network, the input space is partitioned into m x m x ... m discrete nodes (if the discrete input space is known, then m is usually selected less than the range) . For example, for a 3-D PDF array creating a 26 x 2s x 26 grid is sufficient.
As shown in Fig. 4, the next step involves mapping or projecting the influence of the each training pattern to the neighbour nodes. This is accomplished according to expression (2) as shown below:
"j I o i X r • ' ' i n-lJ = J -l I- XQ i Xl 1 • • • r n-l J ^j L XQ ' ^1 1 • • i n-l-l •
1 , i f rk - O
0 , if rk ≥ r0 ( 2 ) d-j Lχ 0 , X} , • • • i -Xn-i- = ι - λ , if rk < r0
2" -1
∑ (1 - r.j i = 0
where P [x0, xl f . . . ,xn-1. is the current value of the (x0, x-, , . . . , xn.λ) node after the j 'th iteration; d_ [x0 l x1 , . . . ,xπ-IJ represents the influence of j 'th input pattern to the (x0, xι r . . . , xn_1) node; rk is the distance from the pattern to the k'th node; r0 is the minimum distance between two neighbour nodes,- and n is the dimension of the space.
2n
From expression (l) , it will be appreciated that Vj Σ dλ(-, - l represents the normalized values. k=l
Once the accumulation of PN [xσ, xl r . . . ,xπ-J7 (where j = N - number of the training patterns) is completed, a normalization operation is performed to obtain the total energy value for PPNN EpPN - 1. The normalized values (i.e. P*) for PPNN are calculated according to expression (3) as follows: P*N[x0,x1, ... ,xn_1] = Ptf[x0,x1, ....X-. /N (3)
For feed-forward calculations the trained and normalized nodes P*N[x0,x1, ... ,xM-I and the reverse mapping are utilized according to expression (4) given below,
2n-l
~ LXQ ' • • i n-lJ ~ *-" *fl ( o i X\ι • • • i Xn-lJ ~j XQI Xl i • • • i Xn-l-l ' i = 0 (4)
where .a; [x0,xlr ... , -^7 are calculated according to expression (1) above.
To solve a two class (i.e. C0 - cytoplasm and Cx - nuclear) application using the PPΝΝ according to the present invention, two networks must be trained for each class separately, that is, Pc0[x0,xlf ... ,xn.1] and PC1 [x0,x1, ... ,xn. . Because both PPΝΝ are normalized, they can be joined together according to expression (5) below as follows:
Pco/ci t- o' Xl i • • • i X -l-l = P CO o r Xl / • • • / X -lJ ~ * ci IXo i Xl i • • • i Xn-l-l ^ -1 I
The final decision from expressions (4) and (5) is given by
C0/ if h3 > 0 Pattern, e { ^ ±f ^ ≤ Q {6)
While the PPΝΝ according to the present invention is particularly suited to handle multi-modal data distributions, in many practical situations there will be an unbalanced data set. This means that some clusters will contain less data samples than other clusters and as a result some natural clusters which were represented with a small number of patterns could be lost after PPΝΝ joining. To solve this problem there is provided an algorithm which equalizes all natural clusters according to another aspect of the invention.
Reference is next made to Fig. 5, which shows in flow chart form an embodiment of a clustering algorithm 200 according to the present invention. All training patterns, i.e. Ν samples, in block 202 and a given number (i.e. "K") of clusters in block 204 are applied to a K-mean clustering operation block 206. The clustering operation 206 clusters the input data and generates clusters 1 through K (block 208) . Next, all the training data which belongs to an ith-cluster is extracted into a separate sub¬ class. For each sub-class of training data, a normalized PPNN, i.e. Ei = 1, is created (block 210) . The final operation in the clustering algorithm comprises joining all of the K PPNN's together and normalizing the resulting PPNN by dividing all nodes by the number of clusters (block 212) . The operation performed in block 212 may be expressed as follows:
E = (E. + .... + Ek) /K-l
It will also be understood that the clustering algorithm 200 may be implemented to the each class separately before creating the final classifier according the expression (6) above, as follows. The optimal number of clusters for each of two classes may be found from final PPNN performance analysis (expression (6) above) . First, the number of clusters for PPN2 = 1 are fixed and the optimal number of clusters for PPN-L are found. Next, the reverse variant is modelled as: PPN2 = 1 , Λ PPN2 = opt . Lastly, the two optimal networks PPN^ Λ PPN2 opt are combined together according to expression (6) .
While the neural network assisted multi-spectral segmentation process is described with a Probability Projection Neural Network according to the present invention, it will be understood that other conventional neural networks are suitable, including for example, Backpropagation (BP) networks, Elliptic Basic Functions (EBF) networks, and Learning Vector Quantization (LQV) networks. However, the PPNN is preferred. The performance results of the Probability Projection Neural Net have been found to exceed those achieved by conventional networks.
According to another aspect of the present invention, the neural network assisted multi-spectral segmentation process is implemented as a hardware-encoded procedure embedded in conventional FPGA (Field Programmable Gate Array) logic as part of a special-purpose computer.
The hardware implementation of this network is found in the form of a look-up table contained in a portion of hardware memory (Fig. 6) . As described above, the neural network 20 comprises three input nodes and a single, binary output node. The structure of the neural network 20 according to the present invention also simplifies the hardware implementation of the network.
As shown in Fig. 6, the three input nodes correspond to three optical bands 301, 302, 303 used in gathering the images. The images taken in the 530 nm and 630 nm bands have 7- bits of useful resolution while the 577 nm band retains all 8- bits. (The 577 nm band is centered on the nucleus.) The performance of the neural network 20 is then determined for all possible combinations of these three inputs. Since there are 22 bits in total, there are 222 or 4.2 million possible combinations. To create the look-up table, all input pixels in the space (27 x 27 x 2s variants for the three images in the present embodiment) are scanned and the look-up table is filled with the PPNN decision, i.e. 1 - pixel belongs to nuclear,- 0 - pixel doesn't belong to nuclear, for all each of these pixel combinations.
The coding of the results (i.e. outputs) of the neural network comprises assigning each possible combination of inputs a unique address 304 in a look-up table 305 stored in memory. The address 304 in the table 305 is formed from by joining together the binary values of the three channel values indicated by 306, 307, 308, respectively in Fig. 6. For example, as shown in Fig. 6, the pixel for the image from the first channel 301 (i.e. 530 nm) is binary 0101011, the pixel for image from the second channel 302 (i.e. 630 nm) is binary 0101011, and the pixel for the image from the third channel 303 (i.e 577 nm) is binary 00101011, and concatenated together binary representations 306, 307, 308 form the address 304 which is binary 0101011010101100101011. The address 304 points to a location in the look-up table 305 (i.e. memory) which stores a single binary value 309 that represents the response of the neural network to this combination of inputs, e.g. the logic 0 at memory location 0101011010101100101011 signifies that the pixel in question does not belong to the nucleus.
The hardware-encoding of NNA-MSS advantageously allows the process to execute at a high speed while making a complex decision. Secondly, as experimental data is further tabulated and evaluated more complex decision spaces can be utilized to improve segmentation accuracy. Thus, an algorithm according to the present invention can be optimized further by the adjustment of a table of coefficients that describe the neural-network connexion weights without the necessity of altering the system architecture.
The present invention may be embodied in other specific forms without departing from the spirit or essential characteristics thereof. Therefore, the presently discussed embodiments are considered to be illustrative and not restrictive, the scope of the invention being indicated by the appended claims rather than the foregoing description, and all changes which come within the meaning and range of equivalency of the claims are therefore intended to be embraced therein.

Claims

WHAT IS CLAIMED IS:
1. A method for identifying nuclear and cytoplasmic objects in a biological specimen, said method comprising the steps of :
(a) acquiring a plurality of images of said biological specimen;
(b) identifying cellular material from said images and creating a cellular material map;
(c) applying a neural network to said cellular material map and classifying nuclear and cytoplasmic objects from said images.
2. The method as claimed in claim 1, wherein said step of acquiring a plurality of images comprises capturing three digitized images of micrographic scene for said biological specimen.
3. The method as claimed in claim 2, wherein said step of creating a cellular material map comprises a threshold operation for identifying regions in said images containing cellular material.
4. The method as claimed in claim 3, further including the application of dilation and erosion operations to said cellular material map.
5. The method as claimed in claim 1, wherein said step of applying a neural network comprises training said neural network with examples of types of nuclear and cytoplasmic objects to be classified, and said training step including backpropagating errors in classification of said examples.
6. The method as claimed in claim 1, wherein said step of classifying nuclear and cytoplasmic objects comprises determining a threshold surface in three-dimensional space, and said nuclear and cytoplasmic objects being separated by said three-dimensional space.
7. The method as claimed in claim 6, wherein said neural network comprises a probability projection neural network.
8. The method as claimed in claim 7, wherein said probability projection neural network utilizes a probability density function estimator to estimate a feature vector being within given classes.
9. The method as claimed in claim 8, further including the step of equalizing clusters of data appearing in said images.
10. A system for identifying nuclear and cytoplasmic objects in a biological specimen, said system comprising:
(a) image acquisition means for acquiring a plurality of images of said biological specimen;
(b) processing means for processing said images and generating a cellular material map identifying cellular material;
(c) neural processor means for processing said cellular material map and including means for classifying nuclear and cytoplasmic objects from said images.
11. The system as claimed in claim 10, wherein said neural processor means comprises a look-up table stored in memory having decision outputs stored in addressable locations of said memory, and including addressing means for generating an address to said memory for reading said decision output corresponding to a combination of said image inputs.
12. The system as claimed in claim 11, wherein said addressing means comprises means for combining binary values corresponding to said images and forming an address for accessing said memory from said combined binary values.
13. The system as claimed in claim 10, wherein said neural processor means comprises a probability projection neural network and includes a probability density function estimator to estimate a feature vector being within given classes.
14. The system as claimed in claim 13, wherein said neural processor means includes equalization means for equalizing clusters of data in said images.
15. The system as claimed in claim 10, wherein said neural processor means includes means for determining a threshold surface in three-dimensional space, said nuclear and cytoplasmic objects being separated by said three-dimensional space.
16. A hardware-encoded neural processor for classifying input data, said hardware-encoded neural processor comprising:
(a) a memory having a plurality of addressable storage locations;
(b) said addressable storage locations containing classification information associated with the input data;
(c) address generation means for generating an address from said input data for accessing the classification information stored in said memory for selected input data.
17. The device as claimed in claim 16, wherein said input data comprises image pixels in a digitized image of cellular material.
18. The device as claimed in claim 17, wherein said classification information comprises a binary digit stored in each of said addressable locations of said memory, one state of said binary digit indicating that said input data belongs to a predetermined class, and the other state of said binary digit indicating that the input data is outside said class.
PCT/CA1996/000619 1995-09-19 1996-09-18 A neural network assisted multi-spectral segmentation system WO1997011350A2 (en)

Priority Applications (5)

Application Number Priority Date Filing Date Title
JP9512263A JPH11515097A (en) 1995-09-19 1996-09-18 Neural network assisted multispectral segmentation system
AU69214/96A AU726049B2 (en) 1995-09-19 1996-09-18 A neural network assisted multi-spectral segmentation system
EP96929994A EP0850405A2 (en) 1995-09-19 1996-09-18 A neural network assisted multi-spectral segmentation system
CA002232164A CA2232164A1 (en) 1995-09-19 1996-09-18 A neural network assisted multi-spectral segmentation system
US09/040,378 US6463425B2 (en) 1995-09-19 1998-03-18 Neural network assisted multi-spectral segmentation system

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US396495P 1995-09-19 1995-09-19
US60/003,964 1995-09-19

Related Child Applications (3)

Application Number Title Priority Date Filing Date
US09/040,378 A-371-Of-International US6463425B2 (en) 1995-09-19 1998-03-18 Neural network assisted multi-spectral segmentation system
US09/040,378 Continuation US6463425B2 (en) 1995-09-19 1998-03-18 Neural network assisted multi-spectral segmentation system
US09/970,610 Continuation US20020123977A1 (en) 1995-09-19 2001-10-04 Neural network assisted multi-spectral segmentation system

Publications (2)

Publication Number Publication Date
WO1997011350A2 true WO1997011350A2 (en) 1997-03-27
WO1997011350A3 WO1997011350A3 (en) 1997-05-22

Family

ID=21708431

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CA1996/000619 WO1997011350A2 (en) 1995-09-19 1996-09-18 A neural network assisted multi-spectral segmentation system

Country Status (6)

Country Link
US (2) US6463425B2 (en)
EP (1) EP0850405A2 (en)
JP (1) JPH11515097A (en)
AU (1) AU726049B2 (en)
CA (1) CA2232164A1 (en)
WO (1) WO1997011350A2 (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP3598195A1 (en) 2018-07-20 2020-01-22 Olympus Soft Imaging Solutions GmbH Method for microscopic assessment
CN113515798A (en) * 2021-07-05 2021-10-19 中山大学 Urban three-dimensional space expansion simulation method and device

Families Citing this family (28)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
DE50000887D1 (en) * 1999-05-12 2003-01-16 Siemens Ag ADDRESS READING METHOD
JP2005331394A (en) * 2004-05-20 2005-12-02 Olympus Corp Image processor
US20060020563A1 (en) * 2004-07-26 2006-01-26 Coleman Christopher R Supervised neural network for encoding continuous curves
US20070036467A1 (en) * 2004-07-26 2007-02-15 Coleman Christopher R System and method for creating a high resolution material image
US20060017740A1 (en) * 2004-07-26 2006-01-26 Coleman Christopher R Diurnal variation of geo-specific terrain temperatures in real-time infrared sensor simulation
US7709796B2 (en) * 2005-02-25 2010-05-04 Iscon Video Imaging, Inc. Methods and systems for detecting presence of materials
US7627540B2 (en) 2005-06-28 2009-12-01 Neurosciences Research Foundation, Inc. Addressing scheme for neural modeling and brain-based devices using special purpose processor
US7533071B2 (en) * 2005-06-28 2009-05-12 Neurosciences Research Foundation, Inc. Neural modeling and brain-based devices using special purpose processor
US7765029B2 (en) * 2005-09-13 2010-07-27 Neurosciences Research Foundation, Inc. Hybrid control device
US8117137B2 (en) 2007-04-19 2012-02-14 Microsoft Corporation Field-programmable gate array based accelerator system
US8341100B2 (en) * 2008-07-03 2012-12-25 Nec Laboratories America, Inc. Epithelial layer detector and related methods
JPWO2010021043A1 (en) * 2008-08-21 2012-01-26 グローリー株式会社 Cash management system
US8301638B2 (en) * 2008-09-25 2012-10-30 Microsoft Corporation Automated feature selection based on rankboost for ranking
US8131659B2 (en) * 2008-09-25 2012-03-06 Microsoft Corporation Field-programmable gate array based accelerator system
US9551700B2 (en) * 2010-12-20 2017-01-24 Milagen, Inc. Device and methods for the detection of cervical disease
JP5333570B2 (en) * 2011-12-21 2013-11-06 富士ゼロックス株式会社 Image processing apparatus, program, and image processing system
US9053429B2 (en) 2012-12-21 2015-06-09 International Business Machines Corporation Mapping neural dynamics of a neural model on to a coarsely grained look-up table
US9087301B2 (en) 2012-12-21 2015-07-21 International Business Machines Corporation Hardware architecture for simulating a neural network of neurons
US9373059B1 (en) * 2014-05-05 2016-06-21 Atomwise Inc. Systems and methods for applying a convolutional network to spatial data
US9971966B2 (en) 2016-02-26 2018-05-15 Google Llc Processing cell images using neural networks
US10546237B2 (en) 2017-03-30 2020-01-28 Atomwise Inc. Systems and methods for correcting error in a first classifier by evaluating classifier output in parallel
GB201705876D0 (en) 2017-04-11 2017-05-24 Kheiron Medical Tech Ltd Recist
GB201705911D0 (en) * 2017-04-12 2017-05-24 Kheiron Medical Tech Ltd Abstracts
US10902577B2 (en) 2017-06-19 2021-01-26 Apeel Technology, Inc. System and method for hyperspectral image processing to identify object
US10902581B2 (en) 2017-06-19 2021-01-26 Apeel Technology, Inc. System and method for hyperspectral image processing to identify foreign object
WO2019028004A1 (en) * 2017-07-31 2019-02-07 Smiths Detection Inc. System for determining the presence of a substance of interest in a sample
HUE058907T2 (en) 2018-06-14 2022-09-28 Kheiron Medical Tech Ltd Second reader suggestion
US11151356B2 (en) * 2019-02-27 2021-10-19 Fei Company Using convolution neural networks for on-the-fly single particle reconstruction

Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO1991020048A1 (en) * 1990-06-21 1991-12-26 Applied Electronic Vision, Inc. Cellular analysis utilizing video processing and neural network
WO1992013308A1 (en) * 1991-01-29 1992-08-06 Neuromedical Systems, Inc. Morphological classification system and method
EP0525964A2 (en) * 1991-06-25 1993-02-03 Scitex Corporation Ltd. Apparatus and method for color calibration
US5276771A (en) * 1991-12-27 1994-01-04 R & D Associates Rapidly converging projective neural network
US5276772A (en) * 1991-01-31 1994-01-04 Ail Systems, Inc. Real time adaptive probabilistic neural network system and method for data sorting
EP0587093A2 (en) * 1992-09-08 1994-03-16 Hitachi, Ltd. Information processing apparatus using inference and adaptive learning
US5331550A (en) * 1991-03-05 1994-07-19 E. I. Du Pont De Nemours And Company Application of neural networks as an aid in medical diagnosis and general anomaly detection
EP0710004A2 (en) * 1994-10-27 1996-05-01 Sharp Kabushiki Kaisha Image processing apparatus

Family Cites Families (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4998284A (en) * 1987-11-17 1991-03-05 Cell Analysis Systems, Inc. Dual color camera microscope and methodology for cell staining and analysis
US4839807A (en) * 1987-08-03 1989-06-13 University Of Chicago Method and system for automated classification of distinction between normal lungs and abnormal lungs with interstitial disease in digital chest radiographs
US5544650A (en) * 1988-04-08 1996-08-13 Neuromedical Systems, Inc. Automated specimen classification system and method
US4965725B1 (en) * 1988-04-08 1996-05-07 Neuromedical Systems Inc Neural network based automated cytological specimen classification system and method
US5734022A (en) * 1990-08-01 1998-03-31 The Johns Hopkins University Antibodies to a novel mammalian protein associated with uncontrolled cell division
US5784162A (en) * 1993-08-18 1998-07-21 Applied Spectral Imaging Ltd. Spectral bio-imaging methods for biological research, medical diagnostics and therapy
US6690817B1 (en) * 1993-08-18 2004-02-10 Applied Spectral Imaging Ltd. Spectral bio-imaging data for cell classification using internal reference
US6665060B1 (en) * 1999-10-29 2003-12-16 Cytyc Corporation Cytological imaging system and method

Patent Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO1991020048A1 (en) * 1990-06-21 1991-12-26 Applied Electronic Vision, Inc. Cellular analysis utilizing video processing and neural network
WO1992013308A1 (en) * 1991-01-29 1992-08-06 Neuromedical Systems, Inc. Morphological classification system and method
US5276772A (en) * 1991-01-31 1994-01-04 Ail Systems, Inc. Real time adaptive probabilistic neural network system and method for data sorting
US5331550A (en) * 1991-03-05 1994-07-19 E. I. Du Pont De Nemours And Company Application of neural networks as an aid in medical diagnosis and general anomaly detection
EP0525964A2 (en) * 1991-06-25 1993-02-03 Scitex Corporation Ltd. Apparatus and method for color calibration
US5276771A (en) * 1991-12-27 1994-01-04 R & D Associates Rapidly converging projective neural network
EP0587093A2 (en) * 1992-09-08 1994-03-16 Hitachi, Ltd. Information processing apparatus using inference and adaptive learning
EP0710004A2 (en) * 1994-10-27 1996-05-01 Sharp Kabushiki Kaisha Image processing apparatus

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
IEEE 1994 National Aerospace & Electronics Conference (NAECON) Dayton, US, 23-27 May, volume 2, pages 1090-1097 XP000647232 *

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP3598195A1 (en) 2018-07-20 2020-01-22 Olympus Soft Imaging Solutions GmbH Method for microscopic assessment
EP3598194A1 (en) * 2018-07-20 2020-01-22 Olympus Soft Imaging Solutions GmbH Method for microscopic assessment
US11199689B2 (en) 2018-07-20 2021-12-14 Olympus Soft Imaging Solutions Gmbh Method for microscopic analysis
CN113515798A (en) * 2021-07-05 2021-10-19 中山大学 Urban three-dimensional space expansion simulation method and device
CN113515798B (en) * 2021-07-05 2022-08-12 中山大学 Urban three-dimensional space expansion simulation method and device

Also Published As

Publication number Publication date
JPH11515097A (en) 1999-12-21
AU6921496A (en) 1997-04-09
US20020123977A1 (en) 2002-09-05
WO1997011350A3 (en) 1997-05-22
EP0850405A2 (en) 1998-07-01
US6463425B2 (en) 2002-10-08
US20020042785A1 (en) 2002-04-11
AU726049B2 (en) 2000-10-26
CA2232164A1 (en) 1997-03-27

Similar Documents

Publication Publication Date Title
AU726049B2 (en) A neural network assisted multi-spectral segmentation system
US7236623B2 (en) Analyte recognition for urinalysis diagnostic system
Buyssens et al. Multiscale convolutional neural networks for vision–based classification of cells
Alqudah et al. Segmented and non-segmented skin lesions classification using transfer learning and adaptive moment learning rate technique using pretrained convolutional neural network
Song et al. Hybrid deep autoencoder with Curvature Gaussian for detection of various types of cells in bone marrow trephine biopsy images
EP1433140A1 (en) Chromatin segmentation
Zheng et al. Direct neural network application for automated cell recognition
CN116580394A (en) White blood cell detection method based on multi-scale fusion and deformable self-attention
Mussio et al. Automatic cell count in digital images of liver tissue sections
Hurst et al. Neural net‐based identification of cells expressing the p300 tumor‐related antigen using fluorescence image analysis
Kim et al. Nucleus segmentation and recognition of uterine cervical pap-smears
JP4452624B2 (en) Automatic histological categorization of tubules
Bai et al. A convolutional neural network combined with color deconvolution for mitosis detection
Putzu Computer aided diagnosis algorithms for digital microscopy
Nishchhal et al. Accurate Cell Segmentation in Blood Smear Images Based on Color Analysis and Cnn Models
Kao et al. A novel deep learning architecture for testis histology image classification
Bhavana et al. Identification of Blood group and Blood cells through Image Processing
Subhija Detection of Breast Cancer from Histopathological Images
CN116630960B (en) Corn disease identification method based on texture-color multi-scale residual shrinkage network
Parisse et al. Graph encoding of multiscale structural networks from binary images with application to bio imaging
Sammouda et al. Liver cancer detection system based on the analysis of digitized color images of tissue samples obtained using needle biopsy
Murphy et al. A Performance Analysis of a State-of-the-Art CNN versus a Capsule Network for Cell Image Classification
Surendran RECOGNITION OF FOUR TYPES OF WHITE BLOOD CELLS IN PERIFERAL BLOOD
Antony et al. Comparison of CNN and YOLOv5 For Melanoma Detection
Kumar et al. IoT based leukemia detection using Fuzzy C-means clustering Technique

Legal Events

Date Code Title Description
AK Designated states

Kind code of ref document: A2

Designated state(s): AL AM AT AU AZ BB BG BR BY CA CH CN CZ DE DK EE ES FI GB GE HU IS JP KE KG KP KR KZ LK LR LS LT LU LV MD MG MK MN MW MX NO NZ PL PT RO RU SD SE SG SI SK TJ TM TR TT UA UG US UZ VN AM AZ BY KG KZ MD RU TJ TM

AL Designated countries for regional patents

Kind code of ref document: A2

Designated state(s): KE LS MW SD SZ UG AT BE CH DE DK ES FI FR GB GR IE IT LU MC NL PT SE BF BJ CF CG CI CM GA GN

AK Designated states

Kind code of ref document: A3

Designated state(s): AL AM AT AU AZ BB BG BR BY CA CH CN CZ DE DK EE ES FI GB GE HU IS JP KE KG KP KR KZ LK LR LS LT LU LV MD MG MK MN MW MX NO NZ PL PT RO RU SD SE SG SI SK TJ TM TR TT UA UG US UZ VN AM AZ BY KG KZ MD RU TJ TM

AL Designated countries for regional patents

Kind code of ref document: A3

Designated state(s): KE LS MW SD SZ UG AT BE CH DE DK ES FI FR GB GR IE IT LU MC NL PT SE BF BJ CF CG CI CM GA GN

DFPE Request for preliminary examination filed prior to expiration of 19th month from priority date (pct application filed before 20040101)
121 Ep: the epo has been informed by wipo that ep was designated in this application
ENP Entry into the national phase

Ref document number: 2232164

Country of ref document: CA

Kind code of ref document: A

Ref document number: 2232164

Country of ref document: CA

WWE Wipo information: entry into national phase

Ref document number: 09040378

Country of ref document: US

ENP Entry into the national phase

Ref document number: 1997 512263

Country of ref document: JP

Kind code of ref document: A

WWE Wipo information: entry into national phase

Ref document number: 1996929994

Country of ref document: EP

WWP Wipo information: published in national office

Ref document number: 1996929994

Country of ref document: EP

REG Reference to national code

Ref country code: DE

Ref legal event code: 8642

WWW Wipo information: withdrawn in national office

Ref document number: 1996929994

Country of ref document: EP