WO2000041182A1 - Memory array organization - Google Patents

Memory array organization Download PDF

Info

Publication number
WO2000041182A1
WO2000041182A1 PCT/US1999/028177 US9928177W WO0041182A1 WO 2000041182 A1 WO2000041182 A1 WO 2000041182A1 US 9928177 W US9928177 W US 9928177W WO 0041182 A1 WO0041182 A1 WO 0041182A1
Authority
WO
WIPO (PCT)
Prior art keywords
data path
narrow
narrow data
rdram
memory
Prior art date
Application number
PCT/US1999/028177
Other languages
French (fr)
Inventor
Raymond S. Tetrick
Original Assignee
Intel Corporation
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Intel Corporation filed Critical Intel Corporation
Priority to EP99959103A priority Critical patent/EP1153395A4/en
Priority to AU16346/00A priority patent/AU1634600A/en
Publication of WO2000041182A1 publication Critical patent/WO2000041182A1/en
Priority to HK02103244.6A priority patent/HK1042374A1/en

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/07Responding to the occurrence of a fault, e.g. fault tolerance
    • G06F11/08Error detection or correction by redundancy in data representation, e.g. by using checking codes
    • G06F11/10Adding special bits or symbols to the coded information, e.g. parity check, casting out 9's or 11's
    • G06F11/1008Adding special bits or symbols to the coded information, e.g. parity check, casting out 9's or 11's in individual solid state devices
    • G06F11/1044Adding special bits or symbols to the coded information, e.g. parity check, casting out 9's or 11's in individual solid state devices with specific ECC/EDC distribution
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/07Responding to the occurrence of a fault, e.g. fault tolerance
    • G06F11/08Error detection or correction by redundancy in data representation, e.g. by using checking codes
    • G06F11/10Adding special bits or symbols to the coded information, e.g. parity check, casting out 9's or 11's
    • G06F11/1008Adding special bits or symbols to the coded information, e.g. parity check, casting out 9's or 11's in individual solid state devices
    • G06F11/1012Adding special bits or symbols to the coded information, e.g. parity check, casting out 9's or 11's in individual solid state devices using codes or arrangements adapted for a specific type of error
    • G06F11/1028Adjacent errors, e.g. error in n-bit (n>1) wide storage units, i.e. package error

Definitions

  • This invention relates generally to memory arrays.
  • the present invention relates to the organization of a memory array with error correction.
  • Server devices in a network typically have more stringent memory processing characteristics than desktop personal computers.
  • servers reliably store a great deal of information and quickly distribute that information in response to requests from other devices in the network.
  • servers typically have error correction capability support for the stored data in the memory interface.
  • This error correction may include, for example, Single Bit Correct/Double Bit Detect (“SBCDBD”) and Double Bit Correct/Triple Bit Detect (“DBCTBD”) .
  • SBCDBD Single Bit Correct/Double Bit Detect
  • DBCTBD Double Bit Correct/Triple Bit Detect
  • Some server memory arrays also have a "chip kill" feature - the ability to detect the complete or substantial failure of a single memory device in the array.
  • DRAM arrays in servers are specifically designed to use error correction codes (“ECC”) , which are additional memory bits stored along with the data, to detect and correct errors of the data stored in the memory.
  • ECC error correction codes
  • Full error correction codes employ at least one or two extra bits for each 8-bit byte of data.
  • ECC memories in server devices storing 16-bit data frequently have 3 additional bits used for error correction.
  • Some memory arrays specifically designed for these servers use 72 bit data words (to provide eight additional bits for error correction) instead of the 64 bit data path width of the standard memory and memory interface used in desktop systems without any error detection circuitry. But such specialized memory arrays are less available and higher in cost than the standard desktop memory.
  • RDRAMTM brand dynamic random access memory (available from Rambus, Inc. , of Mountain View, Calif.) transfer data over a narrow data path having less bits than the data words transferred into and out of the processor.
  • These narrow data path memory devices are more expensive and do not easily accommodate the full error correction desired in some environments such as in servers. Consequently, it is desireable to devise a manner of organizing such narrow data path memory devices to accommodate full error correction.
  • a memory array has Rambus Direct Random Access Memories (RDRAMTMs) coupled to respective first RDRAMTM channels, at least one of which stores and transfers a respective mutually exclusive group of the bits of a data word over its respective first RDRAMTM channel in parallel with the other first RDRAMTMs.
  • RDRAMTMs Rambus Direct Random Access Memories
  • second RDRAMTM coupled to a respective second RDRAMTM channel, the second RDRAMTM storing and transferring error correction data used in detecting and correcting errors in the data stored in the first RDRAMTMs.
  • Fig. 1 is a generalized block diagram providing a basic illustration of RDRAMTM installation in a computer device.
  • Fig. 2 is a block diagram illustrating the memory array arrangement of a server according to a first example embodiment of the invention.
  • Fig. 3 is a block diagram illustrating the memory array arrangement of a server according to a second example embodiment of the invention.
  • Fig. 4 is a block diagram illustrating the memory array arrangement of a server according to a third example embodiment of the invention.
  • An example application of the invention is in the memory array of a server device storing and transferring error sensitive data.
  • the example embodiments of the invention seek to utilize and organize widely available desktop memory devices into a memory array in such a manner so as to facilitate error correction and thereby make such devices suitable for use as the building blocks of a memory array in a server.
  • the example embodiment of the invention is implemented with RDRAMTM memory from Rambus, Inc. of Mountain View, California.
  • the invention may of course receive application in memory intensive devices other than servers.
  • Fig. 1 illustrates an example installation of RDRAMTM memories. As shown, they may be mounted on a number of Rambus In-line Memory Module (“RIMM”) packaging units 101-1, 101-2, etc, which are electrically coupled in daisy chain fashion to memory interface 104 via Rambus channel 103.
  • RIMMs 101 are substantially similar to DIMMs except, of course, that they have RDRAMTM memories 102-1, 102-2, etc., rather than DRAMs, and also have different sizes.
  • Each RIMM 102 has two connectors instead of one, so that they can be coupled in sequence in daisy chain fashion as shown in Fig. 1. There are electrical performance advantages to such an arrangement. But another advantage is that the size of the memory array can be easily changed by just adding or deleting a memory component.
  • the Rambus channel for each daisy chain in this embodiment is restricted to an 8-bit or 16-bit data word (9 or 18 bits if a parity bit is used) - making it unsuitable for server devices which employ a larger data word to accommodate strong error correction.
  • the parity bits in 9- bit and 18-bit RDRAMTM memories are insufficient to support error correction codes sometimes desired for servers.
  • Double Bit Correct/Triple Bit Detect (“DBCTBD”) code, if any two parity bits fail or any two bits fail, the data is corrected before it is transferred out of the memory device.
  • error correction with chip kill can detect the failure and sometimes continue running the memory device in some degraded mode. Potentially, error correction can also be performed upon the detection of three or more failed bits ("triple bit correct") . Error correction frequently involves review of successive parity bits. But of course, the error code supported may be any presently available or later developed error code.
  • the example embodiments arrange the basic units of RDRAMTM memory to allow for strong error correcting capabilities.
  • Such ECC support for 16 bit data words employs three additional bits.
  • Such ECC support for 64 bits employs 8 additional bits. This strong error correction and data reliability cannot be accomplished using the limited data path widths of RDRAMTM memories and Rambus channels in a conventional desktop memory configuration.
  • FIG. 2 A block diagram of a first example embodiment of a memory array in a server according to the invention is shown in Fig. 2.
  • the example array has four RDRAMTM memories 201-1 to 201-4 (each RDRAMTM memory 201-1 to 201-4 may be made up of several devices as shown in Fig. 1 but are referred to in the singular for convenience) on respective RIMMS 101-1 to 101-4, each RDRAMTM having a 16-bit data path and coupled to memory interface chipset 206 via respective Rambus channels 203-1 to 203-4 and channel interfaces 206-1 to 206-4.
  • the RDRAMTM memories 201-1 to 201-4 and RIMMs 101-1 to 101-4 need not have 16-bit data paths.
  • the data path widths could be 8 bits, for example.
  • RIMMs 101-1 to 101-4 all have the same capacity in this embodiment and are driven by the same clock and control signals such that RDRAMTM memory 201-1 to 201-4 each transfer a respective 16 bits of a 64 bit data word in parallel. Collectively, RDRAMTM memories 201-1 to 201-4 transfer 64 bits of a data word in parallel.
  • RDRAMTM 201-5 8 more bits are stored and transferred by RDRAMTM 201-5 on RIMM 101-5 ' via channel interface 206-5 in memory interface chipset 206 and Rambus channel 203- 5 for a total of 72 bits.
  • Memory interface chipset 206 has five separate respective channel interfaces 206-1 to 206-5, one for each Rambus channel 203-1 to 203-5.
  • Channel interface for Rambus channel 203-5 is coupled to ECC logic 205 and provides internal ECC support.
  • ID bits indicate the data path width and size of RDRAMTMs 201-1 to
  • the data path width of RDRAMTM 201-5, RIMM 101-5' and Rambus channel 203-5 is one half of the width of the data path for RDRAMTMs 201-1 to 201-4, RIMMs 101-1 to 101-4 and Rambus channels 203-1 to 203-4.
  • the capacity of RDRAMTM 201-5 is one half the capacity of each of RDRAMTMs 201-1 to 201-4.
  • RDRAMTM memories 201-1 to 201-4 are each shown as having a 2 megabyte (2M) capacity and RDRAMTM memory 201-5 is shown as having a 1 megabyte (1M) capacity.
  • the memories may, of course, have any capacity so long as, for this embodiment, the relationship is maintained between RDRAMTM memory 201-5 and RDRAMTM memories 201-1 to 201-4.
  • the 8 additional bits from RDRAMTM 201-5 are dedicated to error detection and correction.
  • the memory interface chipset 206 decides and controls what data is read, written and corrected with the memory array as known in the art. It also generates the proper error correction code data to be stored in ECC RDRAMTM 201-5.
  • Each pair of ECC bits stored and transferred by ECC RDRAMTM 201-5 corresponds to sixteen of the bits in the 64-bit data word stored and transferred by RDRAMTMs 201-1 to 201-4.
  • the sixteen bits corresponding to the pair of ECC bits are not stored and transferred by any single one of RIMMs 101-1 to 101-4. Instead, the sixteen bits are spread out among RIMMs 101-1 to 101-4.
  • ECC RDRAMTM 301-5, RIMM 101-5 and Rambus channel 303-5 could each have a sixteen bit data path and ECC RDRAMTM 301-5 could have a capacity which is one-half that of each of RDRAMTMs 201-1 to 201-4. For example, if the capacity of each of RDRAMTMs 201-1 to 201-4 is 1 megabyte, then the capacity of RDRAMTM 301-5 is 1/2 megabyte. Such a relationship of RDRAMTM capacity could easily be accomplished by the appropriate selection of the number of memory devices on each respective RIMM as discussed above with respect to Fig. 1.
  • either one of the upper eight bits or lower eight bits of the 16 bits stored and transferred in ECC RDRAMTM 301-5 is arbitrarily accessed by different addressing from memory interface chipset 206 via address signal lines 304-5.
  • This address shifting of one bit permits a similar advantage of the first example embodiment.
  • it allows the server memory to made up entirely of RDRAMTM, RIMM and Rambus channel components of one single data path width so that a server memory may be built with only one data path width in inventory.
  • a server memory array with strong error correction is organized from RDRAMTMs not intended to be able to support such strong error correction.
  • a third example embodiment is shown in Fig. 4. This embodiment extends the error detection and correction capability of the first two example embodiments to include DBCTBD and chip kill.
  • the third example embodiment differs from the first example embodiment insofar as RDRAMTM 401-5, RIMM 101-5 and Rambus channel 403-5 have a 16-bit data path width as RDRAMTMs 201-1 to 201-4, RIMMs 101-1 to 101-4 and Rambus channels 203- 1 to 203-4.
  • RDRAMTM 401-5 has the capacity of RDRAMTMs 201-1 to 201-4.
  • RDRAMTM 401-5 stores and transfers 16 bits of error correction data. It supports double bit correct and triple bit detect chip kill, with proper encoding, in conjunction with memory interface chipset 206.
  • the third example embodiment also is not limited to 16 bit data path widths and RIMMS 101-1 to 101-5 may contain any amount of memory capacity.
  • Other error correction schemes which access memory devices and save some portion of the memories for ECC data may increase the amount of memory that is employed.
  • One of the advantages of the example embodiments is that one RIMM can be designated and reserved for error correction regardless of the capacity of the memory array. Another advantage is that, since all three example embodiments of the invention utilize a fifth Rambus channel, the same identical memory interface chipset 206 could be used to support ECC correction in all three of the example embodiments as well as other embodiments. In this way, either simple single bit error correction or complicated, more expensive, multi-bit correction could be supported with the same or a similar memory interface chipset and memory organization.
  • An advantage of the example embodiment comes from the use of RDRAMTM memory devices. While these devices have high performance, the Rambus channel interconnect technology for the devices can only have 8, 9, 16 or 18 bits.
  • the example embodiments according to the invention allow a memory array to utilize Rambus devices of 8-bit widths or 16-bit widths to obtain the high performance characteristics thereof and still support strong error correction.
  • the example embodiments of the invention are not limited to personal computers. Indeed, the invention is particularly useful for any computing device employing the high memory performance of Rambus and strong error correction.
  • the invention may be used in any device in which a high degree of data storage reliability is desired.

Abstract

The memory array of a server device organizes conventional desktop memory so as to be able to perform error correction. Each one of several Rambus Direct Random Access Memory ('RDRAMTM') devices (101-1 to 101-4) transfers one group of bits of a data word across a corresponding channel (203-1 to 203-4). An additional RDRAMTM device (101-5) transfers data used for performing error correction, including chip kill, for the data stored in the RDRAMTM devices.

Description

MEMORY ARRAY ORGANIZATION
Technical Field
This invention relates generally to memory arrays. In particular, the present invention relates to the organization of a memory array with error correction.
Background Art
Server devices in a network typically have more stringent memory processing characteristics than desktop personal computers. In particular, it is desireable that servers reliably store a great deal of information and quickly distribute that information in response to requests from other devices in the network. There are numerous transactions, such as for bank accounts, etc., in which it is desireable that the data be correctly stored and that an error be covered or corrected as software is running. For example, if the data is a credit card account number, corruption of the data could result in the wrong account being charged, etc. Consequently, server devices should be able to either: correct erroneous data stored in its memory and continue processing or, if the data cannot be corrected, stop the transaction and provide an error notice.
Corruption sometimes occurs in the storage medium of the memory itself. Therefore, servers typically have error correction capability support for the stored data in the memory interface. This error correction may include, for example, Single Bit Correct/Double Bit Detect ("SBCDBD") and Double Bit Correct/Triple Bit Detect ("DBCTBD") . Some server memory arrays also have a "chip kill" feature - the ability to detect the complete or substantial failure of a single memory device in the array.
Some dynamic random access memory ("DRAM") arrays in servers are specifically designed to use error correction codes ("ECC") , which are additional memory bits stored along with the data, to detect and correct errors of the data stored in the memory. Full error correction codes employ at least one or two extra bits for each 8-bit byte of data. ECC memories in server devices storing 16-bit data frequently have 3 additional bits used for error correction.
Some memory arrays specifically designed for these servers use 72 bit data words (to provide eight additional bits for error correction) instead of the 64 bit data path width of the standard memory and memory interface used in desktop systems without any error detection circuitry. But such specialized memory arrays are less available and higher in cost than the standard desktop memory.
Furthermore, some memory devices such as RDRAM™ brand dynamic random access memory (available from Rambus, Inc. , of Mountain View, Calif.) transfer data over a narrow data path having less bits than the data words transferred into and out of the processor. These narrow data path memory devices are more expensive and do not easily accommodate the full error correction desired in some environments such as in servers. Consequently, it is desireable to devise a manner of organizing such narrow data path memory devices to accommodate full error correction.
Disclosure of the Invention
The present invention is directed to a method of organizing memory devices into a memory array having error correction. In a first aspect, a memory array has Rambus Direct Random Access Memories (RDRAM™s) coupled to respective first RDRAM™ channels, at least one of which stores and transfers a respective mutually exclusive group of the bits of a data word over its respective first RDRAM™ channel in parallel with the other first RDRAM™s. There is also a second RDRAM™ coupled to a respective second RDRAM™ channel, the second RDRAM™ storing and transferring error correction data used in detecting and correcting errors in the data stored in the first RDRAM™s. Brief Description of the Drawings
Fig. 1 is a generalized block diagram providing a basic illustration of RDRAM™ installation in a computer device.
Fig. 2 is a block diagram illustrating the memory array arrangement of a server according to a first example embodiment of the invention.
Fig. 3 is a block diagram illustrating the memory array arrangement of a server according to a second example embodiment of the invention. Fig. 4 is a block diagram illustrating the memory array arrangement of a server according to a third example embodiment of the invention.
Best Mode for Carrying Out the Invention
An example application of the invention is in the memory array of a server device storing and transferring error sensitive data. In particular, the example embodiments of the invention seek to utilize and organize widely available desktop memory devices into a memory array in such a manner so as to facilitate error correction and thereby make such devices suitable for use as the building blocks of a memory array in a server. The example embodiment of the invention is implemented with RDRAM™ memory from Rambus, Inc. of Mountain View, California. However, the invention may of course receive application in memory intensive devices other than servers.
Fig. 1 illustrates an example installation of RDRAM™ memories. As shown, they may be mounted on a number of Rambus In-line Memory Module ("RIMM") packaging units 101-1, 101-2, etc, which are electrically coupled in daisy chain fashion to memory interface 104 via Rambus channel 103. RIMMs 101 are substantially similar to DIMMs except, of course, that they have RDRAM™ memories 102-1, 102-2, etc., rather than DRAMs, and also have different sizes. Each RIMM 102 has two connectors instead of one, so that they can be coupled in sequence in daisy chain fashion as shown in Fig. 1. There are electrical performance advantages to such an arrangement. But another advantage is that the size of the memory array can be easily changed by just adding or deleting a memory component. While the architecture shown in Fig. 1 makes it easy to expand capacity by adding another RDRAM™ memory to the daisy chain, the Rambus channel for each daisy chain in this embodiment is restricted to an 8-bit or 16-bit data word (9 or 18 bits if a parity bit is used) - making it unsuitable for server devices which employ a larger data word to accommodate strong error correction. The parity bits in 9- bit and 18-bit RDRAM™ memories are insufficient to support error correction codes sometimes desired for servers. In Double Bit Correct/Triple Bit Detect ("DBCTBD") code, if any two parity bits fail or any two bits fail, the data is corrected before it is transferred out of the memory device. Also, if one of the memory devices fails entirely, error correction with chip kill can detect the failure and sometimes continue running the memory device in some degraded mode. Potentially, error correction can also be performed upon the detection of three or more failed bits ("triple bit correct") . Error correction frequently involves review of successive parity bits. But of course, the error code supported may be any presently available or later developed error code.
The example embodiments arrange the basic units of RDRAM™ memory to allow for strong error correcting capabilities. Such ECC support for 16 bit data words employs three additional bits. Such ECC support for 64 bits employs 8 additional bits. This strong error correction and data reliability cannot be accomplished using the limited data path widths of RDRAM™ memories and Rambus channels in a conventional desktop memory configuration.
A block diagram of a first example embodiment of a memory array in a server according to the invention is shown in Fig. 2. The example array has four RDRAM™ memories 201-1 to 201-4 (each RDRAM™ memory 201-1 to 201-4 may be made up of several devices as shown in Fig. 1 but are referred to in the singular for convenience) on respective RIMMS 101-1 to 101-4, each RDRAM™ having a 16-bit data path and coupled to memory interface chipset 206 via respective Rambus channels 203-1 to 203-4 and channel interfaces 206-1 to 206-4. The RDRAM™ memories 201-1 to 201-4 and RIMMs 101-1 to 101-4 need not have 16-bit data paths. The data path widths could be 8 bits, for example. RIMMs 101-1 to 101-4 all have the same capacity in this embodiment and are driven by the same clock and control signals such that RDRAM™ memory 201-1 to 201-4 each transfer a respective 16 bits of a 64 bit data word in parallel. Collectively, RDRAM™ memories 201-1 to 201-4 transfer 64 bits of a data word in parallel.
In addition, 8 more bits are stored and transferred by RDRAM™ 201-5 on RIMM 101-5 ' via channel interface 206-5 in memory interface chipset 206 and Rambus channel 203- 5 for a total of 72 bits. Memory interface chipset 206 has five separate respective channel interfaces 206-1 to 206-5, one for each Rambus channel 203-1 to 203-5. Channel interface for Rambus channel 203-5 is coupled to ECC logic 205 and provides internal ECC support. As part of each Rambus channel 203-1 to 203-5, identification (ID) bits indicate the data path width and size of RDRAM™s 201-1 to
201-5. The data path width of RDRAM™ 201-5, RIMM 101-5' and Rambus channel 203-5 is one half of the width of the data path for RDRAM™s 201-1 to 201-4, RIMMs 101-1 to 101-4 and Rambus channels 203-1 to 203-4. Correspondingly, the capacity of RDRAM™ 201-5 is one half the capacity of each of RDRAM™s 201-1 to 201-4. For purposes of illustrating the relationship only, RDRAM™ memories 201-1 to 201-4 are each shown as having a 2 megabyte (2M) capacity and RDRAM™ memory 201-5 is shown as having a 1 megabyte (1M) capacity. The memories may, of course, have any capacity so long as, for this embodiment, the relationship is maintained between RDRAM™ memory 201-5 and RDRAM™ memories 201-1 to 201-4.
The 8 additional bits from RDRAM™ 201-5 are dedicated to error detection and correction. The memory interface chipset 206 decides and controls what data is read, written and corrected with the memory array as known in the art. It also generates the proper error correction code data to be stored in ECC RDRAM™ 201-5.
Each pair of ECC bits stored and transferred by ECC RDRAM™ 201-5 corresponds to sixteen of the bits in the 64-bit data word stored and transferred by RDRAM™s 201-1 to 201-4. However, the sixteen bits corresponding to the pair of ECC bits are not stored and transferred by any single one of RIMMs 101-1 to 101-4. Instead, the sixteen bits are spread out among RIMMs 101-1 to 101-4.
Alternatively, in a second example embodiment shown in Fig. 3, ECC RDRAM™ 301-5, RIMM 101-5 and Rambus channel 303-5 could each have a sixteen bit data path and ECC RDRAM™ 301-5 could have a capacity which is one-half that of each of RDRAM™s 201-1 to 201-4. For example, if the capacity of each of RDRAM™s 201-1 to 201-4 is 1 megabyte, then the capacity of RDRAM™ 301-5 is 1/2 megabyte. Such a relationship of RDRAM™ capacity could easily be accomplished by the appropriate selection of the number of memory devices on each respective RIMM as discussed above with respect to Fig. 1.
In the second example embodiment, either one of the upper eight bits or lower eight bits of the 16 bits stored and transferred in ECC RDRAM™ 301-5 is arbitrarily accessed by different addressing from memory interface chipset 206 via address signal lines 304-5. This address shifting of one bit permits a similar advantage of the first example embodiment. However, it allows the server memory to made up entirely of RDRAM™, RIMM and Rambus channel components of one single data path width so that a server memory may be built with only one data path width in inventory. As in the first example embodiment, in this embodiment, a server memory array with strong error correction is organized from RDRAM™s not intended to be able to support such strong error correction.
A third example embodiment is shown in Fig. 4. This embodiment extends the error detection and correction capability of the first two example embodiments to include DBCTBD and chip kill.
The third example embodiment differs from the first example embodiment insofar as RDRAM™ 401-5, RIMM 101-5 and Rambus channel 403-5 have a 16-bit data path width as RDRAM™s 201-1 to 201-4, RIMMs 101-1 to 101-4 and Rambus channels 203- 1 to 203-4. RDRAM™ 401-5 has the capacity of RDRAM™s 201-1 to 201-4. RDRAM™ 401-5 stores and transfers 16 bits of error correction data. It supports double bit correct and triple bit detect chip kill, with proper encoding, in conjunction with memory interface chipset 206.
Of course, the third example embodiment also is not limited to 16 bit data path widths and RIMMS 101-1 to 101-5 may contain any amount of memory capacity. Other error correction schemes which access memory devices and save some portion of the memories for ECC data may increase the amount of memory that is employed.
One of the advantages of the example embodiments is that one RIMM can be designated and reserved for error correction regardless of the capacity of the memory array. Another advantage is that, since all three example embodiments of the invention utilize a fifth Rambus channel, the same identical memory interface chipset 206 could be used to support ECC correction in all three of the example embodiments as well as other embodiments. In this way, either simple single bit error correction or complicated, more expensive, multi-bit correction could be supported with the same or a similar memory interface chipset and memory organization.
An advantage of the example embodiment comes from the use of RDRAM™ memory devices. While these devices have high performance, the Rambus channel interconnect technology for the devices can only have 8, 9, 16 or 18 bits. The example embodiments according to the invention allow a memory array to utilize Rambus devices of 8-bit widths or 16-bit widths to obtain the high performance characteristics thereof and still support strong error correction. Of course, the example embodiments of the invention are not limited to personal computers. Indeed, the invention is particularly useful for any computing device employing the high memory performance of Rambus and strong error correction. The invention may be used in any device in which a high degree of data storage reliability is desired.
Other features of the invention may be apparent to those skilled in the art from the detailed description of the exemplary embodiments and claims when read in connection with the accompanying drawings. While the foregoing and following written and illustrated disclosure focuses on disclosing exemplary embodiments of the invention, it should be understood that the same is by way of illustration and example only, is not to be taken by way of limitation and may be modified in learned practice of the invention. While the foregoing has described what are considered to be exemplary embodiments of the invention, it is understood that various modifications may be made therein and that the invention may be implemented in various forms and embodiments, and that it may be applied in numerous applications, only some of which have been described herein. It is intended by the following claims to claim all such modifications and variations.

Claims

1. A memory array comprising: a plurality of narrow data path memory devices coupled to respective first narrow data path channels, at least one of the plurality of first narrow data path memory devices storing and transferring a respective mutually exclusive group of the bits of a data word over its respective first narrow data path channel in parallel with the other first narrow data path memory devices; and a second narrow data path memory device coupled to a respective second narrow data path channel, said second narrow data path memory device storing and transferring error correction data used in detecting and correcting errors in the data stored in said plurality of first narrow data path memory devices.
2. The memory array recited in claim 1, wherein each of said first narrow data path memory device is adapted to store and transfer a mutually exclusive group of bits.
3. The memory array recited in claim 2, wherein said plurality of first narrow data path memory devices comprises four narrow data path memory devices and the width of said data word is four times that of each mutually exclusive group of bits.
4. The memory array recited in claim 1, wherein the narrow data path memory devices comprise a plurality of
Rambus Direct Random Access Memories (RDRAMs) .
5. The memory array recited in claim 4, wherein the second RDRAM is one-half the size of the first RDRAMs.
6. The memory array recited in claim 5, wherein the data path width of the second RDRAM is one-half the data path width of the first RDRAM.
7. The memory array recited in claim 5, wherein the data path width of the second RDRAM is the same as the data path width of the first DRAM.
8. The memory array recited in claim 7, wherein the addressing of the second RDRAM is adapted to be shifted by one bit to arbitrarily transfer either the upper bits or the lower bits of the error correction data.
9. A server device comprising: a memory array having: a plurality of first narrow data path memory devices coupled to respective first narrow data path channels, at least one of the plurality of first narrow data path memory devices storing and transferring a respective mutually exclusive group of the bits of a data word over its respective first narrow data path channel in parallel with the other first narrow data path memory devices; and a second narrow data path memory device coupled to a respective second narrow data path channel, said second narrow data path memory device storing and transferring error correction data used in detecting and correcting errors in the data stored in said plurality of first narrow data path memory devices; and a memory interface chipset, said memory interface chipset controlling the transfer of data to and from said first narrow data path memory devices and the transfer of error correction data to and from said second narrow data path memory device.
10. A server device as recited in claim 9, said memory interface chipset having channel interfaces respectively coupled to said first narrow data path memory devices and an ECC channel interface coupled to said second narrow data path memory device.
11. A server device as recited in claim 10, said memory interface chipset having ECC logic coupled to said ECC channel interface.
12. The memory array recited in claim 9, wherein each of said first narrow data path memory device is adapted to store and transfer a mutually exclusive group of bits.
13. The memory array recited in claim 12, wherein said plurality of first narrow data path memory devices comprises four narrow data path memory devices and said data word is comprised of four times that of each mutually exclusive group of bits.
14. The memory array recited in claim 9, wherein the narrow data path memory devices comprise Rambus In-line Memory Modules (RIMMs) having a plurality of Rambus Direct Random Access Memories (RDRAMs) .
15. The memory array recited in claim 14, wherein the second RDRAM is one-half the size of the first RDRAMs.
16. The memory array recited in claim 15, wherein the data path width of the second RDRAM is one-half the data path width of the first RDRAM.
17. The memory array recited in claim 15, wherein the data path width of the second RDRAM is the same as the data path width of the first RDRAM.
18. The memory array recited in claim 17 , wherein the addressing of the second RDRAM is shifted by one bit to arbitrarily transfer either the upper bits or the lower bits of the data word.
19. A method of performing error correction in a memory, said method comprising: transferring data words of a predetermined width to and from a plurality of narrow data path memory devices through respective first narrow data path channels, a mutually exclusive group of the bits of the data words being transferred and stored in a respectively corresponding one of the plurality of first narrow data path memory devices in parallel with the other first narrow data path memory devices; transferring error correction data to a second narrow data path memory device through a respective second narrow data path channel ; and detecting and correcting errors in the data stored in said plurality of first narrow data path memory devices using said error correction data stored in said second narrow data path memory device.
20. The method recited in claim 19, wherein the width of the error correction data transferred to the second narrow data path memory device is one-half the width of the data words transferred to the first narrow data path memory device.
21. The method recited in claim 19, wherein the width of the error correction data transferred to the second narrow data path memory device is the same as the width of the data words transferred to the first narrow data path memory devices.
22. The method recited in claim 21, comprising further shifting the addressing of the second narrow data path memory device by one bit to arbitrarily transfer either the upper bits or the lower bits of the error correction data.
PCT/US1999/028177 1998-12-30 1999-11-30 Memory array organization WO2000041182A1 (en)

Priority Applications (3)

Application Number Priority Date Filing Date Title
EP99959103A EP1153395A4 (en) 1998-12-30 1999-11-30 Memory array organization
AU16346/00A AU1634600A (en) 1998-12-30 1999-11-30 Memory array organization
HK02103244.6A HK1042374A1 (en) 1998-12-30 2002-04-30 Memory array organization

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US22318498A 1998-12-30 1998-12-30
US09/223,184 1998-12-30

Publications (1)

Publication Number Publication Date
WO2000041182A1 true WO2000041182A1 (en) 2000-07-13

Family

ID=22835413

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/US1999/028177 WO2000041182A1 (en) 1998-12-30 1999-11-30 Memory array organization

Country Status (5)

Country Link
US (1) US6598199B2 (en)
EP (1) EP1153395A4 (en)
AU (1) AU1634600A (en)
HK (1) HK1042374A1 (en)
WO (1) WO2000041182A1 (en)

Families Citing this family (27)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20040236877A1 (en) * 1997-12-17 2004-11-25 Lee A. Burton Switch/network adapter port incorporating shared memory resources selectively accessible by a direct execution logic element and one or more dense logic devices in a fully buffered dual in-line memory module format (FB-DIMM)
US7197575B2 (en) * 1997-12-17 2007-03-27 Src Computers, Inc. Switch/network adapter port coupling a reconfigurable processing element to one or more microprocessors for use with interleaved memory controllers
US7424552B2 (en) * 1997-12-17 2008-09-09 Src Computers, Inc. Switch/network adapter port incorporating shared memory resources selectively accessible by a direct execution logic element and one or more dense logic devices
US7565461B2 (en) * 1997-12-17 2009-07-21 Src Computers, Inc. Switch/network adapter port coupling a reconfigurable processing element to one or more microprocessors for use with interleaved memory controllers
US7373440B2 (en) * 1997-12-17 2008-05-13 Src Computers, Inc. Switch/network adapter port for clustered computers employing a chain of multi-adaptive processors in a dual in-line memory module format
US6691276B2 (en) * 2001-06-25 2004-02-10 Intel Corporation Method for detecting and correcting failures in a memory system
US6941493B2 (en) * 2002-02-27 2005-09-06 Sun Microsystems, Inc. Memory subsystem including an error detection mechanism for address and control signals
US20030163769A1 (en) * 2002-02-27 2003-08-28 Sun Microsystems, Inc. Memory module including an error detection mechanism for address and control signals
US6973613B2 (en) * 2002-06-28 2005-12-06 Sun Microsystems, Inc. Error detection/correction code which detects and corrects component failure and which provides single bit error correction subsequent to component failure
US6996766B2 (en) * 2002-06-28 2006-02-07 Sun Microsystems, Inc. Error detection/correction code which detects and corrects a first failing component and optionally a second failing component
US6976194B2 (en) 2002-06-28 2005-12-13 Sun Microsystems, Inc. Memory/Transmission medium failure handling controller and method
US6996686B2 (en) * 2002-12-23 2006-02-07 Sun Microsystems, Inc. Memory subsystem including memory modules having multiple banks
US7505890B2 (en) * 2003-01-15 2009-03-17 Cox Communications, Inc. Hard disk drive emulator
US7779285B2 (en) * 2003-02-18 2010-08-17 Oracle America, Inc. Memory system including independent isolated power for each memory module
US7530008B2 (en) * 2003-08-08 2009-05-05 Sun Microsystems, Inc. Scalable-chip-correct ECC scheme
US7188296B1 (en) 2003-10-30 2007-03-06 Sun Microsystems, Inc. ECC for component failures using Galois fields
US7581154B2 (en) 2005-06-30 2009-08-25 Intel Corporation Method and apparatus to lower operating voltages for memory arrays using error correcting codes
US7447941B2 (en) * 2005-07-19 2008-11-04 Hewlett-Packard Development Company, L.P. Error recovery systems and methods for execution data paths
CN101317157B (en) * 2005-10-10 2011-06-08 英特尔公司 Reducing vulnerability of storage data software bug
US7683480B2 (en) * 2006-03-29 2010-03-23 Freescale Semiconductor, Inc. Methods and apparatus for a reduced inductance wirebond array
DE102006016499B4 (en) * 2006-04-07 2014-11-13 Qimonda Ag Memory module control, memory control and corresponding memory arrangement and method for error correction
KR20080080882A (en) * 2007-03-02 2008-09-05 삼성전자주식회사 Multi-layer semiconductor memory device having ecc layer and method for error detection and correction using thereof
US8166229B2 (en) * 2008-06-30 2012-04-24 Intel Corporation Apparatus and method for multi-level cache utilization
US8321761B1 (en) * 2009-09-28 2012-11-27 Nvidia Corporation ECC bits used as additional register file storage
US8327225B2 (en) 2010-01-04 2012-12-04 Micron Technology, Inc. Error correction in a stacked memory
US10120749B2 (en) 2016-09-30 2018-11-06 Intel Corporation Extended application of error checking and correction code in memory
US10705912B2 (en) 2017-06-07 2020-07-07 Rambus Inc. Energy efficient storage of error-correction-detection information

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5424989A (en) * 1992-08-21 1995-06-13 Kabushiki Kaisha Toshiba Semiconductor memory device
US5490248A (en) * 1993-02-16 1996-02-06 International Business Machines Corporation Disk array system having special parity groups for data blocks with high update activity

Family Cites Families (18)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4796222A (en) * 1985-10-28 1989-01-03 International Business Machines Corporation Memory structure for nonsequential storage of block bytes in multi-bit chips
JP2583547B2 (en) * 1988-01-13 1997-02-19 株式会社日立製作所 Semiconductor memory
US4993028A (en) * 1988-09-07 1991-02-12 Thinking Machines Corporation Error detection and correction coding
US5584034A (en) * 1990-06-29 1996-12-10 Casio Computer Co., Ltd. Apparatus for executing respective portions of a process by main and sub CPUS
US5457703A (en) * 1990-11-21 1995-10-10 Hitachi, Ltd. Array disk system and control method thereof
US5345565A (en) * 1991-03-13 1994-09-06 Ncr Corporation Multiple configuration data path architecture for a disk array controller
US5463643A (en) * 1994-03-07 1995-10-31 Dell Usa, L.P. Redundant memory channel array configuration with data striping and error correction capabilities
DE69421925T2 (en) * 1994-09-30 2000-03-16 St Microelectronics Srl Storage device with error detection and correction and method for writing and erasing the storage device
US5539875A (en) * 1994-12-30 1996-07-23 International Business Machines Corporation Error windowing for storage subsystem recovery
US5745671A (en) * 1995-02-28 1998-04-28 International Business Machines Corporation Data storage system with localized XOR function
US5680539A (en) * 1995-07-11 1997-10-21 Dell Usa, L.P. Disk array system which performs data reconstruction with dynamic load balancing and user-specified disk array bandwidth for reconstruction operation to maintain predictable degradation
US6018778A (en) * 1996-05-03 2000-01-25 Netcell Corporation Disk array controller for reading/writing striped data using a single address counter for synchronously transferring data between data ports and buffer memory
US5799324A (en) * 1996-05-10 1998-08-25 International Business Machines Corporation System and method for management of persistent data in a log-structured disk array
DE69827949T2 (en) * 1997-07-28 2005-10-27 Intergraph Hardware Technologies Co., Las Vegas DEVICE AND METHOD FOR DETECTING AND REPORTING MEMORY ERRORS
US6119248A (en) * 1998-01-26 2000-09-12 Dell Usa L.P. Operating system notification of correctable error in computer information
US6003121A (en) 1998-05-18 1999-12-14 Intel Corporation Single and multiple channel memory detection and sizing
US6141747A (en) * 1998-09-22 2000-10-31 Advanced Micro Devices, Inc. System for store to load forwarding of individual bytes from separate store buffer entries to form a single load word
US6061263A (en) 1998-12-29 2000-05-09 Intel Corporation Small outline rambus in-line memory module

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5424989A (en) * 1992-08-21 1995-06-13 Kabushiki Kaisha Toshiba Semiconductor memory device
US5490248A (en) * 1993-02-16 1996-02-06 International Business Machines Corporation Disk array system having special parity groups for data blocks with high update activity

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
See also references of EP1153395A4 *

Also Published As

Publication number Publication date
US6598199B2 (en) 2003-07-22
US20010001158A1 (en) 2001-05-10
HK1042374A1 (en) 2002-08-09
AU1634600A (en) 2000-07-24
EP1153395A4 (en) 2002-04-17
EP1153395A1 (en) 2001-11-14

Similar Documents

Publication Publication Date Title
US6598199B2 (en) Memory array organization
US11625296B2 (en) DRAM assist error correction mechanism for DDR SDRAM interface
US8811065B2 (en) Performing error detection on DRAMs
US5961660A (en) Method and apparatus for optimizing ECC memory performance
US10824508B2 (en) High efficiency redundant array of independent memory
US8135935B2 (en) ECC implementation in non-ECC components
US8341499B2 (en) System and method for error detection in a redundant memory system
US6574768B2 (en) Technique for correcting single-bit errors and detecting paired double-bit errors
US20140325315A1 (en) Memory module buffer data storage
US20040237001A1 (en) Memory integrated circuit including an error detection mechanism for detecting errors in address and control signals
KR100609343B1 (en) A system and method for improving multi-bit error protection in computer memory systems
CN116745849A (en) Memory device protection
US20230238049A1 (en) Address mapping for improved memory reliability
US20040003165A1 (en) Memory subsystem including error correction
WO2023106434A1 (en) Dram assist error correction method using ddr sdram interface
JPS61216060A (en) Storage device

Legal Events

Date Code Title Description
ENP Entry into the national phase

Ref country code: AU

Ref document number: 2000 16346

Kind code of ref document: A

Format of ref document f/p: F

AK Designated states

Kind code of ref document: A1

Designated state(s): AE AL AM AT AU AZ BA BB BG BR BY CA CH CN CU CZ DE DK EE ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MD MG MK MN MW MX NO NZ PL PT RO RU SD SE SG SI SK SL TJ TM TR TT UA UG US UZ VN YU ZA ZW

AL Designated countries for regional patents

Kind code of ref document: A1

Designated state(s): GH GM KE LS MW SD SL SZ TZ UG ZW AM AZ BY KG KZ MD RU TJ TM AT BE CH CY DE DK ES FI FR GB GR IE IT LU MC NL PT SE BF BJ CF CG CI CM GA GN GW ML MR NE SN TD TG

121 Ep: the epo has been informed by wipo that ep was designated in this application
DFPE Request for preliminary examination filed prior to expiration of 19th month from priority date (pct application filed before 20040101)
WWE Wipo information: entry into national phase

Ref document number: 1999959103

Country of ref document: EP

REG Reference to national code

Ref country code: DE

Ref legal event code: 8642

WWP Wipo information: published in national office

Ref document number: 1999959103

Country of ref document: EP