US20060069947A1 - Apparatus, method and program for the control of storage - Google Patents

Apparatus, method and program for the control of storage Download PDF

Info

Publication number
US20060069947A1
US20060069947A1 US11/008,143 US814304A US2006069947A1 US 20060069947 A1 US20060069947 A1 US 20060069947A1 US 814304 A US814304 A US 814304A US 2006069947 A1 US2006069947 A1 US 2006069947A1
Authority
US
United States
Prior art keywords
parity
creation
failure monitoring
instruction
storage devices
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
US11/008,143
Other versions
US7395451B2 (en
Inventor
Hideo Takahashi
Tsukasa Makino
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Fujitsu Ltd
Original Assignee
Fujitsu Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Fujitsu Ltd filed Critical Fujitsu Ltd
Assigned to FUJITSU LIMITED reassignment FUJITSU LIMITED ASSIGNMENT OF ASSIGNORS INTEREST (SEE DOCUMENT FOR DETAILS). Assignors: MAKINO, TSUKASA, TAKAHASHI, HIDEO
Assigned to FUJITSU LIMITED reassignment FUJITSU LIMITED ASSIGNMENT OF ASSIGNORS INTEREST (SEE DOCUMENT FOR DETAILS). Assignors: MAKINO, TSUKASA, TAKAHASHI, HIDEO
Publication of US20060069947A1 publication Critical patent/US20060069947A1/en
Application granted granted Critical
Publication of US7395451B2 publication Critical patent/US7395451B2/en
Expired - Fee Related legal-status Critical Current
Adjusted expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/06Digital input from, or digital output to, record carriers, e.g. RAID, emulated record carriers or networked record carriers
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/07Responding to the occurrence of a fault, e.g. fault tolerance
    • G06F11/08Error detection or correction by redundancy in data representation, e.g. by using checking codes
    • G06F11/10Adding special bits or symbols to the coded information, e.g. parity check, casting out 9's or 11's
    • G06F11/1076Parity data used in redundant arrays of independent storages, e.g. in RAID systems
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F2211/00Indexing scheme relating to details of data-processing equipment not covered by groups G06F3/00 - G06F13/00
    • G06F2211/10Indexing scheme relating to G06F11/10
    • G06F2211/1002Indexing scheme relating to G06F11/1076
    • G06F2211/1059Parity-single bit-RAID5, i.e. RAID 5 implementations

Definitions

  • the present invention relates generally to a storage control apparatus, method and program for a disk array, etc., securing data redundancy through RAID configuration and, more particularly, to a storage control apparatus and control method and program assuring data recovery based on the redundancy configuration upon failure disconnection of a storage device.
  • RAID Redundant Array of Independent Disks
  • RAID5 has ordinarily been used that is suited for I/O requests from the host attendant on the transaction processing.
  • the RAID5 writes user data into (N-1) disk devices having the same logical block address making up a stripe, of N disk devices configuring a disk array, and writes a parity into the remaining one, the parity being generated by EXORing the user data, the disk device with the parity written thereinto being different from stripe to stripe so that the parities can be distributed.
  • the user data of the failed disk device can be recovered by implementing the EXOR operation of the user data and the parity read from the other disk devices forming a RAID group together with the failed disk device.
  • Such a conventional RAID5 disk array system may face the worst situation possible that the data cannot be recovered and goes lost due to being incapable of the EXOR operation in case, when a disk device fails and degenerates, data cannot be read from two or more disk devices including the failed one.
  • the following may be considered as causes of rendering the data recovery infeasible upon the occurrence of a failure in the disk device.
  • First is a case where the parity consistency has broken down as a result of the parity going abnormal due to some reasons such as design errors of firmware of the disk device.
  • an abnormality has occurred in a medium of the other disk device than the failed disk device.
  • a portion unsubjected to read and write for a long while may appear even at the same logical block address on the same stripe, and such a portion may not probably undergo data recovery due to the medium abnormality occurring thereat.
  • the present invention is characterized by a storage control apparatus configured to write plural pieces of user data into (n-1) storage devices of n storage devices and to write parity data calculated from the plural pieces of user data into remaining one (1) storage device, the storage control apparatus comprising a failure monitoring unit arranged to add points in proportion to detected abnormality to find statistically added points for each of the storage devices, the failure monitoring unit issuing an instruction to re-create parity when the statistically added points come closer to a predefined failure determination point; and a parity re-creation unit arranged, when receiving the instruction to re-create parity from the failure monitoring unit, to read the plural pieces of user data from the (n-1) storage devices to re-calculate parity data for write into the remaining one (1) storage device.
  • the failure monitoring unit issues an instruction to initiate re-creation of parity when the statistically added points reach a given threshold value obtained by multiplying the failure determination point by a coefficient less than 1.
  • the failure monitoring unit issues an instruction to initiate re-creation of parity when the statistically added points reach a given threshold value obtained by multiplying the failure determination point by a coefficient in the range of 0.7 to 0.9.
  • the failure monitoring unit initiates re-creation of parity when self-diagnostic abnormality based on SMART (Self-Monitoring, Analysis and Reporting Technology) feature is output from one of the plurality of storage devices.
  • SMART Self-Monitoring, Analysis and Reporting Technology
  • the SMART acts to previously collect error rates in read-out and write-in operations of the hard disk, determine the time at which itself becomes inoperable based on the error ratios and notify the user thereof to urge the user to perform data backup prior to the termination of operation.
  • the SMART is incorporated in ATA/ATAPI standard.
  • the storage control apparatus of the present invention further comprises a channel adapter connecting to a host, a device adapter connecting the plurality of storage devices to one another, and a central processing module interposed between the channel adapter and the device adapter, the failure monitoring unit being disposed in the central processing module, with the parity re-creation unit disposed in the device adapter, the central processing module instructing the device adapter on re-creation of parity for execution.
  • the present invention provides a storage control method.
  • writing plural pieces of user data into (n-1) storage devices of n storage devices and writing parity data calculated from the plural pieces of user data into remaining one (1) storage device it comprises a failure monitoring step of issuing an instruction to re-create parity depending on the degree of detected abnormality for each of the storage devices; and a parity re-creation step, when receiving the instruction to re-create parity from the failure monitoring step, of reading the plural pieces of user data from the (n-1) storage devices to re-calculate parity data for write into the remaining one (1) storage device.
  • the present invention provides a storage control program.
  • the program of the present invention is operable to drive a computer of a storage control apparatus writing plural pieces of user data into (n-1) storage devices of n storage devices and writing parity data calculated from the plural pieces of user data into remaining one (1) storage device to execute a failure monitoring step of adding points in proportion to detected abnormality to find statistically added points for each of the storage devices, and issuing an instruction to re-create parity when the statistically added points come closer to a predefined failure determination point; and a parity re-creation step, when receiving the instruction to re-create parity from the failure monitoring step, of reading the plural pieces of user data from the (n-1) storage devices to re-calculate parity data for write into the remaining one (1) storage device.
  • Details of the storage control method and program in accordance with the present invention are basically the same as those of the storage control apparatus.
  • the user data are read out and parity is re-created for being written into a storage device for parity, whereby the consistency and reliability of the parity can be secured when a specific storage device has failed, to thereby obviate the worst possible situation where data goes lost without correct data being recovered due to inconsistent parity upon the occurrence of failure of the storage device.
  • FIG. 1 is a block diagram of the function configuration of a disk array system employing the present invention
  • FIGS. 2A and 2B are explanatory views of statistically added points and parity re-creation based on the detection of a failure in the present invention
  • FIG. 3 is a flowchart of failure monitoring processing effected by a central processing module of FIG. 1 ;
  • FIG. 4 is a flowchart of other failure monitoring processing effected by the central processing module of FIG. 1 ;
  • FIG. 5 is a flowchart of parity re-calculation processing effected by a device adapter of FIG. 1 ;
  • FIGS. 6A and 6B are flowcharts of other parity re-calculation processing effected by the device adapter of FIG. 1 .
  • FIG. 1 is a block diagram of the function configuration of a disk array system to which the present invention is applied.
  • the disk array system is constituted of a disk array control apparatus 10 acting as a storage controller, and a disk array 14 .
  • the disk array control apparatus 10 is provided with a channel adapter 16 , central processing modules (PM) 18 - 1 and 18 - 2 , and device adapters 20 - 1 and 20 - 2 .
  • the channel adapter 16 is coupled to a host 12 to process interface control for I/O requests from the host 12 .
  • the channel adapter 16 When the channel adapter 16 receives a data write operation request or a data read operation request from the host 12 , the channel adapter 16 notifies either the central processing module 18 - 1 or 18 - 2 of the operation request, and makes direct access to a cache memory disposed in the central processing modules 18 - 1 and 18 - 2 to effect data transfer between the channel adapter 16 and the host 12 .
  • the central processing modules 18 - 1 and 18 - 2 are core modules of the disk array control apparatus 10 and each execute three processings, i.e., resource management, cache memory management and service.
  • the resource management includes management of function module resources and effective control management.
  • the cache memory management includes management of assignment to memory areas disposed in the central processing modules 18 - 1 and 18 - 2 and entire cache control.
  • the service provides various services by maintenance tools.
  • the device adapters 20 - 1 and 20 - 2 are connected via a fiber channel interface to the disk devices 22 - 1 to 22 - 4 acting as storage devices and a stand-by disk device 24 that are disposed in the disk array 14 , to provide control of the fiber channel interface, I/O control of the disk devices, RAID control, etc., in this embodiment, RAID control of RAID5.
  • the four disk devices 22 - 1 to 22 - 4 disposed in the disk array 14 are arranged to accommodate control having RAID5 redundant configuration provided by the device adapters 20 - 1 and 20 - 2 , with the additional stand-by disk device 24 acting as a hot standby which replaces any failed disk device.
  • the central processing modules 18 - 1 and 18 - 2 of the disk array control apparatus 10 are provided with failure monitoring units 26 - 1 and 26 - 2 , whilst the device adapters 20 - 1 and 20 - 2 are provided with parity re-creation units 28 - 1 and 28 - 2 .
  • the failure monitoring units 26 - 1 and 26 - 2 disposed in the central processing modules 18 - 1 and 18 - 2 accumulate points in proportion to abnormality detected, to obtain statistically added points for each of the four disk devices 22 - 1 to 22 - 4 having the RAID5 redundant configuration disposed in the disk array 14 .
  • the failure monitoring units 26 - 1 and 26 - 2 When the statistically added points come closer to a predefined failure determination point, the failure monitoring units 26 - 1 and 26 - 2 issue an instruction on re-creation of parity. When the statistically added points exceed the failure determination point, the failure monitoring units 26 - 1 and 26 - 2 make a failure determination to issue an instruction on disconnection of the failed disk device.
  • the parity re-creation units 28 - 1 and 28 - 2 disposed in the device adapters 20 - 1 and 20 - 2 execute parity re-creation processing for the four disk devices 22 - 1 to 22 - 4 where user data are read out from three disk devices for each of logical block addresses in volumes, i.e., logical areas configured on disks of the disk devices 22 - 1 to 22 - 4 to re-calculate the parity data through exclusive OR processing for write into the remaining one disk device.
  • the central processing modules 18 - 1 and 18 - 2 and the device adapters 20 - 1 and 20 - 2 have respective dual configurations such that for I/O requests from the host 12 , the central processing module 18 - 1 and the device adapter 20 - 1 may act as primary side, for example, with the central processing module 18 - 2 and the device adapter 20 - 2 as the secondary side and such that for I/O requests from the host 12 , the primary side may be enabled with the secondary side providing backup upon the occurrence of a failure.
  • FIGS. 2A and 2B are explanatory views of the statistically added points and the parity re-creation based on the detection of a failure of the disk devices in the disk array system of the present invention.
  • FIG. 2A shows the disk array control apparatus 10 and the four disk devices 22 - 1 to 22 - 4 disposed in the disk array 14 associated therewith. Due to its RAID5 redundant configuration, the disk devices 22 - 1 to 22 - 4 each have stripes separated by the logical block addresses A 0 , A 1 , A 2 , A 3 , . . .
  • the stripes are designated at A 0 , A 1 , A 2 and A 3 .
  • user data is stored in three of the four disk devices 22 - 1 to 22 - 4 and parity data is stored in the remaining one disk device.
  • the disk device 22 - 3 stores parity data P 2
  • the disk device 22 - 2 stores parity data P 3
  • the disk device 22 - 1 stores parity data P 4
  • the disk array system having such a RAID5 redundant configuration when user data D 2 on the stripe A 0 is desired to be read out in case the disk device 22 - 2 has failed for example, user data D 1 , D 3 and parity data P 1 are read out from the normal disk devices 22 - 1 , 22 - 3 and 22 - 4 so that the three pieces of data are EXORed to recover the user data D 2 of the failed disk device 22 - 2 for response to the host.
  • the thus obtained new parity P 1 new is written into the disk device 22 - 4 so that thereafter a read response become possible with the user data D 2 recovered by EXORing the user data D 1 , D 3 and parity P 1 read in response to a read request of the user data D 2 to the failed disk device 22 - 2 .
  • the disk array control apparatus 10 accumulates points in proportion to the content of the notified error to obtain the statistically added points for each of the disk devices 22 - 1 to 22 - 4 having such a RAID5 redundant configuration. Once the statistically added points exceed a failure determination point, e.g., 255 points for judging the occurrence of predefined failures of a disk device, the disk device is regarded as having failed and is disconnected from the disk array system for being disabled, thus allowing the RAID5 redundant configuration to effect the processings for the read request and write request.
  • the content of errors contained in the statistically added points can be, e.g., abnormality of medium upon read, abnormality of actuator control, abnormality of read-related command, abnormality of power saving capability, lowering of read properties.
  • Points in proportion to the degree of errors are predefined for accumulation.
  • the statistically added points have come closer to a failure determination point, 255 points, at which the disk device is regarded as having failed, to instruct the device adapter 20 - 1 on the parity re-creation processing.
  • 200 points for example are available as a threshold value to issue an instruction on parity re-creation as a result of determination of being immediately before the failure.
  • FIG. 2A shows the status where e.g., three-times of medium errors have occurred in the read operation of user data D 2 of the disk device 22 - 2 on the stripe A 0 specified by the disk array control apparatus 10 .
  • the instruction on parity re-creation is issued when the statistically added points of the disk device 22 - 2 exceed 200 points as a result of occurrence of the medium errors.
  • FIG. 2B shows the processing operations for parity re-creation in case that the statistically added points of the disk device 22 - 2 exceed the threshold value.
  • the parity re-creation processing is effected for the four disk devices 22 - 1 to 22 - 4 making up the group of RAID5 in the disk array 14 , while specifying the strips A 0 , A 1 , A 2 , A 3 , etc., in sequence.
  • the stripe A 0 for example, user data D 1 , D 2 and D 3 are read out through user data read-out processing 30 from the disk devices 22 - 1 to 22 - 3 storing the user data D 1 , D 2 and D 3 of the four disk devices 22 - 1 to 22 - 4 .
  • the calculated parity is then written into the disk device 22 - 4 on the stripe A 0 through parity write-in processing 34 , to thereby issue an instruction on the parity re-creation of the stripe A 0 .
  • Similar parity re-creation processing is repeated for each of the logical block addresses indicated by the stripes A 1 , A 2 , A 3 , etc.
  • FIG. 3 is a flowchart of failure monitoring processing effected by the failure monitoring units 26 - 1 and 26 - 2 disposed in the central processing modules 18 - 1 and 18 - 2 of FIG. 1 .
  • points are-added to the statistically added points depending on the content of error notified from the device adapter, and thereafter at step S 2 it is checked whether the statistically added points have exceeded 200 points as a threshold value to determine the parity re-creation. If the statistically added points have exceeded 200 points, then at step S 3 it is checked whether an instruction on parity re-creation has been given to the device adapter.
  • step S 4 the disk device is determined as being immediately before the occurrence of failure, instructing the device adapter on the parity re-creation.
  • step S 5 it is checked whether the statistically added points have exceeded 255 points as the failure determination point, and since negative in this case, the processing comes to an end.
  • the procedure goes to step S 5 where it is checked whether the statistically added points have exceeded 255 points. If affirmative, then at step S 6 the disk device is determined as having failed, issuing an instruction on the disconnection of the failed disk device.
  • FIG. 4 is a flowchart of other failure monitoring processing effected by the central processing modules.
  • This embodiment is characterized in that if the error notification from the device adapter is a SMART abnormality notification, the disk device is determined as having failed, with the result that an instruction on the parity re-creation is issued. That is, in the processings of FIG. 4 , at step S 1 it is checked whether the error notification from the device adapter is a SMART abnormality notification or not.
  • the disk device 22 - 1 to 22 - 3 are ordinarily provided with a SMART system, the disk device is determined as being immediately before the occurrence of a failure when it is the SMART abnormality notification, and at step S 5 an instruction on the parity re-creation is given to the device adapter. Similar to the failure monitoring processing of FIG. 3 , the processings of steps S 2 to S 7 include adding points to the statistically added points depending on the content of error in response to the error notification from the device adapter, and issuing an instruction on the parity-recreation when the statistically added points exceed 200 points. It is to be noted that although in FIG.
  • the instruction on the parity re-creation is issued in response to earlier one of the point of time where the statistically added points have exceed 200 points and the point of time where the SMART abnormality notification has been received, only the SMART abnormality notification may trigger the instruction on the parity re-creation without using the statistically added points.
  • FIG. 5 is a flowchart of parity re-creation processing effected by the device adapter of FIG. 1 .
  • the parity re-calculation processing is started up based on a parity re-creation instruction from the central processing modules.
  • the logical block addresses are set as the stripes and at step S 2 user data are read from other disk devices than the disk device for parity.
  • step S 3 it is checked whether data read has succeeded or not. In case of having failed in the data read, then at step S 9 error termination processing results due to incapability of calculating the parity.
  • step S 3 data read has succeeded, then at step S 4 the parity is re-calculated through the EXOR operation of the user data. Then at step S 5 the re-calculated parity is written into the corresponding disk device. If at step S 6 the parity write-in has succeeded, then at step S 7 it is checked whether the final logical block address has been reached or not. If the final logical block address has not been reached, then the procedure goes back to step Si to set the next logical block address for repetition of the same processings. If the final logical block address has been reached, a series of processings come to an end. In case of having failed in the parity read-in at step S 6 , then at step S 8 the replacement processing is executed and then parity is again written into the replaced area if the cause for the failure is a medium abnormality.
  • FIGS. 6A and 6B are flowcharts of other parity re-creation processing effected by the device adapter of FIG. 1 .
  • This parity re-calculation processing is characterized in that in case of having failed in read-out of a single piece of user when the user data read-out has failed, the data having failed in read-out are recovered and the parity re-creation is made in accordance with the RAID5 redundant configuration.
  • step S 1 logical block addresses are set as the stripes, and then at step S 2 user data are read from other disk devices than the disk device for parity. Then, at step D 3 it is checked whether the data read-out has succeeded or not.
  • step S 4 it is checked at step S 4 whether a single piece of user data has failed in read-out. If a single piece of user data has failed in the read-out, then at step S 5 the parity is read out to recover the user data having failed in the read-out from the EXOR operation of that failed user data and the normally read-out remaining two pieces of user data. Then at step S 6 the parity is re-calculated by EXORing the user data. In case two or more pieces of user data have failed in read-out at step S 4 , error ending processing results at step 11 due to the incapability of recovering.
  • the parity is written into the corresponding disk device at step S 7 , and if the parity write-in has succeeded at step S 8 , then it is checked at step S 9 whether the final logical bloc address has been reached or not. Subsequently, the processings from step Si are repeated till the final logical block address is reached. Alternatively, case of having failed in the parity write-in at step S 8 , if medium abnormality is determined at step S 10 , then the replacement processing is executed to recover the medium abnormality, after which the parity is again written into the replaced area.
  • any proper storage devices could be employed as examples.
  • the present invention is not intended to be restricted to the above embodiments, but encompasses any proper variants without impairing the objects and advantages thereof. Further, the present invention is not limited by the numerical values indicated in the above embodiments.

Abstract

A storage control apparatus is provided that comprises a failure monitoring unit arranged to add points in proportion to detected abnormality to find statistically added points for each of N disk devices, the failure monitoring unit issuing an instruction to re-create parity when the statistically added points come closer to a predefined failure determination point, the failure monitoring unit issuing an instruction to disconnect the failed disk device when the statistically added points exceed the failure determination point; and a parity re-creation unit arranged, when receiving the instruction to re-create parity from the failure monitoring unit, to read the plural pieces of user data from (N-1) disk devices to re-calculate parity data for write into the remaining one (1) disk device for each of all addresses of the plurality of disk devices.

Description

  • This application is a priority based on prior application No. JP 2004-263833, filed Sep. 10, 2004, in Japan.
  • CROSS-REFERENCE TO RELATED APPLICATION
  • This application claims the benefit of priority to prior application No. JP 2004-263833, filed Sep. 10, 2004 in Japan.
  • BACKGROUND OF THE INVENTION
  • 1. Field of the Invention
  • The present invention relates generally to a storage control apparatus, method and program for a disk array, etc., securing data redundancy through RAID configuration and, more particularly, to a storage control apparatus and control method and program assuring data recovery based on the redundancy configuration upon failure disconnection of a storage device.
  • 2. Description of the Related Art
  • In a conventional disk array system, data redundancy has been enhanced by configuring RAID (Redundant Array of Independent Disks) composed of a plurality of disk devices arranged in a disk array, to respond to I/O requests from a host. Though various types of RAIDs exist, RAID5 has ordinarily been used that is suited for I/O requests from the host attendant on the transaction processing. The RAID5 writes user data into (N-1) disk devices having the same logical block address making up a stripe, of N disk devices configuring a disk array, and writes a parity into the remaining one, the parity being generated by EXORing the user data, the disk device with the parity written thereinto being different from stripe to stripe so that the parities can be distributed. In the disk array system having such a RAID5 redundant configuration, when a disk device fails and degenerates, the user data of the failed disk device can be recovered by implementing the EXOR operation of the user data and the parity read from the other disk devices forming a RAID group together with the failed disk device.
  • Such a conventional RAID5 disk array system, however, may face the worst situation possible that the data cannot be recovered and goes lost due to being incapable of the EXOR operation in case, when a disk device fails and degenerates, data cannot be read from two or more disk devices including the failed one. The following may be considered as causes of rendering the data recovery infeasible upon the occurrence of a failure in the disk device. First, is a case where the parity consistency has broken down as a result of the parity going abnormal due to some reasons such as design errors of firmware of the disk device. There is another case where an abnormality has occurred in a medium of the other disk device than the failed disk device. Furthermore, a portion unsubjected to read and write for a long while may appear even at the same logical block address on the same stripe, and such a portion may not probably undergo data recovery due to the medium abnormality occurring thereat.
  • SUMMARY OF THE INVENTION
  • It is the object of the present invention to provide a storage control apparatus and control method and program preventing data from becoming unreadable from two or more storage devices contained in the RAID group and preventing the read parity from becoming inconsistent when the storage device has failed, as well as securely obviating the occurrence of the situation where data of the failed storage device goes lost without being recovered.
  • The present invention is characterized by a storage control apparatus configured to write plural pieces of user data into (n-1) storage devices of n storage devices and to write parity data calculated from the plural pieces of user data into remaining one (1) storage device, the storage control apparatus comprising a failure monitoring unit arranged to add points in proportion to detected abnormality to find statistically added points for each of the storage devices, the failure monitoring unit issuing an instruction to re-create parity when the statistically added points come closer to a predefined failure determination point; and a parity re-creation unit arranged, when receiving the instruction to re-create parity from the failure monitoring unit, to read the plural pieces of user data from the (n-1) storage devices to re-calculate parity data for write into the remaining one (1) storage device. The failure monitoring unit issues an instruction to initiate re-creation of parity when the statistically added points reach a given threshold value obtained by multiplying the failure determination point by a coefficient less than 1. The failure monitoring unit issues an instruction to initiate re-creation of parity when the statistically added points reach a given threshold value obtained by multiplying the failure determination point by a coefficient in the range of 0.7 to 0.9. The failure monitoring unit initiates re-creation of parity when self-diagnostic abnormality based on SMART (Self-Monitoring, Analysis and Reporting Technology) feature is output from one of the plurality of storage devices. The SMART acts to previously collect error rates in read-out and write-in operations of the hard disk, determine the time at which itself becomes inoperable based on the error ratios and notify the user thereof to urge the user to perform data backup prior to the termination of operation. The SMART is incorporated in ATA/ATAPI standard. The storage control apparatus of the present invention further comprises a channel adapter connecting to a host, a device adapter connecting the plurality of storage devices to one another, and a central processing module interposed between the channel adapter and the device adapter, the failure monitoring unit being disposed in the central processing module, with the parity re-creation unit disposed in the device adapter, the central processing module instructing the device adapter on re-creation of parity for execution.
  • The present invention provides a storage control method. In the storage control method of the present invention writing plural pieces of user data into (n-1) storage devices of n storage devices and writing parity data calculated from the plural pieces of user data into remaining one (1) storage device, it comprises a failure monitoring step of issuing an instruction to re-create parity depending on the degree of detected abnormality for each of the storage devices; and a parity re-creation step, when receiving the instruction to re-create parity from the failure monitoring step, of reading the plural pieces of user data from the (n-1) storage devices to re-calculate parity data for write into the remaining one (1) storage device. The present invention provides a storage control program. The program of the present invention is operable to drive a computer of a storage control apparatus writing plural pieces of user data into (n-1) storage devices of n storage devices and writing parity data calculated from the plural pieces of user data into remaining one (1) storage device to execute a failure monitoring step of adding points in proportion to detected abnormality to find statistically added points for each of the storage devices, and issuing an instruction to re-create parity when the statistically added points come closer to a predefined failure determination point; and a parity re-creation step, when receiving the instruction to re-create parity from the failure monitoring step, of reading the plural pieces of user data from the (n-1) storage devices to re-calculate parity data for write into the remaining one (1) storage device. Details of the storage control method and program in accordance with the present invention are basically the same as those of the storage control apparatus.
  • According to the present invention, immediately before a storage device contained in the RAID group goes down, the user data are read out and parity is re-created for being written into a storage device for parity, whereby the consistency and reliability of the parity can be secured when a specific storage device has failed, to thereby obviate the worst possible situation where data goes lost without correct data being recovered due to inconsistent parity upon the occurrence of failure of the storage device. Since user data are read out from all the storage devices upon re-creation of the parity with the re-created parity being written into the storage device for parity, medium abnormality of all the storage devices can be detected so that execution of replacement processing for the medium abnormality enables the worst possible situation to securely be prevented where data cannot be read due to the medium abnormality from two or more storage devices including the failed storage device and goes lost when the failure of the storage device has occurred. The above and other objects, features, and advantages of the present invention will become more apparent from the following detailed description with reference to the drawings.
  • BRIEF DESCRIPTION OF THE DRAWINGS
  • FIG. 1 is a block diagram of the function configuration of a disk array system employing the present invention;
  • FIGS. 2A and 2B are explanatory views of statistically added points and parity re-creation based on the detection of a failure in the present invention;
  • FIG. 3 is a flowchart of failure monitoring processing effected by a central processing module of FIG. 1;
  • FIG. 4 is a flowchart of other failure monitoring processing effected by the central processing module of FIG. 1;
  • FIG. 5 is a flowchart of parity re-calculation processing effected by a device adapter of FIG. 1; and
  • FIGS. 6A and 6B are flowcharts of other parity re-calculation processing effected by the device adapter of FIG. 1.
  • DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENT
  • FIG. 1 is a block diagram of the function configuration of a disk array system to which the present invention is applied. In FIG. 1, the disk array system is constituted of a disk array control apparatus 10 acting as a storage controller, and a disk array 14. The disk array control apparatus 10 is provided with a channel adapter 16, central processing modules (PM) 18-1 and 18-2, and device adapters 20-1 and 20-2. The channel adapter 16 is coupled to a host 12 to process interface control for I/O requests from the host 12. When the channel adapter 16 receives a data write operation request or a data read operation request from the host 12, the channel adapter 16 notifies either the central processing module 18-1 or 18-2 of the operation request, and makes direct access to a cache memory disposed in the central processing modules 18-1 and 18-2 to effect data transfer between the channel adapter 16 and the host 12. The central processing modules 18-1 and 18-2 are core modules of the disk array control apparatus 10 and each execute three processings, i.e., resource management, cache memory management and service. The resource management includes management of function module resources and effective control management. The cache memory management includes management of assignment to memory areas disposed in the central processing modules 18-1 and 18-2 and entire cache control. The service provides various services by maintenance tools. The device adapters 20-1 and 20-2 are connected via a fiber channel interface to the disk devices 22-1 to 22-4 acting as storage devices and a stand-by disk device 24 that are disposed in the disk array 14, to provide control of the fiber channel interface, I/O control of the disk devices, RAID control, etc., in this embodiment, RAID control of RAID5. The four disk devices 22-1 to 22-4 disposed in the disk array 14 are arranged to accommodate control having RAID5 redundant configuration provided by the device adapters 20-1 and 20-2, with the additional stand-by disk device 24 acting as a hot standby which replaces any failed disk device. In such a disk array system of the present invention, the central processing modules 18-1 and 18-2 of the disk array control apparatus 10 are provided with failure monitoring units 26-1 and 26-2, whilst the device adapters 20-1 and 20-2 are provided with parity re-creation units 28-1 and 28-2. The failure monitoring units 26-1 and 26-2 disposed in the central processing modules 18-1 and 18-2 accumulate points in proportion to abnormality detected, to obtain statistically added points for each of the four disk devices 22-1 to 22-4 having the RAID5 redundant configuration disposed in the disk array 14. When the statistically added points come closer to a predefined failure determination point, the failure monitoring units 26-1 and 26-2 issue an instruction on re-creation of parity. When the statistically added points exceed the failure determination point, the failure monitoring units 26-1 and 26-2 make a failure determination to issue an instruction on disconnection of the failed disk device. When receiving the instruction on parity re-creation from the failure monitoring unit 26-1 or 26-2, the parity re-creation units 28-1 and 28-2 disposed in the device adapters 20-1 and 20-2 execute parity re-creation processing for the four disk devices 22-1 to 22-4 where user data are read out from three disk devices for each of logical block addresses in volumes, i.e., logical areas configured on disks of the disk devices 22-1 to 22-4 to re-calculate the parity data through exclusive OR processing for write into the remaining one disk device. It is to be noted that the disk array control apparatus 10 of the disk array system of FIG. 1 is shown having its minimal system configuration and may further be provided with, as necessary, additional channel adapters, central processing modules, device adapters and disk devices of the disk array 14. The central processing modules 18-1 and 18-2 and the device adapters 20-1 and 20-2 have respective dual configurations such that for I/O requests from the host 12, the central processing module 18-1 and the device adapter 20-1 may act as primary side, for example, with the central processing module 18-2 and the device adapter 20-2 as the secondary side and such that for I/O requests from the host 12, the primary side may be enabled with the secondary side providing backup upon the occurrence of a failure.
  • FIGS. 2A and 2B are explanatory views of the statistically added points and the parity re-creation based on the detection of a failure of the disk devices in the disk array system of the present invention. FIG. 2A shows the disk array control apparatus 10 and the four disk devices 22-1 to 22-4 disposed in the disk array 14 associated therewith. Due to its RAID5 redundant configuration, the disk devices 22-1 to 22-4 each have stripes separated by the logical block addresses A0, A1, A2, A3, . . . in the volumes, i.e., logical areas configured on the disk devices such that the four disk devices 22-1 to 22-4 are subjected to concurrent execution of data I/O for the stripes at the same logical block address. In the description which follows, the stripes are designated at A0, A1, A2 and A3. In the RAID5 redundant configuration, user data is stored in three of the four disk devices 22-1 to 22-4 and parity data is stored in the remaining one disk device. When the stripes at the logical block address are viewed for example, user data D1, D2 and d3 are stored in the three disk devices 22-1 to 22-3, with the remaining one disk device 22-4 storing parity P1 calculated from the EXOR operation of the D1, D2 and D3. In this manner, although it is common to all the stripes that the stripes at the same logical block address of the four disk devices 22-1 to 22-4 store three pieces of user data and one piece of parity data, different disk devices store different parity data on a stripe by stripe basis so that the parity data is distributed. That is, on the stripe A1 the disk device 22-3 stores parity data P2, on the stripe A2 the disk device 22-2 stores parity data P3, and on the stripe A3 the disk device 22-1 stores parity data P4. In the disk array system having such a RAID5 redundant configuration, when user data D2 on the stripe A0 is desired to be read out in case the disk device 22-2 has failed for example, user data D1, D3 and parity data P1 are read out from the normal disk devices 22-1, 22-3 and 22-4 so that the three pieces of data are EXORed to recover the user data D2 of the failed disk device 22-2 for response to the host. Write of the user data D2 from the host to the failed disk device 22-2 is as follows. Let the user data D1, D2, D3 and parity data P1 prior to the write be old data D1 old, D2 old, D3 old, and old parity P1 old, respectively. Let the write data from the host be new data D2 new. First, the old data D1 old, D3 old and old parity P1 old are read out from the disk devices 22-1, 22-3 and 22-4, respectively to obtain the old data D2 old of the failed disk device 22-2 as
    D 2 old =D 1 old(+)D 3 old(+) P 1 old
    where (+) represents exclusive OR.
    The new parity P1 new is then obtained from the old data D2 old, new data D2 new and old parity P1 old as
    new parity=old parity (+) old data+new data
    P 1 new =P 1 old(+)D 2 old(+) D 2 new
    The thus obtained new parity P1 new is written into the disk device 22-4 so that thereafter a read response become possible with the user data D2 recovered by EXORing the user data D1, D3 and parity P1 read in response to a read request of the user data D2 to the failed disk device 22-2. When receiving notification of error from the failure monitoring unit 26-1 via the device adapter 20-1 for example shown in FIG. 1, the disk array control apparatus 10 accumulates points in proportion to the content of the notified error to obtain the statistically added points for each of the disk devices 22-1 to 22-4 having such a RAID5 redundant configuration. Once the statistically added points exceed a failure determination point, e.g., 255 points for judging the occurrence of predefined failures of a disk device, the disk device is regarded as having failed and is disconnected from the disk array system for being disabled, thus allowing the RAID5 redundant configuration to effect the processings for the read request and write request. The content of errors contained in the statistically added points can be, e.g., abnormality of medium upon read, abnormality of actuator control, abnormality of read-related command, abnormality of power saving capability, lowering of read properties. Points in proportion to the degree of errors are predefined for accumulation. In addition to the processing of statistically added points responsive to the error notification for the disk devices 22-1 to 22-4, it is determined in the present invention that the statistically added points have come closer to a failure determination point, 255 points, at which the disk device is regarded as having failed, to instruct the device adapter 20-1 on the parity re-creation processing. Aside from the failure determination point of 255 points, 200 points for example are available as a threshold value to issue an instruction on parity re-creation as a result of determination of being immediately before the failure. Also available as another threshold value are points obtained by multiplying 255 points of the failure determination point with, e.g., a coefficient of 1 or less such as 0.7 to 0.9.
  • FIG. 2A shows the status where e.g., three-times of medium errors have occurred in the read operation of user data D2 of the disk device 22-2 on the stripe A0 specified by the disk array control apparatus 10. The instruction on parity re-creation is issued when the statistically added points of the disk device 22-2 exceed 200 points as a result of occurrence of the medium errors. FIG. 2B shows the processing operations for parity re-creation in case that the statistically added points of the disk device 22-2 exceed the threshold value. The parity re-creation processing is effected for the four disk devices 22-1 to 22-4 making up the group of RAID5 in the disk array 14, while specifying the strips A0, A1, A2, A3, etc., in sequence. As to the stripe A0 for example, user data D1, D2 and D3 are read out through user data read-out processing 30 from the disk devices 22-1 to 22-3 storing the user data D1, D2 and D3 of the four disk devices 22-1 to 22-4. The three pieces of user data D1, D2 and D3 are then subjected to parity calculation processing 32 to effect EXOR operation
    P 1=D 1(+)D 2(+) D 3
    to calculate the parity. The calculated parity is then written into the disk device 22-4 on the stripe A0 through parity write-in processing 34, to thereby issue an instruction on the parity re-creation of the stripe A0. Similar parity re-creation processing is repeated for each of the logical block addresses indicated by the stripes A1, A2, A3, etc. By virtue of such parity re-creation processing effected immediately before the occurrence of a failure of the disk device where user data are read out from three disk devices of all the disk devices 22-1 to 22-4 in the disk array 14 with parity data being written into the remaining one, read-out operation of the user data and the write-in operation of the parity data are effected without exception for all the stripes (all the logical block addresses) of all of the disk devices.
  • As a result, check is made of all the stripes (all the logical block addresses) of the disk devices 22-1 to 22-4. Thus, in case that a medium failure occurs, a replacement area is secured by the replacement processing into which data or parity recovered by RAID5 is written, whereby check of the medium failure can be done over all the areas of all the disk devices 22-1 to 22-4 prior to the occurrence of a failure of the disk device 22-2. For all the stripes (all the logical block addresses) of all the disk devices 22-1 to 22-4, the distributedly stored parity data are calculated and written in upon read-out of the user data on the same stripe, thus assuring the consistency and effectiveness of the parity data. In consequence, if the statistically added points exceed 255 points as the failure determination point as a result of error notifications after the parity re-creation, with the result that the disk device 22-2 is disconnected as a failed disk device, then the recovery and the assurance of parity consistency based on the replacement processing against the medium abnormality is completed due to the parity re-creation operation effected immediately before the occurrence of the failure. Thus, from the remaining three normal disk devices 22-1, 22-3, and 22-4, there can normally be conducted the recovery of the read data in conformity with the redundant configuration of RAID5 and the rewrite of the parity data corresponding to the write-in of the write data, thereby securely obviating the occurrence of the worst possible situation where RAID5-based data recovery becomes impossible leading to lost data as a result of failing to read out the data from two or more disk devices including the failed disk device or of lacking the parity data consistency.
  • FIG. 3 is a flowchart of failure monitoring processing effected by the failure monitoring units 26-1 and 26-2 disposed in the central processing modules 18-1 and 18-2 of FIG. 1. In FIG. 3, at step S1 points are-added to the statistically added points depending on the content of error notified from the device adapter, and thereafter at step S2 it is checked whether the statistically added points have exceeded 200 points as a threshold value to determine the parity re-creation. If the statistically added points have exceeded 200 points, then at step S3 it is checked whether an instruction on parity re-creation has been given to the device adapter. If negative, then at step S4 the disk device is determined as being immediately before the occurrence of failure, instructing the device adapter on the parity re-creation. Then, at step S5 it is checked whether the statistically added points have exceeded 255 points as the failure determination point, and since negative in this case, the processing comes to an end. Then, in case that the failure monitoring processing of FIG. 3 has been executed in response to the error notification from the device adapter after the instruction on the parity re-creation has been given to the device adapter at step S4, since the parity has already been re-created at step S3, the procedure goes to step S5 where it is checked whether the statistically added points have exceeded 255 points. If affirmative, then at step S6 the disk device is determined as having failed, issuing an instruction on the disconnection of the failed disk device.
  • FIG. 4 is a flowchart of other failure monitoring processing effected by the central processing modules. This embodiment is characterized in that if the error notification from the device adapter is a SMART abnormality notification, the disk device is determined as having failed, with the result that an instruction on the parity re-creation is issued. That is, in the processings of FIG. 4, at step S1 it is checked whether the error notification from the device adapter is a SMART abnormality notification or not. Since the disk devices 22-1 to 22-3 are ordinarily provided with a SMART system, the disk device is determined as being immediately before the occurrence of a failure when it is the SMART abnormality notification, and at step S5 an instruction on the parity re-creation is given to the device adapter. Similar to the failure monitoring processing of FIG. 3, the processings of steps S2 to S7 include adding points to the statistically added points depending on the content of error in response to the error notification from the device adapter, and issuing an instruction on the parity-recreation when the statistically added points exceed 200 points. It is to be noted that although in FIG. 4 the instruction on the parity re-creation is issued in response to earlier one of the point of time where the statistically added points have exceed 200 points and the point of time where the SMART abnormality notification has been received, only the SMART abnormality notification may trigger the instruction on the parity re-creation without using the statistically added points.
  • FIG. 5 is a flowchart of parity re-creation processing effected by the device adapter of FIG. 1. In FIG. 5, the parity re-calculation processing is started up based on a parity re-creation instruction from the central processing modules. First, at step S1, the logical block addresses are set as the stripes and at step S2 user data are read from other disk devices than the disk device for parity. Then at step S3 it is checked whether data read has succeeded or not. In case of having failed in the data read, then at step S9 error termination processing results due to incapability of calculating the parity. In case at step S3 data read has succeeded, then at step S4 the parity is re-calculated through the EXOR operation of the user data. Then at step S5 the re-calculated parity is written into the corresponding disk device. If at step S6 the parity write-in has succeeded, then at step S7 it is checked whether the final logical block address has been reached or not. If the final logical block address has not been reached, then the procedure goes back to step Si to set the next logical block address for repetition of the same processings. If the final logical block address has been reached, a series of processings come to an end. In case of having failed in the parity read-in at step S6, then at step S8 the replacement processing is executed and then parity is again written into the replaced area if the cause for the failure is a medium abnormality.
  • FIGS. 6A and 6B are flowcharts of other parity re-creation processing effected by the device adapter of FIG. 1. This parity re-calculation processing is characterized in that in case of having failed in read-out of a single piece of user when the user data read-out has failed, the data having failed in read-out are recovered and the parity re-creation is made in accordance with the RAID5 redundant configuration. In FIGS. 6A and 6B, at step S1 logical block addresses are set as the stripes, and then at step S2 user data are read from other disk devices than the disk device for parity. Then, at step D3 it is checked whether the data read-out has succeeded or not. In case of having failed in data read-out at that time, it is checked at step S4 whether a single piece of user data has failed in read-out. If a single piece of user data has failed in the read-out, then at step S5 the parity is read out to recover the user data having failed in the read-out from the EXOR operation of that failed user data and the normally read-out remaining two pieces of user data. Then at step S6 the parity is re-calculated by EXORing the user data. In case two or more pieces of user data have failed in read-out at step S4, error ending processing results at step 11 due to the incapability of recovering. After the re-calculation of parity at step S6, the parity is written into the corresponding disk device at step S7, and if the parity write-in has succeeded at step S8, then it is checked at step S9 whether the final logical bloc address has been reached or not. Subsequently, the processings from step Si are repeated till the final logical block address is reached. Alternatively, case of having failed in the parity write-in at step S8, if medium abnormality is determined at step S10, then the replacement processing is executed to recover the medium abnormality, after which the parity is again written into the replaced area. Although the above embodiments have been directed to the magnetic disk devices as the disk devices by way of example, any proper storage devices could be employed as examples. The present invention is not intended to be restricted to the above embodiments, but encompasses any proper variants without impairing the objects and advantages thereof. Further, the present invention is not limited by the numerical values indicated in the above embodiments.

Claims (17)

1. A storage control apparatus configured to write plural pieces of user data into (n-1) storage devices of n storage devices and to write parity data calculated from the plural pieces of user data into remaining one (1) storage device, the storage control apparatus comprising:
a failure monitoring unit arranged to add points in proportion to detected abnormality to find statistically added points for each of the storage devices, the failure monitoring unit issuing an instruction to re-create parity when the statistically added points come closer to a predefined failure determination point; and
a parity re-creation unit arranged, when receiving the instruction to re-create parity from the failure monitoring unit, to read the plural pieces of user data from the (n-1) storage devices to re-calculate parity data for write into the remaining one (1) storage device.
2. The storage control apparatus of claim 1, wherein the failure monitoring unit issues an instruction to initiate re-creation of parity when the statistically added points reach a given threshold value obtained by multiplying the failure determination point by a coefficient less than 1.
3. The storage control apparatus of claim 1, wherein the failure monitoring unit issues an instruction to initiate re-creation of parity when the statistically added points reach a given threshold value obtained by multiplying the failure determination point by a coefficient in the range of 0.7 to 0.9.
4. The storage control apparatus of claim 1, wherein the failure monitoring unit issues an instruction to initiate re-creation of parity when self-diagnostic abnormality based on SMART feature is output from one of the plurality of storage devices.
5. The storage control apparatus of claim 1, wherein the plurality of storage devices have a redundant configuration of RAID5.
6. The storage control apparatus of claim 1, further comprising a channel adapter connecting to a host, a device adapter connecting the plurality of storage devices to one another, and a central processing module interposed between the channel adapter and the device adapter, wherein
the failure monitoring unit is disposed in the central processing module, wherein
the parity re-creation unit is disposed in the device adapter, and wherein
the central processing module instructs the device adapter on re-creation of parity for execution.
7. A storage control method writing plural pieces of user data into (n-1) storage devices of n storage devices and writing parity data calculated from the plural pieces of user data into remaining one (1) storage device, the storage control method comprising:
a failure monitoring step of issuing an instruction to re-create parity depending on the degree of detected abnormality for each of the storage devices; and
a parity re-creation step, when receiving the instruction to re-create parity from the failure monitoring step, of reading the plural pieces of user data from the (n-1) storage devices to re-calculate parity data for write into the remaining one (1) storage device.
8. The storage control method of claim 7, wherein the failure monitoring step includes issuing an instruction to initiate re-creation of parity when the statistically added points reach a given threshold value obtained by multiplying the failure determination point by a coefficient less than 1.
9. The storage control method of claim 7, wherein the failure monitoring step includes issuing an instruction to initiate re-creation of parity when the statistically added points reach a given threshold value obtained by multiplying the failure determination point by a coefficient in the range of 0.7 to 0.9.
10. The storage control method of claim 7, wherein the failure monitoring step includes issuing an instruction to initiate re-creation of parity when self-diagnostic abnormality based on SMART feature is output from one of the plurality of storage devices.
11. The storage control method of claim 7, wherein the plurality of storage devices have a redundant configuration of RAID5.
12. The storage control method of claim 7, in which are disposed a channel adapter connecting to a host, a device adapter connecting the plurality of storage devices to one another, and a central processing module interposed between the channel adapter and the device adapter, wherein
the failure monitoring step is processed by the central processing module, wherein
the parity re-creation step is processed by the device adapter, and wherein
the central processing module instructs the device adapter on re-creation of parity for execution.
13. A program operable to drive a computer of a storage control apparatus writing plural pieces of user data into (n-1) storage devices of n storage devices and writing parity data calculated from the plural pieces of user data into remaining one (1) storage device to execute:
a failure monitoring step of adding points in proportion to detected abnormality to find statistically added points for each of the storage devices, and issuing an instruction to re-create parity when the statistically added points come closer to a predefined failure determination point; and
a parity re-creation step, when receiving the instruction to re-create parity from the failure monitoring step, of reading the plural pieces of user data from the (n-1) storage devices to re-calculate parity data for write into the remaining one (1) storage device.
14. The program of claim 13, wherein the failure monitoring step includes issuing an instruction to initiate re-creation of parity when the statistically added points reach a given threshold value obtained by multiplying the failure determination point by a coefficient less than 1.
15. The program of claim 13, wherein the failure monitoring step includes issuing an instruction to initiate re-creation of parity when the statistically added points reach a given threshold value obtained by multiplying the failure determination point by a coefficient in the range of 0.7 to 0.9.
16. The program of claim 13, wherein the failure monitoring step includes issuing an instruction to initiate re-creation of parity when self-diagnostic abnormality based on SMART feature is output from one of the plurality of storage devices.
17. The program of claim 13, wherein the plurality of storage devices have a redundant configuration of RAID5.
US11/008,143 2004-09-10 2004-12-10 Apparatus, method and program for the control of storage Expired - Fee Related US7395451B2 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
JP2004-263833 2004-09-10
JP2004263833A JP2006079418A (en) 2004-09-10 2004-09-10 Storage control apparatus, control method and program

Publications (2)

Publication Number Publication Date
US20060069947A1 true US20060069947A1 (en) 2006-03-30
US7395451B2 US7395451B2 (en) 2008-07-01

Family

ID=36100606

Family Applications (1)

Application Number Title Priority Date Filing Date
US11/008,143 Expired - Fee Related US7395451B2 (en) 2004-09-10 2004-12-10 Apparatus, method and program for the control of storage

Country Status (4)

Country Link
US (1) US7395451B2 (en)
JP (1) JP2006079418A (en)
KR (1) KR100711165B1 (en)
CN (1) CN100353328C (en)

Cited By (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20080022041A1 (en) * 2006-07-10 2008-01-24 Akio Nakajima Storage control system, control method for storage control system, port selector, and controller
US20080183987A1 (en) * 2007-01-25 2008-07-31 Fujitsu Limited Storage system, storage control method, and storage control program
US20080201630A1 (en) * 2007-02-21 2008-08-21 Fujitsu Limited Storage controlling device and storage controlling method
US20090013213A1 (en) * 2007-07-03 2009-01-08 Adaptec, Inc. Systems and methods for intelligent disk rebuild and logical grouping of san storage zones
US20090217086A1 (en) * 2008-02-27 2009-08-27 Fujitsu Limited Disk array apparatus, disk array control method and disk array controller
US7913108B1 (en) * 2006-03-28 2011-03-22 Emc Corporation System and method for improving disk drive performance during high frequency vibration conditions
US20130024730A1 (en) * 2011-07-22 2013-01-24 Fujitsu Limited Disk control apparatus, method of detecting failure of disk apparatus, and recording medium for disk diagnosis program
US9395938B2 (en) 2013-09-09 2016-07-19 Fujitsu Limited Storage control device and method for controlling storage devices
US10001947B1 (en) * 2015-05-08 2018-06-19 American Megatrends, Inc. Systems, methods and devices for performing efficient patrol read operations in a storage system
US11422888B2 (en) * 2020-10-14 2022-08-23 Western Digital Technologies, Inc. Data integrity check for writing data in memory

Families Citing this family (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2007179297A (en) * 2005-12-28 2007-07-12 Fujitsu Ltd Method, device and program for reinforcing parity check of raid 5
KR100809300B1 (en) * 2006-08-07 2008-03-04 삼성전자주식회사 Method for organizing RAID system having changeable storage
JP4402711B2 (en) * 2007-11-05 2010-01-20 富士通株式会社 Disk array device, disk array device control method, disk array device control program, and disk array control device
JP4862847B2 (en) * 2008-03-07 2012-01-25 日本電気株式会社 Disk array data recovery method, disk array system, and control program
JP2010128773A (en) * 2008-11-27 2010-06-10 Nec Fielding Ltd Disk array device, disk control method therefor, and disk control program therefor
JP2010238124A (en) * 2009-03-31 2010-10-21 Fujitsu Ltd Data management program, data management device and data managing method
CN103019618A (en) * 2012-11-29 2013-04-03 浪潮电子信息产业股份有限公司 Overall hot backup method for multiple controllers
US9535612B2 (en) 2013-10-23 2017-01-03 International Business Machines Corporation Selecting a primary storage device
US10929226B1 (en) 2017-11-21 2021-02-23 Pure Storage, Inc. Providing for increased flexibility for large scale parity
CN110442298B (en) * 2018-05-02 2021-01-12 杭州海康威视系统技术有限公司 Storage equipment abnormality detection method and device and distributed storage system
CN108986869B (en) * 2018-07-26 2021-04-30 南京群顶科技有限公司 Disk fault detection method using multi-model prediction
CN109358809B (en) * 2018-09-28 2020-07-24 方一信息科技(上海)有限公司 RAID data storage system and method

Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5892780A (en) * 1995-01-26 1999-04-06 International Business Machines Corporation Data storage system and parity generation method for data storage system
US6195760B1 (en) * 1998-07-20 2001-02-27 Lucent Technologies Inc Method and apparatus for providing failure detection and recovery with predetermined degree of replication for distributed applications in a network
US6553511B1 (en) * 2000-05-17 2003-04-22 Lsi Logic Corporation Mass storage data integrity-assuring technique utilizing sequence and revision number metadata
US20030105767A1 (en) * 2001-11-22 2003-06-05 Koji Sonoda Storage system and control method
US20040250161A1 (en) * 2003-06-09 2004-12-09 Brian Patterson Method and apparatus for data reconstruction
US20040260967A1 (en) * 2003-06-05 2004-12-23 Copan Systems, Inc. Method and apparatus for efficient fault-tolerant disk drive replacement in raid storage systems
US7043663B1 (en) * 2001-11-15 2006-05-09 Xiotech Corporation System and method to monitor and isolate faults in a storage area network
US7058762B2 (en) * 2003-06-09 2006-06-06 Hewlett-Packard Development Company, L.P. Method and apparatus for selecting among multiple data reconstruction techniques

Family Cites Families (19)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1035573A (en) 1988-10-07 1989-09-13 赏宝荣 Gas alarm
JP2618078B2 (en) 1990-07-09 1997-06-11 富士通株式会社 Array disk controller
JP3384016B2 (en) 1993-02-19 2003-03-10 富士ゼロックス株式会社 Document editing management device
JP3053153B2 (en) 1993-09-20 2000-06-19 株式会社日立製作所 How to start application of document management system
US5623595A (en) * 1994-09-26 1997-04-22 Oracle Corporation Method and apparatus for transparent, real time reconstruction of corrupted data in a redundant array data storage system
JPH0962658A (en) 1995-08-21 1997-03-07 Hitachi Inf Syst Ltd Inter-document link processing system
JPH09167120A (en) 1995-12-15 1997-06-24 Denso Corp Error correction device for storage device
KR100208801B1 (en) * 1996-09-16 1999-07-15 윤종용 Storage device system for improving data input/output perfomance and data recovery information cache method
JPH10247134A (en) 1997-03-05 1998-09-14 Nec Corp Fault processing circuit for disk array device for direct connection bus
JP3063666B2 (en) 1997-03-31 2000-07-12 日本電気株式会社 Array disk controller
JPH10283122A (en) 1997-04-02 1998-10-23 Sony Corp Disk type data recording and reproducing device
JPH11345095A (en) 1998-06-02 1999-12-14 Toshiba Corp Disk array device and control method therefor
US6240429B1 (en) 1998-08-31 2001-05-29 Xerox Corporation Using attached properties to provide document services
JP2000339206A (en) 1999-05-27 2000-12-08 Cadix Inc Electronic file managing method and computer readable recording medium storing program manage electronic file
US6993701B2 (en) 2001-12-28 2006-01-31 Network Appliance, Inc. Row-diagonal parity technique for enabling efficient recovery from double failures in a storage array
JP2004022136A (en) 2002-06-19 2004-01-22 Sony Corp Data recording and reproducing device and method, and digital camera
US7024586B2 (en) * 2002-06-24 2006-04-04 Network Appliance, Inc. Using file system information in raid data reconstruction and migration
JP4286634B2 (en) 2002-11-20 2009-07-01 パナソニック株式会社 Memory failure relief circuit
KR20040066638A (en) * 2003-01-20 2004-07-27 삼성전자주식회사 Parity Storing Method And Error block recovering Method In External Storage Sub-system

Patent Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5892780A (en) * 1995-01-26 1999-04-06 International Business Machines Corporation Data storage system and parity generation method for data storage system
US6195760B1 (en) * 1998-07-20 2001-02-27 Lucent Technologies Inc Method and apparatus for providing failure detection and recovery with predetermined degree of replication for distributed applications in a network
US6553511B1 (en) * 2000-05-17 2003-04-22 Lsi Logic Corporation Mass storage data integrity-assuring technique utilizing sequence and revision number metadata
US7043663B1 (en) * 2001-11-15 2006-05-09 Xiotech Corporation System and method to monitor and isolate faults in a storage area network
US20030105767A1 (en) * 2001-11-22 2003-06-05 Koji Sonoda Storage system and control method
US20040260967A1 (en) * 2003-06-05 2004-12-23 Copan Systems, Inc. Method and apparatus for efficient fault-tolerant disk drive replacement in raid storage systems
US20040250161A1 (en) * 2003-06-09 2004-12-09 Brian Patterson Method and apparatus for data reconstruction
US7058762B2 (en) * 2003-06-09 2006-06-06 Hewlett-Packard Development Company, L.P. Method and apparatus for selecting among multiple data reconstruction techniques

Cited By (17)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7913108B1 (en) * 2006-03-28 2011-03-22 Emc Corporation System and method for improving disk drive performance during high frequency vibration conditions
US20100115143A1 (en) * 2006-07-10 2010-05-06 Akio Nakajima Storage control system, control method for storage control system, port selector, and controller
US7761657B2 (en) * 2006-07-10 2010-07-20 Hitachi, Ltd. Storage control system, control method for storage control system, port selector, and controller
US7831767B2 (en) 2006-07-10 2010-11-09 Hitachi, Ltd. Storage control system, control method for storage control system, port selector, and controller
US20080022041A1 (en) * 2006-07-10 2008-01-24 Akio Nakajima Storage control system, control method for storage control system, port selector, and controller
US20080183987A1 (en) * 2007-01-25 2008-07-31 Fujitsu Limited Storage system, storage control method, and storage control program
US9251016B2 (en) 2007-01-25 2016-02-02 Fujitsu Limited Storage system, storage control method, and storage control program
US8489976B2 (en) * 2007-02-21 2013-07-16 Fujitsu Limited Storage controlling device and storage controlling method
US20080201630A1 (en) * 2007-02-21 2008-08-21 Fujitsu Limited Storage controlling device and storage controlling method
US20090013213A1 (en) * 2007-07-03 2009-01-08 Adaptec, Inc. Systems and methods for intelligent disk rebuild and logical grouping of san storage zones
US7900083B2 (en) * 2008-02-27 2011-03-01 Fujitsu Limited Disk array apparatus, disk array control method and disk array controller
US20090217086A1 (en) * 2008-02-27 2009-08-27 Fujitsu Limited Disk array apparatus, disk array control method and disk array controller
US20130024730A1 (en) * 2011-07-22 2013-01-24 Fujitsu Limited Disk control apparatus, method of detecting failure of disk apparatus, and recording medium for disk diagnosis program
US8977892B2 (en) * 2011-07-22 2015-03-10 Fujitsu Limited Disk control apparatus, method of detecting failure of disk apparatus, and recording medium for disk diagnosis program
US9395938B2 (en) 2013-09-09 2016-07-19 Fujitsu Limited Storage control device and method for controlling storage devices
US10001947B1 (en) * 2015-05-08 2018-06-19 American Megatrends, Inc. Systems, methods and devices for performing efficient patrol read operations in a storage system
US11422888B2 (en) * 2020-10-14 2022-08-23 Western Digital Technologies, Inc. Data integrity check for writing data in memory

Also Published As

Publication number Publication date
CN1746854A (en) 2006-03-15
JP2006079418A (en) 2006-03-23
CN100353328C (en) 2007-12-05
US7395451B2 (en) 2008-07-01
KR100711165B1 (en) 2007-04-24
KR20060023933A (en) 2006-03-15

Similar Documents

Publication Publication Date Title
US7395451B2 (en) Apparatus, method and program for the control of storage
US6442711B1 (en) System and method for avoiding storage failures in a storage array system
JP5887757B2 (en) Storage system, storage control device, and storage control method
US7979635B2 (en) Apparatus and method to allocate resources in a data storage library
US7809979B2 (en) Storage control apparatus and method
US5790773A (en) Method and apparatus for generating snapshot copies for data backup in a raid subsystem
US7525749B2 (en) Disk array apparatus and disk-array control method
US8448047B2 (en) Storage device, storage control device, data transfer intergrated circuit, and storage control method
JP4303187B2 (en) Program, storage control method, and storage device
US6243827B1 (en) Multiple-channel failure detection in raid systems
US8402210B2 (en) Disk array system
US6892276B2 (en) Increased data availability in raid arrays using smart drives
US7779202B2 (en) Apparatus and method for controlling disk array with redundancy and error counting
US6438647B1 (en) Method and apparatus for providing battery-backed immediate write back cache for an array of disk drives in a computer system
US8312315B2 (en) Storage control device and RAID group extension method
US20090313617A1 (en) Method for Updating Control Program of Physical Storage Device in Storage Virtualization System and Storage Virtualization Controller and System Thereof
US8225136B2 (en) Control method and storage device
US7363532B2 (en) System and method for recovering from a drive failure in a storage array
JPH11338648A (en) Disk array device, its error control method, and recording medium where control program thereof is recorded
US10338844B2 (en) Storage control apparatus, control method, and non-transitory computer-readable storage medium
US20070101188A1 (en) Method for establishing stable storage mechanism
US8782465B1 (en) Managing drive problems in data storage systems by tracking overall retry time
US7594051B2 (en) Storage apparatus
US20070036055A1 (en) Device, method and program for recovering from media error in disk array device
JPH09269871A (en) Data re-redundancy making system in disk array device

Legal Events

Date Code Title Description
AS Assignment

Owner name: FUJITSU LIMITED, JAPAN

Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNORS:TAKAHASHI, HIDEO;MAKINO, TSUKASA;REEL/FRAME:016083/0392

Effective date: 20041130

AS Assignment

Owner name: FUJITSU LIMITED, JAPAN

Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNORS:TAKAHASHI, HIDEO;MAKINO, TSUKASA;REEL/FRAME:016728/0253

Effective date: 20041130

STCF Information on status: patent grant

Free format text: PATENTED CASE

FEPP Fee payment procedure

Free format text: PAYOR NUMBER ASSIGNED (ORIGINAL EVENT CODE: ASPN); ENTITY STATUS OF PATENT OWNER: LARGE ENTITY

FPAY Fee payment

Year of fee payment: 4

FPAY Fee payment

Year of fee payment: 8

FEPP Fee payment procedure

Free format text: MAINTENANCE FEE REMINDER MAILED (ORIGINAL EVENT CODE: REM.); ENTITY STATUS OF PATENT OWNER: LARGE ENTITY

LAPS Lapse for failure to pay maintenance fees

Free format text: PATENT EXPIRED FOR FAILURE TO PAY MAINTENANCE FEES (ORIGINAL EVENT CODE: EXP.); ENTITY STATUS OF PATENT OWNER: LARGE ENTITY

STCH Information on status: patent discontinuation

Free format text: PATENT EXPIRED DUE TO NONPAYMENT OF MAINTENANCE FEES UNDER 37 CFR 1.362

FP Lapsed due to failure to pay maintenance fee

Effective date: 20200701