CN100389383C - Method for solving writing cavity in magnetic disk redundancy array - Google Patents

Method for solving writing cavity in magnetic disk redundancy array Download PDF

Info

Publication number
CN100389383C
CN100389383C CNB2006100836926A CN200610083692A CN100389383C CN 100389383 C CN100389383 C CN 100389383C CN B2006100836926 A CNB2006100836926 A CN B2006100836926A CN 200610083692 A CN200610083692 A CN 200610083692A CN 100389383 C CN100389383 C CN 100389383C
Authority
CN
China
Prior art keywords
write
master disk
disk controller
controller
daily record
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CNB2006100836926A
Other languages
Chinese (zh)
Other versions
CN1862478A (en
Inventor
陈庆议
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
New H3C Technologies Co Ltd
Original Assignee
Hangzhou H3C Technologies Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Hangzhou H3C Technologies Co Ltd filed Critical Hangzhou H3C Technologies Co Ltd
Priority to CNB2006100836926A priority Critical patent/CN100389383C/en
Publication of CN1862478A publication Critical patent/CN1862478A/en
Application granted granted Critical
Publication of CN100389383C publication Critical patent/CN100389383C/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Abstract

The present invention discloses a method for solving the problem of writing cavities in redundant arrays of disks, which comprises the steps that a host computer sends a write command to a master disk controller, the master disk controller receives the write command and generates a checking diary, and then, the master disk controller writes the checking diary in a memory and executes write operation; if the master disk controller breaks down in the process of the write operation, a backup disk controller reads the checking diary from the memory to reestablish check data subblocks. The method of the present invention effectively stores the checking diary, rapidly reestablishes the check data subblocks, and effectively solves the problem of writing cavities and the bottleneck of the system performance in redundant arrays of disks. The method of the present invention avoids the defects that a great deal of time is wasted when the problem of writing cavities in redundant arrays of inexpensive disks is solved in the prior art and checking data sub blocks are reestablished, and a great deal of time is also wasted for waiting for information transfer between the master disk controller and the backup disk during write operation.

Description

A kind of method that solves writing cavity in the redundant arrays of inexpensive disks
Technical field
The present invention relates to computing machine read-write control field, be meant the method for writing cavity in a kind of solution redundant arrays of inexpensive disks (Write Hole) especially.
Background technology
At the initial stage of computing machine development, the price of " high capacity " hard disk is also quite high, the main method that solves the data storage safety issue is to use equipment such as magnetic tape station to back up, though this method can guarantee the safety of data, consults with back-up job all quite loaded down with trivial details.1987, three slip-stick artist's co-presents of University of California Berkeley be entitled as the paper of " A Case ofRedundant Array of Inexpensive Disks (Redundant Array of Inexpensive Disc scheme) ", its basic thought is exactly that many less, the relatively inexpensive hard disk drives of capacity are carried out organic assembling, form a kind of in manageability, the virtual disk that capacity, performance, reliabilty and availability are all higher makes its performance surpass the big hard disk of a costliness.This design philosophy is accepted very soon, and from then on raid-array (RAID, Redundant Array OfIndependent Disks) technology has obtained widespread use, and data storage has entered quicker, safer, the more cheap New Times.
The RAID technology is divided into several different grades.Mainly comprise several standards of RAID0~RAID7, can provide different speed respectively, security and cost performance, RAID5 is one of them.The RAID5 technology mainly is presented as on all disks access data and parity information across in read-write control on the disk.As shown in Figure 1, main frame 101 links to each other with the virtual memory of being made up of 5 disks 104 by master disk controller 102, backup diskette controller 103, and master disk controller 102 links to each other by high availability path 10 5 with backup diskette controller 103.Deblocking and checking data piecemeal cross-distribution are on each disk.In per 5 pieces 1 checking data piecemeal is arranged all, corresponding to checking data piecemeal 4, deblocking 6,7,8,9 is corresponding to checking data piecemeal 5 as data piecemeal 0,1,2,3.Generating the checking data piecemeal has two kinds of algorithms, and the checking data piecemeal that adopts two kinds of algorithms that identical deblocking is operated generation is identical.First kind of algorithm be when the deblocking that will write more for a long time, for example write 0,1,2 three data piecemeal,
Figure C20061008369200051
The read-write of input and output for reading legacy data piecemeal 3, is write new data piecemeal 0 in proper order like this, writes new data piecemeal 1, writes new data piecemeal 2, write check deblocking 4.Second kind of algorithm be when the deblocking that will write more after a little while, for example only write 0 deblocking,
Figure C20061008369200052
Figure C20061008369200053
The read-write that needs input and output is like this read old checking data piecemeal 4 in proper order for reading legacy data piecemeal 0, writes new data piecemeal 0, writes new checking data piecemeal 4.When main frame 101 is write 0,1,2 deblockings of virtual disk by master disk controller 102, read, the calculation check process all finishes.If but after having write 0,1 deblocking, write in the process of the 2nd deblocking master disk controller 102 machine of delaying, and the checking data piecemeal of this moment is not also write on the virtual disk, and this checking data piecemeal still is old checking data piecemeal.Backup diskette controller 103 can start carries out write operation, and backup diskette controller 103 may use second kind of algorithm, calculates new checking data piecemeal with old checking data block data.It is wrong obtaining new checking data piecemeal like this.After this carry out data again and write fashionablely, it also is wrong generating new checking data piecemeal with wrong checking data piecemeal.If after this disk failure takes place, will cause data failure, RAID5 writing cavity effect appears.For solving the writing cavity problem, must become the verification daily record to each key element record of uncompleted write operation and preserve, rebuild the checking data piecemeal after being used for.Head it off has following two kinds of methods in the prior art.
A kind of method is to set up high availability path 10 5 between master disk controller 102 and the backup diskette controller 103, before beginning for each write operation, master disk controller 102 is sent to backup diskette controller 103 with the verification daily record by path 10 5, after master disk controller 102 waited for that backup diskette controller 103 is confirmed to receive the verification daily record, master disk controller 102 began to carry out write operation.If master disk controller 102 machine of delaying in writing process, backup diskette controller 103 is rebuild the checking data piecemeal according to the verification daily record.
Another kind method is after master disk controller 102 breaks down, and when main frame is switched to backup diskette controller 103, thinks that all checking data piecemeals are all unreliable.Checking data piecemeals all on the virtual disk is reinitialized one time.
First method need be provided at active and standby with the path 10 5 of setting up a high availability between the Magnetic Disk Controller in the prior art, and itself can not break down this passage at work, otherwise the verification daily record that the master disk controller generates can't be sent to the backup diskette controller.Master disk controller 102 all will stop write operation earlier when each write operation in the preservation process of verification daily record, wait receives after backup diskette controller 103 receives the affirmation information of verification daily record and just carries out write operation, and the efficient of write operation has been produced bigger influence.Also influenced the handling property of master disk controller in the message exchange of being carried out with the backup diskette controller.Second method is owing to will rebuild all checking data piecemeals, and for the bigger situation of data in magnetic disk amount, the time of reconstruction can be longer, to the bigger influence of normal use generation of system.
Summary of the invention
In view of this, the technical matters of the present invention's solution is to provide a kind of method that solves writing cavity in the redundant arrays of inexpensive disks.This method has not only solved the writing cavity problem, and the wait when having avoided the checking data piecemeal all to rebuild a large amount of time of waste and write operation at interval.
For addressing the above problem, the invention provides a kind of method that solves writing cavity in the redundant arrays of inexpensive disks, comprising:
A, main frame send write order to the master disk controller;
B, master disk controller are received write order, generate the verification daily record;
C, master disk controller are with in the verification daily record write store and carry out write operation;
If quit work verification daily record in the backup diskette controller readout memory in the d master disk controller write operation process;
E, backup diskette controller are rebuild the checking data piecemeal according to the verification daily record;
F, main frame overwriting data piecemeal.
Wherein, among the described step c in write store and the described steps d readout memory be in a block RAM, to write, read.
Wherein, among the described step c in write store and the described steps d readout memory be in two block RAMs, to write, read in the block RAM therein.
Described two block RAMs are independently-powered respectively.
Wherein, described step c further comprises: if the master disk controller completes successfully write operation, then send the information of writing successfully to main frame.
The present invention also provides a kind of method that solves writing cavity in the redundant arrays of inexpensive disks, it is characterized in that, comprising:
A, main frame send write order to storage control module;
B, storage control module are sent to the master disk controller with write order, and the verification daily record that self generates is sent to the backup diskette controller;
C, master disk controller are carried out write operation according to write order, and the backup diskette controller is written to the verification daily record in the internal memory of self;
If d master disk controller quits work in the process of write operation, the backup diskette controller is rebuild the checking data piecemeal according to the verification daily record, and the verification deblocking is write on the disk again.
Wherein, described step c further comprises:
C71, master disk controller are carried out write operation success back and are sent the information of writing successfully to storage control module, and backup diskette controller write check daily record success back sends the information of writing successfully to storage control module;
If c72, storage control module send the information of writing successfully to main frame after receiving the information of writing successfully of master disk controller and backup diskette controller, write failure information otherwise send to main frame.
Wherein, described storage control module is software module in the main frame.
Wherein, described storage control module is independent equipment.
The present invention compares with prior art, preserves the verification daily record effectively, and rebuilds the checking data piecemeal apace, solves writing cavity problem and system performance bottleneck in the redundant arrays of inexpensive disks efficiently.The stand-by period that information is transmitted during at write operation between plenty of time of having wasted when having avoided prior art to rebuild the checking data piecemeal when solving redundant arrays of inexpensive disks writing cavity problem and the primary, spare Magnetic Disk Controller.
Description of drawings
Fig. 1 is the system construction drawing that prior art solves the RAID5 writing cavity;
Fig. 2 is the process flow diagram of the inventive method;
Fig. 3 is the process flow diagram of the inventive method embodiment one;
Fig. 4 is the process flow diagram of the inventive method embodiment two.
Embodiment
Method of the present invention can be preserved the verification daily record effectively, rebuilds the checking data piecemeal by the verification daily record and solves the writing cavity problem, improves the service efficiency of system efficiently.Avoid the wait interval of read/writing control method when all rebuilding a large amount of time of checking data piecemeal waste and write operation of prior art.
Referring to Fig. 2, method of the present invention comprises:
Step S201: main frame sends write request;
Step S202: master disk controller write check daily record is to shared drive and carry out write operation;
Step S203: the master disk controller is delayed behind the machine, the reader check daily record and rebuild the checking data piecemeal according to the verification daily record from shared drive of backup diskette controller.
Describe method workflow of the present invention in detail below in conjunction with Fig. 3.Suppose that each itemize has 5 piecemeals in the present RAID5 system, the branch block size is 30k, data itemize 0={ deblocking 0, deblocking 1, deblocking 2, deblocking 3, checking data piecemeal 4}={1,1,1,1,0};
Step S301: main frame is initiated { start address=0, size=30k, the write request of deblocking 0=0} to the master disk controller.
Step S302: the master disk controller generates verification daily record, { start address=0, size=30k}
Step S303: the master disk controller writes the verification daily record among the RAM;
Step S304: the master disk controller is carried out write operation.
Master disk controller calculation check deblocking,
Figure C20061008369200081
Figure C20061008369200082
According to write request computational data itemize.Because the checking data piecemeal is invisible to the user, data itemize size=minute block size 30k * (5-1)=120k.Data itemize=start address 0/ itemize size=0/120=0 then.
The master disk controller finds data itemize 0, carry out write operation and write that { at this moment the piecemeal situation { 0 for deblocking 0=0, checking data piecemeal=1}, 1,1,1,1}, behind the intact write operation of master disk controller correct execution, execution in step S305, machine, then execution in step S306 if the master disk controller is delayed in carrying out the write operation process.
Step S305: the master disk controller returns the information write successfully to main frame.
Step S306: the backup diskette controller is read the verification daily record among the RAM.
Suppose that the master disk controller write the machine of delaying behind the deblocking 0, this moment, the checking data piecemeal was not also write on the disk, this moment, the piecemeal situation was: { 0,1,1,1,0}, this moment, the checking data piecemeal was wrong, so the master disk controller can not send the information write successfully to main frame, and the backup diskette controller is the reader check daily record from shared drive.
Step S307: the backup diskette controller is rebuild the checking data piecemeal according to the verification daily record.
The backup diskette controller recomputates the verification itemize according to the verification daily record, finds the checking data piecemeal, recomputates the checking data piecemeal,
Figure C20061008369200091
Figure C20061008369200092
Figure C20061008369200093
At this moment the piecemeal situation 0,1,1,1,1}, this moment the checking data piecemeal be again correct.
Step S308: main frame overwriting data piecemeal.
Because main frame is not received the information write successfully, can retransmit write request, the backup diskette controller can be according to write request overwriting data piecemeal 0 and checking data piecemeal 4.
For RAM storer among this embodiment can be one or two, when adopting two independently-powered RAM to work, the master disk controller is written to the verification daily record in two block RAMs, and after the master disk controller quit work, the backup diskette controller is the reader check daily record from a block RAM.If in the middle of this process, when a block RAM broke down cisco unity malfunction, an other block RAM can be protected the verification daily record effectively, and the backup diskette controller still can the reader check daily record be rebuild the checking data piecemeal from an other block RAM.
The present invention is a kind of incessantly in the method that solves writing cavity by preservation verification daily record, also can realize by adding the memory controller module, describes another embodiment of method of the present invention in detail below in conjunction with accompanying drawing 4.Suppose that each itemize has 5 piecemeals in the present RAID5 system, the branch block size is 30k, data itemize 0={ deblocking 0, deblocking 1, deblocking 2, deblocking 3, checking data piecemeal 4}={1,1,1,1,0};
Step S401: main frame sends { start address=0, size=30k, deblocking 0=0} write request to storage control module;
Step S402: { start address=0, size=30k} were sent to the master disk controller with write order, simultaneously the verification daily record are sent to the backup diskette controller in generation verification daily record after storage control module was received write order;
Step S403: master disk control receives that write order carries out write operation, and the backup diskette controller receives that the verification daily record writes in the internal memory simultaneously.
Master disk controller calculation check deblocking,
Figure C20061008369200102
According to write request computational data itemize.Because the checking data piecemeal is invisible to the user, data itemize size=minute block size 30k * (5-1)=120k.Data itemize=start address 0/ itemize size=0/120k=0 then.
The master disk controller finds data itemize 0, carries out write operation, will deblocking 0=0, checking data piecemeal 4=1} are written in the disk array, and at this moment the piecemeal situation be 0,1,1,1,1}.
{ deblocking 0=0} is saved in the local internal memory backup diskette controller for start address=0, size=30k with the verification daily record.
After master disk controller and backup diskette controller are received the information of storage control module transmission, concurrent working simultaneously, order in no particular order.
Step S404: the master disk controller machine of delaying.
Step S405: the backup diskette controller is rebuild the checking data piecemeal.
The machine if the master disk controller is delayed in the process of work, then the backup diskette controller is rebuild the checking data piecemeal according to the verification daily record of receiving.
The backup diskette controller recomputates the verification itemize according to the verification daily record, finds the checking data piecemeal, recomputates the checking data piecemeal.
Figure C20061008369200103
Figure C20061008369200104
Figure C20061008369200105
At this moment the piecemeal situation 0,1,1,1,1}, this moment the checking data piecemeal be again correct.
Step S406: master disk control is returned and is write success, and the backup diskette controller returns writes success.
If primary, spare Magnetic Disk Controller has all completed successfully write operation, then primary, spare Magnetic Disk Controller returns the information write successfully to storage control module;
Step S407: have only storage control module to receive the information of writing successfully of master disk controller and backup diskette controller, just return the information write successfully to main frame; Otherwise return to main frame and to write failure information, write request can take place in main frame again.
For method of the present invention, the form of realization is diversified.Within the spirit and principles in the present invention all, any modification of being done, be equal to replacement, improvement etc., all should be included within protection scope of the present invention.

Claims (9)

1. a method that solves writing cavity in the redundant arrays of inexpensive disks is characterized in that, comprising:
A, main frame send write order to the master disk controller;
B, master disk controller are received write order, generate the verification daily record;
C, master disk controller are with in the verification daily record write store and carry out write operation;
If quit work verification daily record in the backup diskette controller readout memory in the d master disk controller write operation process;
E, backup diskette controller are rebuild the checking data piecemeal according to the verification daily record;
F, main frame overwriting data piecemeal.
2. method according to claim 1 is characterized in that, among the described step c in write store and the described steps d readout memory be in a block RAM, to write, read.
3. method according to claim 1 is characterized in that, among the described step c in write store and the described steps d readout memory be in two block RAMs, to write, read in the block RAM therein.
4. according to the described method of claim 3, it is characterized in that described two block RAMs are independently-powered respectively.
5. method according to claim 1 is characterized in that, described step c further comprises: if the master disk controller completes successfully write operation, then send the information of writing successfully to main frame.
6. a method that solves writing cavity in the redundant arrays of inexpensive disks is characterized in that, comprising:
A, main frame send write order to storage control module;
B, storage control module are sent to the master disk controller with write order, and the verification daily record that self generates is sent to the backup diskette controller;
C, master disk controller are carried out write operation according to write order, and the backup diskette controller is written to the verification daily record in the internal memory of self;
If d master disk controller quits work in the process of write operation, the backup diskette controller is rebuild the checking data piecemeal according to the verification daily record, and the verification deblocking is write on the disk again.
7. method according to claim 6 is characterized in that, described step c further comprises:
C71, master disk controller are carried out write operation success back and are sent the information of writing successfully to storage control module, and backup diskette controller write check daily record success back sends the information of writing successfully to storage control module;
If c72, storage control module send the information of writing successfully to main frame after receiving the information of writing successfully of master disk controller and backup diskette controller, write failure information otherwise send to main frame.
8. method according to claim 6 is characterized in that, described storage control module is software module in the main frame.
9. method according to claim 6 is characterized in that, described storage control module is independent equipment.
CNB2006100836926A 2006-06-02 2006-06-02 Method for solving writing cavity in magnetic disk redundancy array Active CN100389383C (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CNB2006100836926A CN100389383C (en) 2006-06-02 2006-06-02 Method for solving writing cavity in magnetic disk redundancy array

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CNB2006100836926A CN100389383C (en) 2006-06-02 2006-06-02 Method for solving writing cavity in magnetic disk redundancy array

Publications (2)

Publication Number Publication Date
CN1862478A CN1862478A (en) 2006-11-15
CN100389383C true CN100389383C (en) 2008-05-21

Family

ID=37389922

Family Applications (1)

Application Number Title Priority Date Filing Date
CNB2006100836926A Active CN100389383C (en) 2006-06-02 2006-06-02 Method for solving writing cavity in magnetic disk redundancy array

Country Status (1)

Country Link
CN (1) CN100389383C (en)

Families Citing this family (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP5029513B2 (en) * 2008-06-30 2012-09-19 ソニー株式会社 Information processing apparatus, information processing apparatus control method, and program
CN101488077B (en) * 2009-02-24 2011-08-24 浪潮电子信息产业股份有限公司 Redundancy magnetic disk control server system
CN102541471B (en) * 2011-12-28 2014-12-10 创新科软件技术(深圳)有限公司 Storage system with multiple controllers

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5604902A (en) * 1995-02-16 1997-02-18 Hewlett-Packard Company Hole plugging garbage collection for a data storage system
US5819109A (en) * 1992-12-07 1998-10-06 Digital Equipment Corporation System for storing pending parity update log entries, calculating new parity, updating the parity block, and removing each entry from the log when update is complete
US5860091A (en) * 1996-06-28 1999-01-12 Symbios, Inc. Method and apparatus for efficient management of non-aligned I/O write request in high bandwidth raid applications
US6523087B2 (en) * 2001-03-06 2003-02-18 Chaparral Network Storage, Inc. Utilizing parity caching and parity logging while closing the RAID5 write hole
US6629211B2 (en) * 2001-04-20 2003-09-30 International Business Machines Corporation Method and system for improving raid controller performance through adaptive write back/write through caching
US20050066124A1 (en) * 2003-09-24 2005-03-24 Horn Robert L. Method of RAID 5 write hole prevention
CN1704910A (en) * 2004-06-03 2005-12-07 华为技术有限公司 Write handling method for disc array arrangement

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5819109A (en) * 1992-12-07 1998-10-06 Digital Equipment Corporation System for storing pending parity update log entries, calculating new parity, updating the parity block, and removing each entry from the log when update is complete
US5604902A (en) * 1995-02-16 1997-02-18 Hewlett-Packard Company Hole plugging garbage collection for a data storage system
US5860091A (en) * 1996-06-28 1999-01-12 Symbios, Inc. Method and apparatus for efficient management of non-aligned I/O write request in high bandwidth raid applications
US6523087B2 (en) * 2001-03-06 2003-02-18 Chaparral Network Storage, Inc. Utilizing parity caching and parity logging while closing the RAID5 write hole
US6629211B2 (en) * 2001-04-20 2003-09-30 International Business Machines Corporation Method and system for improving raid controller performance through adaptive write back/write through caching
US20050066124A1 (en) * 2003-09-24 2005-03-24 Horn Robert L. Method of RAID 5 write hole prevention
CN1704910A (en) * 2004-06-03 2005-12-07 华为技术有限公司 Write handling method for disc array arrangement

Also Published As

Publication number Publication date
CN1862478A (en) 2006-11-15

Similar Documents

Publication Publication Date Title
US7809979B2 (en) Storage control apparatus and method
JP3742494B2 (en) Mass storage device
CN103136074B (en) Data storage method and data storage system of multiple disk array systems
CN100407121C (en) Information processing system, primary storage device, and computer readable recording medium recorded thereon logical volume restoring program
US20090271659A1 (en) Raid rebuild using file system and block list
CN101351760A (en) Method and system for accessing auxiliary data in power-efficient high-capacity scalable storage
CN102207895B (en) Data reconstruction method and device of redundant array of independent disk (RAID)
CN100349127C (en) Embedded system data back-up device and method
CN101567211A (en) Method for improving usability of disk and disk array controller
CN104035830A (en) Method and device for recovering data
CN1746854A (en) The device, method and the program that are used for control store
CN101436149A (en) Method for rebuilding data of magnetic disk array
US20070101188A1 (en) Method for establishing stable storage mechanism
CN102177496A (en) System and method for transferring data between different RAID data storage types for current data and replay data
CN103049400A (en) Disk reconfiguration method and disk reconfiguration device
CN103049220A (en) Storage control method, storage control device and solid-state storage system
CN103019893A (en) Multi-disk fault-tolerant two-dimensional hybrid disk RAID4 system architecture and read-write method thereof
US7260739B2 (en) Method, apparatus and program storage device for allowing continuous availability of data during volume set failures in a mirrored environment
CA2694819C (en) Redundancy in array storage system based on minimum distances of donor/recipient arrays
CN100389383C (en) Method for solving writing cavity in magnetic disk redundancy array
JP2003316525A (en) Disk array controller
JP4698710B2 (en) Storage system and power saving method thereof
JPH1069357A (en) Storage network control method
CN101794246A (en) Backup storage system, backup system, data backup method and recovery method
CN102541472A (en) Method and device for reconstructing RAID (Redundant Array of Independent Disks)

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C14 Grant of patent or utility model
GR01 Patent grant
CP03 Change of name, title or address
CP03 Change of name, title or address

Address after: 310052 Binjiang District Changhe Road, Zhejiang, China, No. 466, No.

Patentee after: Xinhua three Technology Co., Ltd.

Address before: 310053 Hangzhou hi tech Industrial Development Zone, Zhejiang province science and Technology Industrial Park, No. 310 and No. six road, HUAWEI, Hangzhou production base

Patentee before: Huasan Communication Technology Co., Ltd.