WO2006019744A3 - Deterministically active-active failover of redundant servers in a network storage appliance - Google Patents

Deterministically active-active failover of redundant servers in a network storage appliance Download PDF

Info

Publication number
WO2006019744A3
WO2006019744A3 PCT/US2005/024710 US2005024710W WO2006019744A3 WO 2006019744 A3 WO2006019744 A3 WO 2006019744A3 US 2005024710 W US2005024710 W US 2005024710W WO 2006019744 A3 WO2006019744 A3 WO 2006019744A3
Authority
WO
WIPO (PCT)
Prior art keywords
server
active
heartbeat
backplane
network storage
Prior art date
Application number
PCT/US2005/024710
Other languages
French (fr)
Other versions
WO2006019744A2 (en
Inventor
Ian Davies
Original Assignee
Dot Hill Systems Corp
Ian Davies
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Dot Hill Systems Corp, Ian Davies filed Critical Dot Hill Systems Corp
Publication of WO2006019744A2 publication Critical patent/WO2006019744A2/en
Publication of WO2006019744A3 publication Critical patent/WO2006019744A3/en

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/07Responding to the occurrence of a fault, e.g. fault tolerance
    • G06F11/16Error detection or correction of the data by redundancy in hardware
    • G06F11/20Error detection or correction of the data by redundancy in hardware using active fault-masking, e.g. by switching out faulty elements or by switching in spare elements
    • G06F11/202Error detection or correction of the data by redundancy in hardware using active fault-masking, e.g. by switching out faulty elements or by switching in spare elements where processing functionality is redundant
    • G06F11/2046Error detection or correction of the data by redundancy in hardware using active fault-masking, e.g. by switching out faulty elements or by switching in spare elements where processing functionality is redundant where the redundant components share persistent storage
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/07Responding to the occurrence of a fault, e.g. fault tolerance
    • G06F11/16Error detection or correction of the data by redundancy in hardware
    • G06F11/20Error detection or correction of the data by redundancy in hardware using active fault-masking, e.g. by switching out faulty elements or by switching in spare elements
    • G06F11/202Error detection or correction of the data by redundancy in hardware using active fault-masking, e.g. by switching out faulty elements or by switching in spare elements where processing functionality is redundant
    • G06F11/2023Failover techniques
    • G06F11/2028Failover techniques eliminating a faulty processor or activating a spare
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/07Responding to the occurrence of a fault, e.g. fault tolerance
    • G06F11/16Error detection or correction of the data by redundancy in hardware
    • G06F11/20Error detection or correction of the data by redundancy in hardware using active fault-masking, e.g. by switching out faulty elements or by switching in spare elements
    • G06F11/202Error detection or correction of the data by redundancy in hardware using active fault-masking, e.g. by switching out faulty elements or by switching in spare elements where processing functionality is redundant
    • G06F11/2035Error detection or correction of the data by redundancy in hardware using active fault-masking, e.g. by switching out faulty elements or by switching in spare elements where processing functionality is redundant without idle spare hardware
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/07Responding to the occurrence of a fault, e.g. fault tolerance
    • G06F11/16Error detection or correction of the data by redundancy in hardware
    • G06F11/20Error detection or correction of the data by redundancy in hardware using active fault-masking, e.g. by switching out faulty elements or by switching in spare elements
    • G06F11/2053Error detection or correction of the data by redundancy in hardware using active fault-masking, e.g. by switching out faulty elements or by switching in spare elements where persistent mass storage functionality or persistent mass storage control functionality is redundant
    • G06F11/2089Redundant storage control functionality
    • G06F11/2092Techniques of failing over between control units
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/30Monitoring
    • G06F11/3003Monitoring arrangements specially adapted to the computing system or computing system component being monitored
    • G06F11/3006Monitoring arrangements specially adapted to the computing system or computing system component being monitored where the computing system is distributed, e.g. networked systems, clusters, multiprocessor systems
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/30Monitoring
    • G06F11/3055Monitoring arrangements for monitoring the status of the computing system or of the computing system component, e.g. monitoring if the computing system is on, off, available, not available
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L43/00Arrangements for monitoring or testing data switching networks
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/07Responding to the occurrence of a fault, e.g. fault tolerance
    • G06F11/16Error detection or correction of the data by redundancy in hardware
    • G06F11/20Error detection or correction of the data by redundancy in hardware using active fault-masking, e.g. by switching out faulty elements or by switching in spare elements
    • G06F11/2002Error detection or correction of the data by redundancy in hardware using active fault-masking, e.g. by switching out faulty elements or by switching in spare elements where interconnections or communication control functionality are redundant
    • G06F11/2005Error detection or correction of the data by redundancy in hardware using active fault-masking, e.g. by switching out faulty elements or by switching in spare elements where interconnections or communication control functionality are redundant using redundant communication controllers
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/07Responding to the occurrence of a fault, e.g. fault tolerance
    • G06F11/16Error detection or correction of the data by redundancy in hardware
    • G06F11/20Error detection or correction of the data by redundancy in hardware using active fault-masking, e.g. by switching out faulty elements or by switching in spare elements
    • G06F11/2002Error detection or correction of the data by redundancy in hardware using active fault-masking, e.g. by switching out faulty elements or by switching in spare elements where interconnections or communication control functionality are redundant
    • G06F11/2007Error detection or correction of the data by redundancy in hardware using active fault-masking, e.g. by switching out faulty elements or by switching in spare elements where interconnections or communication control functionality are redundant using redundant communication media
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/07Responding to the occurrence of a fault, e.g. fault tolerance
    • G06F11/16Error detection or correction of the data by redundancy in hardware
    • G06F11/20Error detection or correction of the data by redundancy in hardware using active fault-masking, e.g. by switching out faulty elements or by switching in spare elements
    • G06F11/202Error detection or correction of the data by redundancy in hardware using active fault-masking, e.g. by switching out faulty elements or by switching in spare elements where processing functionality is redundant
    • G06F11/2038Error detection or correction of the data by redundancy in hardware using active fault-masking, e.g. by switching out faulty elements or by switching in spare elements where processing functionality is redundant with a single idle spare processing component
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/07Responding to the occurrence of a fault, e.g. fault tolerance
    • G06F11/16Error detection or correction of the data by redundancy in hardware
    • G06F11/20Error detection or correction of the data by redundancy in hardware using active fault-masking, e.g. by switching out faulty elements or by switching in spare elements
    • G06F11/2053Error detection or correction of the data by redundancy in hardware using active fault-masking, e.g. by switching out faulty elements or by switching in spare elements where persistent mass storage functionality or persistent mass storage control functionality is redundant
    • G06F11/2094Redundant storage or storage space
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/07Responding to the occurrence of a fault, e.g. fault tolerance
    • G06F11/16Error detection or correction of the data by redundancy in hardware
    • G06F11/20Error detection or correction of the data by redundancy in hardware using active fault-masking, e.g. by switching out faulty elements or by switching in spare elements
    • G06F11/2097Error detection or correction of the data by redundancy in hardware using active fault-masking, e.g. by switching out faulty elements or by switching in spare elements maintaining the standby controller/processing unit updated
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L43/00Arrangements for monitoring or testing data switching networks
    • H04L43/10Active monitoring, e.g. heartbeat, ping or trace-route

Abstract

Deterministic active-active failover of redundant server blades hot-pluggable into a backplane of a network storage appliance chassis is disclosed. Each server monitors the other's heartbeat on a respective path in the backplane. Other paths between the two servers on the backplane enable one server to reliably kill the other server and take over its identity on the network in response to detecting a stopped heartbeat of the other server. The apparatus is superior to a conventional heartbeat link between servers in separate chassis, such as an Ethernet cable, because it is not prone to user removal or damage since the backplane cannot be removed by a user while the appliance is operational and enables each server to know a true heartbeat failure has occurred, as opposed to failure of a conventional external heartbeat link causing each server to each think the other has failed.
PCT/US2005/024710 2004-07-16 2005-07-11 Deterministically active-active failover of redundant servers in a network storage appliance WO2006019744A2 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US10/893,718 2004-07-16
US10/893,718 US7627780B2 (en) 2003-04-23 2004-07-16 Apparatus and method for deterministically performing active-active failover of redundant servers in a network storage appliance

Publications (2)

Publication Number Publication Date
WO2006019744A2 WO2006019744A2 (en) 2006-02-23
WO2006019744A3 true WO2006019744A3 (en) 2006-09-14

Family

ID=35907862

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/US2005/024710 WO2006019744A2 (en) 2004-07-16 2005-07-11 Deterministically active-active failover of redundant servers in a network storage appliance

Country Status (2)

Country Link
US (1) US7627780B2 (en)
WO (1) WO2006019744A2 (en)

Families Citing this family (35)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7565566B2 (en) * 2003-04-23 2009-07-21 Dot Hill Systems Corporation Network storage appliance with an integrated switch
US7334064B2 (en) * 2003-04-23 2008-02-19 Dot Hill Systems Corporation Application server blade for embedded storage appliance
US7644135B2 (en) * 2004-10-25 2010-01-05 Texas Instruments Incorporated Method of improving communications data throughput on embedded systems and reducing the load on the operating system and central processing unit
US20060142874A1 (en) * 2004-12-23 2006-06-29 Pettigrew David D Jr System for reducing electrical wiring in complex apparatus, such as vehicles and aircraft
US7486526B1 (en) * 2005-03-29 2009-02-03 Emc Corporation Techniques for redundant computing from within a 1U form factor with improved serviceability and high availability characteristics
US7809993B2 (en) * 2006-01-17 2010-10-05 International Business Machines Corporation Apparatus, system, and method for receiving digital instructions at devices sharing an identity
US7930529B2 (en) * 2006-12-27 2011-04-19 International Business Machines Corporation Failover of computing devices assigned to storage-area network (SAN) storage volumes
US7861113B2 (en) * 2007-03-16 2010-12-28 Dot Hill Systems Corporation Method and apparatus for operating storage controller system in elevated temperature environment
US20080307005A1 (en) * 2007-06-09 2008-12-11 Pettigrew Jr David D System for reducing electrical wiring in complex apparatus, such as vehicles and aircraft
US8260891B2 (en) * 2007-10-30 2012-09-04 Dell Products L.P. System and method for the provision of secure network boot services
US8306652B2 (en) * 2008-03-14 2012-11-06 International Business Machines Corporation Dual-band communication of management traffic in a blade server system
US8615606B2 (en) * 2008-12-10 2013-12-24 At&T Intellectual Property I, L.P. Methods and apparatus to manipulate services in a distributed business intelligence computing environment
US8037364B2 (en) * 2009-01-09 2011-10-11 International Business Machines Corporation Forced management module failover by BMC impeachment consensus
US8176150B2 (en) * 2009-08-12 2012-05-08 Dell Products L.P. Automated services procurement through multi-stage process
US9182874B2 (en) * 2011-01-31 2015-11-10 Dell Products, Lp System and method for out-of-band communication between a remote user and a local user of a server
US8681606B2 (en) * 2011-08-30 2014-03-25 International Business Machines Corporation Implementing redundancy on infiniband (IB) networks
GB2499822B (en) 2012-02-29 2020-01-08 Metaswitch Networks Ltd Failover processing
US9137141B2 (en) 2012-06-12 2015-09-15 International Business Machines Corporation Synchronization of load-balancing switches
US8938521B2 (en) 2012-08-29 2015-01-20 Lenovo Enterprise Solutions (Singapore) Pte. Ltd. Bi-directional synchronization enabling active-active redundancy for load-balancing switches
US8954780B1 (en) * 2012-10-11 2015-02-10 Symantec Corporation Systems and methods for transferring input/output operations within computer clusters
US9311173B2 (en) 2013-03-12 2016-04-12 Honeywell International Inc. Systems and methods for increasing robustness of a system with a remote server
WO2015023192A1 (en) * 2013-08-14 2015-02-19 Intel Corporation Manageability redundancy for micro server and clustered system-on-a-chip deployments
US10402217B2 (en) * 2014-05-15 2019-09-03 Vmware, Inc. Automatic reconfiguration of a pre-configured hyper-converged computing device
KR102387973B1 (en) * 2015-12-01 2022-04-19 삼성전자주식회사 Duplicated storage device, server system having the same, and operation method thereof
JP6409812B2 (en) * 2016-04-01 2018-10-24 横河電機株式会社 Redundancy apparatus, redundancy system, and redundancy method
US10467100B2 (en) * 2016-08-15 2019-11-05 Western Digital Technologies, Inc. High availability state machine and recovery
WO2018094686A1 (en) * 2016-11-25 2018-05-31 华为技术有限公司 Smb service failure handling method, and storage device
JP2018116477A (en) * 2017-01-18 2018-07-26 富士通株式会社 Information processing apparatus and information processing system
US11300604B2 (en) * 2018-04-06 2022-04-12 Bently Nevada, Llc Monitoring system with serial data lane transmission network
US11009864B2 (en) 2018-04-06 2021-05-18 Bently Nevada, Llc Gated asynchronous multipoint network interface monitoring system
CN110830817A (en) * 2018-08-08 2020-02-21 视联动力信息技术股份有限公司 Video transcoding capacity adjusting method and video transcoding server
CN110069381A (en) * 2019-03-20 2019-07-30 山东超越数控电子股份有限公司 A method of Domestic Platform heartbeat detection is realized by CPLD
US10762773B1 (en) 2019-08-19 2020-09-01 Ademco Inc. Systems and methods for building and using a false alarm predicting model to determine whether to alert a user and/or relevant authorities about an alarm signal from a security system
US11327858B2 (en) 2020-08-11 2022-05-10 Seagate Technology Llc Preserving data integrity during controller failure
CN113691306B (en) * 2021-07-08 2022-09-23 曙光网络科技有限公司 Method, system and storage medium for protecting optical fiber circuit

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2002101573A2 (en) * 2001-06-13 2002-12-19 Intel Corporation Modular server architecture
US20030018927A1 (en) * 2001-07-23 2003-01-23 Gadir Omar M.A. High-availability cluster virtual server system
US6609213B1 (en) * 2000-08-10 2003-08-19 Dell Products, L.P. Cluster-based system and method of recovery from server failures
US20040111559A1 (en) * 2002-12-10 2004-06-10 Thomas Heil Apparatus and method for sharing boot volume among server blades

Family Cites Families (105)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4159516A (en) 1976-03-23 1979-06-26 Texas Instruments Incorporated Input/output controller having selectable timing and maskable interrupt generation
US4245344A (en) 1979-04-02 1981-01-13 Rockwell International Corporation Processing system with dual buses
US4245355A (en) * 1979-08-08 1981-01-13 Eaton Corporation Microwave frequency converter
BE887134A (en) 1979-12-14 1981-05-14 Gte Automatic Electric Lab Inc INTERRUPTION EXPANSION CIRCUIT
US5175849A (en) 1988-07-28 1992-12-29 Amdahl Corporation Capturing data of a database system
US5140592A (en) * 1990-03-02 1992-08-18 Sf2 Corporation Disk array system
US5124987A (en) 1990-04-16 1992-06-23 Storage Technology Corporation Logical track write scheduling system for a parallel disk drive array data storage subsystem
GB2273180A (en) 1992-12-02 1994-06-08 Ibm Database backup and recovery.
US5590381A (en) 1994-06-30 1996-12-31 Lucent Technologies Inc. Method and apparatus for buffered video playback of video content distributed on a plurality of disks
US5546272A (en) 1995-01-18 1996-08-13 Dell Usa, L.P. Serial fan cooling subsystem for computer systems
US5790775A (en) 1995-10-23 1998-08-04 Digital Equipment Corporation Host transparent storage controller failover/failback of SCSI targets and associated units
JP3628777B2 (en) 1995-10-30 2005-03-16 株式会社日立製作所 External storage device
US5884098A (en) 1996-04-18 1999-03-16 Emc Corporation RAID controller system utilizing front end and back end caching systems including communication path connecting two caching systems and synchronizing allocation of blocks in caching systems
US5852724A (en) * 1996-06-18 1998-12-22 Veritas Software Corp. System and method for "N" primary servers to fail over to "1" secondary server
US5835694A (en) 1996-12-06 1998-11-10 International Business Machines Corporation Raid-configured disk drive array wherein array control occurs at the disk drive level
US6144887A (en) * 1996-12-09 2000-11-07 Denso Corporation Electronic control unit with reset blocking during loading
US6073209A (en) 1997-03-31 2000-06-06 Ark Research Corporation Data storage controller providing multiple hosts with access to multiple storage subsystems
US6134673A (en) * 1997-05-13 2000-10-17 Micron Electronics, Inc. Method for clustering software applications
US5986880A (en) 1997-06-16 1999-11-16 Compaq Computer Corporation Electronic apparatus having I/O board with cable-free redundant adapter cards thereon
US6085333A (en) 1997-12-19 2000-07-04 Lsi Logic Corporation Method and apparatus for synchronization of code in redundant controllers in a swappable environment
US5996024A (en) 1998-01-14 1999-11-30 Emc Corporation Method and apparatus for a SCSI applications server which extracts SCSI commands and data from message and encapsulates SCSI responses to provide transparent operation
US6728781B1 (en) 1998-05-12 2004-04-27 Cornell Research Foundation, Inc. Heartbeat failure detector method and apparatus
US5964886A (en) 1998-05-12 1999-10-12 Sun Microsystems, Inc. Highly available cluster virtual disk system
US6272591B2 (en) 1998-10-19 2001-08-07 Intel Corporation Raid striping using multiple virtual channels
US6330687B1 (en) 1998-11-13 2001-12-11 Digi-Data Corporation System and method to maintain performance among N single raid systems during non-fault conditions while sharing multiple storage devices during conditions of a faulty host computer or faulty storage array controller
US6260079B1 (en) 1998-11-15 2001-07-10 Hewlett-Packard Company Method and system for enhancing fibre channel loop resiliency for a mass storage enclosure by increasing component redundancy and using shunt elements and intelligent bypass management
JP2000172624A (en) 1998-12-07 2000-06-23 Hitachi Ltd Input/output processing system
JP4132322B2 (en) 1998-12-16 2008-08-13 株式会社日立製作所 Storage control device and control method thereof
EP1026575A3 (en) 1999-02-08 2006-09-06 Hitachi, Ltd. Disk array system and method of changing its configuration
US6289376B1 (en) 1999-03-31 2001-09-11 Diva Systems Corp. Tightly-coupled disk-to-CPU storage server
US6389432B1 (en) 1999-04-05 2002-05-14 Auspex Systems, Inc. Intelligent virtual volume access
WO2000060472A1 (en) 1999-04-06 2000-10-12 Lipstream Networks, Inc. Facilitating real-time, multi-point communications over the internet
US6401170B1 (en) 1999-08-18 2002-06-04 Digi-Data Corporation RAID systems during non-fault and faulty conditions on a fiber channel arbitrated loop, SCSI bus or switch fabric configuration
US7103647B2 (en) 1999-08-23 2006-09-05 Terraspring, Inc. Symbolic definition of a computer system
US6346489B1 (en) 1999-09-02 2002-02-12 Applied Materials, Inc. Precleaning process for metal plug that minimizes damage to low-κ dielectric
US6526477B1 (en) 1999-09-03 2003-02-25 Adaptec, Inc. Host-memory based raid system, device, and method
JP4462697B2 (en) 2000-01-31 2010-05-12 株式会社日立製作所 Storage controller
US20030099254A1 (en) 2000-03-03 2003-05-29 Richter Roger K. Systems and methods for interfacing asynchronous and non-asynchronous data media
US6654831B1 (en) 2000-03-07 2003-11-25 International Business Machine Corporation Using multiple controllers together to create data spans
US6898727B1 (en) 2000-03-22 2005-05-24 Emc Corporation Method and apparatus for providing host resources for an electronic commerce site
US7162542B2 (en) 2000-04-13 2007-01-09 Intel Corporation Cascading network apparatus for scalability
WO2001084313A2 (en) 2000-05-02 2001-11-08 Sun Microsystems, Inc. Method and system for achieving high availability in a networked computer system
US6658504B1 (en) 2000-05-16 2003-12-02 Eurologic Systems Storage apparatus
US6971016B1 (en) 2000-05-31 2005-11-29 International Business Machines Corporation Authenticated access to storage area network
AU2001270104A1 (en) 2000-06-23 2002-01-08 Comsonics Inc. Diving mask with embedded computer system
DE10030329C1 (en) 2000-06-27 2002-01-24 Siemens Ag Redundant control system as well as control computer and peripheral unit for such a control system
IES20010400A2 (en) 2000-07-06 2002-02-06 Richmount Computers Ltd Data gathering device for a rack enclosure
WO2002013458A2 (en) 2000-08-07 2002-02-14 Inrange Technologies Corporation Fibre channel switch
KR100340686B1 (en) 2000-09-19 2002-06-15 오길록 The Apparatus for Redundant Interconnection between Multiple Hosts and RAID
JP4734484B2 (en) 2000-10-19 2011-07-27 新世代株式会社 Information processing apparatus and memory cartridge system
US6785678B2 (en) 2000-12-21 2004-08-31 Emc Corporation Method of improving the availability of a computer clustering system through the use of a network medium link state function
US6934875B2 (en) 2000-12-29 2005-08-23 International Business Machines Corporation Connection cache for highly available TCP systems with fail over connections
EP1356359A4 (en) 2000-12-29 2006-08-30 Ming Qiu Server array hardware architecture and system
US6990547B2 (en) 2001-01-29 2006-01-24 Adaptec, Inc. Replacing file system processors by hot swapping
US6715098B2 (en) * 2001-02-23 2004-03-30 Falconstor, Inc. System and method for fibrechannel fail-over through port spoofing
IES20010610A2 (en) * 2001-03-08 2002-09-18 Richmount Computers Ltd Reset facility for redundant processor using a fibre channel loop
IES20010783A2 (en) 2001-04-26 2002-09-18 Richmount Computers Ltd Data storage apparatus
US6691184B2 (en) 2001-04-30 2004-02-10 Lsi Logic Corporation System and method employing a dynamic logical identifier
US6792515B2 (en) 2001-06-21 2004-09-14 International Business Machines Corporation System for addressing processors connected to a peripheral bus
US6757177B2 (en) 2001-07-05 2004-06-29 Tropic Networks Inc. Stacked backplane assembly
US6874100B2 (en) 2001-07-12 2005-03-29 Digi-Data Corporation Raid system with multiple controllers and proof against any single point of failure
US6785744B2 (en) 2001-08-08 2004-08-31 International Business Machines Corporation Mapping SCSI medium changer commands to mainframe-compatible perform library function commands
US20030033463A1 (en) 2001-08-10 2003-02-13 Garnett Paul J. Computer system storage
US7437493B2 (en) 2001-09-28 2008-10-14 Dot Hill Systems Corp. Modular architecture for a network storage controller
US7062591B2 (en) 2001-09-28 2006-06-13 Dot Hill Systems Corp. Controller data sharing using a modular DMA architecture
US6839788B2 (en) 2001-09-28 2005-01-04 Dot Hill Systems Corp. Bus zoning in a channel independent storage controller architecture
US6895467B2 (en) 2001-10-22 2005-05-17 Hewlett-Packard Development Company, L.P. System and method for atomizing storage
GB2381713A (en) * 2001-11-01 2003-05-07 3Com Corp Failover mechanism involving blocking of access of a malfunctioning server and continuing monitoring to enable unblocking of access if server recovers
US6732243B2 (en) 2001-11-08 2004-05-04 Chaparral Network Storage, Inc. Data mirroring using shared buses
US6874103B2 (en) 2001-11-13 2005-03-29 Hewlett-Packard Development Company, L.P. Adapter-based recovery server option
US7127633B1 (en) 2001-11-15 2006-10-24 Xiotech Corporation System and method to failover storage area network targets from one interface to another
US6883065B1 (en) 2001-11-15 2005-04-19 Xiotech Corporation System and method for a redundant communication channel via storage area network back-end
US6904482B2 (en) 2001-11-20 2005-06-07 Intel Corporation Common boot environment for a modular server system
US6983397B2 (en) 2001-11-29 2006-01-03 International Business Machines Corporation Method, system, and program for error handling in a dual adaptor system where one adaptor is a master
US6782450B2 (en) 2001-12-06 2004-08-24 Raidcore, Inc. File mode RAID subsystem
US7138733B2 (en) * 2001-12-13 2006-11-21 Hewlett-Packard Development Company, L.P. Redundant data and power infrastructure for modular server components in a rack
US7111084B2 (en) 2001-12-28 2006-09-19 Hewlett-Packard Development Company, L.P. Data storage network with host transparent failover controlled by host bus adapter
US7076555B1 (en) 2002-01-23 2006-07-11 Novell, Inc. System and method for transparent takeover of TCP connections between servers
US6983396B2 (en) 2002-02-15 2006-01-03 International Business Machines Corporation Apparatus for reducing the overhead of cache coherency processing on each primary controller and increasing the overall throughput of the system
US7152185B2 (en) 2002-02-22 2006-12-19 Bea Systems, Inc. Method for event triggered monitoring of managed server health
US20030177224A1 (en) * 2002-03-15 2003-09-18 Nguyen Minh Q. Clustered/fail-over remote hardware management system
US7073022B2 (en) 2002-05-23 2006-07-04 International Business Machines Corporation Serial interface for a data storage array
US7986618B2 (en) 2002-06-12 2011-07-26 Cisco Technology, Inc. Distinguishing between link and node failure to facilitate fast reroute
JP3932994B2 (en) * 2002-06-25 2007-06-20 株式会社日立製作所 Server handover system and method
PL203170B1 (en) * 2002-07-01 2009-09-30 Advanced Digital Broadcast Ltd System designed to detect actuation of microprocessor operation monitoring internal system and method of zeroing microprocessor system equipped with a system detecting actuation of internal monitoring system
US7548971B2 (en) 2002-08-12 2009-06-16 Hewlett-Packard Development Company, L.P. System and method for managing the operating frequency of blades in a bladed-system
US6970054B2 (en) 2002-10-02 2005-11-29 Hewlett-Packard Development Company, L.P. Apparatus for terminating transmission lines to reduce electromagnetic interference in an electronic system
US7739485B2 (en) 2002-10-11 2010-06-15 Hewlett-Packard Development Company, L.P. Cached field replaceable unit EEPROM data
US7752294B2 (en) 2002-10-28 2010-07-06 Netapp, Inc. Method and system for dynamic expansion and contraction of nodes in a storage area network
US20040117522A1 (en) * 2002-12-11 2004-06-17 Dell Products L.P. System and method for addressing protocol translation in a storage environment
JP2004220216A (en) 2003-01-14 2004-08-05 Hitachi Ltd San/nas integrated storage device
JP2004234558A (en) 2003-01-31 2004-08-19 Hitachi Ltd Storage device controller and program
US20040168008A1 (en) 2003-02-18 2004-08-26 Hewlett-Packard Development Company, L.P. High speed multiple ported bus interface port state identification system
US7236987B1 (en) 2003-02-28 2007-06-26 Sun Microsystems Inc. Systems and methods for providing a storage virtualization environment
US7290168B1 (en) 2003-02-28 2007-10-30 Sun Microsystems, Inc. Systems and methods for providing a multi-path network switch system
US7134046B2 (en) 2003-03-19 2006-11-07 Lucent Technologies Inc. Method and apparatus for high availability distributed processing across independent networked computer fault groups
US7334064B2 (en) 2003-04-23 2008-02-19 Dot Hill Systems Corporation Application server blade for embedded storage appliance
US7565566B2 (en) * 2003-04-23 2009-07-21 Dot Hill Systems Corporation Network storage appliance with an integrated switch
US7251745B2 (en) 2003-06-11 2007-07-31 Availigent, Inc. Transparent TCP connection failover
US7475134B2 (en) * 2003-10-14 2009-01-06 International Business Machines Corporation Remote activity monitoring
US7225356B2 (en) 2003-11-06 2007-05-29 Siemens Medical Solutions Health Services Corporation System for managing operational failure occurrences in processing devices
US7246256B2 (en) 2004-01-20 2007-07-17 International Business Machines Corporation Managing failover of J2EE compliant middleware in a high availability system
US7249277B2 (en) 2004-03-11 2007-07-24 Hitachi, Ltd. Disk array including plural exchangeable magnetic disk unit
US7137042B2 (en) * 2004-03-17 2006-11-14 Hitachi, Ltd. Heartbeat apparatus via remote mirroring link on multi-site and method of using same
JP2005301442A (en) 2004-04-07 2005-10-27 Hitachi Ltd Storage device

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6609213B1 (en) * 2000-08-10 2003-08-19 Dell Products, L.P. Cluster-based system and method of recovery from server failures
WO2002101573A2 (en) * 2001-06-13 2002-12-19 Intel Corporation Modular server architecture
US20030018927A1 (en) * 2001-07-23 2003-01-23 Gadir Omar M.A. High-availability cluster virtual server system
US20040111559A1 (en) * 2002-12-10 2004-06-10 Thomas Heil Apparatus and method for sharing boot volume among server blades

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
J. REGULA: "Using PCI Express and Non-transparent Bridging in Blade Servers", BLADE LETTER, March 2004 (2004-03-01), pages 12, XP002374828, Retrieved from the Internet <URL:http://www.bladesystems.org/elearning_center/publications/BladeLetter_Q104.pdf> [retrieved on 20060329] *
T. OKANO: "Future Trends of BladeServer: Virtualization and Optimization", NEC JOURNAL OF ADVANCED TECHNOLOGY, vol. 1, no. 2, 20 June 2004 (2004-06-20), pages 119 - 124, XP002374827, Retrieved from the Internet <URL:http://www.nec.co.jp/techrep/en/r_and_d/a04/a04-no2/a119.pdf> [retrieved on 20060327] *

Also Published As

Publication number Publication date
US20050207105A1 (en) 2005-09-22
WO2006019744A2 (en) 2006-02-23
US7627780B2 (en) 2009-12-01

Similar Documents

Publication Publication Date Title
WO2006019744A3 (en) Deterministically active-active failover of redundant servers in a network storage appliance
US10078564B2 (en) Preventing split-brain scenario in a high-availability cluster
CN105406980B (en) A kind of multinode backup method and device
CN103346903B (en) Dual-machine backup method and device
JP5102901B2 (en) Method and system for maintaining data integrity between multiple data servers across a data center
WO2006121990A3 (en) Fault tolerant computer system
CN104102559B (en) A kind of double controller storage system restarting link based on redundancy heart beating link and opposite end
WO2010068237A3 (en) Deterministic session load-balancing and redundancy of access servers in a computer network
US20070162783A1 (en) System and method for virtual router failover in a network routing system
WO2004095276A3 (en) Corrective actions for servers with shared resources
JP2004532442A5 (en)
AU2003260526A1 (en) High availability software based contact centre
DE69614623D1 (en) Fault-tolerant multiple network server
CN103795572B (en) The switching method and monitoring server of principal and subordinate&#39;s server
WO2008093310A3 (en) Redundant far-end context related pseudo-wire connectivity using inter-chassis backup conneection
JP2008542858A5 (en)
CN109600264A (en) CloudStack cloud platform
CN103560907A (en) Management method and device for double machine load balancing devices
CN110413225B (en) High-reliability cluster storage dual-active configuration method, system, terminal and storage medium
WO2006033694A3 (en) Health monitor for a geographically distributed voice messaging system
JP2007274153A (en) Disconnection method of faulty apparatus in fiber channel switch
US10237122B2 (en) Methods, systems, and computer readable media for providing high availability support at a bypass switch
Wang et al. Virtual machines of high availability using hardware-assisted failure detection
CN103731292A (en) Fault-tolerant method enabling server to realize fault seamless recovery
WO2001082078A3 (en) Method and apparatus for maintaining the integrity of configuration data in redundant, fault tolerant network appliances

Legal Events

Date Code Title Description
AK Designated states

Kind code of ref document: A2

Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BW BY BZ CA CH CN CO CR CU CZ DE DK DM DZ EC EE EG ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KM KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NA NG NI NO NZ OM PG PH PL PT RO RU SC SD SE SG SK SL SM SY TJ TM TN TR TT TZ UA UG US UZ VC VN YU ZA ZM ZW

AL Designated countries for regional patents

Kind code of ref document: A2

Designated state(s): GM KE LS MW MZ NA SD SL SZ TZ UG ZM ZW AM AZ BY KG KZ MD RU TJ TM AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HU IE IS IT LT LU LV MC NL PL PT RO SE SI SK TR BF BJ CF CG CI CM GA GN GQ GW ML MR NE SN TD TG

NENP Non-entry into the national phase

Ref country code: DE

121 Ep: the epo has been informed by wipo that ep was designated in this application
122 Ep: pct application non-entry in european phase