US20040042145A1 - Distributed diode fault check - Google Patents

Distributed diode fault check Download PDF

Info

Publication number
US20040042145A1
US20040042145A1 US10/455,753 US45575303A US2004042145A1 US 20040042145 A1 US20040042145 A1 US 20040042145A1 US 45575303 A US45575303 A US 45575303A US 2004042145 A1 US2004042145 A1 US 2004042145A1
Authority
US
United States
Prior art keywords
power supply
isolation devices
voltage
isolation
fault
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
US10/455,753
Other versions
US6967487B2 (en
Inventor
Paul Garnett
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Oracle America Inc
Original Assignee
Sun Microsystems Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Sun Microsystems Inc filed Critical Sun Microsystems Inc
Assigned to SUN MICROSYSTEMS, INC. reassignment SUN MICROSYSTEMS, INC. ASSIGNMENT OF ASSIGNORS INTEREST (SEE DOCUMENT FOR DETAILS). Assignors: GARNETT, PAUL JEFFREY, SUN MICROSYSTEMS LIMITED
Publication of US20040042145A1 publication Critical patent/US20040042145A1/en
Application granted granted Critical
Publication of US6967487B2 publication Critical patent/US6967487B2/en
Assigned to Oracle America, Inc. reassignment Oracle America, Inc. MERGER AND CHANGE OF NAME (SEE DOCUMENT FOR DETAILS). Assignors: Oracle America, Inc., ORACLE USA, INC., SUN MICROSYSTEMS, INC.
Adjusted expiration legal-status Critical
Expired - Lifetime legal-status Critical Current

Links

Images

Classifications

    • HELECTRICITY
    • H02GENERATION; CONVERSION OR DISTRIBUTION OF ELECTRIC POWER
    • H02JCIRCUIT ARRANGEMENTS OR SYSTEMS FOR SUPPLYING OR DISTRIBUTING ELECTRIC POWER; SYSTEMS FOR STORING ELECTRIC ENERGY
    • H02J1/00Circuit arrangements for dc mains or dc distribution networks
    • H02J1/10Parallel operation of dc sources
    • H02J1/108Parallel operation of dc sources using diodes blocking reverse current flow
    • GPHYSICS
    • G01MEASURING; TESTING
    • G01RMEASURING ELECTRIC VARIABLES; MEASURING MAGNETIC VARIABLES
    • G01R31/00Arrangements for testing electric properties; Arrangements for locating electric faults; Arrangements for electrical testing characterised by what is being tested not provided for elsewhere
    • G01R31/40Testing power supplies
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F1/00Details not covered by groups G06F3/00 - G06F13/00 and G06F21/00
    • G06F1/26Power supply means, e.g. regulation thereof
    • G06F1/28Supervision thereof, e.g. detecting power-supply failure by out of limits supervision
    • GPHYSICS
    • G01MEASURING; TESTING
    • G01RMEASURING ELECTRIC VARIABLES; MEASURING MAGNETIC VARIABLES
    • G01R31/00Arrangements for testing electric properties; Arrangements for locating electric faults; Arrangements for electrical testing characterised by what is being tested not provided for elsewhere
    • G01R31/28Testing of electronic circuits, e.g. by signal tracer
    • G01R31/30Marginal testing, e.g. by varying supply voltage
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/07Responding to the occurrence of a fault, e.g. fault tolerance
    • G06F11/16Error detection or correction of the data by redundancy in hardware
    • G06F11/20Error detection or correction of the data by redundancy in hardware using active fault-masking, e.g. by switching out faulty elements or by switching in spare elements
    • G06F11/2015Redundant power supplies

Definitions

  • the present invention relates to redundant power supply systems.
  • the invention relates to a method and apparatus for testing for faults in the isolation circuits of systems powered by redundant power supplies.
  • isolation devices due to the nature of their operation, it is inherently the case that the presence of certain types of faults in isolation devices (such as a short circuit) only become apparent when the power supply which it serves to isolate becomes inoperative. A latent fault in an isolation device may therefore develop while the associated power supply is in the operative state and remain undetected until that power supply becomes inoperative.
  • each power supply relies upon its associated isolation devices for protection from the backflow of current. Should it remain undetected, the development of a latent fault in an isolation device has potentially disastrous consequences, which may lead to catastrophic failure in its associated power supply or in other power supplies in the system.
  • An aspect of the present invention can provide a fault test method for use in a system comprising one or more system units.
  • Each system unit may comprise at least two isolation devices and be powered by a plurality of redundant power supplies.
  • Each isolation device requires a forward biasing voltage at least equal to a respective forward bias voltage value to be operable in a normal conductive state.
  • Each isolation device can be electrically connected with a respective one of the power supplies such that each power supply has associated with it a set of one or more isolation devices.
  • An output of each isolation device can be connected to a common load of the corresponding system unit.
  • the fault test method can be performed by a testing system and include:
  • the system controller is thereby able to check for fault conditions in each isolation device in a selected set of isolation devices, the set being associated with a corresponding power supply.
  • the isolation devices of a power supply system may thus be tested for latent faults during normal operation, and while each power supply associated with those isolation devices is in the operative state.
  • Each system unit may include at least two isolation devices and measuring circuitry for measuring the current through those isolation devices.
  • Each isolation device requires a forward biasing voltage at least equal to a respective forward bias voltage value to be operable in a normal conductive state.
  • Each isolation device can have an output connected to a common load of the respective system unit.
  • the system can further include a plurality of redundant power supplies.
  • Each power supply can supply power for at least one system unit via an isolation device of the respective system unit.
  • Each power supply thereby can have associated with it a set of one or more isolation devices.
  • Each power supply can also comprise control circuitry operable to control an output voltage of that power supply.
  • the system controller can be operable to:
  • At least one power supply to marginally vary an output voltage thereof such that a differential voltage is applied between an output of the power supply associated with the selected set of isolation devices and outputs of the remaining power supplies for selectively changing the conductive state of at least one isolation device by reducing the forward voltage thereof to less than said respective forward bias voltage, and
  • FIG. 1 shows a system comprising two power supplies, a system controller and N system units in accordance with an embodiment of the present invention
  • FIG. 2 shows a system comprising two power supplies, a system controller and two system units in accordance with an embodiment of the present invention
  • FIG. 3 shows a system comprising three power supplies, a system controller and two system units in accordance with an embodiment of the present invention
  • FIG. 4 shows an isolation circuit which may be provided within each of the system units in accordance with embodiments of the present invention wherein each system unit receives power from two power supplies;
  • FIG. 5 shows an isolation circuit which may be provided within each of the system units in accordance with embodiments of the present invention wherein each system unit receives power from three power supplies;
  • FIG. 6 shows an output level control circuit that may be provided within each power supply in the system in accordance with an embodiment of the present invention.
  • FIG. 1 illustrates a power supply system 510 in which a number of system units 552 , 554 , 556 can be powered by two power supply units 512 , 514 . Power can be supplied to each system unit via common power rails 558 and 560 . According to this arrangement, each power supply can be provided with a respective power rail whereby the number of power rails is equal to the number of power supplies.
  • the power supply unit 512 can comprise control logic 516 , which receives control signals from system controller 550 . These control signals can be indicative of a desired output voltage for the power supply, as determined by the system controller 550 .
  • Control logic 516 can be operable to supply a logic level input circuit 515 , referred to hereinafter as a margin arm 515 , with logic signals ‘up’ and ‘down’ in accordance with the desired output voltage.
  • the margin arm 515 can be operable to incrementally increase or decrease the output voltage presented by the power supply 512 in accordance with the logic signals received from the control logic 516 .
  • the power supply unit 514 can comprise components corresponding to those described in relation to the power supply unit 512 . In the case where more than two power supplies are present, it is anticipated that those further power supplies could also comprise such components.
  • the system unit 520 can be connected to a power rail 558 and a power rail 560 whereby power from both of the power supplies 512 , 514 can be received.
  • the power supply unit 552 comprises two isolation devices 520 and 522 , both of which only conduct current in one direction.
  • the isolation device 520 thereby isolates the power rail 558 from a backflow of current from the system unit 552 (and thereby protects each power supply or system unit connected to that rail).
  • the isolation device 522 isolates the power rail 560 .
  • each isolation device can comprise a diode, although any arrangement of components which can imposes the condition of unidirectional current flow could be employed.
  • Sensing circuits 532 and 534 can be connected in parallel with isolation devices 520 and 522 respectively and are operable to determine the magnitude and direction of current flowing through those isolation devices.
  • the sensing circuits can be provided within the system unit 552 , although in alternative implementations of the invention, they may be provided externally. In either case, the sensing circuits 532 and 534 can be provided either ‘upstream’ or (as indicated in FIG. 1) ‘downstream’ of the respective isolation devices 520 and 522 .
  • the currents flowing through the isolation devices 520 and 522 both supply a common load presented by the electrical components of the system unit.
  • An analogue to digital converter (ADC) unit 544 can be operable to receive analogue signals from each of the sensing circuits 532 and 534 . These signals can be indicative of the currents flowing into (or indeed out of) system unit 552 from respective power rails 558 and 560 . The signals can be converted into a digital signal S 1 that can be supplied to the system controller 550 . The system controller is thereby able to monitor the currents supplying the system unit 552 .
  • ADC analogue to digital converter
  • Each additional system unit, from the 2 nd system unit 554 up to and including the Nth system unit 556 can comprises components corresponding to those described in relation to system unit 552 and can be arranged in an analogous fashion with respect to the power rails 558 and 560 . Accordingly each power rail can be supplied by a single power supply and can supply power to N system units. Each system unit can receive a portion of its total power supply from each power rail.
  • the system controller 550 can be operable to perform a test cycle that tests for latent faults in the isolation devices of one or more of the system units.
  • the system unit 550 may be connected to an alarm system for reporting the detection of such latent faults.
  • this test cycle can comprise marginally varying the output voltages of one or more of the power supplies so as to observe the current flowing through each isolation device in the presence of various voltage conditions.
  • an isolation device will, at least, in the absence of faults, only conduct current in a single direction.
  • This unidirectional current flow behavior is associated with a ‘switch-on’ or biasing voltage V B , which is a characteristic of the device.
  • V B biasing voltage
  • a typical switch-on voltage for a semiconductor diode is V B ⁇ +0.6V.
  • This system 510 is shown in FIG. 2.
  • the same reference symbols are used to denote integers analogous to those of the generalized system 510 shown in FIG. 1.
  • the first step in the test cycle comprises selecting the set of isolation devices associated with a given one of the power supplies 512 , 514 and corresponding power rail 558 , 560 .
  • the set of isolation devices associated with the power supply 512 and the power rail 558 are selected (namely isolation devices 520 and 524 ). It is envisaged however that no particular order of selection need be imposed. Having selected a set of isolation devices, phase one of the test cycle may proceed.
  • the selected set of isolation devices is tested for short circuits. Should a short circuit be present in an isolation device, the output characteristics set out above would alter such that current is able to flow in the direction opposite to I F . As described previously, this has potentially disastrous consequences since under certain conditions (such as those described below) current may flow back through a shorted device and onto the common power rail.
  • This common power rail is shared by a power supply unit and a number of other system units (one, in the system of FIG. 2) and a backflow of current may damage those devices.
  • each power supply outputs a supply voltage substantially equal to V S .
  • the system controller can control the power supply unit 512 to reduce its supply voltage such that the voltage on the power rail 558 decreases from usual operational value V S to value V S1 with V S1 ⁇ V S .
  • the system controller 550 can also control the remaining power supplies in the system 510 (in this case these comprise a single unit, namely power supply unit 514 ) to increase their supply voltages such that the voltage appearing on the remaining power rails ( 560 ) increases from V S to V S2 with V S2 >V S .
  • the values of V S1 and V S2 are such that the absolute decrease in voltage on the power rail 558 is equal to the absolute increase in voltage on the power rail 514 :
  • V S2 ⁇ V S1 V D .
  • each system unit 552 , 554 is supplied primarily by power supply 514 whose supply voltage is higher than that of the power supply 512 . Consequently, the common load voltage V L in each system unit 552 , 554 is expected to increase. Accordingly, while the input voltage at each of the selected set of isolation devices is reduced to V S1 , the voltage at the outputs of those devices is increased and therefore the respective bias voltages are decreased. In accordance with the test procedure, incremental change V D is in fact chosen such the bias voltage V across each of the selected isolation devices falls to a value lower than V B . Accordingly, it is expected that the current flowing through each of the selected set of isolation devices (in the absence of latent faults) should fall to zero.
  • a negative current (where a positive current refers to a flow of current from a power supply to a system unit) will be expected to flow (in the absence of faults in the remaining power supplies and isolation devices) through that (those) faulty device(s) since the load voltage V L exceeds V S1 . Should such negative currents appear, they can be sensed by a respective sensing circuit.
  • the system controller 550 can determine from signals S 1 and S 2 whether any of the isolation devices in the selected set are short circuited.
  • the test procedure then enters a second phase in which each of the selected set of isolation devices are tested for open circuit faults.
  • the system controller 550 can control the power supply 512 associated with the selected set of isolation devices 520 , 524 to increase its supply voltage to a level higher than the normal operational supply voltage V S .
  • the controller 550 can also control the remaining power supplies 514 to decrease their supply voltage below V S .
  • each of the isolation devices 522 , 526 other than the selected set of isolation devices 520 , 524 should (in the absence faults) fall to zero. Consequently, the common load voltage V L in each isolation device is expected to be supplied only by the power supply 512 . However, should an open circuit fault exist in any of the selected set of isolation devices 520 , 524 , then it is expected that no current will flow through that (those) faulty device(s). A lack of current flow of this kind can be detected by a respective sensing circuit and communicated to the system controller 550 via a respective ADC unit 544 , 546 .
  • the system controller 550 is operable to report to an alarm system (not shown).
  • the system controller 550 may be operable to provide the alarm system with details of which system unit(s) 552 , 554 contain faults and more particularly which isolation devices 520 , 522 , 524 , 526 are involved.
  • the system controller can also be operable to indicate the type of fault that has been detected (e.g. short circuit, open circuit).
  • the alarm system can be operable to provide some form of indication to a user of the detected fault(s).
  • System controller 550 may be operable to perform a further test cycle after the completion of some form of delay interval.
  • the length of the delay interval may be determined in a number of different ways.
  • system controller may comprise some form of internal clock and means for storing a delay interval indicator.
  • a mechanism may also be provided by which a user is able to select the indicator whereby the user can decide upon a suitable delay interval.
  • the system controller 550 may be operable to receive an external control signal indicating that a test cycle should be commenced. Such a control signal may for example originate in one of the system units. Means may also be proved by which a user is able to cause the system controller 550 to initiate a test cycle immediately.
  • the amplitude of the differential voltage VD can be selected by considering the characteristic bias voltages V B of the isolation devices 520 , 522 , 524 , 526 in the system.
  • similar isolation devices can be used in each of the system units 544 , 554 . Accordingly, there is a substantially uniform value of V B for each of the isolation devices in the system 510 .
  • the bias voltage across each of the selected set of isolation devices 520 , 524 can be reduced to a value lower than the characteristic bias voltage V B as represented in Table 4 below: TABLE 4 V ⁇ V B .
  • V S is the normal (unmodified) supply voltage on the power rails 558 and 560
  • 1 ⁇ 2V D is the drop in that supply voltage in accordance with that phase of the test procedure
  • V B is the characteristic voltage drop across the isolation device while that device is forward biased (and thereby allowing current to flow).
  • V min is determined according to a ‘worst case scenario’ where the selected isolation device is the only operational (i.e. passing current) isolation device in that system unit.
  • V D is given by the relationship in Table 6 below: TABLE 6 V D ⁇ 2(V S ⁇ V min ⁇ V B ).
  • V D should also take account of resistive losses in the power rails, statistical variations in the values of the bias currents V B for the isolation devices in the system 510 and other system tolerances such as deviations in the supply voltages from the nominal set value. Since these variables will be dependent on a given system, the optimum voltage may for instance be determined empirically using a dummy load.
  • the differential voltage V D is created by varying the supply voltage of the power supply associated with the selected set of isolation devices and varying the supply voltage of the remaining power supplies in the opposite sense.
  • V D may be created by varying only the power supply associated with the selected set of isolation devices while keeping the supply voltages of the remaining power supplies constant.
  • the supply voltage of the power supply associated with the selected set of isolation devices is kept constant while the supply voltages of the remaining power supplies are varied.
  • system 510 comprises two power supply units 512 ; 514 .
  • any number of power supply units may be present.
  • An example of this is illustrated by the embodiment shown in FIG. 3.
  • the system 510 comprises three power supply units 512 , 514 , 590 .
  • the additional power supply unit 590 comprises components analogous to those present in the other two power supply units. Accordingly, power supply unit 590 can comprise control logic 592 and a margin arm 591 .
  • the system shown in FIG. 3 can comprises an additional power rail 596 which provides power to each of the system units 552 ; 554 .
  • each system unit can receive power from each of the power supply units 512 , 514 , 590 via respective power rails 558 , 560 and 596 .
  • Each system unit can comprise three isolation devices, each for isolating a respective one of the power rails from a backflow of current and the currents passing through those isolation devices may be sensed by respective sensing circuits.
  • a full test cycle could comprise:
  • a shortened test procedure for the system 510 shown in FIG. 3 may comprise:
  • test cycle open circuit tests may be performed on all of the isolation devices in the system in a single step. This step would not require the altering of any of the supply voltages if it can be assumed that each power supply is providing the same normal operational voltage V S (whereby each isolation device should be forward biased). The lack of a forward current I F under such voltage conditions should be indicative of an open circuit fault. Having performed this step, the test cycle may then continue by checking each individual set of isolation devices for short circuits in the manner described above.
  • each isolation device 520 , 522 , 523 comprises a diode.
  • the output of each isolation device in each system unit supplies a common load V L of that system unit.
  • Each sensing circuit comprises a resistor arranged in parallel with a differential amplifier (see for example the resistor 560 and the differential amplifier 562 in FIG. 4).
  • Each differential amplifier measures the voltage across a respective resistor and outputs an analogue signal indicative of that voltage. In this manner, the current flowing through each isolation device is measured.
  • the output of each differential amplifier is supplied to the analogue to a digital converter (ADC) unit of the respective system unit.
  • ADC digital converter
  • the ADC unit 544 comprises two separate ADCs, each for performing an analogue to digital conversion of the output signal from a respective differential amplifier.
  • the converted signals, S 1A and S 1B are then provided to the system controller whereby the system controller is able to monitor the currents flowing through each of the isolation devices in the system unit.
  • the ADC unit 544 can comprises three separate ADCs (one for each differential amplifier) and thereby can produces three digital signals S 1A , S 1B and S 1C .
  • the ADC unit could be provided within the system controller 550 , whereby the system controller would receive analogue signals from each of the system units.
  • FIG. 6 there is shown an embodiment of the margin arm (for example margin arm 515 in FIG. 2), which may be used in each power supply unit.
  • the margin arm controls the output voltage of the power supply with respect to a reference voltage V REF in response to control signals received from control logic ( 516 ).
  • a power train 580 has an output voltage V OUT which, during normal operation, is equal out voltage value V S as described above.
  • the value of V OUT is controlled by control voltage V C derived from the reference voltage V REF via a differential amplifier 582 .
  • a potential divider circuit provided by shunt resistance R 1 and R 2 provides a feedback input V FB to the differential amplifier 582 by dividing the supply voltage V OUT in a predetermined manner.
  • a switch SW 1 is operable to connect a shunt resistance R 3 in parallel with resistance R 2 when it is required to upwardly bias V FB to increase the supply voltage V S1 by a margin.
  • a switch SW 2 is operable to connect a shunt resistance R 4 in parallel with R 1 when it is required to reduce the supply voltage V OUT by a margin.
  • control logic for example control logic 516 in FIG. 2 is operable to operate switches SW 1 and SW 2 in response to logic signals received from the system control in order to margin up or margin down output voltage V OUT .
  • the system controller is able to instruct each power supply unit to vary its output voltage to a desired level during a test cycle.
  • the function of the system controller 550 may be implemented in hardware or software, the software implementation requiring a processor provided with a computer program defining instructions for controlling the operation of the processor.
  • the system controller may be operable to store values of voltage (such as the differential voltage V D or the respective bias voltages V B of the isolation devices) as well as current (including values of current passing through each isolation device under certain voltage conditions) in a look up table for comparison with values measured during the test cycle.
  • the system controller itself may comprise one of the system units. The system controller would, in such embodiments, receive power from each of the power supply units in the system in the manner described above and would be operable to perform latent fault checks on its own isolation devices.
  • each of the isolation devices has been described as being essentially uniform (in terms of bias voltage and output characteristics) the invention could also be implemented in systems where isolation devices with different output characteristics are present. This would require the use of differential voltages V D in the test cycle for each isolation device. As stated above, these values could be stored by the system controller 550 in a look up table.
  • Each system unit can comprise electrical equipment such as a computer or different portions of a computer system.
  • each system unit may comprise a field replaceable unit (FRU) which may be readily removed and replaced should a fault (such as those tested for in the methods described above) develop therein. Should redundant isolation devices be included in those FRUs, the FRUs could be swapped between systems comprising different numbers of power supply units. Further, should a power supply unit develop a fault, that unit (which may also comprise a FRU) may be removed without adversely affecting the test method.
  • the system controller may be operable to detect the number of power supply units present in the system.
  • a system in accordance with the present invention is implemented as a blade server system that includes a rack mountable shelf that supports a plurality of power supply units and a plurality of system units in the form of server blades, the power supply units being connectable to the server blades via a midplane within the rack mountable shelf.
  • a server blade system is described in published US application US-2003-0032335-A1 (U.S. Ser. No. 10/171,908), the whole content of which is incorporated herein by reference.

Abstract

A method and apparatus for testing for latent faults in the isolation devices of a system comprising redundant power supplies which supply power to one or more system units. A system controller is operable to perform a test cycle to perform the fault checks, which may comprise checks for short circuits and/or open circuits. The checks may be performed within a test cycle in which each of the isolation devices in the system is tested. The test cycle may be performed at regular intervals, the interval between each cycle being determinable by user input. In the event that a fault in one of the isolation devices is detected, the system controller may be operable to report the fault to an alarm system.

Description

    BACKGROUND OF THE INVENTION
  • The present invention relates to redundant power supply systems. In particular, the invention relates to a method and apparatus for testing for faults in the isolation circuits of systems powered by redundant power supplies. [0001]
  • It is customary in redundant power supply systems to provide a number of isolation devices, which serve to isolate inoperative power supplies from any unwanted backflow of current. The inoperative state in one (or more) of the power supplies may be intentional or alternatively may occur due to a malfunction. [0002]
  • Due to the nature of their operation, it is inherently the case that the presence of certain types of faults in isolation devices (such as a short circuit) only become apparent when the power supply which it serves to isolate becomes inoperative. A latent fault in an isolation device may therefore develop while the associated power supply is in the operative state and remain undetected until that power supply becomes inoperative. [0003]
  • However, it is in the inoperative state that each power supply relies upon its associated isolation devices for protection from the backflow of current. Should it remain undetected, the development of a latent fault in an isolation device has potentially disastrous consequences, which may lead to catastrophic failure in its associated power supply or in other power supplies in the system. [0004]
  • Therefore, it is desirable to be able to test the integrity of each isolation device ‘before the event’, while each associated power supply is in the operative state, thereby ensuring the protection of each power supply from unwanted backflow currents in the inoperative state. [0005]
  • SUMMARY OF THE INVENTION
  • An aspect of the present invention can provide a fault test method for use in a system comprising one or more system units. Each system unit may comprise at least two isolation devices and be powered by a plurality of redundant power supplies. Each isolation device requires a forward biasing voltage at least equal to a respective forward bias voltage value to be operable in a normal conductive state. Each isolation device can be electrically connected with a respective one of the power supplies such that each power supply has associated with it a set of one or more isolation devices. An output of each isolation device can be connected to a common load of the corresponding system unit. The fault test method can be performed by a testing system and include: [0006]
  • selecting as a subject for a test procedure, the set of isolation devices associated with a corresponding one of the power supplies; [0007]
  • controlling the power supplies during the test procedure to marginally vary an output voltage of at least one power supply such that a differential voltage is applied between an output of the power supply associated with the selected set of isolation devices and outputs of the remaining power supplies for selectively changing the conductive state of at least one isolation device by reducing the forward voltage thereof to less than said respective forward bias voltage; [0008]
  • measuring the current through each isolation device in said selected set of isolation devices; and [0009]
  • determining whether each measured current is consistent with the existence of a fault condition in the respective isolation device. [0010]
  • By employing the method described here, the system controller is thereby able to check for fault conditions in each isolation device in a selected set of isolation devices, the set being associated with a corresponding power supply. The isolation devices of a power supply system may thus be tested for latent faults during normal operation, and while each power supply associated with those isolation devices is in the operative state. [0011]
  • Another aspect of the present invention can provide a system that comprises one or more system units. Each system unit may include at least two isolation devices and measuring circuitry for measuring the current through those isolation devices. Each isolation device requires a forward biasing voltage at least equal to a respective forward bias voltage value to be operable in a normal conductive state. Each isolation device can have an output connected to a common load of the respective system unit. The system can further include a plurality of redundant power supplies. Each power supply can supply power for at least one system unit via an isolation device of the respective system unit. Each power supply thereby can have associated with it a set of one or more isolation devices. Each power supply can also comprise control circuitry operable to control an output voltage of that power supply. The system controller can be operable to: [0012]
  • select as a subject for a test procedure, the set of isolation devices associated with a corresponding one of said power supplies; [0013]
  • cause at least one power supply to marginally vary an output voltage thereof such that a differential voltage is applied between an output of the power supply associated with the selected set of isolation devices and outputs of the remaining power supplies for selectively changing the conductive state of at least one isolation device by reducing the forward voltage thereof to less than said respective forward bias voltage, and [0014]
  • determine whether the current measured by said measuring circuitry through each isolation device in said selected set of isolations devices is consistent with the existence of a fault condition in the respective isolation device. [0015]
  • BRIEF DESCRIPTION OF THE DRAWINGS
  • For a better understanding of the invention and to show how the same may be carried into effect reference is now made by way of example to the accompanying drawings in which: [0016]
  • FIG. 1 shows a system comprising two power supplies, a system controller and N system units in accordance with an embodiment of the present invention; [0017]
  • FIG. 2 shows a system comprising two power supplies, a system controller and two system units in accordance with an embodiment of the present invention; [0018]
  • FIG. 3 shows a system comprising three power supplies, a system controller and two system units in accordance with an embodiment of the present invention; [0019]
  • FIG. 4 shows an isolation circuit which may be provided within each of the system units in accordance with embodiments of the present invention wherein each system unit receives power from two power supplies; [0020]
  • FIG. 5 shows an isolation circuit which may be provided within each of the system units in accordance with embodiments of the present invention wherein each system unit receives power from three power supplies; [0021]
  • FIG. 6 shows an output level control circuit that may be provided within each power supply in the system in accordance with an embodiment of the present invention.[0022]
  • While the invention is susceptible to various modifications and alternative forms, specific embodiments are shown by way of example in the drawings and are herein described in detail. It should be understood, however, that drawings and detailed description thereto are not intended to limit the invention to the particular form disclosed, but on the contrary, the invention is to cover all modifications, equivalents and alternatives falling within the spirit and scope of the present invention as defined by the appended claims. In this regard, it is envisaged that combinations of features from the independent claims with features of dependent claims other than as presented by the dependencies of the claims, and also with features from the description, is also envisaged. [0023]
  • DESCRIPTION OF PARTICULAR EMBODIMENTS
  • FIG. 1 illustrates a [0024] power supply system 510 in which a number of system units 552, 554, 556 can be powered by two power supply units 512, 514. Power can be supplied to each system unit via common power rails 558 and 560. According to this arrangement, each power supply can be provided with a respective power rail whereby the number of power rails is equal to the number of power supplies.
  • The [0025] power supply unit 512 can comprise control logic 516, which receives control signals from system controller 550. These control signals can be indicative of a desired output voltage for the power supply, as determined by the system controller 550. Control logic 516 can be operable to supply a logic level input circuit 515, referred to hereinafter as a margin arm 515, with logic signals ‘up’ and ‘down’ in accordance with the desired output voltage. The margin arm 515 can be operable to incrementally increase or decrease the output voltage presented by the power supply 512 in accordance with the logic signals received from the control logic 516.
  • The [0026] power supply unit 514 can comprise components corresponding to those described in relation to the power supply unit 512. In the case where more than two power supplies are present, it is anticipated that those further power supplies could also comprise such components.
  • In operation, the [0027] system unit 520 can be connected to a power rail 558 and a power rail 560 whereby power from both of the power supplies 512, 514 can be received. The power supply unit 552 comprises two isolation devices 520 and 522, both of which only conduct current in one direction. The isolation device 520 thereby isolates the power rail 558 from a backflow of current from the system unit 552 (and thereby protects each power supply or system unit connected to that rail). Similarly, the isolation device 522 isolates the power rail 560. In the embodiments described herein, each isolation device can comprise a diode, although any arrangement of components which can imposes the condition of unidirectional current flow could be employed.
  • [0028] Sensing circuits 532 and 534 can be connected in parallel with isolation devices 520 and 522 respectively and are operable to determine the magnitude and direction of current flowing through those isolation devices. The sensing circuits can be provided within the system unit 552, although in alternative implementations of the invention, they may be provided externally. In either case, the sensing circuits 532 and 534 can be provided either ‘upstream’ or (as indicated in FIG. 1) ‘downstream’ of the respective isolation devices 520 and 522.
  • In the arrangement shown in FIG. 1, the currents flowing through the [0029] isolation devices 520 and 522 both supply a common load presented by the electrical components of the system unit.
  • An analogue to digital converter (ADC) [0030] unit 544 can be operable to receive analogue signals from each of the sensing circuits 532 and 534. These signals can be indicative of the currents flowing into (or indeed out of) system unit 552 from respective power rails 558 and 560. The signals can be converted into a digital signal S1 that can be supplied to the system controller 550. The system controller is thereby able to monitor the currents supplying the system unit 552.
  • Each additional system unit, from the 2[0031] nd system unit 554 up to and including the Nth system unit 556 can comprises components corresponding to those described in relation to system unit 552 and can be arranged in an analogous fashion with respect to the power rails 558 and 560. Accordingly each power rail can be supplied by a single power supply and can supply power to N system units. Each system unit can receive a portion of its total power supply from each power rail.
  • The [0032] system controller 550 can be operable to perform a test cycle that tests for latent faults in the isolation devices of one or more of the system units. The system unit 550 may be connected to an alarm system for reporting the detection of such latent faults. Broadly speaking, this test cycle can comprise marginally varying the output voltages of one or more of the power supplies so as to observe the current flowing through each isolation device in the presence of various voltage conditions.
  • Before describing the test cycle in more detail, it will be useful to consider the characteristics of semiconductor isolation devices (such as a diode) used in the context of this invention. [0033]
  • As described above, an isolation device will, at least, in the absence of faults, only conduct current in a single direction. This unidirectional current flow behavior is associated with a ‘switch-on’ or biasing voltage V[0034] B, which is a characteristic of the device. For positive forward voltages VF≧VB applied across the input and output of an isolation device, a positive current I=IF passes through the device. Another characteristic of semiconductor isolation devices is that they present minimal resistance to current flow when forward biased. Consequently, when forward biased, the value of VF is substantially equal to VB for essentially all useful values of forward current I=IF. A typical switch-on voltage for a semiconductor diode is VB≈+0.6V.
  • For values of V[0035] F lower than VB, the resistance of an isolation device increases abruptly to an essentially semi-infinite value. Consequently, the current flowing through the device falls to zero. The output characteristics of a typical isolation device can be as indicated in Table 1 below:
    TABLE 1
    V ≧ VB
    Figure US20040042145A1-20040304-P00801
    I = IF > 0 & V = VF ≈ VB
    V < VB
    Figure US20040042145A1-20040304-P00801
    I = 0.
  • The test cycle performed by the [0036] system controller 550 will now be described in relation to a system comprising two power supply units 512, 514 and two system units 552, 554 (i.e. N=2). This system 510 is shown in FIG. 2. In FIG. 2, the same reference symbols are used to denote integers analogous to those of the generalized system 510 shown in FIG. 1.
  • The first step in the test cycle comprises selecting the set of isolation devices associated with a given one of the power supplies [0037] 512, 514 and corresponding power rail 558, 560. In the present example, the set of isolation devices associated with the power supply 512 and the power rail 558 are selected (namely isolation devices 520 and 524). It is envisaged however that no particular order of selection need be imposed. Having selected a set of isolation devices, phase one of the test cycle may proceed.
  • In the first phase of the test cycle, the selected set of isolation devices is tested for short circuits. Should a short circuit be present in an isolation device, the output characteristics set out above would alter such that current is able to flow in the direction opposite to I[0038] F. As described previously, this has potentially disastrous consequences since under certain conditions (such as those described below) current may flow back through a shorted device and onto the common power rail. This common power rail is shared by a power supply unit and a number of other system units (one, in the system of FIG. 2) and a backflow of current may damage those devices.
  • During normal operation, it is assumed that each power supply outputs a supply voltage substantially equal to V[0039] S. To test the selected set of isolation devices, the system controller can control the power supply unit 512 to reduce its supply voltage such that the voltage on the power rail 558 decreases from usual operational value VS to value VS1 with VS1<VS. The system controller 550 can also control the remaining power supplies in the system 510 (in this case these comprise a single unit, namely power supply unit 514) to increase their supply voltages such that the voltage appearing on the remaining power rails (560) increases from VS to VS2 with VS2>VS. The values of VS1 and VS2 are such that the absolute decrease in voltage on the power rail 558 is equal to the absolute increase in voltage on the power rail 514:
  • This is represented in Table 2 below: [0040]
    TABLE 2
    Power Rail 558: V → VS1 = VS − VD/2
    Power Rail 560: V → VS2 = VS + VD/2.
  • The difference in voltage between [0041] power rail 558 and power rail 560 is therefore equal to VD, i.e. VS2−VS1=VD.
  • Under these conditions, the common load of each [0042] system unit 552, 554 is supplied primarily by power supply 514 whose supply voltage is higher than that of the power supply 512. Consequently, the common load voltage VL in each system unit 552, 554 is expected to increase. Accordingly, while the input voltage at each of the selected set of isolation devices is reduced to VS1, the voltage at the outputs of those devices is increased and therefore the respective bias voltages are decreased. In accordance with the test procedure, incremental change VD is in fact chosen such the bias voltage V across each of the selected isolation devices falls to a value lower than VB. Accordingly, it is expected that the current flowing through each of the selected set of isolation devices (in the absence of latent faults) should fall to zero.
  • If, however a short circuit fault exists in any of the selected set of isolation devices, then a negative current (where a positive current refers to a flow of current from a power supply to a system unit) will be expected to flow (in the absence of faults in the remaining power supplies and isolation devices) through that (those) faulty device(s) since the load voltage V[0043] L exceeds VS1. Should such negative currents appear, they can be sensed by a respective sensing circuit.
  • Accordingly, having controlled the power supplies in the system to vary their supply voltages in the manner described above, the [0044] system controller 550 can determine from signals S1 and S2 whether any of the isolation devices in the selected set are short circuited.
  • The test procedure then enters a second phase in which each of the selected set of isolation devices are tested for open circuit faults. In this phase, the [0045] system controller 550 can control the power supply 512 associated with the selected set of isolation devices 520, 524 to increase its supply voltage to a level higher than the normal operational supply voltage VS. The controller 550 can also control the remaining power supplies 514 to decrease their supply voltage below VS. During this second phase, the voltage conditions can be as set out in Table 3 below:
    TABLE 3
    Power Rail 558: V → VS1 = VS + VD/2
    Power Rail 560: V → VS2 = VS − VD/2.
  • The difference in voltage between the [0046] power rail 558 and the power rail 560 is therefore equal to −VD, i.e. VS2−VS1=−VD.
  • Under these conditions, it is expected that the currents passing through each of the [0047] isolation devices 522, 526 other than the selected set of isolation devices 520, 524 should (in the absence faults) fall to zero. Consequently, the common load voltage VL in each isolation device is expected to be supplied only by the power supply 512. However, should an open circuit fault exist in any of the selected set of isolation devices 520, 524, then it is expected that no current will flow through that (those) faulty device(s). A lack of current flow of this kind can be detected by a respective sensing circuit and communicated to the system controller 550 via a respective ADC unit 544, 546.
  • In the case that one or more faults are detected, the [0048] system controller 550 is operable to report to an alarm system (not shown). The system controller 550 may be operable to provide the alarm system with details of which system unit(s) 552, 554 contain faults and more particularly which isolation devices 520,522,524,526 are involved. The system controller can also be operable to indicate the type of fault that has been detected (e.g. short circuit, open circuit). The alarm system can be operable to provide some form of indication to a user of the detected fault(s).
  • The test procedure described above can be repeated for each set of isolation devices in the [0049] system 510, whereby the test cycle is completed. System controller 550 may be operable to perform a further test cycle after the completion of some form of delay interval. The length of the delay interval may be determined in a number of different ways. For example, system controller may comprise some form of internal clock and means for storing a delay interval indicator. A mechanism may also be provided by which a user is able to select the indicator whereby the user can decide upon a suitable delay interval. Alternatively the system controller 550 may be operable to receive an external control signal indicating that a test cycle should be commenced. Such a control signal may for example originate in one of the system units. Means may also be proved by which a user is able to cause the system controller 550 to initiate a test cycle immediately.
  • The amplitude of the differential voltage VD can be selected by considering the characteristic bias voltages V[0050] B of the isolation devices 520, 522, 524, 526 in the system. In the system 510, similar isolation devices can be used in each of the system units 544, 554. Accordingly, there is a substantially uniform value of VB for each of the isolation devices in the system 510.
  • During the first phase of the test procedure, the bias voltage across each of the selected set of [0051] isolation devices 520, 524 can be reduced to a value lower than the characteristic bias voltage VB as represented in Table 4 below:
    TABLE 4
    V < VB.
  • In the case where one of the selected set of isolation devices is the only functioning isolation device in the respective system unit (the remaining isolation devices in the system unit may for example have open circuit faults), this may give rise to an unacceptable drop in common load voltage V[0052] L in that system unit. Such a drop may, for example, mean that insufficient current is supplied to the components of the system unit. To avoid such an occurrence, it is required that the common load voltage remains above some minimum acceptable voltage Vmin. This requires a relationship as represented in Table 5 below:
    TABLE 5
    VS − ½VD − VB ≧ Vmin
  • In Table 5, V[0053] S is the normal (unmodified) supply voltage on the power rails 558 and 560, ½VD is the drop in that supply voltage in accordance with that phase of the test procedure, and VB is the characteristic voltage drop across the isolation device while that device is forward biased (and thereby allowing current to flow). Note that the value of Vmin is determined according to a ‘worst case scenario’ where the selected isolation device is the only operational (i.e. passing current) isolation device in that system unit.
  • Accordingly, the value of V[0054] D is given by the relationship in Table 6 below:
    TABLE 6
    VD ≦ 2(VS − Vmin − VB).
  • The calculation of V[0055] D should also take account of resistive losses in the power rails, statistical variations in the values of the bias currents VB for the isolation devices in the system 510 and other system tolerances such as deviations in the supply voltages from the nominal set value. Since these variables will be dependent on a given system, the optimum voltage may for instance be determined empirically using a dummy load.
  • In the example test procedure described above, the differential voltage V[0056] D is created by varying the supply voltage of the power supply associated with the selected set of isolation devices and varying the supply voltage of the remaining power supplies in the opposite sense. Alternatively, VD may be created by varying only the power supply associated with the selected set of isolation devices while keeping the supply voltages of the remaining power supplies constant. In a further alternative the supply voltage of the power supply associated with the selected set of isolation devices is kept constant while the supply voltages of the remaining power supplies are varied.
  • In the embodiments shown in FIGS. 1 and 2, [0057] system 510 comprises two power supply units 512;514. However, it is also envisaged that any number of power supply units may be present. An example of this is illustrated by the embodiment shown in FIG. 3. In this embodiment, the system 510 comprises three power supply units 512, 514, 590. As described above, the additional power supply unit 590 comprises components analogous to those present in the other two power supply units. Accordingly, power supply unit 590 can comprise control logic 592 and a margin arm 591.
  • In accordance with the topology described in relation to the generalized system shown in FIG. 1, the system shown in FIG. 3 can comprises an [0058] additional power rail 596 which provides power to each of the system units 552;554. In this embodiment, each system unit can receive power from each of the power supply units 512, 514, 590 via respective power rails 558, 560 and 596. Each system unit can comprise three isolation devices, each for isolating a respective one of the power rails from a backflow of current and the currents passing through those isolation devices may be sensed by respective sensing circuits.
  • The addition of power supply units in this type of system involves the introduction of additional sets of isolation devices. Accordingly, for each additional power supply included, two additional test phases (namely one additional short circuit test and one additional open circuit test) are employed for a full test cycle. [0059]
  • Considerations for choosing between the variety of possible test cycle procedures available for a given system topology will now be discussed. While in the test cycle described above in relation to the system in FIG. 2, each set of isolation devices were tested separately, it is in fact possible to simplify the test procedure. Examples of suitable simplifications, which can be of particular benefit to systems comprising multiple power supply units, are described below. [0060]
  • In a system comprising two power supply units, it is in fact possible to halve the number to test phases required by performing short circuit and open circuit tests simultaneously. Referring once more to FIG. 2 for instance, while the [0061] system controller 550 is performing a short circuit test on the a first set of isolation devices (for example 520, 524), it may simultaneously perform open circuit tests on the other set of isolation devices (522, 526). Similarly, when the first set of isolation devices (520, 524) are being tested for open circuits, the other set of isolation devices (522, 526) may be tested for short circuits. This is possible because the voltage conditions which are applied during a short circuit test for on set of isolation devices (namely the lowering/raising of the input voltages applied by an amount VD/2) are the same voltage conditions which are required for performing open circuits test for the other isolation devices. Accordingly, in the system 510 shown in FIG. 2, a full test cycle could comprise:
  • lowering the supply voltage of the [0062] power supply 512 by an amount VD/2 while increasing the supply voltage of power supply 514 by an amount VD/2;
  • performing short circuit tests on the [0063] isolation devices 520 and 524 while also performing open circuit tests on the isolation devices 522 and 526;
  • reversing the voltage conditions such that the output voltages of the power supplies [0064] 512 and 514 are increased/decreased to a value VD/2 above/below its normal operational value (Vs) respectively;
  • performing short circuit tests on the [0065] isolation devices 522 and 526 while also performing open circuit tests on the isolation devices 520 and 524;
  • reporting to an alarm system if a fault is detected. [0066]
  • Similar considerations can apply to systems comprising three or more power supply units (for example that shown in FIG. 3). By way of example, a shortened test procedure for the [0067] system 510 shown in FIG. 3 may comprise:
  • lowering the supply voltage of the [0068] power supply 512 by an amount VD/2 while increasing the supply voltages of the power supplies 514 and 590 by an amount VD/2;
  • performing short circuit tests on the [0069] isolation devices 520 and 524 while also performing open circuit tests on the isolation devices 522, 526 and 593, 594;
  • altering the voltage conditions such that the output voltage of the [0070] power supply 512 is increased to a value VD/2 above its normal operational value (VS) while the output voltages of the power supply units 514 and 590 are decreased to a value VD/2 below their normal operational value (VS);
  • performing open circuit tests on the [0071] isolation devices 520 and 524 while performing short circuit tests on the isolation devices 522, 526 and 593, 594;
  • reporting to an alarm system if a fault is detected. [0072]
  • Since it is possible to test more than a single device at once, the increase in the number of power supplies (and therefore isolation devices) over the system of FIG. 2 does not impact upon the number of steps required to perform a complete test cycle. [0073]
  • In an alternative test cycle, open circuit tests may be performed on all of the isolation devices in the system in a single step. This step would not require the altering of any of the supply voltages if it can be assumed that each power supply is providing the same normal operational voltage V[0074] S (whereby each isolation device should be forward biased). The lack of a forward current IF under such voltage conditions should be indicative of an open circuit fault. Having performed this step, the test cycle may then continue by checking each individual set of isolation devices for short circuits in the manner described above.
  • Turning now to FIGS. 4 and 5, there are shown two examples of the isolation device and sensing circuit arrangement for a given system unit. In each of these embodiments, each [0075] isolation device 520, 522, 523 comprises a diode. The output of each isolation device in each system unit supplies a common load VL of that system unit. Each sensing circuit comprises a resistor arranged in parallel with a differential amplifier (see for example the resistor 560 and the differential amplifier 562 in FIG. 4). Each differential amplifier measures the voltage across a respective resistor and outputs an analogue signal indicative of that voltage. In this manner, the current flowing through each isolation device is measured. The output of each differential amplifier is supplied to the analogue to a digital converter (ADC) unit of the respective system unit.
  • In the embodiment shown in FIG. 4, the [0076] ADC unit 544 comprises two separate ADCs, each for performing an analogue to digital conversion of the output signal from a respective differential amplifier. The converted signals, S1A and S1B are then provided to the system controller whereby the system controller is able to monitor the currents flowing through each of the isolation devices in the system unit. In the embodiment shown in FIG. 5, the ADC unit 544 can comprises three separate ADCs (one for each differential amplifier) and thereby can produces three digital signals S1A, S1B and S1C. In alternative embodiments, the ADC unit could be provided within the system controller 550, whereby the system controller would receive analogue signals from each of the system units.
  • In alternative embodiments, it may be desirable to provide one of more sets of redundant isolation devices with respective sensing circuits and ADCs. This would allow the number of power supplies used in the [0077] system 510 to be altered according to need. For example, a system unit taken from the embodiment shown in FIG. 3 could be readily used in the system 510 shown in FIG. 2. Furthermore, should one isolation device or its respective sensing circuit or ADC develop a fault, an alternative isolation device could be configured to receive power.
  • In FIG. 6, there is shown an embodiment of the margin arm (for [0078] example margin arm 515 in FIG. 2), which may be used in each power supply unit. The margin arm controls the output voltage of the power supply with respect to a reference voltage VREF in response to control signals received from control logic (516). A power train 580 has an output voltage VOUT which, during normal operation, is equal out voltage value VS as described above. The value of VOUT is controlled by control voltage VC derived from the reference voltage VREF via a differential amplifier 582.
  • A potential divider circuit provided by shunt resistance R[0079] 1 and R2 provides a feedback input VFB to the differential amplifier 582 by dividing the supply voltage VOUT in a predetermined manner. A switch SW1 is operable to connect a shunt resistance R3 in parallel with resistance R2 when it is required to upwardly bias VFB to increase the supply voltage VS1 by a margin. Similarly, a switch SW2 is operable to connect a shunt resistance R4 in parallel with R1 when it is required to reduce the supply voltage VOUT by a margin.
  • The control logic (for [0080] example control logic 516 in FIG. 2) is operable to operate switches SW1 and SW2 in response to logic signals received from the system control in order to margin up or margin down output voltage VOUT. In this manner, the system controller is able to instruct each power supply unit to vary its output voltage to a desired level during a test cycle.
  • The function of the [0081] system controller 550 may be implemented in hardware or software, the software implementation requiring a processor provided with a computer program defining instructions for controlling the operation of the processor. The system controller may be operable to store values of voltage (such as the differential voltage VD or the respective bias voltages VB of the isolation devices) as well as current (including values of current passing through each isolation device under certain voltage conditions) in a look up table for comparison with values measured during the test cycle. In some implementations of the invention, the system controller itself may comprise one of the system units. The system controller would, in such embodiments, receive power from each of the power supply units in the system in the manner described above and would be operable to perform latent fault checks on its own isolation devices.
  • While in the embodiments described above, each of the isolation devices has been described as being essentially uniform (in terms of bias voltage and output characteristics) the invention could also be implemented in systems where isolation devices with different output characteristics are present. This would require the use of differential voltages V[0082] D in the test cycle for each isolation device. As stated above, these values could be stored by the system controller 550 in a look up table.
  • Each system unit can comprise electrical equipment such as a computer or different portions of a computer system. In particular, each system unit may comprise a field replaceable unit (FRU) which may be readily removed and replaced should a fault (such as those tested for in the methods described above) develop therein. Should redundant isolation devices be included in those FRUs, the FRUs could be swapped between systems comprising different numbers of power supply units. Further, should a power supply unit develop a fault, that unit (which may also comprise a FRU) may be removed without adversely affecting the test method. In such systems, the system controller may be operable to detect the number of power supply units present in the system. [0083]
  • In one embodiment, a system in accordance with the present invention is implemented as a blade server system that includes a rack mountable shelf that supports a plurality of power supply units and a plurality of system units in the form of server blades, the power supply units being connectable to the server blades via a midplane within the rack mountable shelf. An example of such a server blade system is described in published US application US-2003-0032335-A1 (U.S. Ser. No. 10/171,908), the whole content of which is incorporated herein by reference. [0084]
  • Although the embodiments above have been described in considerable detail, numerous variations and modifications will become apparent to those skilled in the art once the above disclosure is fully appreciated. It is intended that the following claims be interpreted to embrace all such variations and modifications. [0085]

Claims (31)

What is claimed is:
1. A fault test method for use in a system comprising one or more system units, wherein each system unit comprises at least two isolation devices and is powered by a plurality of redundant power supplies, each isolation device requires a forward biasing voltage at least equal to a respective forward bias voltage value to be operable in a normal conductive state and is electrically connected with a respective one of said power supplies such that each power supply has associated with it a set of one or more isolation devices, and an output of each isolation device is connected to a common load of the corresponding system unit, the method being performed by a testing system and comprising:
selecting as a subject for a test procedure, the set of isolation devices associated with a corresponding one of said power supplies;
controlling the power supplies during the test procedure to marginally vary an output voltage of at least one power supply such that a differential voltage is applied between an output of the power supply associated with the selected set of isolation devices and outputs of the remaining power supplies for selectively changing the conductive state of at least one isolation device by reducing the forward voltage thereof to less than said respective forward bias voltage;
measuring the current through each isolation device in said selected set of isolation devices; and
determining whether each measured current is consistent with the existence of a fault condition in the respective isolation device.
2. The fault test method of claim 1, wherein said test procedure includes a short circuit detection test in which each of the power supplies other than the power supply associated with the selected set of isolation devices has a first output voltage greater than a second output voltage of the power supply associated with the selected set of isolation devices such that the differential voltage is applied for changing the conductive states of each isolation device of the selected set of isolation devices to a non-conductive state in the absence of a fault condition and wherein, if the measured current through any isolation devices of the selected set of isolation devices is non-zero and in a direction towards the corresponding power supply, the measured current is determined to be consistent with a short circuit fault condition in that isolation device.
3. The fault test method of claim 2, wherein said test procedure includes an open circuit detection test in which the power supply associated with the selected set of isolation devices has a first output voltage greater than a second output voltage of the remaining power supplies such that the differential voltage is applied for changing the conductive states of each of the isolation devices in sets other than the selected set of isolation devices in the absence of a fault condition and wherein, if the current measured through any of the selected isolation devices is substantially zero, the measured current is determined to be consistent with an open circuit fault condition in that isolation device.
4. The fault test method of claim 3, wherein the method comprises performing a test cycle comprising a sequence of test procedures such that in a completed test cycle respective short circuit tests and open circuit tests are performed for each of the isolation devices.
5. The fault test method as claimed in claim 4, wherein a system controller initiates a test cycle at predetermined intervals.
6. The fault test method as claimed in claim 5, including the system controller receiving and storing input data representative of the predetermined intervals and initiating the test cycle at intervals determined by the data.
7. The fault test method as claimed in claim 5, comprising the system controller generating an alarm signal in response to detection of a fault condition.
8. The fault test method of claim 5, wherein one of said system units comprises said system controller.
9. The fault test method of claim 8, wherein the common load of the system unit comprising said system controller comprises the load presented by said system controller.
10. The fault test method of claim 1, wherein the differential voltage is applied such that, during the test procedure and in the absence of fault conditions, the common load of each system unit is supplied by at least one of the power supplies operating at a first output voltage and being associated with a respective set of isolation devices each in a normal conductive state, and such that each remaining power supply operates at a second output voltage by the differential amount and in respect of which power supply the resulting forward voltage across each of its respective set of isolation devices is less than the forward bias voltage level of that isolation device so that conduction is inhibited.
11. The fault test method of claim 10, wherein said differential voltage is applied such that during the test procedure and in the presence of a fault condition requiring each common load to be supplied by one or more of the remaining power supplies operating at the second voltage, each resulting common load voltage is greater than a minimum operational limit required for operation of that common load.
12. The fault test method of claim 5, comprising the system controller outputting command signals for controlling the power supplies, each isolation device having a respective sensing circuit transmitting signals representative of the measurement of current to the system controller; and the system controller determining whether the measured currents are consistent with the existence of a fault condition.
13. The fault test method of claim 1, wherein each power supply comprises a respective margin arm input and wherein the controlling step comprises inputting a respective logic level signal to the margin arm input of the each power supply.
13. The fault test method of claim 1, wherein each isolation device comprises a respective diode.
14. The fault test method of claim 1, wherein said system comprises a computer system.
16. The fault test method of claim 1, wherein at least one system unit comprises a field replaceable unit.
17. The fault test method of claim 1, wherein each set of isolation devices is powered by the corresponding power supply via a common power bus.
18. A carrier medium carrying processor implementable instructions for instructing a processor of a testing system to perform a fault test method for use in a system comprising one or more system units, wherein each system unit comprises at least two isolation devices and is powered by a plurality of redundant power supplies, each isolation device requires a forward biasing voltage at least equal to a respective forward bias voltage value to be operable in a normal conductive state and is electrically connected with a respective one of said power supplies such that each power supply has associated with it a set of one or more isolation devices, and an output of each isolation device is connected to a common load of the corresponding system unit, the method comprising:
selecting as a subject for a test procedure, the set of isolation devices associated with a corresponding one of said power supplies;
controlling the power supplies during the test procedure to marginally vary an output voltage of at least one power supply such that a differential voltage is applied between an output of the power supply associated with the selected set of isolation devices and outputs of the remaining power supplies for selectively changing the conductive state of at least one isolation device by reducing the forward voltage thereof to less than said respective forward bias voltage;
measuring the current through each isolation device in said selected set of isolation devices; and
determining whether each measured current is consistent with the existence of a fault condition in the respective isolation device.
19. A system comprising:
one or more system units, each system unit comprising at least two isolation devices, and measuring circuitry for measuring the current through isolation devices, each isolation device requiring a forward biasing voltage at least equal to a respective forward bias voltage value to be operable in a normal conductive state and having an output connected to a common load of the respective system unit;
a plurality of redundant power supplies, each supplying power for at least one system unit via an isolation device of the respective system unit such that each power supply has associated with it a set of one or more isolation devices, each power supply comprising control circuitry operable to control an output voltage of that power supply; and
a system controller, operable:
to select as a subject for a test procedure, the set of isolation devices being associated with a corresponding one of said power supplies; to cause at least one power supply to marginally vary an output voltage thereof such that a differential voltage is applied between an output of the power supply associated with a selected set of isolation devices and outputs of the remaining power supplies for selectively changing the conductive state of at least one isolation device by reducing the forward voltage thereof to less than said respective forward bias voltage, and
to determine whether the current measured by said measuring circuitry through each isolation device in said selected set of isolation devices is consistent with the existence of a fault condition in the respective isolation device.
20. The system of claim 19, wherein one of said system units comprises said system controller.
21. The system of claim 20, wherein the common load of the system unit comprising said system controller comprises the operational load of said system controller.
22. The system of claim 19, wherein the system controller is operable to initiate a test cycle at predetermined intervals.
23. The system of claim 22, wherein the system controller is operable to receive and store input data representative of the predetermined intervals and to initiate the test cycle at intervals determined by the data.
24. The system of claim 22, wherein the system controller is operable to generate an alarm signal in response to detection of a fault condition.
25. The system of claim 22, wherein the system controller is operable to output command signals for controlling the power supplies, each isolation device having a respective sensing circuit operable to transmit signals representative of the measurement of current to the system controller; whereby the system controller is further operable to determine whether the measured currents are consistent with the existence of a fault condition.
26. The system of claim 19, wherein each power supply comprises a respective margin arm input and wherein the output voltage of that power supply may be varied by inputting a respective logic level signal to that margin arm input.
27. The system of claim 19, wherein said system comprises a computer system.
28. The system of claim 19, wherein at least one system unit comprises a field replaceable unit.
29. The system of claim 19, wherein each isolation device comprises a respective diode.
30. The system of claim 19, wherein each set of isolation devices is powered by the corresponding power supply via a common power bus.
31. The system of claim 19, comprising a rack mountable shelf, the rack mountable shelf supporting the plurality of power supply units and a plurality of said system units in the form of server blades.
US10/455,753 2002-06-06 2003-06-04 Distributed diode fault check Expired - Lifetime US6967487B2 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
GB0212994A GB2389471B (en) 2002-06-06 2002-06-06 Latent fault detection in redundant power supply systems
GB0212994.8 2002-06-06

Publications (2)

Publication Number Publication Date
US20040042145A1 true US20040042145A1 (en) 2004-03-04
US6967487B2 US6967487B2 (en) 2005-11-22

Family

ID=9938085

Family Applications (2)

Application Number Title Priority Date Filing Date
US10/455,753 Expired - Lifetime US6967487B2 (en) 2002-06-06 2003-06-04 Distributed diode fault check
US10/455,754 Expired - Lifetime US6947864B2 (en) 2002-06-06 2003-06-04 Latent fault detection in redundant power supply systems

Family Applications After (1)

Application Number Title Priority Date Filing Date
US10/455,754 Expired - Lifetime US6947864B2 (en) 2002-06-06 2003-06-04 Latent fault detection in redundant power supply systems

Country Status (2)

Country Link
US (2) US6967487B2 (en)
GB (1) GB2389471B (en)

Cited By (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20120286820A1 (en) * 2011-05-13 2012-11-15 General Electric Company Methods to detect a single point of failure
TWI383164B (en) * 2004-11-22 2013-01-21 Inventio Ag Uninterruptible power supply and method for testing selective isolation means used in the uninterruptible power supply
CN104049156A (en) * 2013-03-13 2014-09-17 力博特公司 Method For Automatic Mapping Of AC Phase Conductors And Identification Of AC Buses In A Multi-Bus Power System
US20140281649A1 (en) * 2013-03-13 2014-09-18 Liebert Corporation Method For Automatic Mapping Of AC Phase Conductors And Identification Of AC Buses In A Multi-Bus Power System
US20160274195A1 (en) * 2015-03-19 2016-09-22 Hamilton Sundstrand Corporation Method for electronically testing integrity of ideal diode components used in or'd voltage bus
US11210930B2 (en) * 2018-05-23 2021-12-28 Minimax Gmbh Device, method, and control module for monitoring a two-wire line
WO2022093833A1 (en) * 2020-10-27 2022-05-05 Wisk Aero Llc Electrically powered propulsion systems for aircraft
US20220163599A1 (en) * 2020-11-24 2022-05-26 OneStep Power Solutions Inc. System, apparatus, and method for testing of an electrical system

Families Citing this family (27)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
GB2389471B (en) 2002-06-06 2005-09-21 Sun Microsystems Inc Latent fault detection in redundant power supply systems
US7529642B2 (en) * 2004-08-20 2009-05-05 Pdf Solutions S.A. Method for evaluating the quality of data collection in a manufacturing environment
EP1659415B1 (en) * 2004-11-22 2019-07-24 Inventio AG Integrity testing of isolation means in an uninterruptible power supply
US7321948B1 (en) * 2005-03-30 2008-01-22 Emc Corporation Highly available system test mechanism
US7209365B2 (en) * 2005-04-06 2007-04-24 Inventec Corporation Card retention device
DE102005028513B4 (en) * 2005-06-17 2020-01-16 Sma Solar Technology Ag Method for determining an interconnection type of at least two electrical devices and system with several electrical devices
US7298167B1 (en) * 2005-06-24 2007-11-20 Emc Corporation Power supply system
DE102006010385B4 (en) * 2006-03-03 2010-11-25 Zippy Technology Corp., Hsin-Tien Method for identifying power supply modules
JP4836694B2 (en) * 2006-07-11 2011-12-14 株式会社オートネットワーク技術研究所 Power supply control device
US7560913B2 (en) * 2006-09-22 2009-07-14 International Business Machines Corporation Input-tracking, automatic output-margining method and system
JP4833164B2 (en) * 2007-07-24 2011-12-07 株式会社日立製作所 Storage subsystem and power supply system
US7868637B2 (en) 2007-10-15 2011-01-11 Hamilton Sundstrand Corporation System and method for automated detection of singular faults in diode or'd power bus circuits
US8446285B2 (en) * 2008-02-08 2013-05-21 Siemens Industry, Inc. Methods and apparatus for controlling and testing a notification appliance circuit
US8340933B1 (en) * 2009-09-30 2012-12-25 Emc Corporation Method and apparatus for performing power supply self-diagnostics in redundant power architectures
CN102053690B (en) * 2009-11-09 2013-01-23 英业达股份有限公司 Server
EP2357484B1 (en) * 2010-01-25 2013-03-13 Siemens Aktiengesellschaft Method for diagnosing an electrical connection and output assembly
JP5904663B2 (en) * 2012-03-27 2016-04-13 株式会社Isowa Corrugated cardboard machine automatic inspection device and corrugated cardboard machine having automatic inspection function
US9342414B1 (en) * 2014-02-12 2016-05-17 Amazon Technologies, Inc. Reserve power bus ring for data center
US9453870B1 (en) * 2014-04-15 2016-09-27 Xilinx, Inc. Testing for shorts between internal nodes of a power distribution grid
US10054646B2 (en) * 2014-08-19 2018-08-21 Astronics Advanced Electronic Systems Corp. High reliability power supply configuration and testing
US10082856B1 (en) * 2016-09-29 2018-09-25 Juniper Networks, Inc. Performing a health check on power supply modules that operate in a current sharing mode
CN111308268B (en) * 2020-02-06 2022-04-12 云南电网有限责任公司电力科学研究院 Method and system for evaluating latent fault of power distribution network equipment
PL3916518T3 (en) * 2020-05-28 2023-07-17 Ovh Power supply combination having heterogeneous power supplies
US11392183B2 (en) * 2020-05-28 2022-07-19 Ovh Power supply combination for delivering power at multiple voltages
PL3916517T3 (en) * 2020-05-28 2023-01-30 Ovh Power supply combination for delivering power at multiple voltages
CN112557938B (en) * 2020-12-21 2022-09-23 宁波拓普集团股份有限公司 Diagnosis method for power supply circuit of intelligent brake system
US11682896B2 (en) 2021-06-08 2023-06-20 International Business Machines Corporation Fault detection in redundant power supplies

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US3679970A (en) * 1970-04-22 1972-07-25 Automatic Elect Lab Selection matrix including a diode test circuit
US5325062A (en) * 1992-07-10 1994-06-28 Sequoia Systems, Inc. Apparatus and method for detection of latent faults in redundant systems
US6157308A (en) * 1999-06-10 2000-12-05 Lucent Technologies Inc. Detecting hidden faults in reliable power systems
US6342791B1 (en) * 1999-02-10 2002-01-29 Nas-Toa Co., Ltd. Diode defect detecting device
US6462667B1 (en) * 1999-12-06 2002-10-08 Westinghouse Electric Company Llc Energize to actuate engineered safety features actuation system and testing method therefor

Family Cites Families (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4156189A (en) * 1977-09-12 1979-05-22 Honeywell Information Systems Inc. Power supply fault analyzer
US5072175A (en) * 1990-09-10 1991-12-10 Compaq Computer Corporation Integrated circuit having improved continuity testability and a system incorporating the same
CA2151684A1 (en) * 1994-06-30 1995-12-31 David L. Aldridge System for switching power and scrubbing for faults
US5521513A (en) * 1994-10-25 1996-05-28 Teradyne Inc Manufacturing defect analyzer
JP3113159B2 (en) * 1994-11-14 2000-11-27 富士通株式会社 Power supply
US5519335A (en) * 1995-03-13 1996-05-21 Unisys Corporation Electronic tester for testing Iddq in an integrated circuit chip
US6301133B1 (en) * 1999-04-07 2001-10-09 Astec International Limited Power supply system with ORing element and control circuit
US6661119B2 (en) * 2001-12-17 2003-12-09 Cloudshield Technologies, Inc. System and method for distributed power supply supporting high currents with redundancy
GB2389471B (en) 2002-06-06 2005-09-21 Sun Microsystems Inc Latent fault detection in redundant power supply systems

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US3679970A (en) * 1970-04-22 1972-07-25 Automatic Elect Lab Selection matrix including a diode test circuit
US5325062A (en) * 1992-07-10 1994-06-28 Sequoia Systems, Inc. Apparatus and method for detection of latent faults in redundant systems
US6342791B1 (en) * 1999-02-10 2002-01-29 Nas-Toa Co., Ltd. Diode defect detecting device
US6157308A (en) * 1999-06-10 2000-12-05 Lucent Technologies Inc. Detecting hidden faults in reliable power systems
US6462667B1 (en) * 1999-12-06 2002-10-08 Westinghouse Electric Company Llc Energize to actuate engineered safety features actuation system and testing method therefor

Cited By (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
TWI383164B (en) * 2004-11-22 2013-01-21 Inventio Ag Uninterruptible power supply and method for testing selective isolation means used in the uninterruptible power supply
US20120286820A1 (en) * 2011-05-13 2012-11-15 General Electric Company Methods to detect a single point of failure
CN104049156A (en) * 2013-03-13 2014-09-17 力博特公司 Method For Automatic Mapping Of AC Phase Conductors And Identification Of AC Buses In A Multi-Bus Power System
US20140281649A1 (en) * 2013-03-13 2014-09-18 Liebert Corporation Method For Automatic Mapping Of AC Phase Conductors And Identification Of AC Buses In A Multi-Bus Power System
US9519560B2 (en) * 2013-03-13 2016-12-13 Liebert Corporation Method for automatic mapping of AC phase conductors and identification of AC buses in a multi-bus power system
US20160274195A1 (en) * 2015-03-19 2016-09-22 Hamilton Sundstrand Corporation Method for electronically testing integrity of ideal diode components used in or'd voltage bus
US9766299B2 (en) * 2015-03-19 2017-09-19 Hamilton Sundstrand Corporation Method for electronically testing integrity of ideal diode components used in OR'd voltage bus
US11210930B2 (en) * 2018-05-23 2021-12-28 Minimax Gmbh Device, method, and control module for monitoring a two-wire line
WO2022093833A1 (en) * 2020-10-27 2022-05-05 Wisk Aero Llc Electrically powered propulsion systems for aircraft
US20220163599A1 (en) * 2020-11-24 2022-05-26 OneStep Power Solutions Inc. System, apparatus, and method for testing of an electrical system
WO2022115329A1 (en) * 2020-11-24 2022-06-02 OneStep Power Solutions Inc. System, apparatus, and method for testing of an electrical system
US11874335B2 (en) * 2020-11-24 2024-01-16 OneStep Power Solutions Inc. System, apparatus, and method for testing of an electrical system

Also Published As

Publication number Publication date
GB2389471A (en) 2003-12-10
US20040039536A1 (en) 2004-02-26
GB0212994D0 (en) 2002-07-17
US6967487B2 (en) 2005-11-22
US6947864B2 (en) 2005-09-20
GB2389471B (en) 2005-09-21

Similar Documents

Publication Publication Date Title
US6967487B2 (en) Distributed diode fault check
US6121693A (en) Distributed power system
US6744151B2 (en) Multi-channel power supply selector
US6504266B1 (en) Method and apparatus for powering up an electronic system after AC power has been removed
US20100264743A1 (en) Static transfer switch device, power supply apparatus using the switch device and switching method thereof
JP5198062B2 (en) System and method for detecting power failure conditions
US8952671B2 (en) Power supply apparatus for testing apparatus
JP2011232171A (en) Failure detection method and battery monitoring device for battery management system of secondary battery
JP4418197B2 (en) Overvoltage protection circuit and power supply, power supply system and electronic device using the same
US8717032B2 (en) Electronic circuit breaker and a method of providing protection switching
US7280333B2 (en) Method and device for short circuit or open load detection
JP2008524597A (en) Circuit and method for detecting voltage in a device under test using a parametric measurement unit
JP4401973B2 (en) Information processing apparatus, failure monitoring method, failure monitoring program
JP4024874B2 (en) High pressure conversion station monitoring device
US7148695B2 (en) Fault detection system
CN109696940B (en) Method for identifying faults at the output of a device and system thereof
US6157308A (en) Detecting hidden faults in reliable power systems
US10173627B2 (en) Method and device for maintaining an actuator for an airbag control device
CN109696615B (en) Method for identifying faults at the output of a device and system therefor
CN108780118B (en) Device with at least two redundant analog input units for measuring current
JP6094070B2 (en) System and method for identifying abnormal points
JP7196700B2 (en) power system
CN113535618A (en) System and method for fault detection and reporting through a serial interface transceiver
CN112666379A (en) On-line power supply current monitoring
KR100703927B1 (en) Apparatus for monitoring obstacle of dual blocking diode

Legal Events

Date Code Title Description
AS Assignment

Owner name: SUN MICROSYSTEMS, INC., CALIFORNIA

Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNORS:GARNETT, PAUL JEFFREY;SUN MICROSYSTEMS LIMITED;REEL/FRAME:014564/0822

Effective date: 20021218

STCF Information on status: patent grant

Free format text: PATENTED CASE

FPAY Fee payment

Year of fee payment: 4

FPAY Fee payment

Year of fee payment: 8

AS Assignment

Owner name: ORACLE AMERICA, INC., CALIFORNIA

Free format text: MERGER AND CHANGE OF NAME;ASSIGNORS:ORACLE USA, INC.;SUN MICROSYSTEMS, INC.;ORACLE AMERICA, INC.;REEL/FRAME:037280/0159

Effective date: 20100212

FPAY Fee payment

Year of fee payment: 12