WO2002078250A2 - High availability packet forwarding apparatus and method - Google Patents

High availability packet forwarding apparatus and method Download PDF

Info

Publication number
WO2002078250A2
WO2002078250A2 PCT/CA2002/000424 CA0200424W WO02078250A2 WO 2002078250 A2 WO2002078250 A2 WO 2002078250A2 CA 0200424 W CA0200424 W CA 0200424W WO 02078250 A2 WO02078250 A2 WO 02078250A2
Authority
WO
WIPO (PCT)
Prior art keywords
service
control processor
fib
control
control processors
Prior art date
Application number
PCT/CA2002/000424
Other languages
French (fr)
Other versions
WO2002078250A3 (en
Inventor
Scott S. Pegrum
Matthew M. Yuen
Nabila Ould-Brahim
Original Assignee
Nortel Networks Limited
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Nortel Networks Limited filed Critical Nortel Networks Limited
Priority to EP02716569A priority Critical patent/EP1374500A2/en
Priority to CA002441470A priority patent/CA2441470A1/en
Publication of WO2002078250A2 publication Critical patent/WO2002078250A2/en
Publication of WO2002078250A3 publication Critical patent/WO2002078250A3/en

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L45/00Routing or path finding of packets in data switching networks
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L12/00Data switching networks
    • H04L12/28Data switching networks characterised by path configuration, e.g. LAN [Local Area Networks] or WAN [Wide Area Networks]
    • H04L12/46Interconnection of networks
    • H04L12/4604LAN interconnection over a backbone network, e.g. Internet, Frame Relay
    • H04L12/462LAN interconnection over a bridge based backbone
    • H04L12/4625Single bridge functionality, e.g. connection of two networks over a single bridge
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L45/00Routing or path finding of packets in data switching networks
    • H04L45/24Multipath
    • H04L45/247Multipath using M:N active or standby paths
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L45/00Routing or path finding of packets in data switching networks
    • H04L45/50Routing or path finding of packets in data switching networks using label swapping, e.g. multi-protocol label switch [MPLS]
    • H04L45/502Frame based
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L45/00Routing or path finding of packets in data switching networks
    • H04L45/58Association of routers
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L49/00Packet switching elements
    • H04L49/60Software-defined switches
    • H04L49/602Multilayer or multiprotocol switching, e.g. IP switching
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L49/00Packet switching elements
    • H04L49/20Support for services
    • H04L49/201Multicast operation; Broadcast operation

Abstract

A high availability packet forwarding router (102) for an internet protocol (IP) network, includes two control processors (104,106), one or more service termination cards (STCs)(112) with forwarding information bases (FIBs)(108,110), and a packet forwarding engine (214). The two processors run asynchronously in a master/standby relationship. Integrity of processes running on the control processors is monitored and the forwarding engine forwards packets according to a FIB maintained by an in-service one of the control processors. Hitless failover and hitless software upgrades are supported.

Description

HIGH AVAILABILITY PACKET FORWARDING APPARATUS
AND METHOD
TECHNICAL FIELD
The present invention relates in general to routers and packet forwarding engines and, in particular, to an apparatus and method that provides high packet forwarding availability.
BACKGROUND OF THE INVENTION
Existing router architectures and routing protocols lack certain desirable features. For the purposes of this discussion, router architectures and routing protocols include bridging spanning tree protocols (STPs) as well as routing protocols such as Open Shortest Path First (OSPF) and BGP-4 (Border Gateway Protocol version 4) .
OSPF is a link-state routing protocol . It is designed to be run internally of a single Autonomous System (AS) . Each OSPF router maintains an identical database describing the AS's topology. From this database, a routing table is calculated by constructing a shortest-path tree. OSPF recalculates routes quickly in response to topological changes, utilizing a minimum of routing protocol traffic. OSPF provides support for equal-cost multipath. An area routing capability is also provided, enabling an additional level of routing protection and a reduction in routing protocol traffic. In addition, all OSPF routing protocol exchanges are authenticated. BGP-4 is an inter-Autonomous System routing protocol. The primary function of a BGP-4 enabled system is to exchange network reachability information with other BGP-4 systems. The network reachability information includes information about a list of ASs that reachability information traverses. The reachability information is sufficient to construct a graph of AS connectivity from which routing loops may be pruned and certain policy decisions at the AS level may be enforced. BGP-4 also provides a new set of mechanisms for supporting classless inter-domain routing. These mechanisms include support for advertising an Internet Protocol (IP) prefix and eliminates the concept of network class within BGP . BGP-4 also introduces mechanisms that allow aggregation of routes, including aggregation of AS paths. To characterize the set of policy decisions that can be enforced using BGP, one must focus on the rule that a BGP-4 speaker advertises to its peers (other BGP-4 speakers with which it communicates) in neighboring ASs only those routes that it uses itself. This rule reflects the "hop-by-hop" routing paradigm generally used throughout the current Internet .
It should be noted that some policies cannot be enforced by the "hop-by-hop" routing paradigm, and thus require methods such as source routing. For example,
BGP-4 does not enable one AS to send traffic to a neighboring AS with the intention that the traffic take a different route from that taken by traffic originating in the neighboring AS. On the other hand, BGP-4 can support any policy conforming to the "hop-by-hop" routing paradigm. Since the current Internet only uses the "hop-by-hop" routing paradigm, and since BGP-4 can support any policy that conforms to that paradigm, BGP-4 is highly applicable as an inter-AS routing protocol for the current Internet .
L3 (layer 3 of the open system interconnection model) routing and bridging protocols were not designed to easily permit dual or synchronous standby architectures within routing switches to provide high availability. Typically, high availability for packet forwarding equipment is achieved through physical duplication of switches. Physical duplication has a high cost due to increased footprint, ongoing management, and cabling costs. It is therefore advantageous to be able to provide a highly reliable and available solution to minimize these costs. Furthermore, physical duplication generally fails to address the most common point of failure in modern packet forwarding equipment, namely software crashes caused by errors in program code . Due to the increasing complexity and feature support in modern packet forwarding software, it is difficult to provide software loads that are completely error free . Current packet forwarding systems, however, fail to adequately address detection and failover when a software fault occurs.
High availability for packet forwarding requires a number of features, including: 1) the ability to perform hitless software upgrades; 2) the ability to provide hitless control path failover due to either software or hardware faults; 3) the ability to provide hitless line card failover; 4) the ability to provide hitless path failover, and 5) other features, including synchronization of Routing/Bridging states using database synchronization, which is difficult to provide due to the large amount of state information required to maintain synchronization. Currently, packet forwarding technology does not support hitless software upgrade or failover, much less the other desirable features listed above.
It therefore remains highly desirable to provide a means of achieving a high level of packet forwarding availability at a reasonable cost and complexity.
OBJECTS OF THE INVENTION It is therefore an object of the invention to provide an apparatus and method for high availability packet forwarding that permits hitless software upgrades and software or hardware failover.
It is a further object of the invention to provide a means of achieving a high level of packet forwarding availability at a reasonable cost and complexity.
It is yet a further object of the invention to provide a packet forwarding engine having first and second control processors that respectively and asynchronously run a plurality of packet receiving and forwarding processes. It is yet a further object of the invention to provide first and second forwarding information bases
(FIBs) that respectively store forwarding information maintained by the respective first and second control processors.
It is yet another object of the invention to provide a packet forwarding engine which includes service termination cards that forward packets in accordance with one of the first and second FIBs, depending on an integrity of the processes running on the respective first and second control processors .
It is yet another object of the invention to provide a method of monitoring selected processes executed by the first and second control processors to ensure that both hardware and software faults are detected as they occur, and failover process are timely initiated.
SUMMARY OF THE INVENTION
Accordingly, the invention provides an apparatus (102) for providing high availability packet forwarding, comprising a service termination card (112) having a packet forwarding engine (214) for receiving and forwarding packets in accordance with a forwarding information base (FIB) ; a first control processor (104) running a plurality of processes (404) and communicatively coupled to the service termination card; a first forwarding information base on the service termination card having forwarding information maintained by the f irst control processor, C H A R A C T E R I Z E D by :
a second control processor (106) running a plurality of processes asynchronously with respect to the first control processor, the second control processor being communicatively coupled to the service termination card;
a second FIB (110) on the service termination card having forwarding information maintained by the bases, depending on an integrity of the processes running on the respective first and second control processors.
According to an aspect of the invention there is provided a method of ensuring high availability in a packet forwarding process, C H A R A C T E R I Z E D by operating first and second control processors independently and asynchronously to generate first and second FIBs. The FIBs are respectively provided to a service termination card. The service termination card operates to forward packets using information from one of the FIBs depending on an integrity of selected processes running on the respective first and second control processors .
In accordance with an embodiment of the invention, the integrity of the processes run by the first and second control processors is determined by monitoring selected processes executed by the first and second control processors. This ensures that both hardware and software faults are detected as they occur, and failover process are timely initiated.
In accordance with an aspect of the invention, full bandwidth utilization is ensured during failover by a bandwidth manager, which releases bandwidth reserved by a failed control processor. The released bandwidth can then be utilized by the operating control processor.
In accordance with another aspect of the invention, line protection is provided for core MPLS traffic by diversely setup label paths through the two control processors. A FIB manager controls MPLS FIBs so that primary labels switched paths (LSPs) and secondary LSPs are generated by different control processors in each of the first and second FIBs .
In accordance with a further aspect of the invention, there is provided
BRIEF DESCRIPTION OF THE DRAWINGS
Further features and advantages of the present invention will become apparent from the following detailed description, taken in combination with the appended drawings, in which:
FIG. 1 is a schematic diagram of a computer network including an apparatus in accordance with the invention;
FIG. 2 is a block diagram of a service termination card shown in FIG. 1; Fig 3 ; is a block diagram of a heartbeat tester shown in FIG. 2 and
FIG. 4 is a block diagram of a control processor shown in FIG. 1.
It will be noted that throughout the appended drawings, like features are identified by like reference numerals.
DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENT
The invention provides an apparatus and method for ensuring high availability for packet forwarding in a packet network. The apparatus has dual control processors that operate asynchronously in parallel to compute separate forwarding information bases (FIBs) selectively used by service termination cards (STCs) for packet forwarding. During normal operation, the STCs use master control processor FIBs for packet forwarding. If integrity of the master control processor is lost, the
STCs switch to the FIBs of the alternate control processor. Control processor integrity is determined by the STCs, which send heartbeat query messages to selected software processes running on each control processor.
This ensures rapid detection of software and hardware faults in a control processor to improve availability.
FIG. 1 is a schematic diagram of a computer network 100 that includes a router 102 in accordance with the .invention. The router 102 includes a first control processor (CPO) 104 and a second control processor
(CP1) 106. Each control processor 104,106 can function as a master, or a standby control processor. Each control processor 104,106 creates and maintains a respective forwarding information base
(FIB0,FIB1) 108,110. Each control processor 104,106 is communicatively connected to one or more service termination cards (STCs) 112 by communications busses' 109 and 111, respectively. Each of the STCs 112 is connected by links 114 via an NNI (network to network interface) 115 to a network core 116. The STCs 112 are also connected 118 to a respective I/O interface 120 that each have a respective Ethernet connection 122 available. The control processors 104,106 are both communicatively connected to an operations, administration and management
(OAM) workstation 124.
FIG. 2 is a block diagram 200 of an STC 112 shown in FIG. 1. The STC 112 includes a lookup memory 204 that stores the FIB0 108, which includes an internet protocol forwarding information base (IP FIB0) 206 and a first multi-protocol label switching (MPLS) primary and backup label switched paths (LSPs)
(MPLS FIB0) 210. The lookup memory 204 also stores the FIB1 110, which includes an IP FIB1 208 and an MPLS FIB1 212. The STC 112 further includes a packet forwarding engine 214 that is communicatively coupled at 217A to the FIB0 108 and coupled at 217B to the FIB1 110. The STC 112 has a heartbeat monitor 220 communicatively coupled at 222 to the CPO 104 and CP1 106 shown in FIG. 1. The function of the heartbeat monitor 220 will be described below with reference to FIGs. 3 and 4. Each control processor 104,106 runs all relevant routing protocol applications independently and • produces the relevant tables required for packet forwarding. The forwarding information bases (FIBO 108 and FIB1 110) derived from the respective control processors 104,106 are distributed to all STCs 112 regardless of their interface association. It should be noted that the respective FIBs are not necessarily identical, but are expected to contain the same reachability information. The next hop information will most likely not be the same, however. The packet forwarding engine 214 selects a set of FIBs to use at run-time, typically based on control processor association information. In accordance with an embodiment of the invention, A FIB manager 230 receives FIB information form the respective control processors 104,106 via busses 109,111 and writes the FIB information in the respective FIBO 108 and FIB1 110. The FIB manager 230 is preferably programmed to write the MPLS FIBs so that the primary LSPs of FIBO are created and maintained by control processor 104, while the backup LSPs of FIBO are created and maintained by control processor 106. In FIB1, the primary LSPs are created and maintained by control processor 106, while the backup LSPs are written by control processor 104. Consequently, on transit core network traffic, diversely setup LSPs through the control processors 104,106 permit both line and equipment protection to be achieved in a single router 102 in accordance with the invention. During a control processor reset, or software upgrade that causes a control processor 104,106 to go out-of-service, the STCs 112 are informed in a timely manner and switch to use the set of FIBs of the remaining active control processor. Multicast IP packet services, multi-protocol label switching (MPLS) , bridging, and various media related protocols use a hot-standby control processor model . In accordance with one embodiment of the invention, full bandwidth utilization is ensured by a bandwidth manager 240. The bandwidth manager 240 accepts bandwidth allocation requests from the respective control processors 104,106 via busses 109,111. The bandwidth manager allocates bandwidth to the respective control processors 104,106, as required and updates the appropriate FIB information using bus 244 to write to lookup memory 204. However, if one of the control processors is out-of-service, the bandwidth manager is advised of the control processor's condition by the heart beat monitor 224, which sends an appropriate signal over connection 242. On being advised that a control processor is out-of-service, the bandwidth manager releases all bandwidth allocated to the out-of-service control processor, so that it is available to the in-service control processor, which can allocate the bandwidth as required. This permits more efficient failover engineering in the core networks.
The monitoring of a control processor to determine whether it is in-service is performed by monitoring critical software processes that it runs. The monitoring of an integrity of critical processes running on a control processor 104,106 is described with reference to FIGs . 3 and 4, and is performed by a heartbeat monitor 220. Integrity of a control processor is defined as being "in-service" or "out-of-service" . The heartbeat monitor 220 includes a tables of critical processes 304 that contain a list of selected processes 404 that run on the respective control processors 104,106. The tables are referenced by a heartbeat inquiry generator 306, which generates heartbeats 306A, 306B, . . . 306C for respectively monitoring the integrity of process 404A, 404B and . . 404C running on the control processors 104,106. The heartbeat monitor 220 sequentially generates and transmits the heartbeat inquiries 306A, 306B, 306C to corresponding processes 404A, 04B, 404C (FIG. 4) . If each process 404A, 404B, 404C returns a heartbeat response 308A,308B, . . .308C within a predetermined period of time, the integrity of each process is declared "in-service". If any of the processes 404A, 404B, 04C fails to return a heartbeat response 308A, 308B, 308C within the predetermined period of time, the integrity of that process 404A, 404B, 404C and the processor 104,106 that runs it is declared to be "out-of-service" .
The invention supports a hitless control processor reset and also supports hitless software upgrades, as long as the packet forwarding engine 214 on STCs 112 does not require reset .
It should further be understood that the assignment of routed interfaces is done at the physical or logical interface level. Therefore, given an STC 112, it is not necessary for all interfaces 224 to be mastered by the same control processor. It should also be noted that a mix of routed and stub routed interfaces to the STCs 112 is permissible.
All packet services run on both control processors 104,106 at a steady state. This includes both unicast and multicast IP protocols, and all MPLS signaling protocols such as RSVP (reservation protocol) and LDP (label distribution protocol) . In general, the control processors are unaware of each other. There are exceptions to this rule, however. First, all local interface (host) routes and subnet routes are injected into the IP forwarding table of both control processors . This exception applies to both UNI 119 and NNI 115 interfaces. Second, for services other than IP unicast, MPLS and RSVP (i.e. services that must be kept synchronized) software on each control processor 104,106 must be aware of its current state, being a master or a standby, and behaves accordingly.
At steady state, core-to-core traffic is routed between routed interfaces of the same control processor. Customer-to-core traffic may be distributed between routed interfaces of both control processors 104,106. Core-to-customer traffic is distributed on ingress between routed interfaces of both control processors . During a control processor reset or software upgrade, the STC 112 is notified that one of the control processors has become unavailable. The STC 112 updates all the logical interfaces to associate with the remaining control processor as the master, which effectively instructs the packet forwarding engine 214 to forward all traffic related to the forwarding table of the remaining active control processor. Note that logic of the packet forwarding engine 214 does not change, and theoretically there should be no packet loss as a result of control processor reset or software upgrade, as long as the packet forwarding engine 214 is not reset. When one control processor is unavailable, core-to- core traffic is routed from any routed interface to the routed interface of the remaining control processor. Customer-to-core traffic is routed towards a routed interface of the remaining control processor.
IP multicast forwarding tables are downloaded from the master control processor to the STCs 112. As there is only a single copy of the IP multicast forwarding table on the STC, no decision is required to select which table to use, as is the case with unicast traffic. During a control processor reset or software upgrade, the STCs 112 are notified that a control processor has become unavailable. Initially, the STC 112 will do nothing, but when one 'of the following two conditions is met, the FIB manager 230 on the STC 112 will erase all the original content of the forwarding information base. The first condition is met if a timeout has expired. The second condition is met if the new master control processor has re-learnt and distributed the same routes that the FIB manager 230 has already installed. IP multicast continues to use the original routes when the master control processor becomes unavailable for a period of time, or after the same set of routes are re-learnt by the new master control processor. A network topology change during control processor switch-over potentially causes packet loss.
When a control processor 104 (for the purpose of the following discussion designated CPx) becomes unavailable due to a software crash, hardware fault, or for any other reason except a software upgrade (which is discussed below in some detail) all STCs 112 and the other control processor (designated CPy in this example) are notified. The packet forwarding engine 214 adjusts the control processor mastership of all affected interfaces to associate with the remaining control processor CPy. The packet forwarding engine 214 adjusts all UNI Layer 1 and Layer 2 physical and logical port records to associate with the remaining control processor, the packet forwarding engine 214 also adjusts a control processor selection algorithm for any Layer 3 static (stub) interface (either NNI or UNI) . CPy operates as if nothing has happened. The Cpy still advertises its own routes, along with all the local routes of the CPx.
Routing peers of CPx stop hearing from CPx for the duration of the reset. Take OSPF (open shortest path first) as an example, it will take 3 times a single hello time (30 seconds) to detect that CPx is unavailable. Normally, CPx will recover in much less than 30 seconds during a warm restart. Hence, any immediate network wide route flap is minimized. Of course, in the event of a persistent hardware or software failure, routing peers will eventually detect the problem and route around all of the routed interfaces of CPx. After CPx recovers, all routing protocol stacks on CPx restart . CPx then re-establishes peering with its original routing peers. Route flap will occur during this stage for a short period of time. CPx then continues to converge and update its forwarding information bases table on STCs 112.
After a predefined period of time, STCs 112 are notified that the CPx forwarding tables are now ready to be used. The packet forwarding engine 214, control processor mastership of all applicable interfaces, and control processor selection algorithm are reset to their original configuration. The delayed notification of CPx availability to STC 112 is intended to minimize routing loops in the carrier core network while CPx is converging.
Assumptions relevant to the control processor reset discussion above include: 1) The STCs 112 rely on at least one control processor being operational; and 2) Logical interface configuration and operational status are propagated to both control processors 104,106, regardless of control processor mastership of the related interface'.
A discussion of how software upgrades are conducted is also relevant to the operation and maintenance of the apparatus in accordance with the invention. One way of upgrading a general purpose CPU software load requires an upgrade of software on the CP 104,106 as well as on STC 112. Note that other options for performing software upgrades may exist without departing from the scope of the present invention. Further, although network processor software upgrades may be impacted, they are not described.
The invention provides a method of performing hitless software upgrades. For example, if a control processor 104 (CPx) is to be upgraded, the CPx is taken out-of-service, but CPy 106 and STC 112 behave exactly as they do in the CP reset scenario described above . The CPx is then .reloaded with the new software load and starts running after a reboot. From the perspective of the CPx, all interfaces are down, even though they are still forwarding ingress traffic using CPy's forwarding information bases.
Each STC 112 is respectively reloaded with the new software version, and restarted. The packet forwarding engine 214 is still forwarding traffic based on CPy's forwarding table. While the STC CPUs are restarting, local services such as loopback detection and MAC address learning are unavailable.
Following reboot after the new software load,
CPx enables its interfaces and establishes peering with neighboring routers. The CPx then downloads its forwarding information bases to the STCs 112. After a predefined period of time, the STCs 112 switch back to using forwarding information bases of the CPx. CPy is subsequently reloaded with the new software version and reboots to start running. CPy then establishes peering with neighboring, routers. CPy downloads its forwarding information bases to STCs 112. After running protocol convergence and sanity checks, the STCs 112 switch to using the FIB of CPx, and the software upgrade is complete .
The invention therefore provides an apparatus and method for high availability packet processing that permits hitless software upgrades and hitless software and hardware fault failover.
While the preferred embodiments of the invention were described in specific terms, it should be noted that alternative network structures can be similarly utilized within the inventive concepts without straying from the intended scope of the invention. Persons skilled in the art will appreciate that there are other alternative implementations and modifications for implementing the present invention, and that the above implementation is only an illustration of one embodiment of the invention. Accordingly, the scope of the invention is intended only to be limited by the claims included herein.

Claims

CLAIMS :
An apparatus (102) for providing high availability packet forwarding, comprising a service termination card (112) having a packet forwarding engine (214) for receiving and forwarding packets in accordance with a forwarding information base (FIB) ; a first control processor (104) running a plurality of processes (404) and communicatively coupled to the service termination card; a first forwarding information base on the service termination card having forwarding information maintained by the first control processor, C H A R A C T E R I Z E D by: a second control processor (106) running a plurality of processes asynchronously with respect to the first control processor, the second control processor being communicatively coupled to the service termination card; a second FIB (110) on the service termination card having forwarding information maintained by the second control processor; and means (230) for permitting the packet forwarding engine to forward packets in accordance with one of the first and second forwarding information bases, depending on an integrity of the processes running on the respective first and second control processors. An apparatus as claimed in claim 1 wherein the service termination card (112) further comprises a heartbeat monitor (220) for determining the integrity of the processes (304a-c) running on the first and second control processors (104,106) .
An apparatus as claimed in claim 2 wherein the heartbeat monitor (220) comprises a table (Table 1) listing the selected processes (404) running on the first control processor (104) and a table (Table 2) listing the selected processes running on the second control processor (106) .
An apparatus as claimed in claim 3 wherein the heartbeat monitor (220) is adapted to send heartbeat inquiry messages (306A-C) to the processes (404) listed in the respective tables (Tables 1,2), and further adapted to conditionally receive heartbeat response messages (308A-C) from the processes, in accordance with the integrity of the respective processes.
An apparatus as claimed in any preceding claim further comprising an input/output interface through which the first and second control processors (104, 106) receive protocol data units (PDUs) providing information for maintaining the respective first and second forwarding information bases (108, 110) .
6. An apparatus as claimed in any preceding claim further comprising an operations and management workstation (124) connected to the respective first and second control processors
(104,106) .
7. An apparatus as claimed in any preceding claim wherein the first and second forwarding information bases (108,110) on the service termination card (112) respectively comprise an Internet protocol forwarding information base.
8. An apparatus as claimed in any preceding claim wherein the first and second forwarding information bases (108,110) on the service termination card (112) respectively comprise a multi -protocol label switching forwarding information base.
9. An apparatus as claimed in any preceding claim further comprising a forwarding information base (FIB) manager (230) that receives FIB information from the first and second control processors (104,106) and stores the FIB information in a memory (204) of the service termination card.
10. An apparatus as claimed in claim 9 wherein the FIB manager stores primary and backup label switched paths (LSPs) (210,212) in each of the first and second FIBs (206,208) so that the primary LSPs in the first FIB are created and maintained by the first control processor (104) and the backup LSPs in the first FIB are created and maintained by the second control processor (106) , while the FIB manager stores the primary and secondary LSPs in a reverse order in the second FIB, to provide line ' protection for label switched paths .
11. An apparatus as claimed in any preceding claim further comprising a bandwidth manager (240) for controlling reservation of local input/output bandwidth between the first and second control processors (104,106).
12. An apparatus as claimed in claim 11 wherein the bandwidth manager (240) is communicatively connected to a heart beat monitor (220) that monitors an integrity of process (404) running on the fist and second control processors (104,106), and informs the bandwidth manager if one of the control processors is declared out-of-service.
13. An apparatus as claimed in claim 12 wherein the bandwidth manager (240) is adapted to release bandwidth allocated to the out-of-service control processor so that the bandwidth can be utilized by the in-service control processor.
14. An apparatus as claimed in any preceding claim wherein the first and second control processes (104,106) are respectively adapted to advertise all local interfaces, so that reachability is maintained in a core network in an event that one of the control processors becomes out-of-service .
15. A method of providing high availability in a packet forwarding process, C H A R A C T E R I Z E D by: operating first and second control processors (104,106) independently and asynchronously to generate and maintain first and second forwarding information bases (FIBs) (108,110) respectively provided to a service termination card (112) ; and operating the service termination card to forward packets using information from one of the FIBs depending on an integrity of selected processes
(404) running on the respective first and second control processors .
16. A method as claimed in claim 15 further comprising a step of dynamically determining an integrity of the selected processes (404) running on the respective first and second control processors
(104,106) .
17. A method as claimed in claim 16 wherein the step of dynamically determining comprises a step of sending heartbeat inquiry messages (306A-C) to each of the selected processes (404) on the respective first and second control processors (104,106) .
18. A method as claimed in claim 17 wherein the step of sending further comprises a step of sending the heartbeat inquiry messages (306A-C) from a heartbeat monitor (220) that is operative on the service termination card (112) .
19. A method as claimed in claim 18 further comprising a step of receiving heartbeat response messages (308A-C) from the respective selected processes (404) run by the respective first and second control processors (104,106).
20. A method as claimed in claim 19 further comprising a step of declaring a one of the control processors (104,106) out-of-service if a one of the processes (404) running on the one of the control processors fails to return a heartbeat response message (308A-C) within a predetermined period of time.
21. A method as claimed in claim 18 further comprising a step of switching to a second FIB (110) if information in the first FIB (108) is being maintained by the control processor (104,106) declared out-of-service .
22. A method as claimed in claim 21 further comprising a step of switching back to the first FIB (108) if the control processor (104) that maintains the forwarding information in the first FIB is declared to be in-service.
23. A method as claimed in claim 22 wherein the step of switching back is delayed for a predefined period of time to minimize routing loops in a carrier core network while the first control processor (104) is converging .
24. A method as claimed in any one of claims 15-23 further comprising a step of installing a new software load on one of the control processors (104,106) by taking the control processor out-of-service, and permitting the other control processor to continue to maintain forwarding information bases (108,110) used by the service termination card to forward packets.
25. A method as claimed in claim 24 further comprising a step of installing a new software load on the service termination card (112) while a packet forwarding engine (214) continues to forward packets using one of the forwarding information bases (108, 110) .
26. A method as claimed in claim 25 further comprising a step of returning the one of the control processors (104,106) to in-service so that it rebuilds and maintains forwarding information bases
(108,110) to be used by the service termination card (112) .
27. A method as claimed in claim 26 further comprising a step of taking the other control processor (104,106) out-of-service, and permitting the one of the control processors to continue to maintain forwarding information bases (108,110) used by the service termination card (112) to forward packets.
28. A method as claimed in any one of claims 15-27 wherein when a service termination card (112) is advised that one of the control processors
'(104,106) it out-of-service, the method further comprising a step of erasing all original content of the FIB (108,110) when one of a timeout has expired, and a remaining in-service control processor (104,106) has relearned and distributed the same routes that a FIB manager (230) of the service termination card (112) has already installed.
29. A method as claimed in claim 28 further comprising a step of continuing to use original routes for IP multicast when the control processor (104,106) becomes out-of-service for a period of time, or after a same set of routes are relearned by the in-service control processor.
PCT/CA2002/000424 2001-03-27 2002-03-27 High availability packet forwarding apparatus and method WO2002078250A2 (en)

Priority Applications (2)

Application Number Priority Date Filing Date Title
EP02716569A EP1374500A2 (en) 2001-03-27 2002-03-27 High availability packet forwarding apparatus and method
CA002441470A CA2441470A1 (en) 2001-03-27 2002-03-27 High availability packet forwarding apparatus and method

Applications Claiming Priority (4)

Application Number Priority Date Filing Date Title
US27909901P 2001-03-27 2001-03-27
US60/279,099 2001-03-27
US10/025,496 US7206309B2 (en) 2001-03-27 2001-12-26 High availability packet forward apparatus and method
US10/025,496 2001-12-26

Publications (2)

Publication Number Publication Date
WO2002078250A2 true WO2002078250A2 (en) 2002-10-03
WO2002078250A3 WO2002078250A3 (en) 2003-03-27

Family

ID=26699810

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CA2002/000424 WO2002078250A2 (en) 2001-03-27 2002-03-27 High availability packet forwarding apparatus and method

Country Status (4)

Country Link
US (2) US7206309B2 (en)
EP (1) EP1374500A2 (en)
CA (1) CA2441470A1 (en)
WO (1) WO2002078250A2 (en)

Cited By (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2007006196A1 (en) * 2005-07-08 2007-01-18 Huawei Technologies Co., Ltd. A method for forwarding service of the data communication device and the forwarding apparatus
WO2007038856A1 (en) * 2005-10-05 2007-04-12 Nortel Networks Limited Provider link state bridging
US7440394B2 (en) 2002-06-24 2008-10-21 Nokia Corporation Method and system for redundant IP forwarding in a telecommunications network
US8059647B2 (en) 2005-10-05 2011-11-15 Nortel Networks Limited Multicast implementation in a link state protocol controlled ethernet network
US8274989B1 (en) 2006-03-31 2012-09-25 Rockstar Bidco, LP Point-to-multipoint (P2MP) resilience for GMPLS control of ethernet
EP1955459A4 (en) * 2005-11-30 2017-03-15 Cisco Technology, Inc. Method and apparatus providing prioritized recursion resolution of border gateway protocol forwarding information bases
CN108173779A (en) * 2017-11-23 2018-06-15 吴英 A kind of method for the automation level for improving router

Families Citing this family (71)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7236490B2 (en) 2000-11-17 2007-06-26 Foundry Networks, Inc. Backplane interface adapter
US7596139B2 (en) 2000-11-17 2009-09-29 Foundry Networks, Inc. Backplane interface adapter with error control and redundant fabric
WO2002091203A1 (en) * 2001-05-03 2002-11-14 Nokia Inc. Method and system for implementing mpls redundancy
US6839866B2 (en) * 2001-05-31 2005-01-04 Sycamore Networks, Inc. System and method for the use of reset logic in high availability systems
GB0118172D0 (en) * 2001-07-26 2001-09-19 British Telecomm A telecommunications network
US7286533B2 (en) * 2001-12-27 2007-10-23 Alcatel-Lucent Canada Inc. Method and apparatus for routing data frames
US7433969B2 (en) * 2001-12-31 2008-10-07 Redback Networks Inc. Method and apparatus for representing label switched paths
US20120155466A1 (en) 2002-05-06 2012-06-21 Ian Edward Davis Method and apparatus for efficiently processing data packets in a computer network
US7187687B1 (en) 2002-05-06 2007-03-06 Foundry Networks, Inc. Pipeline method and system for switching packets
US7468975B1 (en) 2002-05-06 2008-12-23 Foundry Networks, Inc. Flexible method for processing data packets in a network routing system for enhanced efficiency and monitoring capability
CA2428517A1 (en) * 2002-05-13 2003-11-13 Tropic Networks Inc. System and method for distributed resource reservation protocol - traffic engineering (rsvp-te) hitless restart in multi-protocol label switching (mpls) network
US7417987B2 (en) * 2002-06-04 2008-08-26 Lucent Technologies Inc. Distribution of forwarding information in a network node
FI20021235A0 (en) * 2002-06-24 2002-06-24 Nokia Corp A method and system for redundant IP forwarding in a telecommunications network
US7010716B2 (en) * 2002-07-10 2006-03-07 Nortel Networks, Ltd Method and apparatus for defining failover events in a network device
US8144711B1 (en) * 2002-07-15 2012-03-27 Rockstar Bidco, LP Hitless switchover and bandwidth sharing in a communication network
US7424014B2 (en) * 2002-11-12 2008-09-09 Cisco Technology, Inc. System and method for local packet transport services within distributed routers
US7620040B2 (en) * 2002-12-11 2009-11-17 Aspen Networks, Inc. Application non disruptive task migration in a network edge switch
US7570648B2 (en) * 2003-02-03 2009-08-04 At&T Intellectual Property I, L.P. Enhanced H-VPLS service architecture using control word
US7643424B2 (en) * 2003-03-22 2010-01-05 At&T Intellectual Property L, L.P. Ethernet architecture with data packet encapsulation
US6901072B1 (en) 2003-05-15 2005-05-31 Foundry Networks, Inc. System and method for high speed packet transmission implementing dual transmit and receive pipelines
US7535827B2 (en) 2003-10-09 2009-05-19 Alcatel Lucent High availability of resources in telecommunications network using synchronized redundancy mechanism
US8009556B2 (en) * 2003-10-17 2011-08-30 Ip Infusion, Inc. System and method for providing redundant routing capabilities for a network node
US7539131B2 (en) * 2003-11-26 2009-05-26 Redback Networks Inc. Nexthop fast rerouter for IP and MPLS
US7817659B2 (en) 2004-03-26 2010-10-19 Foundry Networks, Llc Method and apparatus for aggregating input data streams
US8730961B1 (en) 2004-04-26 2014-05-20 Foundry Networks, Llc System and method for optimizing router lookup
US7606236B2 (en) * 2004-05-21 2009-10-20 Intel Corporation Forwarding information base lookup method
US7904546B1 (en) 2004-09-27 2011-03-08 Alcatel-Lucent Usa Inc. Managing processes on a network device
US8990365B1 (en) * 2004-09-27 2015-03-24 Alcatel Lucent Processing management packets
US7657703B1 (en) * 2004-10-29 2010-02-02 Foundry Networks, Inc. Double density content addressable memory (CAM) lookup scheme
US7318108B2 (en) * 2004-12-22 2008-01-08 Cisco Technology, Inc. Method and apparatus providing prioritized convergence in border gateway protocol
JP2006254341A (en) * 2005-03-14 2006-09-21 Fujitsu Ltd Bridge device in spanning tree protocol network and control packet processing method
EP1768322A1 (en) * 2005-09-22 2007-03-28 Siemens Aktiengesellschaft Method for reserving bandwidth in a network resource of a communication network
US8448162B2 (en) 2005-12-28 2013-05-21 Foundry Networks, Llc Hitless software upgrades
US8364843B2 (en) * 2006-01-09 2013-01-29 Cisco Technology, Inc. Method and system for minimizing disruption during in-service software upgrade
US7688819B2 (en) * 2006-03-06 2010-03-30 Cisco Technology, Inc. Faster routing protocol convergence using efficient message markup
CN100561978C (en) * 2006-04-26 2009-11-18 华为技术有限公司 A kind of strategy route device and method
US20080107027A1 (en) * 2006-11-02 2008-05-08 Nortel Networks Limited Engineered paths in a link state protocol controlled Ethernet network
US8238255B2 (en) 2006-11-22 2012-08-07 Foundry Networks, Llc Recovering from failures without impact on data traffic in a shared bus architecture
US7978614B2 (en) 2007-01-11 2011-07-12 Foundry Network, LLC Techniques for detecting non-receipt of fault detection protocol packets
JP4356763B2 (en) * 2007-01-30 2009-11-04 トヨタ自動車株式会社 Operating device
US8225134B2 (en) * 2007-04-06 2012-07-17 Cisco Technology, Inc. Logical partitioning of a physical device
JP2008269050A (en) * 2007-04-17 2008-11-06 Hitachi Ltd Compression control device and method
JP4820781B2 (en) * 2007-06-26 2011-11-24 Kddi株式会社 Route management apparatus and computer program
US8037399B2 (en) 2007-07-18 2011-10-11 Foundry Networks, Llc Techniques for segmented CRC design in high speed networks
US8271859B2 (en) 2007-07-18 2012-09-18 Foundry Networks Llc Segmented CRC design in high speed networks
US8509236B2 (en) 2007-09-26 2013-08-13 Foundry Networks, Llc Techniques for selecting paths and/or trunk ports for forwarding traffic flows
US8259569B2 (en) 2008-09-09 2012-09-04 Cisco Technology, Inc. Differentiated services for unicast and multicast frames in layer 2 topologies
US8345536B1 (en) * 2009-01-29 2013-01-01 Force10 Networks, Inc. Multicast high availability enhancements for non-stop forwarding
US8090901B2 (en) 2009-05-14 2012-01-03 Brocade Communications Systems, Inc. TCAM management approach that minimize movements
CN101577638B (en) * 2009-06-04 2011-07-13 中兴通讯股份有限公司 Method for testing Ethernet OAM based on telecom network management system and device
US8599850B2 (en) 2009-09-21 2013-12-03 Brocade Communications Systems, Inc. Provisioning single or multistage networks using ethernet service instances (ESIs)
JP5350293B2 (en) * 2010-02-26 2013-11-27 株式会社日立製作所 Network system
US9014049B2 (en) * 2011-04-27 2015-04-21 Cisco Technology, Inc. Selectively populating forwarding information bases in a packet switch
US9559897B2 (en) 2012-12-21 2017-01-31 Brocade Communications Systems, Inc. Device ID assignment in a system of devices
US9065756B2 (en) * 2013-01-09 2015-06-23 Intel Corporation System and method for providing fast and efficient flushing of a forwarding database in a network processor
US9853889B2 (en) 2013-05-20 2017-12-26 Brocade Communications Systems, Inc. Broadcast and multicast traffic reduction in stacking systems
US9313102B2 (en) 2013-05-20 2016-04-12 Brocade Communications Systems, Inc. Configuration validation in a mixed node topology
US9053216B1 (en) 2013-08-09 2015-06-09 Datto, Inc. CPU register assisted virtual machine screenshot capture timing apparatuses, methods and systems
US10284499B2 (en) * 2013-08-22 2019-05-07 Arris Enterprises Llc Dedicated control path architecture for systems of devices
US9185049B2 (en) 2013-10-31 2015-11-10 Brocade Communications Systems, Inc. Techniques for simplifying stacking trunk creation and management
US9577932B2 (en) 2014-02-12 2017-02-21 Brocade Communications Systems, Inc. Techniques for managing ternary content-addressable memory (TCAM) resources in heterogeneous systems
KR102131863B1 (en) 2014-03-05 2020-07-09 한국전자통신연구원 Method of performing transition of operation mode for a routing processor
US9692695B2 (en) 2014-03-27 2017-06-27 Brocade Communications Systems, Inc. Techniques for aggregating hardware routing resources in a multi-packet processor networking system
US9692652B2 (en) 2014-04-03 2017-06-27 Brocade Communications Systems, Inc. Framework for reliably communicating port information in a system of devices
US10091059B2 (en) 2014-12-16 2018-10-02 Arris Enterprises Llc Handling connections between network devices that support multiple port communication modes
US10404521B2 (en) 2015-01-14 2019-09-03 Datto, Inc. Remotely configurable routers with failover features, and methods and apparatus for reliable web-based administration of same
US9826013B2 (en) 2015-03-19 2017-11-21 Action Streamer, LLC Method and apparatus for an interchangeable wireless media streaming device
US9560100B1 (en) * 2015-03-19 2017-01-31 Action Streamer, LLC Method and system for stabilizing and streaming first person perspective video
US10872016B2 (en) 2015-06-16 2020-12-22 Datto, Inc. Hybrid cloud methods, apparatus and systems for secure file sharing and synchronization with backup and server virtualization
EP3941006B1 (en) * 2020-07-16 2022-10-26 Anapaya Systems AG System and method for carrying and optimizing internet traffic over a source-selected path routing network
US11636214B2 (en) 2020-12-11 2023-04-25 Hewlett Packard Enterprise Development Lp Memory scan-based process monitoring

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP0747833A2 (en) * 1992-12-17 1996-12-11 Tandem Computers Incorporated Fault-tolerant multiprocessor system
US6088328A (en) * 1998-12-29 2000-07-11 Nortel Networks Corporation System and method for restoring failed communication services
US20020089980A1 (en) * 2001-01-11 2002-07-11 Alcatel Router providing continuity of service of the state machines associated with the neighboring routers

Family Cites Families (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US89980A (en) * 1869-05-11 Zinc for organ-pipes and for other
US4710926A (en) * 1985-12-27 1987-12-01 American Telephone And Telegraph Company, At&T Bell Laboratories Fault recovery in a distributed processing system
USH1814H (en) * 1997-09-26 1999-11-02 Browning; Mark David Telephony-support module for a telecommunications switching platform
US7886054B1 (en) * 2000-10-11 2011-02-08 Siddhartha Nag Graphical user interface (GUI) for administering a network implementing media aggregation
US20020103921A1 (en) * 2001-01-31 2002-08-01 Shekar Nair Method and system for routing broadband internet traffic

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP0747833A2 (en) * 1992-12-17 1996-12-11 Tandem Computers Incorporated Fault-tolerant multiprocessor system
US6088328A (en) * 1998-12-29 2000-07-11 Nortel Networks Corporation System and method for restoring failed communication services
US20020089980A1 (en) * 2001-01-11 2002-07-11 Alcatel Router providing continuity of service of the state machines associated with the neighboring routers

Cited By (14)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7440394B2 (en) 2002-06-24 2008-10-21 Nokia Corporation Method and system for redundant IP forwarding in a telecommunications network
US7801151B2 (en) 2005-07-08 2010-09-21 Huawei Technologies Co., Ltd. Method and apparatus for forwarding service in a data communication device
WO2007006196A1 (en) * 2005-07-08 2007-01-18 Huawei Technologies Co., Ltd. A method for forwarding service of the data communication device and the forwarding apparatus
US8059647B2 (en) 2005-10-05 2011-11-15 Nortel Networks Limited Multicast implementation in a link state protocol controlled ethernet network
CN101322355B (en) * 2005-10-05 2010-05-19 北方电讯网络有限公司 Provider link state bridging Ethernet node and its configuration and operation method, Ethernet bridging network
US7688756B2 (en) 2005-10-05 2010-03-30 Nortel Networks Limited Provider link state bridging
WO2007038856A1 (en) * 2005-10-05 2007-04-12 Nortel Networks Limited Provider link state bridging
US8867366B2 (en) 2005-10-05 2014-10-21 Rockstar Consortium Us Lp Multicast implementation in a link state protocol controlled Ethernet network
US9008088B2 (en) 2005-10-05 2015-04-14 Rpx Clearinghouse Llc Multicast implementation in a link state protocol controlled ethernet network
EP1955459A4 (en) * 2005-11-30 2017-03-15 Cisco Technology, Inc. Method and apparatus providing prioritized recursion resolution of border gateway protocol forwarding information bases
US8274989B1 (en) 2006-03-31 2012-09-25 Rockstar Bidco, LP Point-to-multipoint (P2MP) resilience for GMPLS control of ethernet
US8514878B1 (en) 2006-03-31 2013-08-20 Rockstar Consortium Us Lp Point-to-multipoint (P2MP) resilience for GMPLS control of ethernet
CN108173779A (en) * 2017-11-23 2018-06-15 吴英 A kind of method for the automation level for improving router
CN108173779B (en) * 2017-11-23 2018-11-16 泰山医学院 A kind of router automatic stand-by system

Also Published As

Publication number Publication date
EP1374500A2 (en) 2004-01-02
US20020141429A1 (en) 2002-10-03
US7206309B2 (en) 2007-04-17
US20030198182A1 (en) 2003-10-23
US7342874B2 (en) 2008-03-11
CA2441470A1 (en) 2002-10-03
WO2002078250A3 (en) 2003-03-27

Similar Documents

Publication Publication Date Title
US7206309B2 (en) High availability packet forward apparatus and method
US8189579B1 (en) Distributed solution for managing periodic communications in a multi-chassis routing system
US7155632B2 (en) Method and system for implementing IS-IS protocol redundancy
US6262977B1 (en) High availability spanning tree with rapid reconfiguration
US8467287B2 (en) High available method for border gateway protocol version 4
US6490246B2 (en) System and method for using active and standby routers wherein both routers have the same ID even before a failure occurs
US7558194B2 (en) Virtual private network fault tolerance
CA2499343C (en) Ip redundancy with improved failover notification
US7269132B1 (en) Method and apparatus for achieving transparent redundancy at a hierarchical boundary
US7787365B1 (en) Routing protocol failover between control units within a network router
US7535827B2 (en) High availability of resources in telecommunications network using synchronized redundancy mechanism
US7940694B2 (en) Intelligent filtering of redundant data streams within computer networks
JP2005503055A (en) Method and system for implementing OSPF redundancy
EP3820089A1 (en) Controller provided protection paths
CN101656651A (en) Method and device for correlatively protecting traffic engineering tunnels
CN113615132B (en) Fast flood Hong Tapu protection
JP2003318948A (en) Packet relaying apparatus
US20060045004A1 (en) Method for diverting data packets when local link failures are identified
JP3780987B2 (en) Route control method and apparatus, route control program, and storage medium storing route control program
US11349758B2 (en) Multihoming optimizations for fast failover in single-active networks
US20080212610A1 (en) Communication techniques and generic layer 3 automatic switching protection
CN110138656B (en) Service processing method and device
US11558281B2 (en) Shared ethernet segment identifier label allocation for ethernet virtual private network multihoming
US11552821B2 (en) Spanning tree protocol with ethernet virtual private network all-active multihoming
CN113992571A (en) Multi-path service convergence method, device and storage medium in SDN network

Legal Events

Date Code Title Description
AK Designated states

Kind code of ref document: A2

Designated state(s): CA

AL Designated countries for regional patents

Kind code of ref document: A2

Designated state(s): AT BE CH CY DE DK ES FI FR GB GR IE IT LU MC NL PT SE TR

121 Ep: the epo has been informed by wipo that ep was designated in this application
DFPE Request for preliminary examination filed prior to expiration of 19th month from priority date (pct application filed before 20040101)
WWE Wipo information: entry into national phase

Ref document number: 2441470

Country of ref document: CA

WWE Wipo information: entry into national phase

Ref document number: 2002716569

Country of ref document: EP

WWP Wipo information: published in national office

Ref document number: 2002716569

Country of ref document: EP

WWW Wipo information: withdrawn in national office

Ref document number: 2002716569

Country of ref document: EP