USRE43110E1 - Pipelined maximal-sized matching cell dispatch scheduling - Google Patents
Pipelined maximal-sized matching cell dispatch scheduling Download PDFInfo
- Publication number
- USRE43110E1 USRE43110E1 US12/072,900 US7290008A USRE43110E US RE43110 E1 USRE43110 E1 US RE43110E1 US 7290008 A US7290008 A US 7290008A US RE43110 E USRE43110 E US RE43110E
- Authority
- US
- United States
- Prior art keywords
- cell
- subscheduler
- subschedulers
- output queue
- indicator
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Lifetime, expires
Links
- 238000000034 method Methods 0.000 claims description 69
- 230000009977 dual effect Effects 0.000 claims description 7
- 238000013459 approach Methods 0.000 abstract description 6
- 238000010586 diagram Methods 0.000 description 13
- 230000003139 buffering effect Effects 0.000 description 8
- 230000005540 biological transmission Effects 0.000 description 4
- 238000004891 communication Methods 0.000 description 4
- 239000004744 fabric Substances 0.000 description 4
- 230000011664 signaling Effects 0.000 description 4
- 230000000694 effects Effects 0.000 description 3
- 230000006870 function Effects 0.000 description 3
- 230000001960 triggered effect Effects 0.000 description 3
- 230000008901 benefit Effects 0.000 description 2
- 230000000903 blocking effect Effects 0.000 description 2
- 239000000872 buffer Substances 0.000 description 2
- 230000003247 decreasing effect Effects 0.000 description 2
- 230000006855 networking Effects 0.000 description 2
- 230000003287 optical effect Effects 0.000 description 2
- 235000008733 Citrus aurantifolia Nutrition 0.000 description 1
- 235000011941 Tilia x europaea Nutrition 0.000 description 1
- 238000003491 array Methods 0.000 description 1
- 125000004122 cyclic group Chemical group 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 239000004571 lime Substances 0.000 description 1
- 239000000463 material Substances 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 230000002040 relaxant effect Effects 0.000 description 1
Images
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L49/00—Packet switching elements
- H04L49/15—Interconnection of switching modules
- H04L49/1553—Interconnection of ATM switching modules, e.g. ATM switching fabrics
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L49/00—Packet switching elements
- H04L49/30—Peripheral units, e.g. input or output ports
- H04L49/3045—Virtual queuing
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L49/00—Packet switching elements
- H04L49/30—Peripheral units, e.g. input or output ports
- H04L49/3063—Pipelined operation
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L12/00—Data switching networks
- H04L12/54—Store-and-forward switching systems
- H04L12/56—Packet switching systems
- H04L12/5601—Transfer mode dependent, e.g. ATM
- H04L2012/5678—Traffic aspects, e.g. arbitration, load balancing, smoothing, buffer management
- H04L2012/5681—Buffer or queue management
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L49/00—Packet switching elements
- H04L49/50—Overload detection or protection within a single switching element
- H04L49/505—Corrective measures
- H04L49/508—Head of Line Blocking Avoidance
Definitions
- the present invention concerns the communication of data over networks, such as the Internet for example. More specifically, the present invention concerns scheduling the servicing (e.g., dispatching) of cells or packets buffered at input ports of a switch.
- servicing e.g., dispatching
- Switches and routers are used in networks, such as the Internet for example, to forward data towards its destination.
- the need for high-speed switches and routers is introduced in ⁇ 1.2.1 below.
- input buffering, as used in high-speed switches, is introduced in ⁇ 1.2.2 below.
- Input buffering is a cost effective approach for high-speed switches. This is because input-buffered switches do not require internal speedup, nor do they allocate buffers at each crosspoint. They also relax memory-bandwidth and memory-size constraints.
- HOL head-of-line
- FIFO First-In-First-Out
- a fixed-size cell is sent from any input to any output, provided that, in a given time slot, no more than one cell is sent from the same input, and no more than one cell is received by the same output.
- Each input port has N VOQs, one for each of N output ports.
- the HOL cell in each VOQ can be selected for transmission across the switch in each time slot. Therefore, every time slot, a scheduler has to determine one set of matching. That is, for each of the output ports, the scheduler may match one of the corresponding VOQs with the output port.
- RRGS Round-Robin Greedy Scheduling
- N input round-robin operations that select its cell to be transmitted at a given time slot T
- N input round-robin operations are allocated into the different previous N time slots ⁇ T ⁇ N, T ⁇ N+1, . . . , T ⁇ 1 ⁇ in a simple cyclic manner so that RRGS can avoid output contention.
- RRGS can't provide max-min fair share for a best-effort service. For example, let ⁇ (i,j) be the input offered load to VOQ(i,j) and let ⁇ (i,j) be the acceptable transmission rate from VOQ(i,j).
- traffic is not balanced, some inputs can unfairly send more cells than others.
- the present invention may be used to meet the unmet needs outlined in ⁇ 1.2.3 above.
- the present invention may do so by operating, in a pipelined manner, several subschedulers within a main scheduler. This permits each subscheduler to take more than one time slot to complete its matching. For each time slot, one of the subschedulers provides the matching result. Since the subschedulers collectively generate one match per time slot, the subschedulers may adopt a pre-existing maximal matching algorithm, such as iSLIP or DRRM for example, thereby taking advantage of the desirable properties of these algorithms—namely 100% throughput under uniform traffic and fairness for best-effort traffic—found in their non-pipelined versions.
- a main scheduler may have K subschedulers. Assuming that cells arrive at the switch in a uniform fashion, each subscheduler may begin arbitration in successive time slots. Since there are K subschedulers, each one may use K time slots to perform a match. Accordingly, the main scheduler may produce a match every time slot.
- FIG. 1 is a block diagram of an exemplary input buffered switch with which the present invention may be used.
- FIG. 2 is a bubble chart illustrating operations that may be performed by the present invention.
- FIG. 3 is a flow diagram illustrating an exemplary virtual output queue method that may be performed under the present invention.
- FIGS. 4A and 4B are flow diagrams illustrating exemplary main scheduler methods that may be performed under the present invention.
- FIG. 5 is a flow diagram illustrating an exemplary matching method that may be performed under the present invention.
- FIG. 6 is a block diagram of an exemplary apparatus, as well as inter-component signaling links, that may be used to perform operations under the present invention.
- FIGS. 7(a) through 7(c) provide an example that illustrates an operation of an exemplary pipelined maximal-sized matching scheduling approach under the present invention.
- FIG. 8 is a timing diagram illustrating the operation of subschedulers in the example of FIGS. 7(a) through 7(c) .
- the present invention involves methods, and apparatus for scheduling the dispatch of cells or packets in a high-speed input buffered switch.
- the following description is presented to enable one skilled in the art to make and use the invention, and is provided in the context of particular embodiments and methods. Various modifications to the disclosed embodiments and methods will be apparent to those skilled in the art, and the general principles set forth below may be applied to other embodiments, methods and applications. Thus, the present invention is not intended to be limited to the embodiments and methods shown and the inventors regard their invention as the following disclosed methods, apparatus and materials and any other patentable subject matter to the extent that they are patentable.
- the present invention may be used with an input-buffered switch with virtual output queues (VOQs).
- the switch 100 may include input (ports) 110 , a main scheduler 120 and (e.g., crossbar) switching fabric 130 .
- input (ports) 110 may include input (ports) 110 , a main scheduler 120 and (e.g., crossbar) switching fabric 130 .
- N inputs there may be N inputs.
- these N inputs may be grouped across m input modules (not shown).
- Each input 110 will preferably have a number of VOQs corresponding to the number of outputs. Thus, if there are N outputs, each input will have N VOQs.
- the main scheduler 120 may include N 2 request counters, and K subschedulers. Each of the K subschedulers may have N 2 request flags. Each of the K subschedulers may operate in a pipeline manner, thus allowing each subscheduler to take K time slots, and still produce a match (using, e.g., DRRM, iSLIP, or another maximal matching technique) in every cell time slot.
- a first aspect of the present invention may function to relax the time needed for scheduling by maximal-sized matching algorithms. (Recall that with prior techniques, the maximal matching has to be completed within one cell time slot.)
- the present invention may accomplish this by operating several subschedulers in a pipelined manner. In this way each cell may use more than one cell time slot to reach arbitration, so long as the subschedulers, collectively, finish matching for at least one cell every time slot.
- the present invention may insure that this occurs by providing K subschedulers, where each of the K subschedulers generate a match in no more than K cell time slots.
- the present invention may also function to provide high throughput and fairness in best-effort traffic.
- the present invention may accomplish this by adopting an appropriate maximal matching algorithm, such as iSLIP or DRRM for example. Either algorithm may be implemented because the present invention may provide a match every time slot.
- FIG. 2 is a bubble chart illustrating operations that may be performed in accordance with the present invention.
- Subscheduler k 228 may perform a maximal-sized matching operation 226 . It may do so at a time t, where Kl+k ⁇ t ⁇ K (l+1)+k, where l is an integer.
- F(i,j,k) may be used instead of VOQ requests as occurs in DRRM.
- Each subscheduler k may have its own round-robin pointers. The position of pointers in a subscheduler k may be modified by the results only from subscheduler k.
- the operation of DRRM in a subscheduler k may be the same as that of the non-pipelined DRRM scheme.
- the VOQ operations 215 may be allowed to send the HOL cell in the associated VOQ 115 ′ to output at the next time slot.
- This may be used to ensure that cells from the same VOQ 115 ′ are transmitted in sequence, even if L(i,j) ⁇ C(i,j)>1, where L(i,j) is the occupancy of VOQ(i,j) (Note that
- Timing operations 250 may determine a current cell time slot (t), and using parameter K 255 , may determine control signals based on its present determined value k.
- Exemplary methods that may be used to implement various operations of the present invention are described in ⁇ 4.4.1. Then, exemplary apparatus that may be used to implement various operations of the present invention are described in ⁇ 4.4.2.
- FIG. 3 is a high-level flow diagram of an exemplary method 215 ′ that may be used to effect a virtual output queue operation 215 .
- the order in which the steps are effected is not intended to be limited to the order shown.
- the main scheduler operation(s) 220 may be informed, as indicated by block 320 . (See, e.g., communication 230 of FIG. 2 .)
- the HOL cell in VOQ(i,j) 115 may be sent to output j (e.g., in the next cell time slot).
- a successful match request indication may be provided directly from the subscheduler (k) (See, e.g., communication 240 of FIG. 2 .), or via the main scheduler operation(s) 220 .
- the main scheduler operation(s) 220 serve to manage the request counters 222 , the request flags 224 , and the subscheduler operations 226 .
- the acts related to these management responsibilities may be triggered by events (e.g., the entry of a new cell into a VOQ) and may be run periodically or in accordance with some timing-based control (e.g., based on a value (k) derived from a current time slot (t)).
- the timing operations 250 of FIG. 2 may be thought of as one of the main scheduler operation(s) 220 .
- FIG. 4A is a flow diagram of an exemplary method 220 a′ for event-triggered acts of the main scheduler operation(s).
- FIG. 4B is a flow diagram of an exemplary method 220 b′ for timing-controlled acts of the main scheduler operation(s). Each is described below.
- FIG. 4A is a high-level flow diagram of an exemplary method 220 a′ for effecting main scheduler event-triggered operations.
- this fact is relayed to the main scheduling operation(s) 220 (Recall 310 and 320 of FIG. 3 .) and the count value C(i,j) of the counter RC(i,j) corresponding to the VOQ receiving the cell, is incremented.
- FIG. 4B is a high-level flow diagram of an exemplary method 220 b′ for effecting main scheduler timing-controlled operations.
- the order in which the steps are effected is not intended to be limited to the order shown.
- the main steps of the method 220 b′ are run for each cell time slot (t) (e.g., at the start of each cell time slot).
- the main steps of the method 220 b′ may be run for each VOQ. Note that separate instances of the method 220 b′ could be run in parallel for the various VOQs.
- k is set to t mod K.
- FIG. 5 is a high-level flow diagram of an exemplary method 226 ′ that may be used to effect each of the subscheduler operation(s) 226 .
- each subscheduler k generates a match for each requested output port. An output port is requested if any of the VOQs(_, j) is non-empty.
- FIG. 6 is a high-level block diagram illustrating exemplary components and interconnections of an exemplary apparatus 600 that may be used to effect at least some of the operations of the present invention.
- Each of the inputs 610 includes (or more generally, is associated with a corresponding) a number of virtual output queues (VOQs) 615 corresponding to the number of outputs.
- An exemplary main scheduling module 620 may include (or more generally, be associated with) K subschedulers 650 and N 2 request counters (RCs) 640 , each of the request counters (RCs) being associated with a VOQ (and therefore an input-output pair). To simplify the drawing, only the components associated with the VOQs of one of the inputs are shown.
- Each of the K subschedulers 650 may include (or more generally, be associated with) N 2 request flags (RFs) 660 , each of the request flags (RFs) 660 in each of the subschedulers being associated with a VOQ (and therefore an input-output pair).
- the VOQ operations 215 may be effected on the virtual output queues 615 .
- the main scheduler operations 220 and timing operations 250 may be effected on the scheduling control module 630 .
- the subscheduling operations 226 may be effected on the subschedulers 650 . Naturally, such operations only need to be associated with the VOQs of an input, and can be effected in many other ways.
- the solid lines may be used to indicate the receipt of a cell, and the transmission of the cell to the output via the (e.g., crossbar) switching fabric 680 .
- the signaling line(s), depicted by the dashed line 692 may be used to indicate, to the scheduling control module 630 , the arrival of new cells, and to indicate to the VOQs when a match request was successful.
- the signaling lines, depicted by dotted lines 694 and 696 may be used by the scheduling control module 630 to control the request counters 640 and the request flag 660 values, and to permit communications between the scheduling control module 630 and each of its subschedulers 650 .
- the VOQ operations 215 , the main scheduler operations 220 , and the subscheduler operations 226 may be effected by programmable logic arrays, integrated circuits such as application specific integrated circuits, and/or microprocessors operating in accordance with (e.g., stored) instructions.
- Memory referred to generally as a “machine readable medium” may be used to store the various state information, i.e., the values of the request counters and request flags), used by these components. Similarly, memory can be used to buffer cells at the virtual output queues.
- FIGS. 7(a) through 7(c) illustrate an example of operations of a pipelined maximal-sized matching scheduling method employing subschedulers in accordance with one embodiment of the invention.
- the invention is embodied in an input-buffered switch, such as that illustrated in FIG. 1 .
- FIG. 7a shows the cell stored in VOQ( 0 , 0 ).
- the scheduling control module will read RC( 0 , 0 ) and see that C( 0 , 0 ) is equal to one, and it will read RF( 0 , 0 , 2 ) and see that F( 0 , 0 , 2 ) is equal to zero.
- Dashed lines represent these actions.
- HOL head of line
- the main scheduler was given more time to complete the matching by allowing three time slots instead of just one.
- the subschedulers may implement maximal matching algorithms that provide 100% throughput under uniform traffic, and maintains fairness for best-effort traffic.
- the scheduling technique and apparatus of the present invention relax the time for scheduling, thereby enabling large scale and/or high throughput switching, while maintaining fairness for best-effort traffic.
- Other approaches such as WRRGS, may fail for best-effort traffic.
- T the allowable arbitration time per iteration, a port speed, a cell size, and the number of iterations are T, C, L, and I, respectively.
- T the allowable arbitration time per iteration, a port speed, a cell size, and the number of iterations are T, C, L, and I, respectively.
- T (KL)/(CI)
- K is the number of subschedulers.
- K will essentially be equal one.
- the foregoing equation shows that by adding subschedulers, the time for arbitration will increase.
- CMOS technology in which, for example, typical gate-delay time is about 100 ps.
- typical gate-delay time is about 100 ps.
- Texas Instruments “GS40 0:15-micrometer CMOS, Standard Cell/Gate Array,” http://www.ti.com/, version 0.2, May 2000. Incorporated herein by reference. Therefore, the present invention achieves the desired number iterations even when the number of inputs increases or the port speed increases.
- the present invention can achieve 100% throughput under uniform traffic, and maintain fairness for best-effort traffic.
- the known maximal-sized matching algorithms that meet both of these conditions require that a match be made at the end of every time slot. This becomes difficult, if not impossible, with decreasing arbitration times.
- the present invention advantageously allows for longer arbitration times, while still producing a match at the end of every time slot.
- the present invention may allow the increase of inputs and output ports, and/or may allow for the increase in port speeds.
- the present invention can provide a scalable switch that has high throughput and maintains fairness for best-effort traffic.
- the present invention can avoid the problems of (i) the increasing arbitration time as switch size increases, and/or (ii) decreasing cell time slots as port speed increases, by permitting the matching algorithm to use more time.
Abstract
A Pipelined-based Maximal-sized Matching (PMM) scheduling approach for input-buffered switches relaxes the timing constraint for arbitration with a maximal matching scheme. In the PMM approach, arbitration may operate in a pipelined manner. Each subscheduler is allowed to take more than one time slot for its matching. Every time slot, one of them provides the matching result. The subscheduler can adopt a pre-existing efficient maximal matching algorithm such as iSLIP and DRRM. PMM maximizes the efficiency of the adopted arbitration scheme by allowing sufficient time for a number of iterations. PMM preserves 100% throughput under uniform traffic and fairness for best-effort traffic.
Description
BenefitThis application is a reissue of U.S. Pat. No. 7,006,514, where benefit is claimed, under 35 U.S.C. § 119(e)(1), to the filing date of: provisional patent application Ser. No. 60/294,877, entitled “PMM: A PIPELINED MAXIMAL-SIZED MATCHING SCHEDULING APPROACH FOR INPUT BUFFERED SWITCHES”, filed on May 31, 2001 and listing Eiji Oki, Roberto Rojas-Cessa and Jonathan Chao as the inventors, for any inventions disclosed in the manner provided by 35 U.S.C. § 112, ¶ 1. This provisional application is expressly incorporated herein by reference.
§ 1.1 Field of the Invention
The present invention concerns the communication of data over networks, such as the Internet for example. More specifically, the present invention concerns scheduling the servicing (e.g., dispatching) of cells or packets buffered at input ports of a switch.
§ 1.2 Related Art
Switches and routers are used in networks, such as the Internet for example, to forward data towards its destination. The need for high-speed switches and routers is introduced in § 1.2.1 below. Then, input buffering, as used in high-speed switches, is introduced in § 1.2.2 below.
§ 1.2.1 The Need for Large-Scale and High-Speed (e.g., Terabit) Routers and Switches
Many expect that Internet traffic will continue to grow explosively. Given this assumption, it is expected that high-speed switches and routers (e.g., those having a throughput over one Terabit per second) will become necessary. Most high-speed packet switches adopt a fixed-size cell in the switch fabric. If variable length packets are to be supported in the network, such packets may be segmented and/or padded into fixed-sized cells upon arrival, switched through the fabric of the switch, and reassembled into packets before departure. Input buffering is introduced below in § 1.2.2 as a way to handle these incoming cells.
§ 1.2.2 Buffering in High-Speed Routers and Switches
There are various types of buffering strategies in switch architectures: input buffering, output buffering, or crosspoint buffering. Information on these strategies can be found in the following articles: G. Nong and M. Hamdi, “On the Provision of Quality-of-Service Guarantees for Input Queued Switches,” IEEE Commun. Mag., Vol. 38, No. 12, pp. 62-69 (Dec. 2000); E. Oki, N. Yamanaka, Y. Ohtomo, K. Okazaki, and R. Kawano, “A 10-Gb/s (1.25 Gb/s×8) 4×2 0.25-micrometer CMOS/SIMOX ATM Switch Based on Scalable Distribution Arbitration,” IEEE J. Solid-State Circuits, Vol. 34, No. 12, pp. 1921-1934 (Dec. 1999); and J. Turner and N. Yamanaka, “Architectural Choices in Large Scale ATM Switches,” IEICE Trans. Commun., Vol. E81-B, No. 2, pp. 120-137 (Feb. 1998). Each of these articles is incorporated herein by reference. Input buffering is a cost effective approach for high-speed switches. This is because input-buffered switches do not require internal speedup, nor do they allocate buffers at each crosspoint. They also relax memory-bandwidth and memory-size constraints.
§ 1.2.2.1 The Use of Virtual Output Queues to Avoid Head-of-Line Blocking
It is well known that head-of-line (“HOL”) blocking limits the maximum throughput (e.g., to 58.6%) in an input-buffered switch with a First-In-First-Out (FIFO) structure. See, e.g., the article, M. J. Karol, M. G. Hluchyj, and S. P. Morgan, “Input Versus Output Queuing on a Space-Division Packet Switch,” IEEE Trans. Commun., Vol. COM-35, pp. 1347-1356 (1987). This article is incorporated herein by reference. The article, N. Mckeown, “The iSLIP Scheduling Algorithm for Input-Queued Switches,” IEEE/ACM Trans. Networking, Vol. 7, No. 2, pp. 188-200 (April 1999), shows using a Virtual-Output-Queue (VOQ) structure to overcome HOL-blocking. This article is incorporated herein by reference.
In an input-buffered switch that uses VOQs, a fixed-size cell is sent from any input to any output, provided that, in a given time slot, no more than one cell is sent from the same input, and no more than one cell is received by the same output. Each input port has N VOQs, one for each of N output ports. The HOL cell in each VOQ can be selected for transmission across the switch in each time slot. Therefore, every time slot, a scheduler has to determine one set of matching. That is, for each of the output ports, the scheduler may match one of the corresponding VOQs with the output port.
§ 1.2.2.2 Maximum-Sized and Maximal-Sized Matching Algorithms in High Speed Switches
Maximum-sized matching algorithms to schedule the input-output matching for input-buffered switches with VOQs, that achieve 100% throughput have been proposed. See, e.g., the articles: J. E. Hoperoft and R. M. Karp, “An Algorithm for Maximum Matching in Bipartite Graphs,” Soc. Ind. Appl. Math J. Computation, Vol. 2, pp. 225-231 (1973); and N. Mckeon, A. Mekkittikul, V. Anantharam, and J. Walrand, “Achieving 100% Throughput in Input-Queued Switches,” IEEE Trans. Commun., Vol. 47, No. 8, pp. 1260-1267 (August 1999). These articles are incorporated herein by reference. Unfortunately, these algorithms are hard to implement in high-speed switches because of their high computing time complexity.
Maximal-sized matching algorithms have been proposed as an alternative to the maximum-sized matching ones. Two of these algorithms, iSLIP and Dual Round-Robin Matching (DRRM), are described in the articles: N. Mckeown, “The iSLIP Scheduling Algorithm for Input-Queued Switches,” IEEE/ACM Trans. Networking, Vol. 7, No. 2, pp. 188-200 (April 1999); H. J. Chao and J. S. Park, “Centralized Contention Resolution Schemes for a Large-Capacity Optical ATM Switch,” Proc. IEEE ATM Workshop '97, Fairfax, Va. (May 1998); and H. J. Chao, “Saturn: A Terabit Packet Switch Using Dual Round-Robin,” IEEE Commun. Mag., Vol. 38, No. 12, pp. 78-84 (December 2000). These articles are incorporated herein by reference. The computing complexity of the iSLIP and DRRM methods are less than maximum matching methods. Moreover, the iSLIP and DRRM methods provide 100% throughput under uniform traffic and complete fairness for best-effort traffic. However, in each of these methods, the maximal matching is to be completed within one cell time slot. Such a constraint may become unacceptable as the switch size increases and/or the port speed becomes high, because the arbitration time becomes longer than one time slot or the time slot shrinks, respectively. For example, for a 64-byte fixed-length cell at a port speed of 40 Gbit/s (OC-768), the computation time for completing maximal-sized matching is only 12.8 ns. Existing proposals for relaxing the time constraints are discussed below in § 1.2.2.3.
§ 1.2.2.3 Round-Robin Greedy Scheduling (RRGS)
To relax the scheduling timing constraint, a pipelined-based scheduling algorithm called Round-Robin Greedy Scheduling (RRGS) is proposed in the article: A. Smiljanic, R. Fan, and G. Ramamurthy, “RRGS—Round-Robin Greedy Scheduling for Electronic/Optical Terabit Switches,” Proc. IEEE Globecom '99, pp. 1244-1250 (1999). This article is incorporated herein by reference. With RRGS, each input has only to perform one round-robin arbitration within one lime slot to select one VOQ. However, if a given switch has N inputs, then N input round-robin operations (that select its cell to be transmitted at a given time slot T) are allocated into the different previous N time slots {T−N, T−N+1, . . . , T−1} in a simple cyclic manner so that RRGS can avoid output contention.
Unfortunately, RRGS can't provide max-min fair share for a best-effort service. For example, let λ(i,j) be the input offered load to VOQ(i,j) and let μ(i,j) be the acceptable transmission rate from VOQ(i,j). Consider a 3×3 switch in which λ(0,0)=λ(1,0)=1.0 and in which other input offered loads λ(i,j)=0. According to the RRGS algorithm, the acceptable transmission rate is obtained as μ(0,0)=⅔ and μ(1,0)=⅓. Thus, when traffic is not balanced, some inputs can unfairly send more cells than others. The article, A. Similjanic, “Flexible Bandwidth Allocation in Terabit Packet Switches,” Proc. IEEE Workshop on High Performance Switching and Routing 2000, pp. 233-239 (2000) proposes weighted-RRGS (“WRRGS”), which guarantees pre-reserved bandwidth. This article is incorporated herein by reference. However, even with WRRGS, fairness is not guaranteed for best-effort traffic. In addition, once every N time-slot cycles, an idle time slot is produced when N is an even number. This means that RRGS does not maximize its use of the switching capacity.
§ 1.2.3 Unmet Needs
In view of the foregoing limits of proposed maximal matching scheduling schemes, one that (i) relaxes the scheduling time into more than one time slot, (ii) provides high throughput, and/or (iii) maintains fairness for best-effort traffic, is still desired.
The present invention may be used to meet the unmet needs outlined in § 1.2.3 above. The present invention may do so by operating, in a pipelined manner, several subschedulers within a main scheduler. This permits each subscheduler to take more than one time slot to complete its matching. For each time slot, one of the subschedulers provides the matching result. Since the subschedulers collectively generate one match per time slot, the subschedulers may adopt a pre-existing maximal matching algorithm, such as iSLIP or DRRM for example, thereby taking advantage of the desirable properties of these algorithms—namely 100% throughput under uniform traffic and fairness for best-effort traffic—found in their non-pipelined versions.
Assuming that it takes each subscheduler K (or between K−1 and K) time slots to perform the matching, in one embodiment, a main scheduler may have K subschedulers. Assuming that cells arrive at the switch in a uniform fashion, each subscheduler may begin arbitration in successive time slots. Since there are K subschedulers, each one may use K time slots to perform a match. Accordingly, the main scheduler may produce a match every time slot.
The present invention involves methods, and apparatus for scheduling the dispatch of cells or packets in a high-speed input buffered switch. The following description is presented to enable one skilled in the art to make and use the invention, and is provided in the context of particular embodiments and methods. Various modifications to the disclosed embodiments and methods will be apparent to those skilled in the art, and the general principles set forth below may be applied to other embodiments, methods and applications. Thus, the present invention is not intended to be limited to the embodiments and methods shown and the inventors regard their invention as the following disclosed methods, apparatus and materials and any other patentable subject matter to the extent that they are patentable.
§ 4.1 Exemplary Environment
The present invention may be used with an input-buffered switch with virtual output queues (VOQs). For example, as shown in FIG. 1 , the switch 100 may include input (ports) 110, a main scheduler 120 and (e.g., crossbar) switching fabric 130. As shown in FIG. 1 , there may be N inputs. In one embodiment, these N inputs may be grouped across m input modules (not shown). Each input 110 will preferably have a number of VOQs corresponding to the number of outputs. Thus, if there are N outputs, each input will have N VOQs.
In an exemplary embodiment of the invention, the main scheduler 120 may include N2 request counters, and K subschedulers. Each of the K subschedulers may have N2 request flags. Each of the K subschedulers may operate in a pipeline manner, thus allowing each subscheduler to take K time slots, and still produce a match (using, e.g., DRRM, iSLIP, or another maximal matching technique) in every cell time slot.
In the explanation of an exemplary embodiment of the invention, the following terms will be used:
-
- N=the number of input ports and output ports;
- K=the number of subschedulers in each input module;
- VOQ(i,j)=the VOQ of input i that stores cells destined for output j, where 0≦i≦N−1, and 0≦j≦N−1;
- RC(i,j)=the request counter associated with VOQ(i,j);
- Lmax=the maximum VOQ occupancy;
- C(i,j)=the value of RC(i,j)—the number of accumulated requests associated with VOQ(i,j) that have not been sent to any subscheduler, where 0≦C(i,j)≦Lmax;
- RF(i,j,k)=the request flag associated with VOQ(i,j) and subscheduler k, where 0≦k≦K−1;
- F(i,j,k)=the value of RF(i,j,k), where 0≦F(i,j,k)≦1;
When F(i,j,k) is set to one, this indicates that input i has a request to output j in subscheduler k. When F(i,j,k) is set to zero, this indicates that input i has no request to output j in subscheduler k.
§ 4.2 Functions that may be Performed
A first aspect of the present invention may function to relax the time needed for scheduling by maximal-sized matching algorithms. (Recall that with prior techniques, the maximal matching has to be completed within one cell time slot.) The present invention may accomplish this by operating several subschedulers in a pipelined manner. In this way each cell may use more than one cell time slot to reach arbitration, so long as the subschedulers, collectively, finish matching for at least one cell every time slot. The present invention may insure that this occurs by providing K subschedulers, where each of the K subschedulers generate a match in no more than K cell time slots.
The present invention may also function to provide high throughput and fairness in best-effort traffic. The present invention may accomplish this by adopting an appropriate maximal matching algorithm, such as iSLIP or DRRM for example. Either algorithm may be implemented because the present invention may provide a match every time slot. These algorithms have been proven to provide 100% throughput under uniform traffic, and maintain fairness for best-effort traffic.
§ 4.3 Exemplary Operations
At the beginning of every cell time slot t, the main scheduler operation(s) 220 may check to see if there are any cells in a VOQ that are waiting dispatch arbitration. If it finds that there is a cell waiting for arbitration, and that the next subscheduler is not already handling a cell in the same VOQ (i.e., if C(i,j)>0 and F(i,j,k)=0, where k=t mod K), it may decrement the value (C) of the associated request counter (RC) 222 and set the associated request flag (RF) 224 (i.e., C(i,j)=C(i,j)−1 and F(i,j,k)=1) If the foregoing conditions are not met (i.e., if C(i,j)=0, or if F(i,j,k=1)) it may leave the value (C) of the request counter (RC) 222 and the value (F) of the request flag (RF) 224 unchanged.
Subscheduler k 228 may perform a maximal-sized matching operation 226. It may do so at a time t, where Kl+k≦t<K (l+1)+k, where l is an integer. In one embodiment, to apply the DRRM maximal matching algorithm as a matching algorithm in a subscheduler k, F(i,j,k) may be used instead of VOQ requests as occurs in DRRM. Each subscheduler k may have its own round-robin pointers. The position of pointers in a subscheduler k may be modified by the results only from subscheduler k. The operation of DRRM in a subscheduler k may be the same as that of the non-pipelined DRRM scheme.
By the end of every time slot t, the subscheduler k 228 that started the matching operation K time slots previously (i.e., where k=(t−(K−1)) mod K), will have completed the matching operation 226. When an input-output pair (i,j) is matched, the main scheduler operation(s) 220 may unset the kth subscheduler's corresponding flag (i.e., F(i,j,k)=0). In this case, the VOQ operations 215 may be allowed to send the HOL cell in the associated VOQ 115′ to output at the next time slot. This may be used to ensure that cells from the same VOQ 115′ are transmitted in sequence, even if L(i,j)−C(i,j)>1, where L(i,j) is the occupancy of VOQ(i,j) (Note that
When an input-output pair is not matched, the flag is not changed. Note that in one alternative, the subscheduler operation(s) 226 may reset the flag (F(i,j,k)) directly.
Timing operations 250 may determine a current cell time slot (t), and using parameter K 255, may determine control signals based on its present determined value k.
Having described various operations that may be performed in accordance with the present invention, exemplary apparatus, methods and data structures for implementing such operations arc now described in § 4.4 below.
§ 4.4 Exemplary Methods and Apparatus for Performing the Exemplary Operations
Exemplary methods that may be used to implement various operations of the present invention are described in § 4.4.1. Then, exemplary apparatus that may be used to implement various operations of the present invention are described in § 4.4.2.
§ 4.4.1 Exemplary Methods
The main scheduler operation(s) 220 serve to manage the request counters 222, the request flags 224, and the subscheduler operations 226. The acts related to these management responsibilities may be triggered by events (e.g., the entry of a new cell into a VOQ) and may be run periodically or in accordance with some timing-based control (e.g., based on a value (k) derived from a current time slot (t)). Indeed, the timing operations 250 of FIG. 2 may be thought of as one of the main scheduler operation(s) 220. FIG. 4A is a flow diagram of an exemplary method 220a′ for event-triggered acts of the main scheduler operation(s). FIG. 4B is a flow diagram of an exemplary method 220b′ for timing-controlled acts of the main scheduler operation(s). Each is described below.
§ 4.4.2 Exemplary Apparatus
There are various signaling lines and links that may be provided. The solid lines may be used to indicate the receipt of a cell, and the transmission of the cell to the output via the (e.g., crossbar) switching fabric 680. The signaling line(s), depicted by the dashed line 692, may be used to indicate, to the scheduling control module 630, the arrival of new cells, and to indicate to the VOQs when a match request was successful. The signaling lines, depicted by dotted lines 694 and 696, may be used by the scheduling control module 630 to control the request counters 640 and the request flag 660 values, and to permit communications between the scheduling control module 630 and each of its subschedulers 650.
The VOQ operations 215, the main scheduler operations 220, and the subscheduler operations 226 may be effected by programmable logic arrays, integrated circuits such as application specific integrated circuits, and/or microprocessors operating in accordance with (e.g., stored) instructions. Memory (referred to generally as a “machine readable medium”) may be used to store the various state information, i.e., the values of the request counters and request flags), used by these components. Similarly, memory can be used to buffer cells at the virtual output queues.
§ 4.5 Examples Illustrating Operations Performed by an Exemplary Embodiment
Two previous cells have arrived and are being matched by subscheduler 0 and subscheduler 1. In FIG. 7a , these cells can be seen waiting in VOQ (9,0) and VOQ (9,9), respectively. As indicated, the request flags of RF(9,0,0) and RF(9,9,1) have been set to “1”. Also indicated, the request counters RC(9,0) and RC(9,9) have been incremented upon arrival of the cells at their VOQs, and decremented upon assignment to the subschedulers.
Still referring to FIG. 7a , in time slot t=1 a new cell arrives in input 0. The cell is to leave the switch on output port 0. Accordingly, it will be stored in VOQ(0,0). FIG. 7a shows the cell stored in VOQ(0,0). The main scheduler will become informed of this and will increment the request counter RC(0,0) corresponding to VOQ(0,0), (C(0,0)=C(0,0)+1). (Recall, e.g., FIG. 4A .)
Referring to FIG. 7b , at the beginning of time slot t=2, when k=2 (=2 mod 3), the scheduling control module will read RC(0,0) and see that C(0,0) is equal to one, and it will read RF(0,0,2) and see that F(0,0,2) is equal to zero. (Recall, e.g., 470 of FIG. 4b .) Dashed lines represent these actions. RC(0,0) will be decremented, C(0,0)=C(0,0)−1, and RF(0,0,2) will be set to one. (Recall, e.g., 474 and 472 of FIG. 4b .) Solid lines represent these actions. Within this cell time slot, subscheduler 2 begins to operate the maximal-sized matching according to the adopted algorithm. (Recall, e.g., 510, 520 and 530 of FIG. 5 .) This is because 2≧2 (2≧Kl+k=3 (0)+2) and 2<5 (2<K(l+1)+k=3(1)+2). Subscheduler 2 will have completed its matching operation by the end of the fifth time slot (t=5).
Referring to FIG. 7c , at the end of time slot t=3, when k=1 (=(3−(3−1)) mod 3), the k=1 subscheduler will have completed its matching operation. Recall that earlier cell arrived in input port 9, at time slot t=0, and is to leave on output port 9. If the input-output pair (9,9) is matched, the flag RF(9,9,1) will be unset (i.e., F(9,9,1)=0), and the VOQ(9,9) will be informed that it won arbitration and that it can send its head of line (HOL) cell to output 0 (e.g., in the next time slot). (Recall, e.g., 550, 560, and 570 of FIG. 5 .) Similarly, as indicated by FIG. 8 , at the end of cell time slot t=4, subscheduler 2 will finish matching, and at the end of cell time slot t=5, subscheduler 0 will finish matching. If new cells continue to arrive, this pattern will continue and the main scheduler will be able to produce a match at the end of every cell time slot.
Therefore, as this example illustrates, the main scheduler was given more time to complete the matching by allowing three time slots instead of just one. In addition, the subschedulers may implement maximal matching algorithms that provide 100% throughput under uniform traffic, and maintains fairness for best-effort traffic.
§ 4.6 Conclusions
In view of the foregoing, the scheduling technique and apparatus of the present invention relax the time for scheduling, thereby enabling large scale and/or high throughput switching, while maintaining fairness for best-effort traffic. Other approaches, such as WRRGS, may fail for best-effort traffic. For example, if the allowable arbitration time per iteration, a port speed, a cell size, and the number of iterations are T, C, L, and I, respectively, then, T=(KL)/(CI), where K is the number of subschedulers. In the non-pipelined DRRM scheme, K will essentially be equal one. The foregoing equation shows that by adding subschedulers, the time for arbitration will increase. For example if the time for arbitration for a non-pipelined DRRM scheme was 3.2 ns, that time can be increased to 9.6 ns by implementing three subschedulers. Having more time for arbitration may become important, because it is difficult to implement round-robin arbitration that supports large numbers of inputs in hardware using available CMOS technology, in which, for example, typical gate-delay time is about 100 ps. Refer to Texas Instruments, “GS40 0:15-micrometer CMOS, Standard Cell/Gate Array,” http://www.ti.com/, version 0.2, May 2000. Incorporated herein by reference. Therefore, the present invention achieves the desired number iterations even when the number of inputs increases or the port speed increases.
To reiterate, the present invention can achieve 100% throughput under uniform traffic, and maintain fairness for best-effort traffic. The known maximal-sized matching algorithms that meet both of these conditions require that a match be made at the end of every time slot. This becomes difficult, if not impossible, with decreasing arbitration times. The present invention advantageously allows for longer arbitration times, while still producing a match at the end of every time slot.
Given these conditions the present invention may allow the increase of inputs and output ports, and/or may allow for the increase in port speeds. In view of the foregoing, the present invention can provide a scalable switch that has high throughput and maintains fairness for best-effort traffic. The present invention can avoid the problems of (i) the increasing arbitration time as switch size increases, and/or (ii) decreasing cell time slots as port speed increases, by permitting the matching algorithm to use more time.
Claims (60)
1. For use with a switch having a first number of input ports, a second number of output ports, each of the input ports having the second number of virtual output queues, and a third number of subschedulers, each of the third number of subschedulers being able to arbitrate matching to each of the second number of output ports, a method for scheduling the dispatch of cells or packets stored in the virtual output queues, the method comprising:
a) for each of the virtual output queues, maintaining a first indicator for indicating whether the virtual output queue is storing a cell awaiting dispatch arbitration;
b) for each of the subschedulers, maintaining a second indicator F(i,j,k) for indicating whether the subscheduler is available or reserved; and
c) for each of the subschedulers, performing a matching operation, if it has been reserved, to match a cell buffered at a virtual output queue with its corresponding output port,
wherein each of the subschedulers requires more than one cell time slot to generate a match from its matching operation, and
wherein the subschedulers can collectively generate a match result for each output port in each cell time slot.
2. The method of claim 1 wherein each of the subschedulers requires the third number of cell time slots to generate a match from its matching operation.
3. The method of claim 1 wherein each of the subschedulers require no more than the third number of cell time slots to generate a match results from its matching operation.
4. The method of claim 1 wherein fairness for best-effort traffic is maintained.
5. The method of claim 1 wherein the matching operation is a matching operation selected from a group of matching operations consisting of (A) Dual Round-Robin Matching, and (B) iterative-SLIP.
6. The method of claim 1 further comprising:
d) if a cell buffered at a virtual output queue has been successfully matched with its corresponding output port, informing the virtual output queue.
7. The method of claim 6 further comprising:
e) for each of the virtual output queues, if the virtual output queue has been informed that it has been successfully matched with its corresponding output port, then dispatching its head of line cell.
8. The method of claim 7 wherein the head of line cell is dispatched in a next cell time slot.
9. The method of claim 1 further comprising:
e) if a cell buffered at a virtual output queue has been successfully matched with its corresponding output port, then dispatching its head of line cell.
10. The method of claim 9 wherein the head of line cell is dispatched in a next cell time slot.
11. The method of claim 1 wherein the first indicator, for each of the virtual output queues, for indicating whether the virtual output queue is storing a cell awaiting dispatch, is a count,
wherein the count is incremented upon learning that a new cell has arrived at the virtual output queue.
12. The method of claim 11 wherein the count is decremented when an available subscheduler is reserved for considering a head of line cell at a corresponding virtual output queue.
13. The method of claim 1 wherein the second indicator, for each of the subschedulers, is set to indicate that the associated subscheduler is reserved if the first indicator indicates that a corresponding virtual output queue is storing a cell awaiting dispatch arbitration.
14. The method of claim 1 wherein the second indicator, for each of the subschedulers, is set to indicate that the associated subscheduler is available if the associated subscheduler matches a cell buffered at a virtual output queue with its corresponding output port.
15. The method of claim 1 wherein the second indicator is set to indicate that a kth subscheduler is reserved if the first indicator indicates that a corresponding virtual output queue is storing a cell awaiting dispatch arbitration,
wherein k is set to the current cell time slot modulo the third number.
16. For use with a switch including a first number of output ports, a second number of input ports, and the first number of virtual output queues associated with each of the second number of input ports, a dispatch scheduler comprising:
a) a third number of subschedulers;
b) a first indicator, associated with each of the virtual output queues, for indicating whether the virtual output queue is storing a cell awaiting dispatch arbitration; and
c) a second indicator, for each of the subschedulers, indicating whether the subscheduler is available or reserved,
wherein each of the subschedulers is adapted to perform a matching operation, if it has been reserved, to match a cell buffered at a virtual output queue with its corresponding output port,
wherein each of the subschedulers requires more than one cell time slot to generate a match from its matching operation, and
wherein the subschedulers can collectively generate a match result for each output port in each cell time slot.
17. The dispatch scheduler of claim 16 wherein each of the subschedulers requires the third number of cell time slots to generate a match from its matching operation.
18. The dispatch scheduler of claim 16 wherein each of the subschedulers require no more than the third number of cell time slots to generate a match results from its matching operation.
19. The dispatch scheduler of claim 16 wherein fairness for best-effort traffic is maintained.
20. The dispatch scheduler of claim 16 wherein the matching operation is a matching operation selected from a group of matching operations consisting of (A) Dual Round-Robin Matching, and (B) iterative-SLIP.
21. The dispatch scheduler of claim 16 wherein if a cell buffered at a virtual output queue has been successfully matched with its corresponding output port, the virtual output queue is so informed.
22. The dispatch scheduler of claim 16 wherein if a cell buffered at a virtual output queue has been successfully matched with its corresponding output port, its head of line cell is dispatched.
23. The dispatch scheduler of claim 22 wherein the head of line cell is dispatched in a next cell time slot.
24. The dispatch scheduler of claim 16 wherein the first indicator, for each of the virtual output queues, for indicating whether the virtual output queue is storing a cell awaiting dispatch arbitration, is a count,
wherein the count is incremented upon learning that a new cell has arrived at the virtual output queue.
25. The dispatch scheduler of claim 24 wherein the count is decremented when an available subscheduler is reserved for considering a head of line cell at a corresponding virtual output queue.
26. The dispatch scheduler of claim 16 wherein the second indicator, for each of the subschedulers, is set to indicate that the associated subscheduler is reserved if the first indicator indicates that a corresponding virtual output queue is storing a cell awaiting dispatch arbitration.
27. The dispatch scheduler of claim 16 wherein the second indicator, for each of the subschedulers, is set to indicate that the associated subscheduler is available if the associated subscheduler matches a cell buffered at a virtual output queue with its corresponding output port.
28. The dispatch scheduler of claim 16 wherein the second indicator is set to indicate that a kth subscheduler is reserved if the first indicator indicates that a corresponding virtual output queue is storing a cell awaiting dispatch,
wherein k is set to the current cell time slot modulo the third number.
29. For use with a switch having a first number of input ports, a second number of output ports, each of the input ports having the second number of virtual output queues, and a third number of subschedulers, each of the third number of subschedulers being able to arbitrate matching to each of the second number of output ports, a method for scheduling the dispatch of cells or packets stored in the virtual output queues, the method comprising for each of the subschedulers, performing a matching operation, if it has been reserved, to match a cell buffered at a virtual output queue with its corresponding output port,
wherein each of the subschedulers requires more than one cell time slot to generate a match from its matching operation,
wherein the subschedulers can collectively generate a match result for each output port in each cell time slot, and
fairness is maintained for best-effort traffic.
30. The method of claim 29 wherein each of the subschedulers requires the third number of cell time slots to generate a match from its matching operation.
31. The method of claim 29 wherein each of the subschedulers require no more than the third number of cell time slots to generate a match results from its matching operation.
32. The method of claim 29 wherein the matching operation is a matching operation selected from a group of matching operations consisting of (A) Dual Round-Robin Matching, and (B) iterative-SLIP.
33. A method for scheduling the dispatch of cells stored in an output queue of a switch, the method comprising:
maintaining a first indicator for indicating whether the output queue is storing a cell awaiting dispatch arbitration;
maintaining a second indicator for indicating whether a subscheduler is available or reserved; and
performing a matching operation to match a cell buffered at the output queue with a corresponding output port if, based on the first indicator the output queue is storing the cell awaiting dispatch arbitration and if, based on the second indicator, the subscheduler has been reserved,
wherein the subscheduler requires more than one cell time slot to generate a match from the matching operation, and wherein the subscheduler collectively generates a match result in each cell time slot for the output port.
34. The method of claim 33, wherein the subscheduler requires a number of cell time slots equal to a total number of subschedulers to generate the match from the matching operation.
35. The method of claim 33, wherein the subscheduler requires no more than a number of cell time slots equal to a total number of subschedulers to generate the match results from the matching operation.
36. The method of claim 33, wherein fairness for best-effort traffic is maintained.
37. The method of claim 33, wherein the matching operation comprises at least one of the following: a Dual Round-Robin Matching operation and an iterative-SLIP operation.
38. The method of claim 33, further comprising, if the cell buffered at the output queue has been successfully matched with the corresponding output port, informing the output queue.
39. The method of claim 38 further comprising, if the output queue has been informed that it has been successfully matched with the corresponding output port, dispatching a head of line cell.
40. The method of claim 39, wherein the head of line cell is dispatched in a next cell time slot.
41. The method of claim 33, further comprising, if the cell buffered at the output queue has been successfully matched with the corresponding output port, dispatching a head of line cell.
42. The method of claim 41, wherein the head of line cell is dispatched in a next cell time slot.
43. The method of claim 33, wherein the first indicator for indicating whether the output queue is storing a cell awaiting dispatch, is a count, and wherein the count is incremented upon learning that a new cell has arrived at the output queue.
44. The method of claim 43, wherein the count is decremented when the subscheduler is reserved for considering a head of line cell at the corresponding output queue.
45. The method of claim 33, wherein the second indicator is set to indicate that the subscheduler is reserved if the first indicator indicates that the corresponding output queue is storing a cell awaiting dispatch arbitration.
46. The method of claim 33, wherein the second indicator is set to indicate that the subscheduler is available if the associated subscheduler matches a cell buffered at the output queue with the corresponding output port.
47. The method of claim 33 further comprising a plurality of subschedulers that define an order, wherein each of the subschedulers have a respective position in the order, and wherein the second indicator is set to indicate the subscheduler at the respective position is reserved if the first indicator indicates that the corresponding output queue is storing a cell awaiting dispatch arbitration, wherein the respective position is set to the current cell time slot modulo a total number of the plurality of subschedulers.
48. A dispatch scheduler for use with a switch, the dispatch scheduler comprising:
a) a subscheduler;
b) a first indicator module having a first indicator, associated with an output queue, for indicating whether the output queue is storing a cell awaiting dispatch arbitration; and
c) a second indicator module having a second indicator for indicating whether the subscheduler is available or reserved,
wherein the subscheduler is configured to perform a matching operation to match a cell buffered at the output queue with a corresponding output port if, based on the first indicator the output queue is storing the cell awaiting dispatch arbitration and if, based on the second indicator, the subscheduler has been reserved, wherein the subscheduler requires more than one cell time slot to generate a match from the matching operation, and wherein the subscheduler collectively generates a match result in each cell time slot for the output port.
49. The dispatch scheduler of claim 48, wherein the subscheduler requires a number of cell time slots equal to a total number of subschedulers to generate the match from the matching operation.
50. The dispatch scheduler of claim 48, wherein the subscheduler requires no more than a number of cell time slots equal to a total number of subschedulers to generate the match results from the matching operation.
51. The dispatch scheduler of claim 48, wherein fairness for best-effort traffic is maintained.
52. The dispatch scheduler of claim 48, wherein the matching operation comprises at least one of the following: a Dual Round-Robin Matching operation and an iterative-SLIP operation.
53. The dispatch scheduler of claim 48, wherein if a cell buffered at the output queue has been successfully matched with the corresponding output port, the output queue is so informed.
54. The dispatch scheduler of claim 48, wherein if a cell buffered at the output queue has been successfully matched with the corresponding output port, a head of line cell is dispatched.
55. The dispatch scheduler of claim 54, wherein the head of line cell is dispatched in a next cell time slot.
56. The dispatch scheduler of claim 48, wherein the first indicator is a count, and wherein the count is incremented upon learning that a new cell has arrived at the output queue.
57. The dispatch scheduler of claim 56, wherein the count is decremented when the subscheduler is reserved for considering a head of line cell at the corresponding output queue.
58. The dispatch scheduler of claim 48, wherein the second indicator is set to indicate that the subscheduler is reserved if the first indicator indicates that the corresponding output queue is storing a cell awaiting dispatch arbitration.
59. The dispatch scheduler of claim 48, wherein the second indicator is set to indicate that the subscheduler is available if the subscheduler matches a cell buffered at the output queue with the corresponding output port.
60. The dispatch scheduler of claim 48 further comprising a plurality of subschedulers that define an order, wherein each of the subschedulers have a respective position in the order, and wherein the second indicator is set to indicate the subscheduler at the respective position is reserved if the first indicator indicates that the corresponding output queue is storing a cell awaiting dispatch arbitration, wherein the respective position is set to the current cell time slot modulo a total number of the plurality of subschedulers.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US12/072,900 USRE43110E1 (en) | 2001-05-31 | 2008-02-28 | Pipelined maximal-sized matching cell dispatch scheduling |
Applications Claiming Priority (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US29487701P | 2001-05-31 | 2001-05-31 | |
US09/873,110 US7006514B2 (en) | 2001-05-31 | 2001-06-01 | Pipelined maximal-sized matching cell dispatch scheduling |
US12/072,900 USRE43110E1 (en) | 2001-05-31 | 2008-02-28 | Pipelined maximal-sized matching cell dispatch scheduling |
Related Parent Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US09/873,110 Reissue US7006514B2 (en) | 2000-11-20 | 2001-06-01 | Pipelined maximal-sized matching cell dispatch scheduling |
Publications (1)
Publication Number | Publication Date |
---|---|
USRE43110E1 true USRE43110E1 (en) | 2012-01-17 |
Family
ID=26968799
Family Applications (2)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US09/873,110 Ceased US7006514B2 (en) | 2000-11-20 | 2001-06-01 | Pipelined maximal-sized matching cell dispatch scheduling |
US12/072,900 Expired - Lifetime USRE43110E1 (en) | 2001-05-31 | 2008-02-28 | Pipelined maximal-sized matching cell dispatch scheduling |
Family Applications Before (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US09/873,110 Ceased US7006514B2 (en) | 2000-11-20 | 2001-06-01 | Pipelined maximal-sized matching cell dispatch scheduling |
Country Status (1)
Country | Link |
---|---|
US (2) | US7006514B2 (en) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20150200874A1 (en) * | 2002-06-27 | 2015-07-16 | Coriant | Apparatus and Method to Switch Packets Using a Switch Fabric With Memory |
Families Citing this family (19)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US7072353B2 (en) * | 2000-06-15 | 2006-07-04 | At&T Corp. | Flexible bandwidth allocation in high-capacity grooming switches |
US7103056B2 (en) * | 2000-11-20 | 2006-09-05 | Polytechnic University | Scheduling the dispatch of cells in multistage switches using a hierarchical arbitration scheme for matching non-empty virtual output queues of a module with outgoing links of the module |
USRE42600E1 (en) * | 2000-11-20 | 2011-08-09 | Polytechnic University | Scheduling the dispatch of cells in non-empty virtual output queues of multistage switches using a pipelined arbitration scheme |
DE10128686A1 (en) * | 2001-06-13 | 2002-12-19 | Philips Corp Intellectual Pty | Virtual output queue packet switching unit uses port controller status information |
US7203202B2 (en) * | 2001-10-31 | 2007-04-10 | Polytechnic University | Arbitration using dual round robin matching with exhaustive service of winning virtual output queue |
US7292594B2 (en) * | 2002-06-10 | 2007-11-06 | Lsi Corporation | Weighted fair share scheduler for large input-buffered high-speed cross-point packet/cell switches |
KR100468946B1 (en) * | 2002-12-20 | 2005-02-02 | 한국전자통신연구원 | Input Buffered Switches and Its Contention Method Using Pipelined Simple Matching |
EP1668928A1 (en) * | 2003-09-29 | 2006-06-14 | BRITISH TELECOMMUNICATIONS public limited company | Matching process |
ATE323363T1 (en) * | 2003-10-21 | 2006-04-15 | Cit Alcatel | SWITCHING UNIT WITH SCALABLE AND ßQOSß-CONSIDERATE DATA FLOW CONTROL |
JP2006099731A (en) * | 2004-08-30 | 2006-04-13 | Matsushita Electric Ind Co Ltd | Resource management apparatus |
US7852866B2 (en) * | 2006-12-29 | 2010-12-14 | Polytechnic Institute of New York Universiity | Low complexity scheduling algorithm for a buffered crossbar switch with 100% throughput |
US7912068B2 (en) * | 2007-07-20 | 2011-03-22 | Oracle America, Inc. | Low-latency scheduling in large switches |
JP5096102B2 (en) * | 2007-10-17 | 2012-12-12 | 株式会社ユニバーサルエンターテインメント | Radio communication tag and radio communication system |
GB2464310B (en) | 2008-10-10 | 2012-10-17 | Micron Technology Inc | Switching device |
US8995456B2 (en) * | 2009-04-08 | 2015-03-31 | Empire Technology Development Llc | Space-space-memory (SSM) Clos-network packet switch |
US8675673B2 (en) | 2009-07-29 | 2014-03-18 | New Jersey Institute Of Technology | Forwarding cells of partitioned data through a three-stage Clos-network packet switch with memory at each stage |
US8274988B2 (en) * | 2009-07-29 | 2012-09-25 | New Jersey Institute Of Technology | Forwarding data through a three-stage Clos-network packet switch with memory at each stage |
US11516151B2 (en) * | 2019-12-31 | 2022-11-29 | Infinera Oy | Dynamically switching queueing systems for network switches |
WO2022035935A1 (en) * | 2020-08-11 | 2022-02-17 | Georgia Tech Research Corporation | Multi-packet sliding window scheduler and method for input-queued switches |
Citations (22)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5495476A (en) | 1995-01-26 | 1996-02-27 | International Business Machines Corporation | Parallel algorithm to set up benes switch; trading bandwidth for set up time |
US6061330A (en) | 1996-03-15 | 2000-05-09 | Telefoanktiebolaget Lm Ericsson | Flow and congestion control in packet switched networks |
US6072772A (en) | 1998-01-12 | 2000-06-06 | Cabletron Systems, Inc. | Method for providing bandwidth and delay guarantees in a crossbar switch with speedup |
US6141323A (en) | 1996-06-03 | 2000-10-31 | Whittaker Corporation | Closed loop congestion control using a queue measurement system |
US6157643A (en) | 1996-11-06 | 2000-12-05 | Nokia Telecommunications Oy | Switching fabric |
US20010023469A1 (en) | 2000-03-20 | 2001-09-20 | Gab-Joong Jeong | Distributed type input buffer switch system for transmitting arbitration information efficiently and method for processing input data using the same |
US6295295B1 (en) | 1995-11-27 | 2001-09-25 | Telefonaktiebolaget Lm Ericsson | Scheduler for an information packet switch |
US6324165B1 (en) | 1997-09-05 | 2001-11-27 | Nec Usa, Inc. | Large capacity, multiclass core ATM switch architecture |
US20020061020A1 (en) * | 2000-11-20 | 2002-05-23 | Polytechnic University | Scheduling the dispatch of cells in multistage switches |
US20020110134A1 (en) | 2000-12-15 | 2002-08-15 | Glenn Gracon | Apparatus and methods for scheduling packets in a broadband data stream |
US20020110135A1 (en) * | 2000-11-20 | 2002-08-15 | Polytechnic University | Scheduling the dispatch of cells in non-empty virtual output queues of multistage switches using a pipelined arbitration scheme |
US20020122428A1 (en) | 2001-01-03 | 2002-09-05 | Nec Usa, Inc. | Pipeline scheduler with fairness and minimum bandwidth guarantee |
US20020136230A1 (en) | 2000-12-15 | 2002-09-26 | Dell Martin S. | Scheduler for a packet routing and switching system |
US20020176431A1 (en) | 2001-02-17 | 2002-11-28 | Golla Prasad N. | Multiserver scheduling system and method for a fast switching element |
US20030007498A1 (en) | 1999-05-14 | 2003-01-09 | Bay Networks, Nc. | Multicast and unicast scheduling for a network device |
US20030031193A1 (en) | 2001-04-06 | 2003-02-13 | Andrews Daniel M. | Scalable weight-based terabit switch scheduling method |
US6657959B1 (en) | 1998-06-27 | 2003-12-02 | Intel Corporation | Systems and methods for implementing ABR with guaranteed MCR |
US6687225B1 (en) | 1998-11-10 | 2004-02-03 | Fujitsu Limited | Bandwidth control apparatus |
US6725267B1 (en) | 2000-05-31 | 2004-04-20 | Prediwave Corporation | Prefetched data in a digital broadcast system |
US20040120321A1 (en) | 2002-12-20 | 2004-06-24 | Han Man Soo | Input buffered switches using pipelined simple matching and method thereof |
US6795442B1 (en) | 1998-04-23 | 2004-09-21 | Emulex Design & Manufacturing Corporation | System and method for scheduling message transmission and processing in a digital data network |
US6810031B1 (en) | 2000-02-29 | 2004-10-26 | Celox Networks, Inc. | Method and device for distributing bandwidth |
-
2001
- 2001-06-01 US US09/873,110 patent/US7006514B2/en not_active Ceased
-
2008
- 2008-02-28 US US12/072,900 patent/USRE43110E1/en not_active Expired - Lifetime
Patent Citations (22)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5495476A (en) | 1995-01-26 | 1996-02-27 | International Business Machines Corporation | Parallel algorithm to set up benes switch; trading bandwidth for set up time |
US6295295B1 (en) | 1995-11-27 | 2001-09-25 | Telefonaktiebolaget Lm Ericsson | Scheduler for an information packet switch |
US6061330A (en) | 1996-03-15 | 2000-05-09 | Telefoanktiebolaget Lm Ericsson | Flow and congestion control in packet switched networks |
US6141323A (en) | 1996-06-03 | 2000-10-31 | Whittaker Corporation | Closed loop congestion control using a queue measurement system |
US6157643A (en) | 1996-11-06 | 2000-12-05 | Nokia Telecommunications Oy | Switching fabric |
US6324165B1 (en) | 1997-09-05 | 2001-11-27 | Nec Usa, Inc. | Large capacity, multiclass core ATM switch architecture |
US6072772A (en) | 1998-01-12 | 2000-06-06 | Cabletron Systems, Inc. | Method for providing bandwidth and delay guarantees in a crossbar switch with speedup |
US6795442B1 (en) | 1998-04-23 | 2004-09-21 | Emulex Design & Manufacturing Corporation | System and method for scheduling message transmission and processing in a digital data network |
US6657959B1 (en) | 1998-06-27 | 2003-12-02 | Intel Corporation | Systems and methods for implementing ABR with guaranteed MCR |
US6687225B1 (en) | 1998-11-10 | 2004-02-03 | Fujitsu Limited | Bandwidth control apparatus |
US20030007498A1 (en) | 1999-05-14 | 2003-01-09 | Bay Networks, Nc. | Multicast and unicast scheduling for a network device |
US6810031B1 (en) | 2000-02-29 | 2004-10-26 | Celox Networks, Inc. | Method and device for distributing bandwidth |
US20010023469A1 (en) | 2000-03-20 | 2001-09-20 | Gab-Joong Jeong | Distributed type input buffer switch system for transmitting arbitration information efficiently and method for processing input data using the same |
US6725267B1 (en) | 2000-05-31 | 2004-04-20 | Prediwave Corporation | Prefetched data in a digital broadcast system |
US20020110135A1 (en) * | 2000-11-20 | 2002-08-15 | Polytechnic University | Scheduling the dispatch of cells in non-empty virtual output queues of multistage switches using a pipelined arbitration scheme |
US20020061020A1 (en) * | 2000-11-20 | 2002-05-23 | Polytechnic University | Scheduling the dispatch of cells in multistage switches |
US20020136230A1 (en) | 2000-12-15 | 2002-09-26 | Dell Martin S. | Scheduler for a packet routing and switching system |
US20020110134A1 (en) | 2000-12-15 | 2002-08-15 | Glenn Gracon | Apparatus and methods for scheduling packets in a broadband data stream |
US20020122428A1 (en) | 2001-01-03 | 2002-09-05 | Nec Usa, Inc. | Pipeline scheduler with fairness and minimum bandwidth guarantee |
US20020176431A1 (en) | 2001-02-17 | 2002-11-28 | Golla Prasad N. | Multiserver scheduling system and method for a fast switching element |
US20030031193A1 (en) | 2001-04-06 | 2003-02-13 | Andrews Daniel M. | Scalable weight-based terabit switch scheduling method |
US20040120321A1 (en) | 2002-12-20 | 2004-06-24 | Han Man Soo | Input buffered switches using pipelined simple matching and method thereof |
Non-Patent Citations (25)
Title |
---|
A. Smiljanic, "Flexible Bandwidth Allocation in Terabit Packet Switches", pp. 233-239. |
A. Smiljanic, R. Fan and G. Ramamurthy, "RRGS-Round-Robin Greedy Scheduling for Electronic/Optical Terabit Switches", Global Telecommunications Conference-Globecom '99, pp. 1244-1250 (May 1999). |
A. Smiljanić, R. Fan and G. Ramamurthy, "RRGS-Round-Robin Greedy Scheduling for Electronic/Optical Terabit Switches", Global Telecommunications Conference—Globecom '99, pp. 1244-1250 (May 1999). |
Chao et al., "Centralized Contention Resolution Schemes for a Large-Capacity Optical ATM Switch," Proc. IEEE Atm Workshop '97, 1998, 11-16. |
Chao et al., "Saturn: A Terabit Packet Switch Using Dual Round-Robin," IEEE Communications Magazine, 2000, 78-84. |
E. Oki, N. Yamanaka, Y. Ohtomo, K. Okazaki and R. Kawano, "A 10-Gb/s (1.25 Gb/s×8) 4×2 0.25-mum CMOS/SIMOX ATM Switch Based on Scalable Distributed Arbitration", IEEE J. of Solid-State Circuits, vol. 34, No. 12, pp. 1921-1934 (Dec. 1999). |
E. Oki, N. Yamanaka, Y. Ohtomo, K. Okazaki and R. Kawano, "A 10-Gb/s (1.25 Gb/s×8) 4×2 0.25-μm CMOS/SIMOX ATM Switch Based on Scalable Distributed Arbitration", IEEE J. of Solid-State Circuits, vol. 34, No. 12, pp. 1921-1934 (Dec. 1999). |
G. Nong and M. Hamdi, "On the Provision of Quality-of-Service Guarantees for Input Queued Switches", IEEE Commun. Mag., pp. 62-69 (Dec. 2000). |
G. Nong, J. K. Muppals and M. Hamdi, "Analysis of Nonblocking ATM Switches with Multiple Input Queues", IEEE/ACM Transactions on Networking, vol. 7, No. 1, pp. 60-74 (Feb. 1999). |
H.J. Chao and J-S Park, "Centralized Contention Resolution Schemes for a Large-Capacity Optical ATM Switch", Proc. IEEE ATM Workshop '97, pp. 11-16 (Fairfax, VA, May 1998). |
J. Chao, "Saturn: A Terabit Packet Switch Using Dual Round-Robin", IEEE Communications Magazine, pp. 78-84 (Dec. 2000). |
J. Turner and N. Yamanaka, "Architectural Choices in Large Scale ATM Switches", IEICE Trans. Commun., vol. E81-B, No. 2, pp. 120-137 (Feb. 1998). |
McKeown et al., "Achieving 100% Throughout in an Input-Queued Switch," IEEE Trans. On Communications, vol. 47, No. 8, 1260-1267. |
McKeown, N., "The iSLIP Scheduling Algorithm for Input-Queued Switches," IEEE/ACM Transactions on Networking, 1999, vol. 7, No. 2, 188-201. |
N. McKeown, "The iSLIP Scheduling Algorithm for Input-Queued Switches", IEEE/ACM Transactions on Networking, vol. 7, No. 2, pp. 188-201 (Apr. 1999). |
N. McKeown, A. Mekkittikul, V. Anantharam, and J. Walrand, "Achieving 100% Throughput in an Input-Queued Switch", IEEE Trans. on Communications, vol. 47, No. 8, pp. 1260-1267 (Aug. 1999). |
N. Yamanaka, E. Oki, S. Yasukawa, R. Kawano and K. Okazaki, "OPTIMA: Scalable, Multi-Stage, 640-Gbit/s ATM Switching System Based on Advanced Electronic and Optical WDM Technologies", IEICE Trans. Commun., vol. E83-B, No. 7, pp. 1488-1496 (Jul. 2000). |
Nong et al., "Analysis of Nonblocking ATM Switches with Multiple Input Queues," IEEE/ACM Transactions on Networking, 1999, vol. 7, No. 1, 60-74. |
Nong et al., "On the Provision of Quality-of-Service Guarentees for Input Queued Switches," IEEE Commun. Mag., 2000, 62-69. |
Oki et al., "A 10-Gb/s (1.25 Gb/s×8) 4×2 0.25-mum CMOS/SIMOX ATM Switch based on Scalable Distributed Arbitration," IEEE J. Of Solid-State Circuits, 1999, vol. 34, No. 12, 1921-1934. |
Oki et al., "A 10-Gb/s (1.25 Gb/s×8) 4×2 0.25-μm CMOS/SIMOX ATM Switch based on Scalable Distributed Arbitration," IEEE J. Of Solid-State Circuits, 1999, vol. 34, No. 12, 1921-1934. |
Smiljanic et al., "RRGS-Round-Robin Greedy Scheduling for Electronic/Optical Terabit Switches," Global Telecommunications Conference-Globecom '99, 1999, 1244-1250. |
Smiljanic, A., "Flexible Bandwidth Allocation in Terabit Packet Switches," 223-239. |
Turner et al., "Architectural Choices in Large Scale ATM Switches," IEICE Trans. Commun., 1998, vol. E81-B, No. 2, 120-137. |
Yamanaka et al., "OPTIMA: Scalable, Multi-State, 640-Gbit/s ATM Switching System Based on Advanced Electronic and Optical WDM Technologies," IEICE Trans. Commun., 2000, vol. E83-B, No. 7, 1488-1496. |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20150200874A1 (en) * | 2002-06-27 | 2015-07-16 | Coriant | Apparatus and Method to Switch Packets Using a Switch Fabric With Memory |
Also Published As
Publication number | Publication date |
---|---|
US7006514B2 (en) | 2006-02-28 |
US20020181483A1 (en) | 2002-12-05 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
USRE43110E1 (en) | Pipelined maximal-sized matching cell dispatch scheduling | |
USRE43466E1 (en) | Scheduling the dispatch of cells in non-empty virtual output queues of multistage switches using a pipelined hierarchical arbitration scheme | |
US6134217A (en) | Traffic scheduling system and method for packet-switched networks with fairness and low latency | |
EP1009189B1 (en) | RRGS-round-robin greedy scheduling for input/output buffered terabit switches | |
US6940851B2 (en) | Scheduling the dispatch of cells in non-empty virtual output queues of multistage switches using a pipelined arbitration scheme | |
US5859835A (en) | Traffic scheduling system and method for packet-switched networks | |
US6351466B1 (en) | Switching systems and methods of operation of switching systems | |
US7173931B2 (en) | Scheduling the dispatch of cells in multistage switches | |
US6810012B1 (en) | Queue service interval based cell schedular with hierarchical queuing configurations | |
Chuang et al. | Practical algorithms for performance guarantees in buffered crossbars | |
EP1262085B1 (en) | Packet switching | |
US7852769B2 (en) | Flexible bandwidth allocation in high-capacity packet switches | |
EP1061763B1 (en) | A pipelined packet scheduler for high speed optical switches | |
US20060285548A1 (en) | Matching process | |
Oki et al. | A pipeline-based approach for maximal-sized matching scheduling in input-buffered switches | |
US8737414B2 (en) | Scheduling, including distributed scheduling, for a buffered crossbar switch | |
US7203202B2 (en) | Arbitration using dual round robin matching with exhaustive service of winning virtual output queue | |
US20030035427A1 (en) | Method and apparatus for arbitration scheduling with a penalty for a switch fabric | |
CN100490383C (en) | A high-speed Crossbar scheduling method for supporting multipriority | |
US20080031262A1 (en) | Load-balanced switch architecture for reducing cell delay time | |
Pan et al. | Localized independent packet scheduling for buffered crossbar switches | |
Schoenen et al. | Weighted arbitration algorithms with priorities for input-queued switches with 100% throughput | |
US7103056B2 (en) | Scheduling the dispatch of cells in multistage switches using a hierarchical arbitration scheme for matching non-empty virtual output queues of a module with outgoing links of the module | |
Kleban et al. | CRRD-OG: A packet dispatching algorithm with open grants for three-stage buffered Clos-network switches | |
Mhamdi et al. | Practical scheduling algorithms for high-performance packet switches |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
CC | Certificate of correction | ||
FPAY | Fee payment |
Year of fee payment: 8 |
|
FEPP | Fee payment procedure |
Free format text: PAYOR NUMBER ASSIGNED (ORIGINAL EVENT CODE: ASPN); ENTITY STATUS OF PATENT OWNER: LARGE ENTITY |
|
FPAY | Fee payment |
Year of fee payment: 12 |