US 7394809 B2 Résumé A packet classifier having a forest of hash tables data structure. The forest of hash tables data structure includes a number of hash tables, each hash table having a bit mask corresponding to an equivalent set of rules. Each hash table includes a number of entries, wherein an entry of a hash table may correspond to a rule. One or more of the hash tables may include a marker in one entry, wherein the marker identifies another one of the hash tables. The hash table identified by the marker is a descendant of the hash table in which the marker is placed.
Revendications 1. A data structure stored on a computer accessible medium comprising:
a first bit mask;
a number of entries, each entry including a key formed using the first bit mask; and
hash tables representing a root node, wherein at least one of the entries includes a marker, the marker identifying a hash table having a second bit mask, wherein the second bit mask is a descendant of the first bit mask,
wherein the marker identifies an array of descriptors, one of the descriptors identifying the hash table having the second bit mask.
2. The data structure of
a rule identifier associated with one of the entries, the rule identifier identifying a rule associated with the key of the one entry.
3. The data structure of
a priority class associated with the one entry, the priority class identifying a priority of the associated rule.
4. The data structure of
5. An apparatus comprising:
a processing system; and
a memory coupled with the processing system, the memory having a data structure stored therein, the data structure including
a first bit mask, and
a number of entries, each entry including a key formed using the first bit mask, wherein at least one of the entries includes a marker, the marker identifying a hash table having a second bit mask, wherein the second bit mask is a descendant of the first bit mask, wherein the marker of the data structure identifies an array of descriptors, one of the descriptors identifying the hash table having the second bit mask.
6. The apparatus of
a rule identifier associated with one of the entries, the rule identifier identifying a rule associated with the key of the one entry.
7. The apparatus of
a priority class associated with the one entry, the priority class identifying a priority of the associated rule.
8. The apparatus of
9. The apparatus of
10. The apparatus of
11. The apparatus of
12. A data structure stored on a computer accessible medium comprising:
a number of hash tables, each hash table including a bit mask and a number of entries, each entry including a key formed using the bit mask, at least one of the hash tables representing a root node; and
a number of markers, each marker associated with one of the entries of one of the hash tables, the marker of an entry of a hash table identifying another hash table that is a descendant of that hash table, wherein the marker of the entry of the hash table identifies an array of descriptors.
13. The data structure of
14. The data structure of
15. The data structure of
16. An apparatus comprising:
a processing system; and
a memory coupled with the processing system, the memory having a data structure stored therein, the data structure including
a number of hash tables, each hash table including a bit mask and a number of entries, each entry including a key formed using the bit mask, at least one of the hash tables representing a root node; and
a number of markers, each marker associated with one of the entries of one of the hash tables, the marker of an entry of a hash table identifying another hash table that is a descendant of that hash table, wherein the marker of the entry of the hash table identifies an array of descriptors.
17. The apparatus of
18. The apparatus of
19. The apparatus of
20. The apparatus of
21. The apparatus of
22. The apparatus of
23. A method comprising:
selecting from a number of hash tables a hash table having a bit mask corresponding to a bit mask of a rule;
adding an entry for the rule to the selected hash table;
if the selected hash table has an ancestor hash table that is a root node, placing a marker in the root ancestor hash table, the marker identifying the selected hash table; and
creating a search string based upon the rule and a bit mask of the root ancestor hash table.
24. The method of
25. The method of
26. The method of
27. The method of
comparing the search string with each of a number of entries of the root ancestor hash table to determine whether one of the entries has a key matching the search string.
28. The method of
if none of the entries of the root ancestor hash table has a key matching the search string, adding a new entry to the root ancestor hash table; and
placing a key in the new entry, the key corresponding to the search string;
wherein the marker identifying the selected hash table is placed in the new entry of the root ancestor hash table.
29. The method of
placing the marker identifying the selected hash table in the matching entry of the root ancestor hash table if a threshold number of markers will not be exceeded at the matching entry.
30. The method of
accessing an existing marker in the matching entry of the root ancestor hash table; and
placing the marker in a hash table identified by the existing marker, wherein the identified hash table is an ancestor to the selected hash table.
31. The method of
creating a search string based upon the rule and a bit mask of the hash table identified by the existing marker; and
comparing the search string with each of a number of entries of the hash table identified by the existing maker,
wherein the marker is placed in one of the entries having a key matching the search string.
32. The method of
overriding the threshold number of markers; and
placing the marker identifying the selected hash table in the matching entry of the root ancestor hash table.
33. The method of
34. The method of
35. The method of
36. An apparatus comprising:
a memory system having stored thereon a set of instructions; and
a processing system coupled with the memory system, wherein the set of instructions, when executed on the processing system, causes the processing system to
select from a number of hash tables a hash table having a bit mask corresponding to a bit mask of a rule;
add an entry for the rule to the selected hash table; and
if the selected hash table has an ancestor hash table that is a root node, place a marker in the root ancestor hash table, the marker identifying the selected hash table, wherein execution of the set of instructions farther causes the processing system to create a search string based upon the rule and a bit mask of the root ancestor hash table.
37. The apparatus of
38. The apparatus of
39. The apparatus of
40. The apparatus of
to determine whether one of the entries has a key matching the search string.
41. The apparatus of
if none of the entries of the root ancestor hash table has a key matching the search string, add a new entry to the root ancestor hash table; and
place a key in the new entry, the key corresponding to the search string;
wherein the-marker identifying the selected hash table is placed in the new entry of the root ancestor hash table.
42. The apparatus of
place the marker identifying the selected hash table in the matching entry of the root ancestor hash table if a threshold number of markers will not be exceeded at the matching entry.
43. The apparatus of
access an existing marker in the matching entry of the root ancestor hash table; and
place the marker in a hash table identified by the existing marker, wherein the identified hash table is an ancestor to the selected hash table.
44. The apparatus of
create a search string based upon the rule and a bit mask of the hash table identified by the existing marker; and
compare the search string with each of a number of entries of the hash table identified by the existing maker, wherein the marker is placed in one of the entries having a key matching the search string.
45. The apparatus of
override the threshold number of markers; and
place the marker identifying the selected hash table in the matching entry of the root ancestor hash table.
46. The apparatus of
47. The apparatus of
48. The apparatus of
49. An article of manufacture comprising:
a computer accessible medium providing content that, when accessed by a computer, causes the computer to:
select from a number of hash tables a hash table having a bit mask corresponding to a bit mask of a rule;
add an entry for the rule to the selected hash table; and
if the selected hash table has an ancestor hash table that is a root node, place a marker in the root ancestor hash table, the marker identifying the selected hash table, wherein the content, when accessed, further causes the computer to create a search string based upon the rule and a bit mask of the root ancestor hash table.
50. The article of manufacture of
51. The article of manufacture of
52. The article of manufacture of
53. The article of manufacture of
54. The article of manufacture of
if none of the entries of the root ancestor hash table has a key matching the search string,
add a new entry to the root ancestor hash table; and
place a key in the new entry, the key corresponding to the search string;
wherein the marker identifying the selected hash table is placed in the new entry of the root ancestor hash table.
55. The article of manufacture of
place the marker identifying the selected hash table in the matching entry of the root ancestor hash table if a threshold number of markers will not be exceeded at the matching entry.
56. The article of manufacture of
access an existing marker in the matching entry of the root ancestor hash table; and
place the marker in a hash table identified by the existing marker, wherein the identified hash table is an ancestor to the selected hash table.
57. The article of manufacture of
create a search string based upon the rule and a bit mask of the hash table identified by the existing marker; and
compare the search string with each of a number of entries of the hash table identified by the existing maker, wherein the marker is placed in one of the entries having a key matching the search string.
58. The article of manufacture of
override the threshold number of markers; and
place the marker identifying the selected hash table in the matching entry of the root ancestor hash table.
59. The article of manufacture of
60. The article of manufacture of
61. The article of manufacture of
Description The invention relates generally to computer networking and, more particularly, to a method and apparatus for classifying packets. Traditionally, packet routing in computer networks was based solely on the destination address of a packet. This routing technique essentially provided “best effort” delivery, and all traffic going to the same address was treated identically. However, packet routing based on destination address alone is insufficient to meet growing demands for greater bandwidth, enhanced security, and increased flexibility and service differentiation. To meet these objectives, equipment vendors and service providers are providing more discriminating forms of routing, including firewalls, quality of service (QoS) based routing, and bandwidth and/or resource reservation. Generally, a firewall comprises any component, or combination of components, capable of blocking certain classes of traffic (e.g., “unwanted” or “suspicious” traffic). Firewalls are often utilized in corporate networks and other enterprise networks, and the firewall is usually implemented at the entry and/or exit points—i.e., the “trust boundary”—of the network. A typical firewall includes a series of packet filters that are designed to carry out a desired security policy. Network service providers may have a wide array of customers, each requiring different services, service priorities, and pricing. To provide differentiated services to a number of different customers—or, more generally, to provide preferential treatment to certain classes of network traffic—service providers have implemented a variety of mechanisms, including QoS based routing and bandwidth/resource reservation. The goal of QoS based routing is to provide service differentiation for a number of different customers and/or traffic types. QoS based routing may include, for example, routing based upon class of service, special queuing procedures (e.g., per-flow queuing), and fair scheduling methods. Integrally tied with QoS routing is bandwidth or resource reservation. Bandwidth reservation generally includes reserving a specified bandwidth for certain types of traffic. For example, bandwidth reservation may be applied to traffic between two points, or bandwidth reservation may be applied to traffic relating to a certain application (e.g., multimedia, video, etc.). To implement the above-described routing methodologies (e.g., firewalls, QoS routing, bandwidth reservation) that provide more discriminating routing of network traffic, as well as to perform other policy-based packet forwarding techniques, it is necessary to classify packets. Generally, packet classification comprises distinguishing between packets belonging to different flows or between packets associated with different traffic types. As used herein, a “flow” is a series of packets that share at least some common header characteristics (e.g., packets flowing between two specific addresses). A packet is usually classified based upon one or more fields in the packet's header. One or more filters, or “rules,” are applied to this header information to determine which flow the packet corresponds with or what type of traffic the packet is associated with. A number of methods—both hardware and software implementations—for performing packet classification based upon header data are known in the art. However, packet classification is often the bottleneck in routers, especially those routers supporting high speed links (e.g., gigabit capacity), as packet classification techniques struggle to keep pace with advances in link speeds. Further, some conventional packet classification schemes lack the ability to efficiently handle a large number of classification filters (or rules) and may also have large memory requirements. Embodiments of a packet classifier are disclosed herein. The disclosed embodiments of the packet classifier are described below in the context of a router implementing a firewall. However, it should be understood that the disclosed embodiments are not so limited in application and, further, that the embodiments of a packet, classifier described in the following text and figures are generally applicable to any device, system, and/or circumstance where classification of packets or other communications is needed. Illustrated in The router 200 is coupled via a plurality of links 130—including links 130 a, 130 b, . . . , 130 n—with a number of nodes 110 and/or a number of subnets 120. A node 110 comprises any addressable device. For example, a node 110 may comprise a computer system or other computing device, such as a server, a desktop computer, a laptop computer, or a hand-held computing device (e.g., a personal digital assistant or PDA). A subnet 120 may comprise a collection of other nodes, and a subnet 120 may also include other routers or switches. Each of the links 130 a-n may be established over any suitable medium,—e.g., wireless, copper wire, fiber optic, or a combination thereof—using any suitable protocol—e.g., TCP/IP (Transmission Control Protocol/Internet Protocol), HTTP (Hyper-Text Transmission Protocol), as well as others. The network 100 may comprise any type of network, such as a Local Area Network (LAN), a Metropolitan Area Network (MAN), a Wide Area Network (WAN), a Wireless LAN (WLAN), or other network. The router 200 also couples the network 100 with another network (or networks) 5, such as, by way of example, the Internet and/or another LAN, MAN, LAN, or WLAN. Router 200 may be coupled with the other network 5 via any suitable medium, including a wireless, copper wire, and/or fiber optic connection using any suitable protocol (e.g., TCP/IP, HTTP, etc.). It should be understood that the network 100 shown in In one embodiment, the router 200 comprises any suitable computer system, and the packet classifier 500 comprises a software application that may be implemented or executed on this computer system. An embodiment of such a computer system is illustrated in Referring to Coupled with bus 205 is a processing device (or devices) 300. The processing device 300 may comprise any suitable processing device or system, including a microprocessor, a network processor, an application specific integrated circuit (ASIC), or a field programmable gate array (FPGA), or similar device. An embodiment of the processing device 300 is illustrated below in Computer system 200 also includes system memory 210 coupled with bus 205, the system memory 210-comprising, for example, any suitable type of random access memory (e.g., dynamic random access memory, or DRAM). During operation of computer system 200 an operating system 214, the packet classifier 500, as well as other programs 218 may be resident in the system memory 210. Computer system 200 may further include a read-only memory (ROM) 220 coupled with the bus 205. During operation, the ROM 220 may store temporary instructions and variables for processing device 300, and ROM 220 may also have resident thereon a system BIOS (Basic Input/Output System). The computer system 200 may also include a storage device 230 coupled with the bus 205. The storage device 230 comprises any suitable non-volatile memory, such as, for example, a hard disk drive. The packet classifier 500, as well as operating system 214 and other programs 218 (e.g., a software implementation of firewall 201), may be stored in the storage device 230. Further, a device 240 for accessing removable storage media (e.g., a floppy disk drive or a CD ROM drive) may be coupled with bus 205. The computer system 200 may include one or more input devices 250 coupled with the bus 205. Common input devices 250 include keyboards, pointing devices such as a mouse, and scanners or other data entry devices. One or more output devices 260 may also be coupled with the bus 205. Common output devices 260 include video monitors, printing devices, and audio output devices (e.g.; a sound card and speakers). Computer system 200 further comprises a network interface 270 coupled with bus 205. The network interface 270 comprises any suitable hardware, software, or combination of hardware and software capable of coupling the computer system 200.with the network (or networks) 5. The computer system 200 also includes a link interface 280. Link interface 280 comprises any suitable hardware, software, or combination of hardware and software capable of coupling the computer system 200 with each of the links 130 a-n. It should be understood that the computer system 200 illustrated in In one embodiment, the packet classifier 500 comprises a set of instructions (i.e., a software application) run on a computer system—e.g., the computer system 200 of In another embodiment, the packet classifier 500 comprises a set of instructions stored on a machine accessible medium, such as, for example, a magnetic media (e.g., a floppy disk or magnetic tape), an optically accessible disk (e.g., a CD-ROM disk), a flash memory device, etc. To run packet classifier 500 on, for example, computer system 200, the device 240 for accessing removable storage media may access the instructions on the machine accessible medium, and the instructions may then be executed in processing device 300. In this embodiment, the instructions (or a portion thereof) may again be downloaded to system memory 210. In another embodiment, the packet classifier 500 is implemented in hardware or a combination of hardware and software (e.g., firmware). For example, the packet classifier 500 may be implemented in an ASIC, an FPGA, or other similar device that has been programmed in accordance with the disclosed embodiments. As previously noted, an embodiment of processing device 300 is illustrated in Turning now to A core 310 and a number of processing engines 320 (e.g., processing engines 320 a, 320 b, . . . , 320 k) are coupled with the local bus 305. In one embodiment, the core 310 comprises a general purpose processing system, which may execute operating system 214. Core 310 may also control operation of processing device 300 and perform a variety of management functions, such as dispensing instructions to the processing engines 320 for execution. Each of the processing engines 320 a-k comprises any suitable processing system, and each may include an arithmetic and logic unit (ALU), a controller, and a number of registers (for storing data during read/write operations). Also, in one embodiment, each processing engine 320 a-k provides for multiple threads of execution (e.g., four). Also coupled with the local bus 305 is an on-chip memory subsystem 330. Although depicted as a single unit, it should be understood that the on-chip memory subsystem 330 may—and, in practice, likely does—comprise a number of distinct memory units and/or memory types. For example, such on-chip memory may include SDRAM (synchronous dynamic random access memory), SRAM (static random access memory), and/or flash memory (e.g., FlashROM). It should be understood that, in addition to on-chip memory, the processing device 300 may be coupled with off-chip memory (e.g., ROM 220, off-chip cache memory, etc.). Processing device 300 further includes a bus interface 340 coupled with local bus 305. Bus interface 340 provides an interface with other components of computer system 200, including bus 205. For simplicity, bus interface 340 is depicted as a single functional unit; however, it should be understood that, in practice, the processing device 300 may include multiple bus interfaces. For example, the processing device 300 may includes a PCI bus interface, an IX (Internet Exchange) bus interface, as well as others, and the bus interface 340 is intended to represent a collection of one or more such interfaces. It should be understood that the embodiment of processing device 300 illustrated and described with respect to Referring now to Illustrated in Rule database 510 includes a number of rules 600, including rules 600 a, 600 b, . . . , 600 y. The collection of rules 600 a-y is designed to implement a desired policy-based routing scheme (e.g., a firewall, QoS routing, and/or resource reservation, etc.), as described above. Various embodiments of a rule 600 are illustrated in Search agent 530 provides packet classifier 500 with the ability to search the hash table data structure 520. More specifically, search agent 530 can identify one or more rules 600 that are to be applied to an incoming packet based upon information (e.g., header data) contained in the packet. Embodiments of a method of searching the hash table data structure 520 are presented below in Turning to Each rule 600 may be represented by a bit mask 610 and a value set 620. The bit mask 610 is a bit array having bits set (e.g., a “1-bit”) for those bits that the rule 600 “cares for,” whereas those bits that the rule does “not care for” are not set (e.g., a “0-bit”). The value set 620 is a bit array that contains, at those bits set in the bit mask 610 (i.e., at those bits the rules cares for), the actual value of these bits, respectively, in the rule. At those bits of value set 620 corresponding to bits that are not set (e.g., a “0-bit”) in the bit mask 610, the value set 620 contains a “0-bit”. Any group of two or more rules is said to be “equivalent” if they have the same bit mask (although they may not have the same value set), and a group of equivalent rules comprises an “equivalent set.” As will be explained in greater detail below, the bit mask 610 and value set 620 of each rule 600 facilitate organization of the rules 600 into the hash table data structure 520—each hash table 900 including an equivalent set of rules—and also provides a mechanism for indexing into the hash table data structure during the search process. Referring to Illustrated in The value set 720 includes a portion 722 a corresponding to the source address, a portion 722 b corresponding to the destination address, a portion 722 c corresponding to the protocol, a portion 722 d corresponding to the source port, and a portion 722 e corresponding to the destination port. At those bits in the value set 720 corresponding to the bits that have been set (e.g., a “1-bit”) in the bit mask 710, the value set 720 includes actual values from the rule (e.g., the source port 80 is specified in binary notation as “0000000001010000”). To assist the reader, in In Another example of the 5-tuple rule shown in Turning now to In one embodiment, each of the entries 930 a-r includes a key 932, a priority 934, a rule identifier 936, and one or more markers 938 (i.e., entry 930 a includes key 932 a, priority 934 a, rule identifier 936 a, and marker(s) 938 a, and so on). It should be understood, however, that As will be explained below, if a rule has a bit mask matching the bit mask 910 of a hash table 900, the search agent 530 will compare the rule against each entry 930 in this hash table to look for a match, and it is the key 932 of each entry 930 that is used in this comparison. Essentially, the keys 932 provide a mechanism for indexing and retrieving rules within the hash table data structure 520. The priority 634 gives the priority of a rule corresponding to an entry 630, and the rule identifier 636 identifies the corresponding rule (e.g., a memory location or other identifier of the rule in rule database 510). As noted above, one or more markers 938 may also be present in each entry 930 of hash table 900. When searching the hash table data structure 520 for a rule corresponding to a received packet, if there is a match between this packet an en entry 930 of the hash table, the marker (or markers) 938 of that entry is used to identify other hash tables that need to be searched. These additional hash tables that are to be searched are “descendants” of the hash table, which hash table is an “ancestor” to all it's descendants, as will be explained in greater detail below. Generally, a marker 938 comprises a pointer to a memory location of the other hash table. However, in another embodiment, a marker 938 identifies a memory location of an array of hash table descriptors 940 (i.e., entry 930 b has a corresponding array of descriptors 940 b, and so on). The array of descriptors 940 includes a number of descriptors, each identifying a memory location of another hash table. An entry 930 of hash table 900 may include any desired number of markers 938. In one embodiment, a threshold number of markers is set for the entries of the hash tables, such that markers can be “pushed” lower down into the forest of hash tables data structure (although, in some instances, a marker may still be placed in an entry even though the threshold is exceeded). The use of a threshold to determine at what level to enter a marker in the forest of hash tables data structure is explained in greater detail below. Referring now to Each of the rules 1000 a-f specifies a source address and a destination port—i.e., each of the rules 1000 a-f has a dimension of two (2). The rules 1000 a-f of A first rule is a “descendant” of a second rule if the second rule—i.e., the “ancestor” of the first rule—has a bit mask comprising a subset of the first rule's bit mask. The bit mask of the second rule is a subset of the bit mask of the first rule if the second rule's bit mask includes at least one set bit (e.g., a “1-bit”) in common with the first rule's bit mask. For example, referring to Shown in Referring to block 1202 in With reference to block 1206 in Referring to blocks 1208 and 1210, ancestor-descendent relationships are determined for the hash tables, and the root nodes are identified. The ancestor-descendent relationship for these equivalent sets (i.e., for the hash tables shown in Referring now to block 1212 of Referring to block 1220, it is then determined whether this hash table (i.e., the hash table to which the entry for the current rule has been added) is a root hash table. Hash Table A is a root hash table (see Rule B is now selected (see block 1212) and the hash table having a bit mask corresponding to Rule B's bit mask—i.e., Hash Table B—is identified (see block 1214). A key for Rule B is provided (see block 1216), and an entry including this key added to Hash Table B (see block 1218). As shown in Referring now to block 1224 in As shown at block 1228, a search string is created. The search string is generated by performing an “AND” operation between the bit mask of the selected ancestor hash table and the value set of the selected rule. Thus, for Rule B, the search string is created by performing the “AND” operation between bit mask 1010 a of Hash Table A and value set 1020 b of Rule B, wherein the result produces the search string “11111111 00000000 00000000 00000000 000000000101000”. This search string is then compared against each entry of the selected ancestor hash table to determine whether the search string matches the key of any entry, which is shown at block 1230. In our example, the search string matches the key of the first entry 1331 a of Hash Table A. If a matching entry is identified (see block 1230), it is determined whether the matching entry has a number of markers that meets or exceeds the allowed threshold number of markers per entry, as shown at block 1232. Returning to the example, there would be no markers in the first entry 1331 a of Hash Table A at this point and, therefore, the threshold has not been reached. As illustrated at block 1236, if the threshold is not exceeded, a marker identifying the descendent hash table is added to the matching entry of the ancestor hash table. This marker identifies the descendent hash table, thereby indicating that the descendent hash table needs to be searched any time there is a “hit” at this entry of the ancestor hash table (embodiments of a search method are described below in Note that, when the selected ancestor hash table does not have an entry matching the search string (see block 1230), an entry may be created in the ancestor hash table, as shown at block 1234. The key for this new entry will be equivalent to the search string created by applying the bit mask of the selected ancestor hash table to the value set of the current rule (see block 1228). A marker pointing to the descendant hash table is then placed in this newly created entry, which is again shown at block 1236. It should be noted here that, where the ancestor does not have an entry matching the search string, a new entry is created in the ancestor hash table for inclusion of a marker. Thus, it is possible to have an entry in a hash table that, while identifying additional hash tables to search, does not directly correspond to any rule. After the addition of any marker, the method again looks at whether there are additional rules to consider (see block 1220) and, if there are one or more other rules, one of these rules is selected (see block 1212). The above-described procedure is then repeated. In our example, Rule C is selected next. The bit mask 1010 c of Hash Table C matches the bit mask of Rule C, and a first entry 1331 c is entered into Hash Table C for this rule. The key of entry 1331 c comprises-the value set 1020 c of Rule C, and the rule identifier of this entry identifies Rule C. Because Hash Table C is not a root node (see block 1220), a marker for Rule C needs to be entered in the data structure. The only root node ancestor hash table to Hash Table C is Hash Table A (see block 1224), and this root node is selected (see block 1226). Applying the bit mask 1010 a of Hash Table A to the value set 1020 c of Rule C to create a search string (see block 1228), and comparing this search string against each entry of Hash Table A yields a match with the key of the first entry 1331 a of Hash Table A (see block 1230). Referring again to block 1232, the number of markers in the first entry 1331 a of Hash Table A is one (i.e., a maker to Hash Table B), which equals the threshold of one (1), and the addition of another marker at this entry will exceed this threshold. Thus, the method will attempt to “push” the marker farther down into the hash table data structure to a hash table beneath the root node. Referring now to block 1238, the marker (or markers) in the matching entry of the selected ancestor hash table are accessed in order to identify other ancestor nodes (i.e., ancestors of the hash table into which the current rule has been entered) into which the new marker may potentially be placed. In our example, the first entry 1331 a of Hash Table A includes a marker to Hash Table B. If any other ancestor nodes are found in the accessed markers—see block 1240—any of these ancestor nodes is selected, as shown at block 1242. Returning again to the example, Hash Table B is selected. Returning again to block 1228 in Another rule, Rule D, is selected (see blocks 1222 and 1212), and the method continues. The bit mask 1010 d of Rule D matches Hash Table D's bit mask, and an entry 1331 d is entered into Hash Table D for Rule D (see blocks 1214 through 1218), as shown in Rule E is now selected, which has a bit mask 1010 e matching that of Hash Table E. An entry 1331 e is placed in Hash Table E (see Selecting Rule F (see blocks 1222, 1212), which has a bit mask 1010 f matching that of Hash Table F, an entry 1331 f is entered into Hash Table F for rule F, as shown in The markers in the matching entry 1331 a of Hash Table A are accessed to identify other ancestors of Hash Table F (see block 1238), which yields a marker to Hash Table B. Hash Table B is then selected (see blocks 1240, 1242), and a search string created based upon the bit mask 1010 b of this hash table and the value set 1020 f of Rule F (the rule currently being considered). This search string (i.e., “11111111 11111111 00000000 00000000 0000000001010000”) is compared against the entries of Hash Table B (see block 1230), and a match is found at the first entry 1331 b of Hash Table B. However, this entry 1331 b of Hash Table B also includes a marker (i.e., a marker to Hash Table C), and the addition of another marker would violate the threshold number of markers for an entry (see block 1232). Thus, the markers in the matching entry of Hash Table B are accessed to reveal additional ancestors to Hash Table F (see block 1238), which leads to Hash Table C (note that had the matching entry in Hash Table A included other markers to nodes in addition to Hash Table B, those markers would also be considered). Hash Table C is then selected (see block 1240, 1242), and a search string created by applying the bit mask 101 0 c of this hash table to the value set 1020 f of Rule F (see block 1228). Comparing this search string (i.e., “11111111 11111111 00000000 0000000 0000000001010000”) against Hash Table C will yield a match at the first entry 1331 c of this hash table (see block 1230). There are presently no markers in this entry (see block 1232), and a marker to Hash Table F is added to the first entry 1331 c of Hash Table C (see block 1236), as shown in Rule G (see Rule H is selected next. This rule has a bit mask 1010 h matching the bit mask 1010 c of Hash Table C, and an entry 1332 c is entered in this hash table for Rule H. The key for the second entry 1332 c of Hash Table C is the value set 1020 h of Rule H, and this entry also includes a rule identifier for Rule H. Hash Table C has one root ancestor in Hash Table A, and a search string is created by performing an “AND” operation between the bit mask 1010 a of Hash Table A and the value set 1020 h of Rule H. This search string (i.e., “10000000 00000000 00000000 00000000 0000000000010101”) matches the key at the second entry 1332 a of Hash Table A, where a marker to Hash Table B was previously entered (see Rule I is now considered. Rule I has a bit mask 1010 i that matches Hash Table D's bit mask 1010 d, and an entry 1332 d is entered into Hash Table D for this rule, wherein this entry 1332 d includes a key (i.e., the value set 1020 i for Rule I) and a rule identifier for Rule I. Hash Table D is a root node and, therefore, no additional marker is needed. Rule J, which is selected next in our example, has a bit mask 1010 j matching the bit mask 1010 e of Hash Table E. A key (i.e., the value set 1020 j of Rule J) and a rule identifier for Rule J are entered into a new entry 1332 e of Hash Table E. Hash Table E has one root ancestor in Hash Table D (see Rule K is selected, and this rule has a bit mask 1010 k that also matches the bit mask 1010 e of Hash Table E, and an entry 1333 e is added to Hash Table E for this rule. A key and rule identifier for Rule K are placed in this entry 1333 e, wherein the key comprises the value set 1020 k of Rule K. Again, the root ancestor to Hash Table E is Hash Table D. Performing an “AND” operation between the bit mask 1010 d of Hash Table D and the value set 1020 k of Rule K produces a search string (i.e., “11111111 11000000 00000000 00000000 0000000000000000”). Comparing this search string against each entry of Hash Table D yields no matches at this time; thus, a new entry 1333 d is created in Hash Table D (see blocks 1230, 1234 in Rule L, which is now selected, has a bit mask 1010 l that matches the bit mask 1010 f of Hash Table F. Accordingly, an entry 1332 f for Rule L is entered into Hash Table F, the entry 1332 f including a key.(i.e., the value set 1020 l of Rule L) and a rule identifier for this rule. As noted above, Hash Table F is not a root node and, in fact, has multiple root node ancestors in Hash Tables A and D. Either of these root nodes can be selected (see block 1226), and Hash Table A is chosen. Creating a search string based upon Hash Table A's bit mask 1010 a and Rule L's value set 1020 l and comparing this search string (i.e., “10000000 0000000 00000000 00000000 0000000000010101”) to Hash Table A will yield a match at the second entry 1332 a. However, this entry 1332 a of Hash Table A includes one marker (to Hash Table B), and the addition of another marker will exceed the threshold (see block 1232). The markers in the matching entry 1332 a of Hash Table A are accessed to reveal any additional ancestors, which in our example will lead to Hash Table B. Generating a search string based upon Hash Table B's bit mask and Rule L's value set and then comparing this search string (i.e., “10000000 10000000 00000000 00000000 0000000000010101”) to Hash Table B will yield a match at the second entry 1332 b of this hash table. The threshold will again be exceeded if a marker is added to matching entry 1332 b of Hash Table B, and the markers in this entry are accessed to identify other ancestor hash tables (see blocks 1232, 1238, 1240, and 1242 in Rule M is selected next (see Rule N, which is considered next, also has a bit mask 1010 n matching the bit mask 1010 f of Hash Table F. An entry 1334 f for Rule N is entered into Hash Table F, this entry including a key (i.e., the value set 1020 n of Rule N) and a Rule N identifier. As Hash Table F is not a root node, one of Hash Table F's two root nodes (i.e., nodes A and D) will be selected to receive the marker. Root node A is selected. Application of the above-described procedure (i.e., blocks 1228 through 1242) using the appropriate search string will yield a match at entry 1331 a of Hash Table A, where the threshold would be exceeded by the addition of a marker, and at entry 1331 b of Hash Table B, where the threshold would also be exceeded by the addition of a marker. Looking to Hash Table C and using the appropriate search string, a match is found at entry 1333 c of Hash Table C. Entry 1333 c of Hash Table C already includes a marker to Hash Table F (which is the hash table we are attempting to add a marker for) and, therefore, an additional marker is not required. Note that, in this situation where there is multiple root ancestor nodes, another root node could be selected and a marker to Hash Table F entered in this root node (or one of it's descendents). However, it is desirable to minimize the number of markers, such that the number of hash look-ups is correspondingly minimized during a search of the hash table data structure. Accordingly, when the desired marker is already present in the hash table data structure, an additional marker will generally not be added, which is shown in block 1236 of Rule O is now selected. Rule O has a bit mask 1010 o that also matches the bit mask 1010 e of Hash Table E, and an entry 1334 e for this rule is placed in Hash Table E. The new entry 1334 e includes a key (i.e., the value set 1020 o for Rule O) and a Rule O identifier. Hash Table E's only root node—Hash Table D—is selected. Applying the bit mask 1010 d of Hash Table D to the value set 1020 o of Rule O to create a search string (i.e., “11111111 11111111 11000000 000000000 0000000000000000”), and comparing this search string with the entries of Hash Table D yields a match at entry 1334 d.However, this entry 1334 d of Hash Table D already includes a marker (to Hash Table F), and the addition of another marker will exceed the threshold (see block 1232). Further, accessing the markers in entry 1334 d of Hash Table D (see block 1238) yields no other ancestor hash tables. Note here that Hash Table F, which corresponds to the only marker in entry 1334 d of Hash Table D, is not an ancestor of Hash Table E (it is a descendent). Thus, all possible ancestor nodes have been considered (see block 1240). Accordingly, as shown at block 1244 of Rule P, which is selected next, includes a bit mask 1010 p that matches the bit mask 1010 c of Hash Table C. An entry for Rule P is, therefore, needed in Hash Table C. However, note that an earlier created entry—i.e., entry 1333 c having a marker to Hash Table F—has a key matching the value set 1020 p of Rule P. Thus, a new entry is not required, and a rule identifier for Rule P is simply added to this entry 1333 c of Hash Table C (see block 1218 in Rule Q is now considered. Rule Q includes a bit mask 1010 q that matches the bit mask 1010 f of Hash Table F, and an entry 1335 f for Rule Q is entered into this hash table, wherein the entry includes the value set 1020 q as the key and a rule identifier for Rule Q. Hash Table D, which is one of Hash Table F's two root nodes, is selected, and a search string is created based upon the bit mask 1010 d of this root node and the value set 1020 q of Rule Q. This search string (i.e., “11111111 11000000 00000000 00000000 0000000000000000”) matches the third entry 1333 d of Hash Table D. This entry 1333 dof the root node already has a marker (to Hash Table E), and the addition of another marker will exceed the threshold. Accordingly, the markers at the third entry 1333 d of Hash Table D are accessed to identify other root nodes, which leads to Hash Table E. Another search string is created by applying the bit mask 1010 e of Hash Table E to Rule Q's value set 1020 q, and this search string (i.e., “11111111 11000000 00000000 00000000 0000000000000000”) is compared to the entries of Hash Table E. A match with the search string is found at entry 1333 e of Hash Table E (which has no markers at this point), and a marker to Hash Table F is placed in this entry 1333 e. Rule R, which is selected next, includes a bit mask 1010 r that also matches the bit mask 1010 f of Hash Table F. An entry 1336 f including the value set 1020 r as a key and a rule identifier for Rule R is entered into Hash Table F. Hash Table A is selected from Hash Table F's two root nodes, and a search string is created from the bit mask 1010 a of Hash Table A and Rule R's value set 1020 r. This search string (i.e., “11000000 00000000 00000000 00000000 0000000000010111”) does not match any entry of Hash Table A. Thus, a new entry 1333 a is created in Hash Table A—this entry 1333 a having the search string as a key—and a marker to Hash Table F is placed in this entry, as shown in At this point, all rules have been considered (see block 1222), and the forest of hash tables data structure is complete and/or updated (see block 1290). A schematic view of the forest of hash tables data structure 520 generated during the above-described example is illustrated in Turning now to Referring to block 1505, the list of hash tables 540 is initialized to the root nodes (nodes A and D in the example above). As shown at block 1510, header data in a received packet is accessed. For our example, the source address and destination port are read. Assume, for example, that the source address is “192.128.0.0” and the destination port is “23”. As shown at block 1515, a hash table in the list of hash tables to search 1540 is selected. Hash Table A is selected. The bit mask 1010 a of Hash Table A is applied (i.e., an “AND” operation) to the accessed header data to create a search key, which is shown at block 1520. The search key for Hash Table A is “11000000 00000000 00000000 00000000 0000000000010111”. The search key is then compared against each entry of the selected hash table, which is illustrated at block 1525. If there is a match—see block 1530—the rule corresponding to the matching entry (i.e., the rule identified by the rule identifier, if any) is compared with the “best” matched rule 550, which is shown at block 1535, and if the priority of the newly identified rule is greater than that of the “best” matched rule, the rule is stored as the best matched rule, as illustrated at block 1540. Returning to the example, the search key matches the key of the third entry 1333 a of Hash Table A; however, this particular entry contains no rule identifier. Referring to block 1545 in Referring to block 1555, the accessed hash table—i.e., Hash Table A in our example—is then removed from the list of hash tables to search 540. Note that, were there no markers in the matching entry (see block 1545), the algorithm would proceed to block 1555. Similarly, if there was no match found in the selected hash table (see block 1530), the method would proceed to block 1555, at which point that hash table would be removed from the list of hash tables 540. If the list of hash tables to search 540 is not empty—see block 1560—another hash table is selected from the list 540, and this hash table is accessed, as shown at block 1515. For the example, Hash Table D is selected from the list of hash tables 540, and this hash table is accessed. The bit mask 1010 d of Hash Table D is applied to the header data (see block 1520) to create a search key. The search key for. Hash Table D is “11000000 10000000 000000000 0000000 0000000000000000”, and this search key is compared against each entry of Hash Table D (see block 1525). For this search key, there is a match at the second entry 1332 d of Hash Table D (see block 1530), and the rule corresponding to this entry—i.e., Rule I—is compared against the “best” matched rule 550 (see block 1535). At this point, no rule has been stored as the “best” matched rule and, therefore, Rule I is stored as the “best” matched rule. The second entry 1332 d of Hash Table D includes a marker (see block 1545), and the hash table identified by this marker—i.e., Hash Table E—is added to the list of hash tables to search 540 (see block 1550). Hash Table D is then deleted from the list of hash tables 540 (see block 1555). The list of hash tables 540 now includes Hash Tables E and F, and the “best” matched rule 550 is Rule I. Another hash table—e.g., Hash Table E—is selected from the list of hash tables 540 and accessed (see blocks 1560 and 1515). The bit mask 1010 e of Hash Table E is applied to the header data to create a search key—i.e., “11000000 10000000 00000000 00000000 0000000000000000” (see block 1520). The search key is then compared against each entry of Hash Table E (see block 1525), which yields a match at the second entry 1332 e (see block 1530). The rule corresponding to this entry 1332 e is Rule J, and Rule J is compared against the “best” matched rule 550—i.e., Rule I—and the rule with the greater priority is stored as the “best” matched rule (see blocks 1535, 1540). The second entry 1332 e of Hash Table E also includes a marker to Hash Table F, and this hash table is added to the list of hash tables to search 540 (see blocks 1545 and 1550). Note, however, that Hash Table F is already in the list of hash tables 540, so the marker in Hash Table E will have no affect on the list of hash tables 540. Hash Table E is removed from the list of hash tables to search 1555. Thus, the list of hash tables 540 now includes Hash Table F, and the “best” match rule is either Rule I or Rule J, depending upon which has a greater priority. The final hash table, Hash Table F, in the list of hash tables 540 is then selected and accessed (see block 1515). Applying the bit mask 1010 f of Hash Table F to the header data creates the search key “11000000 10000000 00000000 00000000 0000000000010111” (see block 1520), and the search key is compared against the entries of Hash Table F (see block 1525), which yields a match at the sixth entry 1336 f of this hash table (see block 1530). The rule corresponding with this entry 1334 f is Rule R, and this rule is compared with the “best” matched rule 550 to determine which rule is to be applied to the received packet (see blocks 1535, 1540). No markers are present in Hash Table F (see block 1545), and Hash Table F is removed from the list of hash tables 540 (see block 1555). The list of hash tables to search 540 is now empty (see block 1560), and the “best” matched rule—i.e., the greatest in priority of Rules I, J, and R—can be applied to the received packet, as shown at block 1565. In our example, any one of three rules (i.e., Rules I, J, and R) may be the “best” matched rule, depending upon which has the greatest priority. Any suitable policy and/or criteria may be utilized to assess the priority of a rule. In one embodiment, as described above, the priority of a rule may be stored along with the rule in that rule's corresponding hash table (see Note that, in the example presented above, Hash Tables B and C were not searched. In particular, by starting at the root nodes and accessing only those hash tables identified by markers, unnecessary searches that would yield no results are eliminated. The Hash Tables B and C are, therefore, “pruned” from the search process through the use of markers. Accordingly, a look-up operation into each of the pruned hash tables is eliminated, thereby increasing the speed and efficiency of the search algorithm. Although only two hash tables were pruned in the simple example presented herein, many more hash tables may be pruned in practice, as real-world hash table data structures may includes tens or even hundreds of hash tables. Efficiency and speed may be further improved using parallel searches of the hash table data structure 520, and an embodiment of the method of searching the hash table data structure utilizing such parallel search capabilities is illustrated in Referring to The foregoing detailed description and accompanying drawings are only illustrative and not restrictive. They have been provided primarily for a clear and comprehensive understanding of the disclosed embodiments and no unnecessary limitations are to be understood therefrom. Numerous additions, deletions, and modifications to the embodiments described herein, as well as alternative arrangements, may be devised by those skilled in the art without departing from the spirit of the disclosed embodiments and the scope of the appended claims. Citations de brevets
Citations hors brevets
Référencé par
Classifications
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||