WO2012106914A1 - Dynamic tunnel fault diagnosis method, device and system - Google Patents

Dynamic tunnel fault diagnosis method, device and system Download PDF

Info

Publication number
WO2012106914A1
WO2012106914A1 PCT/CN2011/077469 CN2011077469W WO2012106914A1 WO 2012106914 A1 WO2012106914 A1 WO 2012106914A1 CN 2011077469 W CN2011077469 W CN 2011077469W WO 2012106914 A1 WO2012106914 A1 WO 2012106914A1
Authority
WO
WIPO (PCT)
Prior art keywords
dynamic tunnel
tunnel
dynamic
routing information
interface
Prior art date
Application number
PCT/CN2011/077469
Other languages
French (fr)
Chinese (zh)
Inventor
刘德安
Original Assignee
华为技术有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 华为技术有限公司 filed Critical 华为技术有限公司
Priority to PCT/CN2011/077469 priority Critical patent/WO2012106914A1/en
Priority to CN201180001572.6A priority patent/CN102308524B/en
Publication of WO2012106914A1 publication Critical patent/WO2012106914A1/en

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L41/00Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
    • H04L41/06Management of faults, events, alarms or notifications
    • H04L41/0677Localisation of faults
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L41/00Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
    • H04L41/06Management of faults, events, alarms or notifications
    • H04L41/069Management of faults, events, alarms or notifications using logs of notifications; Post-processing of notifications

Definitions

  • the present invention relates to the field of communications technologies, and in particular, to a dynamic tunnel fault diagnosis method, device and system.
  • a static tunnel (Tunnel) or a dynamic tunnel is usually used in a bearer network to complete service transmission.
  • VPN Virtual Private Network Tunnel is generally referred to as a VPN node (generally referred to as a Provider Edge) of a packet switched network (PSN) backbone network, or between a VPN node and a user.
  • PSN packet switched network
  • Tunnel is an indispensable part of building a VPN, which is used to transparently transfer VPN packets from one VPN node to another.
  • Dynamic tunnels use the routing protocol negotiation mode to establish data forwarding routes. For example, if there is a dynamic tunnel between the source and sink NEs, the routing information of the dynamic tunnel is determined before the fault occurs. The signaling negotiation may fail after the fault occurs. For example, when an intermediate node or intermediate link fails, you need to check the possible paths (often many) to find the fault point. If the network topology is complex, the troubleshooting usually takes a very long time. The fault is also very difficult, and in the prior art, it is generally manually checked by a technician, and it is difficult to troubleshoot in time and effectively.
  • the embodiments of the present invention provide a dynamic tunnel fault diagnosis method, a device, and a system, so as to reduce the complexity of troubleshooting a dynamic tunnel and improve the efficiency of troubleshooting dynamic tunnels.
  • a dynamic tunnel fault diagnosis method includes:
  • the NMS saves the routing information of the dynamic tunnel.
  • the latest routing information corresponding to the dynamic tunnel saved before the dynamic tunnel failure is queried; Parsing the interface and the network element through which the dynamic tunnel passes according to the latest routing information of the dynamic tunnel obtained by the query;
  • a network management device including:
  • a memory configured to save routing information of the dynamic tunnel
  • a querying module configured to query, after the dynamic tunnel fault is found, the latest routing information of the corresponding dynamic tunnel saved by the memory before the dynamic tunnel is faulty;
  • a route parsing module configured to parse out an interface and a network element that the dynamic tunnel passes through according to the latest routing information of the dynamic tunnel that is obtained by the query module;
  • the diagnosis module is configured to check an interface and a network element that the dynamic tunnel passes through to learn a suspected fault point in the dynamic tunnel.
  • a fault diagnosis system comprising:
  • the network management device as described in the above embodiments.
  • the network management system saves the routing information of the dynamic tunnel tunnel; in this case, when the dynamic tunnel is faulty, the network management system can query the latest routing information corresponding to the dynamic tunnel saved before the dynamic tunnel failure; Obtaining the latest routing information of the dynamic tunnel to parse the interface and the network element through which the dynamic tunnel passes; and then checking the interface and the network element that the dynamic tunnel passes through to obtain the suspected fault point in the dynamic tunnel, so that Troubleshoot the dynamic tunnels automatically, which can reduce the complexity of troubleshooting dynamic tunnels and improve the troubleshooting efficiency of dynamic tunnels.
  • FIG. 1 is a schematic diagram of a dynamic tunnel fault according to an embodiment of the present invention
  • FIG. 1 is a schematic diagram of a cloud network after a dynamic tunnel failure according to an embodiment of the present invention
  • 2 is a schematic flowchart of a dynamic tunnel fault diagnosis method according to an embodiment of the present invention
  • FIG. 3 is a schematic diagram of a module structure of a network management system according to an embodiment of the present invention
  • FIG. 4 is a schematic diagram of a network management device according to an embodiment of the present invention.
  • FIG. 5-a is a schematic diagram of a diagnostic module of a network management device according to an embodiment of the present invention
  • FIG. 5-b is a schematic diagram of a diagnostic module of another network management device according to an embodiment of the present invention
  • FIG. 5-c is a schematic diagram of the present invention
  • the embodiment provides a schematic diagram of a diagnostic module of another network management device.
  • the embodiments of the present invention provide a dynamic tunnel fault diagnosis method, a device, and a system, which are used to reduce the complexity of troubleshooting dynamic tunnel faults and improve the efficiency of troubleshooting dynamic tunnels.
  • An embodiment of the dynamic tunnel fault diagnosis method of the present invention may include: the network management saves the dynamic tunnel
  • the latest routing information of the dynamic tunnel that was saved before the dynamic tunnel is faulty is queried.
  • the dynamic routing information is parsed according to the latest routing information of the dynamic tunnel.
  • the network management saves routing information of the dynamic tunnel.
  • the network administrator can save part or all of the routing information corresponding to the dynamic tunnel to the database, for example.
  • the network management can be, for example, Keep or delete the historical routing information of the dynamic tunnel that the database has previously saved.
  • the NMS sends a notification that the dynamic tunnel is in the Up state for the first time.
  • the first node of the dynamic tunnel can send a notification to the NMS indicating that the dynamic tunnel becomes
  • the routing information of the dynamic tunnel can be saved. For example, when the user queries the current routing information of the dynamic tunnel, if the current routing information of the dynamic tunnel is different from the saved routing information corresponding to the dynamic tunnel, and the routing information storage instruction of the user is received, the network management The current routing information of the dynamic tunnel can be saved.
  • the network management can also automatically Save the current routing information of the dynamic tunnel. For example, after receiving the route change notification corresponding to the dynamic tunnel, the network management system can save the routing information of the dynamic tunnel change carried in the route change notification after receiving the routing information storage instruction of the user (of course, After receiving the route change notification corresponding to the dynamic tunnel, the NMS can also automatically save the routing information after the dynamic tunnel change carried in the route change notification. For example, the network management system can automatically save the routing information after the dynamic tunnel change under the instruction of the user instruction when the routing information of the dynamic tunnel is changed.
  • the network management system may establish a first thread and a second thread; after receiving the route change notification corresponding to the dynamic tunnel, the first thread queries the cache whether there is a route change notification corresponding to the dynamic tunnel, and if not, The route change notification corresponding to the dynamic tunnel is stored in the cache; if yes, after the existing route change notification corresponding to the dynamic tunnel is deleted in the cache, the route change corresponding to the dynamic tunnel is received.
  • the notification is stored in the cache, where the route change notification carries the routing information after the dynamic tunnel change; the second thread reads the route change notification corresponding to the dynamic tunnel from the cache, and carries the read route change notification
  • the routing information after the dynamic tunnel change is stored in the database. In this way, the processing of the route change notification of the dynamic thread by the first thread and the second thread forms a survivor consumer working mode, and only the last change event is cached in the cache, which is beneficial to reducing the number of times of storing the library.
  • the NMS finds that the dynamic tunnel is faulty, the NMS queries the latest routing information of the dynamic tunnel that is saved before the dynamic tunnel is faulty.
  • the network management system can detect that the dynamic tunnel is faulty, or determine that the dynamic tunnel is faulty according to the related fault report reported by the managed network element.
  • the network management device parses the interface and the network element that the dynamic tunnel passes through according to the latest routing information of the dynamic tunnel obtained by the query.
  • the network management system checks the interface and the network element that the dynamic tunnel passes through to learn the suspected fault point in the dynamic tunnel.
  • the NMS can check the interfaces and NEs that the dynamic tunnel passes through in various ways to learn the suspected fault points in the dynamic tunnel.
  • the network administrator can generate a test command set (eg, can generate a test command corresponding to a ping and/or a Trace Route per hop from the source node of the dynamic tunnel to the destination node); test the dynamic with the generated test command set
  • the interface and network element that the tunnel passes through to learn the suspected fault point of the dynamic tunnel.
  • the network management system can query all the alarms on the route that the dynamic tunnel passes and all the alarms on the dynamic tunnel according to the interface and the network element that the dynamic tunnel passes through; and the query is based on the preset deduction rules. Correlation analysis is performed on all the alarms to obtain a root cause alarm.
  • the derivation rule may include at least one of the following rules:
  • the upstream alarm is a root cause alarm of the downstream alarm
  • the interface alarm is a root cause alarm of the dynamic tunnel alarm.
  • the physical interface alarm is the root cause alarm of the logical interface alarm that is bound to it.
  • the network management system can check the correctness of the configuration data of the interface and the network element that the dynamic tunnel passes through (for example, may include one or more of the following information: check the laser status, the interface enable status, and whether the upstream and downstream IP addresses are On the same network segment, whether the routing constraint information matches the actual routing information, whether the tunnel on each routing node is successfully established, etc.); the suspected fault point in the dynamic tunnel is determined according to the detection result. It can be understood that the network management system can be used to check the interface and the network element that the dynamic tunnel passes through to learn the suspected fault point in the dynamic tunnel, and is not limited to the above example manner.
  • the user can further confirm and troubleshoot the fault problem of the suspected fault point.
  • the network management system in the embodiment of the present invention may be a physical device or may include a network management system composed of multiple physical devices.
  • the network management saves the routing information of the dynamic tunnel tunnel;
  • the NMS can query the latest routing information of the dynamic tunnel that was saved before the dynamic tunnel is faulty.
  • the interface and the NE that the dynamic tunnel passes through are parsed according to the latest routing information of the dynamic tunnel.
  • the interface and the network element that the dynamic tunnel passes through can be checked to obtain the suspected fault point in the dynamic tunnel, so that the fault diagnosis of the dynamic tunnel can be automatically implemented, thereby reducing the complexity of troubleshooting the dynamic tunnel fault.
  • Improve the efficiency of troubleshooting dynamic tunnels For a better understanding of the technical solution of the embodiment of the present invention, an example of the network management module architecture is taken as an example for further detailed introduction.
  • a dynamic tunnel routing processing module may be added to the network management system to support querying the current routing information of the dynamic tunnel, and storing and querying historical routing information of the dynamic tunnel.
  • the alarm processing module interacts with the dynamic tunnel routing processing module to analyze the dynamic routing information of the dynamic tunnel, and obtain the alarm information of the network element and the interface, the network element and the interface, and the dynamic tunnel of the dynamic tunnel. And all the alarms are analyzed based on the preset deduction rules to obtain a root cause alarm, wherein the derivation rule may include at least one of the following rules:
  • the upstream alarm is downstream
  • the root cause alarm of the alarm is the root cause alarm of the dynamic tunnel alarm
  • the physical interface alarm is the root cause alarm of the logical interface alarm bound to it.
  • the network management system can check the correctness of the configuration data of the interface and the network element that the dynamic tunnel passes through; and determine the suspected fault point in the dynamic tunnel according to the detection result.
  • the fault diagnosis module can interact with the dynamic tunnel routing processing module.
  • the fault diagnosis module can automatically generate a test command set, and can use the generated test command set to perform link-pass test such as Ping/Trace Route to automatically analyze the suspected fault point.
  • the dynamic tunnel routing processing module can provide one or more of the following functions, for example:
  • the historical routing information of the dynamic tunnel is saved and the query interface is provided.
  • the alarm processing module may add one or more of the following functions: According to the routing information obtained by the dynamic tunnel routing processing module, all alarms on the tunnel route are queried from the database and the dynamic tunnel is All alarms;
  • the alarm correlation rule analysis is performed, and the root cause alarm is analyzed.
  • the fault diagnosis module may add one or more of the following functions: automatically generate a test command set according to the dynamic tunnel routing information obtained by the dynamic tunnel routing processing module, and may utilize the generated test command set. Perform link continuity test such as Ping/Trace Route; analyze test results based on the upstream and downstream relationship of the route to automatically analyze the suspected fault point.
  • a flag may be added to the dynamic tunnel state data to mark whether the dynamic tunnel is in the UP state for the first time.
  • the dynamic tunnel routing processing module receives the notification that the dynamic tunnel reported by the managed network element changes to the UP state, the dynamic If the tunnel routing processing module determines that it is UP for the first time, it saves the current routing information to the database (or the repository file).
  • the dynamic tunnel routing processing module may generate a prompt message to remind the user to save the new routing information.
  • the instruction for saving the route the dynamic tunnel routing processing module saves the new routing information of the dynamic tunnel to the database. If the current routing information of the dynamic tunnel is different from the saved historical routing information, the dynamic tunnel routing processing module can automatically save the new routing information of the dynamic tunnel to the database.
  • the dynamic tunnel routing processing module may generate a prompt message to remind the user to save the new routing information, and if the user selects to save the route, The command, dynamic tunnel routing processing module saves the new route to the database. In this way, for the same dynamic tunnel, only one route change notification is recorded, and the route change notification is cached in the memory, until the route change notification is confirmed by the user to change the new route change notification of the tunnel, so that the network is in the managed network.
  • the network management implements the merging of multiple change events of the same tunnel, which is beneficial to effectively reduce the network management.
  • the tunnel routing information storage operation can be performed only when the tunnel is UP for the first time, or when the route is updated when the user selects, thereby effectively reducing the number of disk reads and increasing the data. Processing efficiency.
  • the alarm processing module may obtain the historical routing information of the faulty tunnel from the dynamic tunnel routing processing module, where the dynamic tunnel routing processing module may query the historical routing information of the dynamic tunnel from the database, for example, and may be based on the entire network.
  • the interface information is used to translate the IP address in the historical routing information into the interface information.
  • the alarm processing module can query all the alarms on the route that the tunnel passes and all the alarms on the tunnel according to the interface information.
  • the alarm processing module can alarm according to the deduction rules. Correlation analysis is performed to obtain the root cause alarm.
  • the derivation rule includes at least one of the following rules:
  • the upstream alarm is the root cause alarm of the downstream alarm
  • the interface alarm is the root cause alarm of the tunnel alarm
  • the physical interface alarm is bound.
  • the alarm processing module can effectively find the root cause alarm that causes the tunnel fault, such as the Ethernet signal loss (ETH_LOS) alarm, the laser alarm is the root cause alarm of the tunnel alarm, and the laser alarm is the upstream alarm of the ETH_LOS alarm.
  • the alarm processing module can automatically infer that the tunnel fault is caused by a laser alarm.
  • the fault diagnosis module provides intelligent diagnostic functions to find suspected points where tunnel faults occur. For example, the fault diagnosis module obtains the historical routing information of the faulty tunnel from the dynamic tunnel routing processing module, where the dynamic tunnel routing processing module can query the historical routing information of the dynamic tunnel from the database, and can perform historical routing information according to the information of the entire network interface.
  • the IP address is converted into interface information; the fault diagnosis module automatically generates a test command set according to the historical routing information of the dynamic tunnel (which may include a ping/Trace Route command for generating each hop from the source to the destination) and executes to analyze the execution.
  • the point of failure ie, the suspected point of failure).
  • the fault diagnosis module checks the configuration data according to the routing information of the dynamic tunnel.
  • the method may include: checking the status of the laser, the interface enabling state, whether the upstream and downstream IP addresses are in the same network segment, and whether the routing constraint information matches the actual routing information, and each routing node is configured. Whether the tunnel is successfully established, etc., and can prompt the user with the wrong configuration data.
  • the fault diagnosis module combines the routing information of the dynamic tunnel routing processing module for comprehensive analysis, and usually locates the root cause of the dynamic tunnel failure in one click.
  • the network management system can also implement dynamic tunnel fault diagnosis by using other module architectures. Not at all - enumeration.
  • the user can further confirm and troubleshoot the fault problem of the suspected fault point.
  • the network management system saves the routing information of the dynamic tunnel tunnel; in this case, when the dynamic tunnel is faulty, the network management system can query the latest routing information corresponding to the dynamic tunnel saved before the dynamic tunnel failure; Obtaining the latest routing information of the dynamic tunnel to parse the interface and the network element through which the dynamic tunnel passes; and then checking the interface and the network element that the dynamic tunnel passes through to obtain the suspected fault point in the dynamic tunnel, so that Troubleshoot the dynamic tunnel automatically, which helps reduce the complexity of troubleshooting dynamic tunnels and improves the efficiency of troubleshooting dynamic tunnels.
  • the network management device 400 may include: a memory 410, a query module 420, a route parsing module 430, and a diagnostic module 440.
  • the storage 410 is configured to save routing information of the dynamic tunnel.
  • the memory 410 may be specifically configured to: when receiving the notification that the dynamic tunnel first becomes the UP state, save the routing information of the dynamic tunnel at the time; when the user queries the current routing information of the dynamic tunnel, If the current routing information of the dynamic tunnel is different from the saved routing information corresponding to the dynamic tunnel, and the routing information storage instruction of the user is received, the current routing information of the dynamic tunnel is saved; and the dynamic tunnel corresponding to the dynamic tunnel is received.
  • the route change notification if the user saves the route information save command, the route change is saved. The routing information after the dynamic tunnel change carried in the knowledge.
  • the network management device 400 receives the notification that the dynamic tunnel is changed to the UP state for the first time.
  • the first node of the dynamic tunnel can send the dynamic tunnel to the network administrator to indicate that the dynamic tunnel is changed.
  • the memory 410 can save the routing information of the dynamic tunnel at the time. For example, when the user queries the current routing information of the dynamic tunnel, if the current routing information of the dynamic tunnel is different from the saved routing information corresponding to the dynamic tunnel, and the user receives the routing information saving instruction, the memory is received. The 410 can save the current routing information of the dynamic tunnel.
  • the memory 410 can also be used. Automatically save the current routing information of the dynamic tunnel). For example, after receiving the route change notification corresponding to the dynamic tunnel, the network management device 400 can save the routing information after the dynamic tunnel change carried in the route change notification, if the routing information storage instruction of the user is received. Certainly, after receiving the route change notification corresponding to the dynamic tunnel, the memory 410 may also automatically save the routing information after the dynamic tunnel change carried in the route change notification). For example, when the network management device 400 finds that the routing information of the dynamic tunnel is changed, the memory 410 automatically saves the routing information after the dynamic tunnel change under the instruction of the user instruction.
  • the network management device 400 can establish a first thread and a second thread.
  • the first thread queries the cache whether there is a route change notification corresponding to the dynamic tunnel. Then, the route change notification corresponding to the dynamic tunnel is stored in the cache; if yes, after the existing route change notification corresponding to the dynamic tunnel is deleted in the cache, the corresponding dynamic tunnel is received.
  • the route change notification is stored in the cache, where the route change notification carries the routing information after the dynamic tunnel change; the second thread reads the route change notification corresponding to the dynamic tunnel from the cache, and notifies the read route change notification
  • the routing information changed by the dynamic tunnel carried in the database is stored in the database. In this way, the processing of the route change notification of the dynamic thread by the first thread and the second thread forms a survivor consumer working mode, and only the last change event is cached in the cache, which is beneficial to reducing the number of times of storing the library.
  • the querying module 420 is configured to query the latest routing information corresponding to the dynamic tunnel saved by the memory 410 before the dynamic tunnel failure, when the dynamic tunnel fault is found;
  • the route parsing module 430 is configured to parse the interface and the network element that the dynamic tunnel passes through according to the latest routing information of the dynamic tunnel that is obtained by the query module 420.
  • the diagnosis module 440 is configured to check the interface and the network element that the dynamic tunnel passes through to learn the suspected fault point in the dynamic tunnel.
  • the diagnostic module 440 can include: a generating submodule 441 and a testing submodule 442;
  • the generating submodule 441 is configured to generate a test command set
  • the test sub-module 442 is configured to test the interface and the network element that the dynamic tunnel passes through by using the test command set generated by the generating sub-module 441 to obtain the suspected fault point of the dynamic tunnel.
  • the diagnostic module 440 can include: a query sub-module 443 and a derivation sub-module 444.
  • the query sub-module 443 is configured to query, according to the interface and the network element that the dynamic tunnel passes, all the alarms on the route that the dynamic tunnel passes and all the alarms on the dynamic tunnel.
  • the initiating sub-module 444 is configured to perform correlation analysis on all the alarms that are queried based on the preset derivation rules to obtain a root cause alarm, where the derivation rule may include, for example, at least one of the following rules: The root cause alarm of the downstream alarm, the interface alarm is the root cause alarm of the dynamic tunnel alarm, and the physical interface alarm is the root cause alarm of the logical interface alarm bound to it.
  • the diagnostic module 440 can include: a configuration check sub-module 445 and a determination sub-module 446.
  • the configuration checking sub-module 445 is configured to check the correctness of the configuration data of the interface and the network element that the dynamic tunnel passes through;
  • the determining sub-module 446 is configured to determine a suspected fault point in the dynamic tunnel according to the detection result. It can be seen that, in the embodiment, the network management device 400 saves the routing information of the dynamic tunnel tunnel. When the dynamic tunnel is faulty, the network management device 400 can query the latest routing information of the dynamic tunnel that is saved before the dynamic tunnel is faulty. The interface and the network element through which the dynamic tunnel passes are parsed according to the latest routing information of the dynamic tunnel obtained by the query; and the interface and the network element that the dynamic tunnel passes through are checked to obtain the suspected fault point in the dynamic tunnel. In this way, the fault diagnosis of the dynamic tunnel can be automatically implemented, which is beneficial to reduce the fault of troubleshooting the dynamic tunnel. Increase the efficiency of troubleshooting dynamic tunnels.
  • the embodiment of the invention further provides a fault diagnosis system, which may include a network management device 400.
  • the program may be stored in a computer readable storage medium, and the storage medium may include: Read-only memory, random access memory, disk or optical disk, etc.

Abstract

A dynamic Tunnel fault diagnosis method, device and system. The dynamic Tunnel fault diagnosis method can include: storing routing information about a dynamic Tunnel by a webmaster; when it is discovered that the dynamic Tunnel failed, querying the latest routing information stored before the dynamic Tunnel failed and corresponding to the dynamic Tunnel; parsing the interface and network element through which the dynamic Tunnel passes according to the queried latest routing information about the dynamic Tunnel; and checking the interface and network element through which the dynamic Tunnel passes so as to learn of a suspected fault point in the dynamic Tunnel. The technical solution provided by the embodiments of the present invention is advantageous for reducing the complexity of troubleshooting of faults of the dynamic Tunnel and improving the troubleshooting efficiency of the dynamic Tunnel.

Description

动态隧道故障诊断方法及设备和系统  Dynamic tunnel fault diagnosis method, device and system
技术领域 Technical field
本发明涉及通信技术领域,具体涉及一种动态隧道故障诊断方法及设备和 系统。  The present invention relates to the field of communications technologies, and in particular, to a dynamic tunnel fault diagnosis method, device and system.
背景技术 Background technique
目前, 在承载网络中通常采用静态隧道(Tunnel )或动态 Tunnel来完成业 务传输, 承载网络中存在着数以千计的承载节点。  At present, a static tunnel (Tunnel) or a dynamic tunnel is usually used in a bearer network to complete service transmission. There are thousands of bearer nodes in the bearer network.
Tunnel的功能就是在两个网络节点之间提供一条通路,使数据包能够在这 个通路上透明传输。 虚拟专用网 ( VPN Virtual Private Network ) Tunnel—般是 指在分组交换网络 PSN ( Packet Switched Network ) 骨干网的 VPN节点 (一般 指边缘设备(PE, Provider Edge ) )之间, 或在 VPN节点与用户节点之间建立 的用来传输 VPN数据包的虚拟连接。 Tunnel是构建 VPN不可或缺的部分, 用于 把 VPN数据包从一个 VPN节点透明传送到另一个节点。  The function of the tunnel is to provide a path between the two network nodes so that the data packets can be transparently transmitted on this path. VPN Virtual Private Network Tunnel is generally referred to as a VPN node (generally referred to as a Provider Edge) of a packet switched network (PSN) backbone network, or between a VPN node and a user. A virtual connection established between nodes to transport VPN packets. Tunnel is an indispensable part of building a VPN, which is used to transparently transfer VPN packets from one VPN node to another.
动态 Tunnel采用路由协议协商方式建立数据转发路由。 例如源宿网元间存 在一条动态 Tunnel, Tunnel发生故障前, 该动态 Tunnel经过路由的路由信息是 确定的; Tunnel发生故障后信令协商可能失败, 此时需排除故障。 例如在中间 节点或中间链路发生故障时, 需要对可能的路径(往往为很多条)进行一一排 查来找出故障点, 若网络拓朴复杂, 则故障排查通常耗时都非常长, 排除故障 的也难度非常大, 而且现有技术中一般又都是技术人员手动排查,很难及时有 效的排除故障。  Dynamic tunnels use the routing protocol negotiation mode to establish data forwarding routes. For example, if there is a dynamic tunnel between the source and sink NEs, the routing information of the dynamic tunnel is determined before the fault occurs. The signaling negotiation may fail after the fault occurs. For example, when an intermediate node or intermediate link fails, you need to check the possible paths (often many) to find the fault point. If the network topology is complex, the troubleshooting usually takes a very long time. The fault is also very difficult, and in the prior art, it is generally manually checked by a technician, and it is difficult to troubleshoot in time and effectively.
发明内容 Summary of the invention
本发明实施例提供动态隧道故障诊断方法和设备及系统,以期降低排查动 态 Tunnel故障的复杂度, 提升动态 Tunnel故障排查效率。  The embodiments of the present invention provide a dynamic tunnel fault diagnosis method, a device, and a system, so as to reduce the complexity of troubleshooting a dynamic tunnel and improve the efficiency of troubleshooting dynamic tunnels.
为解决上述技术问题, 本发明实施例中提供的技术方案如下:  To solve the above technical problem, the technical solution provided in the embodiment of the present invention is as follows:
一种动态隧道故障诊断方法, 包括:  A dynamic tunnel fault diagnosis method includes:
网管保存动态隧道 Tunnel的路由信息;  The NMS saves the routing information of the dynamic tunnel.
在发现所述动态 Tunnel故障时, 查询在所述动态 Tunnel故障前所保存的对 应该动态 Tunnel的最新路由信息; 根据查询得到的所述动态 Tunnel的最新路由信息, 解析出所述动态 Tunnel 所经过的接口和网元; When the dynamic tunnel is faulty, the latest routing information corresponding to the dynamic tunnel saved before the dynamic tunnel failure is queried; Parsing the interface and the network element through which the dynamic tunnel passes according to the latest routing information of the dynamic tunnel obtained by the query;
检查所述动态 Tunnel所经过的接口和网元, 以获知所述动态 Tunnel中的 疑似故障点。  Checking the interfaces and network elements that the dynamic tunnel passes through to learn the suspected fault points in the dynamic tunnel.
一种网管设备, 包括:  A network management device, including:
存储器, 用于保存动态隧道 Tunnel的路由信息;  a memory, configured to save routing information of the dynamic tunnel;
查询模块, 用于在发现所述动态 Tunnel故障时, 查询所述存储器在所述动 态 Tunnel故障前所保存的对应该动态 Tunnel的最新路由信息;  a querying module, configured to query, after the dynamic tunnel fault is found, the latest routing information of the corresponding dynamic tunnel saved by the memory before the dynamic tunnel is faulty;
路由解析模块, 用于根据所述查询模块查询得到的所述动态 Tunnel的最新 路由信息, 解析出所述动态 Tunnel所经过的接口和网元;  a route parsing module, configured to parse out an interface and a network element that the dynamic tunnel passes through according to the latest routing information of the dynamic tunnel that is obtained by the query module;
诊断模块, 用于检查所述动态 Tunnel所经过的接口和网元, 以获知所述 动态 Tunnel中的疑似故障点。  The diagnosis module is configured to check an interface and a network element that the dynamic tunnel passes through to learn a suspected fault point in the dynamic tunnel.
一种故障诊断系统, 包括:  A fault diagnosis system, comprising:
如上述实施例所述的网管设备。  The network management device as described in the above embodiments.
由上可见, 本发明实施例中网管保存动态隧道 Tunnel的路由信息; 这样在 该动态 Tunnel故障时, 网管便可查询在该动态 Tunnel故障前所保存的对应该动 态 Tunnel的最新路由信息; 根据查询得到的该动态 Tunnel的最新路由信息解析 出该动态 Tunnel所经过的接口和网元; 进而可通过检查该动态 Tunnel所经过的 接口和网元, 以获知该动态 Tunnel中的疑似故障点, 这样便可自动实现动态 Tunnel的故障排查, 进而有利于期降低排查动态 Tunnel故障的复杂度, 提升动 态 Tunnel故障排查效率。  It can be seen that, in the embodiment of the present invention, the network management system saves the routing information of the dynamic tunnel tunnel; in this case, when the dynamic tunnel is faulty, the network management system can query the latest routing information corresponding to the dynamic tunnel saved before the dynamic tunnel failure; Obtaining the latest routing information of the dynamic tunnel to parse the interface and the network element through which the dynamic tunnel passes; and then checking the interface and the network element that the dynamic tunnel passes through to obtain the suspected fault point in the dynamic tunnel, so that Troubleshoot the dynamic tunnels automatically, which can reduce the complexity of troubleshooting dynamic tunnels and improve the troubleshooting efficiency of dynamic tunnels.
附图说明 DRAWINGS
为了更清楚地说明本发明实施例和现有技术中的技术方案,下面将对实施 例和现有技术描述中所需要使用的附图作筒单地介绍,显而易见地, 下面描述 中的附图仅仅是本发明的一些实施例, 对于本领域普通技术人员来讲,在不付 出创造性劳动性的前提下, 还可以根据这些附图获得其他的附图。  In order to more clearly illustrate the embodiments of the present invention and the technical solutions in the prior art, the drawings used in the embodiments and the prior art description will be briefly described below. Obviously, the drawings in the following description It is merely some embodiments of the present invention, and other drawings may be obtained from those skilled in the art without departing from the drawings.
图 1-a是本发明实施例提供了一种动态 Tunnel故障示意图;  FIG. 1 is a schematic diagram of a dynamic tunnel fault according to an embodiment of the present invention;
图 1-b是本发明实施例提供了一种动态 Tunnel故障后的云网络示意图; 图 2是本发明实施例提供了一种动态隧道故障诊断方法的流程示意图; 图 3是本发明实施例提供了一种网管的模块架构示意图; FIG. 1 is a schematic diagram of a cloud network after a dynamic tunnel failure according to an embodiment of the present invention; 2 is a schematic flowchart of a dynamic tunnel fault diagnosis method according to an embodiment of the present invention; FIG. 3 is a schematic diagram of a module structure of a network management system according to an embodiment of the present invention;
图 4是本发明实施例提供了一种网管设备的示意图;  4 is a schematic diagram of a network management device according to an embodiment of the present invention;
图 5-a是本发明实施例提供了一种网管设备的诊断模块的示意图; 图 5-b是本发明实施例提供了另一种网管设备的诊断模块的示意图; 图 5-c是本发明实施例提供了另一种网管设备的诊断模块的示意图。  FIG. 5-a is a schematic diagram of a diagnostic module of a network management device according to an embodiment of the present invention; FIG. 5-b is a schematic diagram of a diagnostic module of another network management device according to an embodiment of the present invention; FIG. 5-c is a schematic diagram of the present invention; The embodiment provides a schematic diagram of a diagnostic module of another network management device.
具体实施方式 detailed description
本发明实施例提供了一种动态隧道故障诊断方法及设备和系统,以期降低 排查动态 Tunnel故障的复杂度, 提升动态 Tunnel故障排查效率。  The embodiments of the present invention provide a dynamic tunnel fault diagnosis method, a device, and a system, which are used to reduce the complexity of troubleshooting dynamic tunnel faults and improve the efficiency of troubleshooting dynamic tunnels.
下面将结合本发明实施例中的附图,对本发明实施例中的技术方案进行清 楚、 完整地描述, 显然, 所描述的实施例仅仅是本发明一部分实施例, 而不是 全部的实施例。基于本发明中的实施例, 本领域普通技术人员在没有作出创造 性劳动前提下所获得的所有其他实施例, 都属于本发明保护的范围。  BRIEF DESCRIPTION OF THE DRAWINGS The technical solutions in the embodiments of the present invention will be described in detail below with reference to the accompanying drawings. All other embodiments obtained by a person of ordinary skill in the art based on the embodiments of the present invention without creative work are within the scope of the present invention.
下面通过具体实施例, 分别进行详细的说明。  The detailed description will be respectively made below through specific embodiments.
首先参见图 1-a和图 1-b, 图 1-a所示拓朴结构中的源宿网元间存在一条动态 Referring first to Figure 1-a and Figure 1-b, there is a dynamic between source and sink NEs in the topology shown in Figure 1-a.
Tunnel, 当该动态 Tunnel故障后, 例如图 1-b所示, 源宿网元间的网络变为不可 预见的云, 此场景下,现有技术则需检查该拓朴图中的所有网元及链路来排查 故障, 而往往实际网络中的拓朴结构往往复杂得多,基于现有技术来排查故障 的复杂度将很高。 本发明动态隧道故障诊断方法的一个实施例, 可包括: 网管保存动态隧道After the dynamic tunnel is faulty, for example, as shown in Figure 1-b, the network between the source and sink NEs becomes an unforeseen cloud. In this scenario, the existing technology needs to check all the NEs in the topology map. And the link to troubleshoot the fault, and often the topology in the actual network is often much more complicated, and the complexity of troubleshooting based on the existing technology will be high. An embodiment of the dynamic tunnel fault diagnosis method of the present invention may include: the network management saves the dynamic tunnel
Tunnel的路由信息; 在发现该动态 Tunnel故障时, 查询在该动态 Tunnel故障前 所保存的对应该动态 Tunnel的最新路由信息; 根据查询得到的该动态 Tunnel的 最新路由信息, 解析出该动态 Tunnel所经过的接口和网元; 检查该动态 Tunnel 所经过的接口和网元, 以获知该动态 Tunnel中的疑似故障点。 If the dynamic tunnel is faulty, the latest routing information of the dynamic tunnel that was saved before the dynamic tunnel is faulty is queried. The dynamic routing information is parsed according to the latest routing information of the dynamic tunnel. The interface and the NE that have passed through; check the interface and NE that the dynamic tunnel passes through to learn the suspected fault point in the dynamic tunnel.
参见图 2, 具体步骤可包括:  Referring to Figure 2, specific steps may include:
201、 网管保存动态隧道 Tunnel的路由信息;  201. The network management saves routing information of the dynamic tunnel.
在实际应用中, 网管例如可将其获得的部分或全部对应该动态 Tunnel的路 由信息保存到数据库中。 在保存该动态 Tunnel最新的路由信息时, 网管例如可 保留或删除数据库此前已保存的该动态 Tunnel的历史路由信息。 In practical applications, the network administrator can save part or all of the routing information corresponding to the dynamic tunnel to the database, for example. When saving the latest routing information of the dynamic tunnel, the network management can be, for example, Keep or delete the historical routing information of the dynamic tunnel that the database has previously saved.
举例来说, 网管在接收到动态 Tunnel首次变为 UP状态的通知(其中, 在动 态 Tunnel的路由协商成功且能够正常传递数据后, 该动态 Tunnel的首节点可向 网管发送指示该动态 Tunnel变为 UP状态的通知 )后, 可保存该动态 Tunnel当时 的路由信息。 又例如, 当用户查询该动态 Tunnel当前的路由信息时, 若该动态 Tunnel当前的路由信息与已保存的对应该动态 Tunnel的路由信息存在差异, 且 又接收到用户的路由信息保存指令, 则网管可保存该动态 Tunnel当前的路由信 息 (当然, 当用户查询该动态 Tunnel当前的路由信息时, 若该动态 Tunnel当前 的路由信息与已保存的对应该动态 Tunnel的路由信息存在差异, 网管亦可以自 动保存该动态 Tunnel当前的路由信息)。 又例如, 网管在接收到对应该动态 Tunnel的路由变更通知后, 若接收到用户的路由信息保存指令, 则网管可保存 该路由变更通知中携带的该动态 Tunnel变更后的路由信息(当然, 在接收到对 应该动态 Tunnel的路由变更通知后, 网管亦可自动保存该路由变更通知中携带 的该动态 Tunnel变更后的路由信息)。又例如, 网管可在发现该动态 Tunnel的路 由信息变更时,在用户指令的指示下或者自动保存该动态 Tunnel变更后的路由 信息。  For example, the NMS sends a notification that the dynamic tunnel is in the Up state for the first time. After the route negotiation of the dynamic tunnel is successful and the data is successfully transmitted, the first node of the dynamic tunnel can send a notification to the NMS indicating that the dynamic tunnel becomes After the notification of the UP status, the routing information of the dynamic tunnel can be saved. For example, when the user queries the current routing information of the dynamic tunnel, if the current routing information of the dynamic tunnel is different from the saved routing information corresponding to the dynamic tunnel, and the routing information storage instruction of the user is received, the network management The current routing information of the dynamic tunnel can be saved. (When the user queries the current routing information of the dynamic tunnel, if the current routing information of the dynamic tunnel is different from the saved routing information corresponding to the dynamic tunnel, the network management can also automatically Save the current routing information of the dynamic tunnel). For example, after receiving the route change notification corresponding to the dynamic tunnel, the network management system can save the routing information of the dynamic tunnel change carried in the route change notification after receiving the routing information storage instruction of the user (of course, After receiving the route change notification corresponding to the dynamic tunnel, the NMS can also automatically save the routing information after the dynamic tunnel change carried in the route change notification. For example, the network management system can automatically save the routing information after the dynamic tunnel change under the instruction of the user instruction when the routing information of the dynamic tunnel is changed.
又例如, 网管可建立第一线程和第二线程; 第一线程在接收到对应该动态 Tunnel的路由变更通知后, 在緩存中查询是否已有对应该动态 Tunnel的路由变 更通知, 若没有, 则将该接收到对应该动态 Tunnel的路由变更通知存储到緩存 中; 若有, 则在删除緩存中该已有的对应该动态 Tunnel的路由变更通知后, 将 该接收到对应该动态 Tunnel的路由变更通知存储到緩存中, 其中, 该路由变更 通知携带有该动态 Tunnel变更后的路由信息; 第二线程从緩存中读取对应该动 态 Tunnel的路由变更通知, 将读取的该路由变更通知中携带的该动态 Tunnel变 更后的路由信息存入数据库。 如此, 第一线程和第二线程对动态 Tunnel的路由 变更通知的处理形成了一种生存者消费者工作模式,并且緩存中只緩存了最后 一次变更事件, 有利于减少存库次数。  For example, the network management system may establish a first thread and a second thread; after receiving the route change notification corresponding to the dynamic tunnel, the first thread queries the cache whether there is a route change notification corresponding to the dynamic tunnel, and if not, The route change notification corresponding to the dynamic tunnel is stored in the cache; if yes, after the existing route change notification corresponding to the dynamic tunnel is deleted in the cache, the route change corresponding to the dynamic tunnel is received. The notification is stored in the cache, where the route change notification carries the routing information after the dynamic tunnel change; the second thread reads the route change notification corresponding to the dynamic tunnel from the cache, and carries the read route change notification The routing information after the dynamic tunnel change is stored in the database. In this way, the processing of the route change notification of the dynamic thread by the first thread and the second thread forms a survivor consumer working mode, and only the last change event is cached in the cache, which is beneficial to reducing the number of times of storing the library.
202、 网管在发现上述动态 Tunnel故障时,查询在该动态 Tunnel故障前所保 存的对应该动态 Tunnel的最新路由信息; 在实际应用中, 网管可主动检测到该动态 Tunnel发生故障, 或可根据被管 网元上报的相关故障报告确定该动态 Tunnel发生故障。 202. When the NMS finds that the dynamic tunnel is faulty, the NMS queries the latest routing information of the dynamic tunnel that is saved before the dynamic tunnel is faulty. In the actual application, the network management system can detect that the dynamic tunnel is faulty, or determine that the dynamic tunnel is faulty according to the related fault report reported by the managed network element.
203、 网管根据查询得到的该动态 Tunnel的最新路由信息, 解析出该动态 Tunnel所经过的接口和网元;  203. The network management device parses the interface and the network element that the dynamic tunnel passes through according to the latest routing information of the dynamic tunnel obtained by the query.
204、 网管检查该动态 Tunnel所经过的接口和网元, 以获知该动态 Tunnel 中的疑似故障点。  204. The network management system checks the interface and the network element that the dynamic tunnel passes through to learn the suspected fault point in the dynamic tunnel.
在实际应用中, 网管可通过多种方式来检查该动态 Tunnel所经过的接口和 网元, 以获知该动态 Tunnel中的疑似故障点。  In actual applications, the NMS can check the interfaces and NEs that the dynamic tunnel passes through in various ways to learn the suspected fault points in the dynamic tunnel.
举例来说, 网管可生成测试命令集合(例如可生成对应从动态 Tunnel的源 节点到目的节点之间的每跳的 ping和 /或 Trace Route等测试命令 ); 利用生成的 测试命令集合测试该动态 Tunnel所经过的接口和网元, 以获知该动态 Tunnel的 疑似故障点。 又例如, 网管可根据该动态 Tunnel所经过的接口和网元, 查询出 该动态 Tunnel所经过路由上的所有告警和该动态 Tunnel上的所有告警; 并基于 预设的推演规则对该查询出的所有告警进行相关性分析, 以得出根因告警, 其 中, 该推演规则例如可至少包括如下规则中的一种: 上游告警是下游告警的根 因告警、 接口告警是动态 Tunnel告警的根因告警、 物理接口告警是其绑定的逻 辑接口告警的根因告警。 又例如, 网管可检查该动态 Tunnel所经过的接口和网 元的配置数据的正确性(例如可能包括检查如下信息的一种或多种: 检查激光 器状态、 接口使能状态、 上下游 IP地址是否在同一网段, 路由约束信息是否匹 配实际路由信息, 每个路由节点上的 Tunnel是否成功建立等); 根据检测结果 确定该动态 Tunnel中的疑似故障点。 可以理解, 网管可结合上述举例方式的一 种或多种, 来检查该动态 Tunnel所经过的接口和网元以获知该动态 Tunnel中的 疑似故障点, 当然也不限于上述举例方式。  For example, the network administrator can generate a test command set (eg, can generate a test command corresponding to a ping and/or a Trace Route per hop from the source node of the dynamic tunnel to the destination node); test the dynamic with the generated test command set The interface and network element that the tunnel passes through to learn the suspected fault point of the dynamic tunnel. For example, the network management system can query all the alarms on the route that the dynamic tunnel passes and all the alarms on the dynamic tunnel according to the interface and the network element that the dynamic tunnel passes through; and the query is based on the preset deduction rules. Correlation analysis is performed on all the alarms to obtain a root cause alarm. The derivation rule may include at least one of the following rules: The upstream alarm is a root cause alarm of the downstream alarm, and the interface alarm is a root cause alarm of the dynamic tunnel alarm. The physical interface alarm is the root cause alarm of the logical interface alarm that is bound to it. For example, the network management system can check the correctness of the configuration data of the interface and the network element that the dynamic tunnel passes through (for example, may include one or more of the following information: check the laser status, the interface enable status, and whether the upstream and downstream IP addresses are On the same network segment, whether the routing constraint information matches the actual routing information, whether the tunnel on each routing node is successfully established, etc.); the suspected fault point in the dynamic tunnel is determined according to the detection result. It can be understood that the network management system can be used to check the interface and the network element that the dynamic tunnel passes through to learn the suspected fault point in the dynamic tunnel, and is not limited to the above example manner.
此外, 网管确定出动态 Tunnel中的疑似故障点后, 用户可进一步对该疑似 故障点的故障问题进行确认和故障排除。  In addition, after the network management determines the suspected fault point in the dynamic tunnel, the user can further confirm and troubleshoot the fault problem of the suspected fault point.
需要说明的是, 本发明实施例中所指的网管, 可能是一个物理设备或者亦 可能包括由多个物理设备组成的网管系统。  It should be noted that the network management system in the embodiment of the present invention may be a physical device or may include a network management system composed of multiple physical devices.
由上可见, 在本实施例中网管保存动态隧道 Tunnel的路由信息; 这样在该 动态 Tunnel故障时, 网管便可查询在该动态 Tunnel故障前所保存的对应该动态 Tunnel的最新路由信息; 根据查询得到的该动态 Tunnel的最新路由信息解析出 该动态 Tunnel所经过的接口和网元; 进而可通过检查该动态 Tunnel所经过的接 口和网元,以获知该动态 Tunnel中的疑似故障点,这样便可自动实现动态 Tunnel 的故障排查,进而有利于期降低排查动态 Tunnel故障的复杂度,提升动态 Tunnel 故障排查效率。 为更好的理解本发明实施例的技术方案,下面以一种举例的网管模块架构 为例做进一步详细的介绍。 It can be seen that, in this embodiment, the network management saves the routing information of the dynamic tunnel tunnel; When the dynamic tunnel is faulty, the NMS can query the latest routing information of the dynamic tunnel that was saved before the dynamic tunnel is faulty. The interface and the NE that the dynamic tunnel passes through are parsed according to the latest routing information of the dynamic tunnel. Then, the interface and the network element that the dynamic tunnel passes through can be checked to obtain the suspected fault point in the dynamic tunnel, so that the fault diagnosis of the dynamic tunnel can be automatically implemented, thereby reducing the complexity of troubleshooting the dynamic tunnel fault. Improve the efficiency of troubleshooting dynamic tunnels. For a better understanding of the technical solution of the embodiment of the present invention, an example of the network management module architecture is taken as an example for further detailed introduction.
参见图 3 ,在本发明实施例的一个应用例中,网管上例如可增加动态 Tunnel 路由处理模块, 以支持查询动态 Tunnel的当前路由信息, 支持保存与查询动态 Tunnel的历史路由信息等。  Referring to FIG. 3, in an application example of the embodiment of the present invention, a dynamic tunnel routing processing module may be added to the network management system to support querying the current routing information of the dynamic tunnel, and storing and querying historical routing information of the dynamic tunnel.
告警处理模块与动态 Tunnel路由处理模块交互, 可通过解析动态 Tunnel的 历史路由信息, 得到动态 Tunnel的历史路由所经过的网元和接口、 路由上各个 网元和接口的告警信息以及该动态 Tunnel上的所有告警; 并基于预设的推演规 则对该查询出的所有告警进行相关性分析, 以得出根因告警, 其中, 该推演规 则例如可至少包括如下规则中的一种: 上游告警是下游告警的根因告警、接口 告警是动态 Tunnel告警的根因告警、物理接口告警是其绑定的逻辑接口告警的 根因告警。 又例如, 网管可检查该动态 Tunnel所经过的接口和网元的配置数据 的正确性; 根据检测结果确定该动态 Tunnel中的疑似故障点。  The alarm processing module interacts with the dynamic tunnel routing processing module to analyze the dynamic routing information of the dynamic tunnel, and obtain the alarm information of the network element and the interface, the network element and the interface, and the dynamic tunnel of the dynamic tunnel. And all the alarms are analyzed based on the preset deduction rules to obtain a root cause alarm, wherein the derivation rule may include at least one of the following rules: The upstream alarm is downstream The root cause alarm of the alarm, the interface alarm is the root cause alarm of the dynamic tunnel alarm, and the physical interface alarm is the root cause alarm of the logical interface alarm bound to it. For example, the network management system can check the correctness of the configuration data of the interface and the network element that the dynamic tunnel passes through; and determine the suspected fault point in the dynamic tunnel according to the detection result.
故障诊断模块可与动态 Tunnel路由处理模块交互, 故障诊断模块可自动生 成测试命令集合,并可利用生成的测试命令集合进行 Ping/Trace Route等链路通 断测试, 以自动分析出疑似故障点。  The fault diagnosis module can interact with the dynamic tunnel routing processing module. The fault diagnosis module can automatically generate a test command set, and can use the generated test command set to perform link-pass test such as Ping/Trace Route to automatically analyze the suspected fault point.
在一种应用场景下, 动态 Tunnel路由处理模块例如可提供以下功能的一种 或多种:  In an application scenario, the dynamic tunnel routing processing module can provide one or more of the following functions, for example:
查询动态 Tunnel的当前路由信息;  Query the current routing information of the dynamic tunnel.
保存动态 Tunnel的历史路由信息并提供查询接口;  The historical routing information of the dynamic tunnel is saved and the query interface is provided.
对比动态 Tunnel的当前路由信息与历史路由信息差异;  Compare the difference between the current routing information of the dynamic tunnel and the historical routing information.
将动态 Tunnel的路由信息解析为接口信息; 路由变更事件处理。 Parsing the routing information of the dynamic tunnel into interface information. Route change event processing.
在一种应用场景下, 告警处理模块例如可新增以下功能的一种或多种: 根据动态 Tunnel路由处理模块获得的路由信息, 从数据库查询到 Tunnel路 由上的所有告警以及该动态 Tunnel上的所有告警;  In an application scenario, the alarm processing module may add one or more of the following functions: According to the routing information obtained by the dynamic tunnel routing processing module, all alarms on the tunnel route are queried from the database and the dynamic tunnel is All alarms;
根据推演规则(例如可包括根据路由上下游关系等等 )进行告警相关性规 则分析, 分析出根因告警。  According to the deduction rules (for example, including the upstream and downstream relationship of the route, etc.), the alarm correlation rule analysis is performed, and the root cause alarm is analyzed.
在一种应用场景下, 故障诊断模块例如可新增以下功能的一种或多种: 根据动态 Tunnel路由处理模块获得的动态 Tunnel路由信息, 自动生成测试 命令集合, 并可利用生成的测试命令集合进行 Ping/Trace Route等链路通断测 试; 根据路由上下游关系等分析测试结果, 以自动分析出疑似故障点。  In an application scenario, the fault diagnosis module may add one or more of the following functions: automatically generate a test command set according to the dynamic tunnel routing information obtained by the dynamic tunnel routing processing module, and may utilize the generated test command set. Perform link continuity test such as Ping/Trace Route; analyze test results based on the upstream and downstream relationship of the route to automatically analyze the suspected fault point.
在实际应用中,例如可在动态 Tunnel状态数据中增加标记,标记动态 Tunnel 是否首次变为 UP状态,当动态 Tunnel路由处理模块接收到被管网元上报的动态 Tunnel变 UP状态的通知时, 动态 Tunnel路由处理模块若判断出若是首次变 UP 状态, 将当时的路由信息保存到数据库(或存库文件) 中。  In a practical application, for example, a flag may be added to the dynamic tunnel state data to mark whether the dynamic tunnel is in the UP state for the first time. When the dynamic tunnel routing processing module receives the notification that the dynamic tunnel reported by the managed network element changes to the UP state, the dynamic If the tunnel routing processing module determines that it is UP for the first time, it saves the current routing information to the database (or the repository file).
当用户查询该动态 Tunnel当前路由时, 若该动态 Tunnel的当前路由信息与 已保存的历史路由信息存在差异, 动态 Tunnel路由处理模块可生成提示信息以 提醒用户保存新路由信息, 若接收到用户选择保存路由的指令, 动态 Tunnel 路由处理模块将该动态 Tunnel的新路由信息保存到数据库。 当然, 当用户查询 该动态 Tunnel当前路由时, 若该动态 Tunnel的当前路由信息与已保存的历史路 由信息存在差异, 动态 Tunnel路由处理模块亦可自动将该动态 Tunnel的新路由 信息保存到数据库。  When the current route information of the dynamic tunnel is queried by the user, if the current routing information of the dynamic tunnel is different from the saved historical routing information, the dynamic tunnel routing processing module may generate a prompt message to remind the user to save the new routing information. The instruction for saving the route, the dynamic tunnel routing processing module saves the new routing information of the dynamic tunnel to the database. If the current routing information of the dynamic tunnel is different from the saved historical routing information, the dynamic tunnel routing processing module can automatically save the new routing information of the dynamic tunnel to the database.
在接收到被管网元上报的路由变更通知(其中携带该动态 Tunnel的新路由 信息 )后, 动态 Tunnel路由处理模块可生成提示信息以提醒用户保存新路由信 息, 若接收到用户选择保存路由的指令, 动态 Tunnel路由处理模块将新路由保 存到数据库。 如此, 对于同一条动态 Tunnel, 只记录一条路由变更通知, 并在 内存中緩存该路由变更通知, 直到该路由变更通知被用户确认变更, 才记录该 Tunnel新的路由变更通知, 这样在被管网元上报大量路由变更事件时, 网管实 现了对同一条 Tunnel的多个变更事件的归并处理, 进而有利于有效的减少网管 需处理的数据量; 基于上述机制, 可以只在 Tunnel第一次 UP时, 或在用户选择 时更新路由时, 才进行 Tunnel路由信息存库操作, 进而可以有效的减少磁盘读 取次数, 提高数据处理效率。 After receiving the route change notification reported by the managed network element (which carries the new routing information of the dynamic tunnel), the dynamic tunnel routing processing module may generate a prompt message to remind the user to save the new routing information, and if the user selects to save the route, The command, dynamic tunnel routing processing module saves the new route to the database. In this way, for the same dynamic tunnel, only one route change notification is recorded, and the route change notification is cached in the memory, until the route change notification is confirmed by the user to change the new route change notification of the tunnel, so that the network is in the managed network. When a large number of routing change events are reported, the network management implements the merging of multiple change events of the same tunnel, which is beneficial to effectively reduce the network management. The amount of data to be processed; based on the above mechanism, the tunnel routing information storage operation can be performed only when the tunnel is UP for the first time, or when the route is updated when the user selects, thereby effectively reducing the number of disk reads and increasing the data. Processing efficiency.
在实际应用中, 例如告警处理模块可从动态 Tunnel路由处理模块获取故障 Tunnel的历史路由信息, 其中, 动态 Tunnel路由处理模块例如可从数据库查询 到该动态 Tunnel的历史路由信息, 并可根据全网接口信息将历史路由信息中的 IP地址转换为接口信息; 告警处理模块可根据接口信息查询到该 Tunnel所经过 路由上的所有告警及该 Tunnel上的所有告警; 告警处理模块可根据推演规则对 告警进行相关性分析, 得出根因告警, 推演规则至少包括如下规则中的一种: 上游告警是下游告警的根因告警, 接口告警是 Tunnel告警的根因告警, 物理接 口告警是其绑定的逻辑接口告警的根因告警等。  In an actual application, for example, the alarm processing module may obtain the historical routing information of the faulty tunnel from the dynamic tunnel routing processing module, where the dynamic tunnel routing processing module may query the historical routing information of the dynamic tunnel from the database, for example, and may be based on the entire network. The interface information is used to translate the IP address in the historical routing information into the interface information. The alarm processing module can query all the alarms on the route that the tunnel passes and all the alarms on the tunnel according to the interface information. The alarm processing module can alarm according to the deduction rules. Correlation analysis is performed to obtain the root cause alarm. The derivation rule includes at least one of the following rules: The upstream alarm is the root cause alarm of the downstream alarm, the interface alarm is the root cause alarm of the tunnel alarm, and the physical interface alarm is bound. The root cause alarm of the logical interface alarm.
通过以上推演分析,告警处理模块能够有效找到引起该 Tunnel故障的根因 告警, 例如以太网信号丟失(ETH_LOS )告警、 激光器告警是 Tunnel告警的 根因告警, 激光器告警是 ETH_LOS告警的上游告警, 最终告警处理模块能够 自动推断出该 Tunnel故障是由激光器告警引起。  Through the above analysis, the alarm processing module can effectively find the root cause alarm that causes the tunnel fault, such as the Ethernet signal loss (ETH_LOS) alarm, the laser alarm is the root cause alarm of the tunnel alarm, and the laser alarm is the upstream alarm of the ETH_LOS alarm. The alarm processing module can automatically infer that the tunnel fault is caused by a laser alarm.
在实际应用中, 故障诊断模块提供智能诊断功能, 以找到 Tunnel故障发生 的疑似点。 例如故障诊断模块从动态 Tunnel路由处理模块获取故障 Tunnel的历 史路由信息, 其中, 动态 Tunnel路由处理模块例如可从数据库查询到该动态 Tunnel的历史路由信息, 并可根据全网接口信息将历史路由信息中的 IP地址转 换为接口信息; 故障诊断模块根据该动态 Tunnel的历史路由信息自动生成测试 命令集合(可包括生成从源到目的的每跳的 ping/Trace Route命令 )并执行, 以 分析出执行失败的点 (即疑似故障点)。 故障诊断模块根据动态 Tunnel的路由 信息检查配置数据, 例如可包括: 检查激光器状态、 接口使能状态、 上下游 IP 地址是否在同一网段,路由约束信息是否匹配实际路由信息,每个路由节点上 的 Tunnel是否成功建立等等, 并可将错误的配置数据提示给用户。  In practical applications, the fault diagnosis module provides intelligent diagnostic functions to find suspected points where tunnel faults occur. For example, the fault diagnosis module obtains the historical routing information of the faulty tunnel from the dynamic tunnel routing processing module, where the dynamic tunnel routing processing module can query the historical routing information of the dynamic tunnel from the database, and can perform historical routing information according to the information of the entire network interface. The IP address is converted into interface information; the fault diagnosis module automatically generates a test command set according to the historical routing information of the dynamic tunnel (which may include a ping/Trace Route command for generating each hop from the source to the destination) and executes to analyze the execution. The point of failure (ie, the suspected point of failure). The fault diagnosis module checks the configuration data according to the routing information of the dynamic tunnel. For example, the method may include: checking the status of the laser, the interface enabling state, whether the upstream and downstream IP addresses are in the same network segment, and whether the routing constraint information matches the actual routing information, and each routing node is configured. Whether the tunnel is successfully established, etc., and can prompt the user with the wrong configuration data.
在实际应用中, 故障诊断模块结合动态 Tunnel路由处理模块的路由信息进 行综合分析, 通常能够一键式定位出导致动态 Tunnel故障的根本原因。  In the actual application, the fault diagnosis module combines the routing information of the dynamic tunnel routing processing module for comprehensive analysis, and usually locates the root cause of the dynamic tunnel failure in one click.
需要说明的是, 网管还可采用其它模块架构来实现动态隧道故障诊断, 此 处不在——列举。 It should be noted that the network management system can also implement dynamic tunnel fault diagnosis by using other module architectures. Not at all - enumeration.
此外, 网管确定出动态 Tunnel中的疑似故障点后, 用户可进一步对该疑似 故障点的故障问题进行确认和故障排除。  In addition, after the network management determines the suspected fault point in the dynamic tunnel, the user can further confirm and troubleshoot the fault problem of the suspected fault point.
由上可见, 在本应用例中网管保存动态隧道 Tunnel的路由信息; 这样在该 动态 Tunnel故障时, 网管便可查询在该动态 Tunnel故障前所保存的对应该动态 Tunnel的最新路由信息; 根据查询得到的该动态 Tunnel的最新路由信息解析出 该动态 Tunnel所经过的接口和网元; 进而可通过检查该动态 Tunnel所经过的接 口和网元,以获知该动态 Tunnel中的疑似故障点,这样便可自动实现动态 Tunnel 的故障排查,进而有利于期降低排查动态 Tunnel故障的复杂度,提升动态 Tunnel 故障排查效率。  It can be seen that, in this application example, the network management system saves the routing information of the dynamic tunnel tunnel; in this case, when the dynamic tunnel is faulty, the network management system can query the latest routing information corresponding to the dynamic tunnel saved before the dynamic tunnel failure; Obtaining the latest routing information of the dynamic tunnel to parse the interface and the network element through which the dynamic tunnel passes; and then checking the interface and the network element that the dynamic tunnel passes through to obtain the suspected fault point in the dynamic tunnel, so that Troubleshoot the dynamic tunnel automatically, which helps reduce the complexity of troubleshooting dynamic tunnels and improves the efficiency of troubleshooting dynamic tunnels.
需要说明的是, 对于前述的各方法实施例, 为了筒单描述, 故将其都表述 为一系列的动作组合,但是本领域技术人员应该知悉, 本发明实施例并不受所 描述的动作顺序的限制, 因为依据本发明实施例, 某些步骤可以采用其他顺序 或者同时进行。 其次, 本领域技术人员也应该知悉, 说明书中所描述的实施例 均属于优选实施例, 所涉及的动作和模块并不一定是本发明所必须的。  It should be noted that, for each of the foregoing method embodiments, for the description of the package, it is expressed as a series of action combinations, but those skilled in the art should know that the embodiments of the present invention are not subject to the described action sequence. Limitations, as certain steps may be performed in other orders or concurrently in accordance with embodiments of the present invention. In addition, those skilled in the art should also understand that the embodiments described in the specification are all preferred embodiments, and the actions and modules involved are not necessarily required by the present invention.
在上述实施例中, 对各个实施例的描述都各有侧重, 某个实施例中没 有详述的部分, 可以参见其他实施例的相关描述。  In the above embodiments, the descriptions of the various embodiments are different, and the details are not described in detail in an embodiment, and the related descriptions of other embodiments can be referred to.
为便于更好的实施本发明实施例的上述技术方案,本发明实施例中还提供 实施上述技术方案的相应装置。 参见图 4, 本发明实施例提供的网管设备 400, 可包括: 存储器 410、 查询 模块 420、 路由解析模块 430和诊断模块 440。  In order to facilitate the implementation of the above technical solutions of the embodiments of the present invention, the corresponding devices for implementing the foregoing technical solutions are also provided in the embodiments of the present invention. Referring to FIG. 4, the network management device 400 provided by the embodiment of the present invention may include: a memory 410, a query module 420, a route parsing module 430, and a diagnostic module 440.
其中, 存储器 410, 用于保存动态隧道 Tunnel的路由信息;  The storage 410 is configured to save routing information of the dynamic tunnel.
在一种应用场景下, 存储器 410可具体用于, 在接收到上述动态 Tunnel首 次变为 UP状态的通知后,保存上述动态 Tunnel当时的路由信息; 当用户查询上 述动态 Tunnel当前的路由信息时, 若上述动态 Tunnel当前的路由信息与已保存 的对应上述动态 Tunnel的路由信息存在差异, 且接收到用户的路由信息保存指 令, 则保存上述动态 Tunnel当前的路由信息; 在接收到对应上述动态 Tunnel的 路由变更通知后, 若接收到用户的路由信息保存指令, 则保存上述路由变更通 知中携带的上述动态 Tunnel变更后的路由信息。 In an application scenario, the memory 410 may be specifically configured to: when receiving the notification that the dynamic tunnel first becomes the UP state, save the routing information of the dynamic tunnel at the time; when the user queries the current routing information of the dynamic tunnel, If the current routing information of the dynamic tunnel is different from the saved routing information corresponding to the dynamic tunnel, and the routing information storage instruction of the user is received, the current routing information of the dynamic tunnel is saved; and the dynamic tunnel corresponding to the dynamic tunnel is received. After receiving the route change notification, if the user saves the route information save command, the route change is saved. The routing information after the dynamic tunnel change carried in the knowledge.
举例来说, 网管设备 400在接收到动态 Tunnel首次变为 UP状态的通知(其 中在动态 Tunnel的路由协商成功且能够正常传递数据后, 该动态 Tunnel的首节 点可向网管发送指示该动态 Tunnel变为 UP状态的通知)后, 存储器 410可保存 该动态 Tunnel当时的路由信息。 又例如, 当用户查询该动态 Tunnel当前的路由 信息时, 若该动态 Tunnel当前的路由信息与已保存的对应该动态 Tunnel的路由 信息存在差异, 且又接收到用户的路由信息保存指令, 则存储器 410可保存该 动态 Tunnel当前的路由信息(当然, 当用户查询该动态 Tunnel当前路由信息时, 若该动态 Tunnel当前的路由信息与已保存的对应该动态 Tunnel的路由信息存在 差异, 存储器 410亦可以自动保存该动态 Tunnel当前的路由信息)。 又例如, 网 管设备 400在接收到对应该动态 Tunnel的路由变更通知后, 若接收到用户的路 由信息保存指令, 则存储器 410可保存该路由变更通知中携带的该动态 Tunnel 变更后的路由信息(当然, 在接收到对应该动态 Tunnel的路由变更通知后, 存 储器 410亦可自动保存该路由变更通知中携带的该动态 Tunnel变更后的路由信 息)。 又例如, 网管设备 400可在发现该动态 Tunnel的路由信息变更时, 存储器 410在用户指令的指示下或者自动保存该动态 Tunnel变更后的路由信息。  For example, the network management device 400 receives the notification that the dynamic tunnel is changed to the UP state for the first time. After the routing negotiation of the dynamic tunnel succeeds and the data can be normally transmitted, the first node of the dynamic tunnel can send the dynamic tunnel to the network administrator to indicate that the dynamic tunnel is changed. After being notified of the UP state, the memory 410 can save the routing information of the dynamic tunnel at the time. For example, when the user queries the current routing information of the dynamic tunnel, if the current routing information of the dynamic tunnel is different from the saved routing information corresponding to the dynamic tunnel, and the user receives the routing information saving instruction, the memory is received. The 410 can save the current routing information of the dynamic tunnel. (Of course, when the user queries the current routing information of the dynamic tunnel, if the current routing information of the dynamic tunnel is different from the saved routing information corresponding to the dynamic tunnel, the memory 410 can also be used. Automatically save the current routing information of the dynamic tunnel). For example, after receiving the route change notification corresponding to the dynamic tunnel, the network management device 400 can save the routing information after the dynamic tunnel change carried in the route change notification, if the routing information storage instruction of the user is received. Certainly, after receiving the route change notification corresponding to the dynamic tunnel, the memory 410 may also automatically save the routing information after the dynamic tunnel change carried in the route change notification). For example, when the network management device 400 finds that the routing information of the dynamic tunnel is changed, the memory 410 automatically saves the routing information after the dynamic tunnel change under the instruction of the user instruction.
又例如, 网管设备 400可建立第一线程和第二线程; 第一线程在接收到对 应该动态 Tunnel的路由变更通知后, 在緩存中查询是否已有对应该动态 Tunnel 的路由变更通知, 若没有, 则将该接收到对应该动态 Tunnel的路由变更通知存 储到緩存中; 若有, 则在删除緩存中该已有的对应该动态 Tunnel的路由变更通 知后, 将该接收到对应该动态 Tunnel的路由变更通知存储到緩存中, 其中, 该 路由变更通知携带有该动态 Tunnel变更后的路由信息; 第二线程从緩存中读取 对应该动态 Tunnel的路由变更通知,将读取的该路由变更通知中携带的该动态 Tunnel变更后的路由信息存入数据库。如此,第一线程和第二线程对动态 Tunnel 的路由变更通知的处理形成了一种生存者消费者工作模式,并且緩存中只緩存 了最后一次变更事件, 有利于减少存库次数。  For example, the network management device 400 can establish a first thread and a second thread. After receiving the route change notification corresponding to the dynamic tunnel, the first thread queries the cache whether there is a route change notification corresponding to the dynamic tunnel. Then, the route change notification corresponding to the dynamic tunnel is stored in the cache; if yes, after the existing route change notification corresponding to the dynamic tunnel is deleted in the cache, the corresponding dynamic tunnel is received. The route change notification is stored in the cache, where the route change notification carries the routing information after the dynamic tunnel change; the second thread reads the route change notification corresponding to the dynamic tunnel from the cache, and notifies the read route change notification The routing information changed by the dynamic tunnel carried in the database is stored in the database. In this way, the processing of the route change notification of the dynamic thread by the first thread and the second thread forms a survivor consumer working mode, and only the last change event is cached in the cache, which is beneficial to reducing the number of times of storing the library.
查询模块 420, 用于在发现上述动态 Tunnel故障时, 查询存储器 410在上述 动态 Tunnel故障前所保存的对应该动态 Tunnel的最新路由信息; 路由解析模块 430, 用于根据查询模块 420查询得到的上述动态 Tunnel的最 新路由信息, 解析出上述动态 Tunnel所经过的接口和网元; The querying module 420 is configured to query the latest routing information corresponding to the dynamic tunnel saved by the memory 410 before the dynamic tunnel failure, when the dynamic tunnel fault is found; The route parsing module 430 is configured to parse the interface and the network element that the dynamic tunnel passes through according to the latest routing information of the dynamic tunnel that is obtained by the query module 420.
诊断模块 440 , 用于检查上述动态 Tunnel所经过的接口和网元, 以获知上 述动态 Tunnel中的疑似故障点。  The diagnosis module 440 is configured to check the interface and the network element that the dynamic tunnel passes through to learn the suspected fault point in the dynamic tunnel.
参见图 5-a, 在一种应用场景下, 诊断模块 440可包括: 生成子模块 441和 测试子模块 442;  Referring to FIG. 5-a, in an application scenario, the diagnostic module 440 can include: a generating submodule 441 and a testing submodule 442;
其中, 生成子模块 441 , 用于生成测试命令集合;  The generating submodule 441 is configured to generate a test command set;
测试子模块 442,用于利用生成子模块 441生成的测试命令集合测试上述动 态 Tunnel所经过的接口和网元, 以获知上述动态 Tunnel的疑似故障点;  The test sub-module 442 is configured to test the interface and the network element that the dynamic tunnel passes through by using the test command set generated by the generating sub-module 441 to obtain the suspected fault point of the dynamic tunnel.
参见图 5-b, 在一种应用场景下, 诊断模块 440可包括: 查询子模块 443和 推演子模块 444。  Referring to FIG. 5-b, in an application scenario, the diagnostic module 440 can include: a query sub-module 443 and a derivation sub-module 444.
查询子模块 443 , 用于根据上述动态 Tunnel所经过的接口和网元, 查询出 上述动态 Tunnel所经过路由上的所有告警和该动态 Tunnel上的所有告警;  The query sub-module 443 is configured to query, according to the interface and the network element that the dynamic tunnel passes, all the alarms on the route that the dynamic tunnel passes and all the alarms on the dynamic tunnel.
推演子模块 444, 用于基于预设的推演规则对上述查询出的所有告警进行 相关性分析, 以得出根因告警, 其中, 该推演规则例如可包括如下规则的至少 一种: 上游告警是下游告警的根因告警、 接口告警是动态 Tunnel告警的根因告 警、 物理接口告警是其绑定的逻辑接口告警的根因告警。  The initiating sub-module 444 is configured to perform correlation analysis on all the alarms that are queried based on the preset derivation rules to obtain a root cause alarm, where the derivation rule may include, for example, at least one of the following rules: The root cause alarm of the downstream alarm, the interface alarm is the root cause alarm of the dynamic tunnel alarm, and the physical interface alarm is the root cause alarm of the logical interface alarm bound to it.
参见图 5-c,在一种应用场景下,诊断模块 440可包括: 配置检查子模块 445 和确定子模块 446。  Referring to FIG. 5-c, in an application scenario, the diagnostic module 440 can include: a configuration check sub-module 445 and a determination sub-module 446.
其中, 配置检查子模块 445 , 检查上述动态 Tunnel所经过的接口和网元的 配置数据的正确性;  The configuration checking sub-module 445 is configured to check the correctness of the configuration data of the interface and the network element that the dynamic tunnel passes through;
确定子模块 446 , 用于根据检测结果确定上述动态 Tunnel中的疑似故障点。 由上可见, 本实施例中网管设备 400保存动态隧道 Tunnel的路由信息; 这 样在该动态 Tunnel故障时,网管设备 400便可查询在该动态 Tunnel故障前所保存 的对应该动态 Tunnel的最新路由信息; 根据查询得到的该动态 Tunnel的最新路 由信息解析出该动态 Tunnel所经过的接口和网元; 进而可通过检查该动态 Tunnel所经过的接口和网元, 以获知该动态 Tunnel中的疑似故障点, 这样便可 自动实现动态 Tunnel的故障排查, 进而有利于期降低排查动态 Tunnel故障的复 杂度, 提升动态 Tunnel故障排查效率。 The determining sub-module 446 is configured to determine a suspected fault point in the dynamic tunnel according to the detection result. It can be seen that, in the embodiment, the network management device 400 saves the routing information of the dynamic tunnel tunnel. When the dynamic tunnel is faulty, the network management device 400 can query the latest routing information of the dynamic tunnel that is saved before the dynamic tunnel is faulty. The interface and the network element through which the dynamic tunnel passes are parsed according to the latest routing information of the dynamic tunnel obtained by the query; and the interface and the network element that the dynamic tunnel passes through are checked to obtain the suspected fault point in the dynamic tunnel. In this way, the fault diagnosis of the dynamic tunnel can be automatically implemented, which is beneficial to reduce the fault of troubleshooting the dynamic tunnel. Increase the efficiency of troubleshooting dynamic tunnels.
本发明实施例还提供一种故障诊断系统, 可包括网管设备 400。  The embodiment of the invention further provides a fault diagnosis system, which may include a network management device 400.
本领域普通技术人员可以理解上述实施例的各种方法中的全部或部分步 骤是可以通过程序来指令相关的硬件来完成,该程序可以存储于一计算机可读 存储介质中, 存储介质可以包括: 只读存储器、 随机存储器、 磁盘或光盘等。  A person skilled in the art may understand that all or part of the various steps of the foregoing embodiments may be completed by a program instructing related hardware. The program may be stored in a computer readable storage medium, and the storage medium may include: Read-only memory, random access memory, disk or optical disk, etc.
以上对本发明实施例所提供的动态隧道故障诊断方法及设备和系统进行 以上实施例的说明只是用于帮助理解本发明的方法及其核心思想; 同时,对于 本领域的一般技术人员,依据本发明的思想,在具体实施方式及应用范围上均 会有改变之处, 综上, 本说明书内容不应理解为对本发明的限制。  The above description of the dynamic tunnel fault diagnosis method, device and system provided by the embodiments of the present invention is only for helping to understand the method and core idea of the present invention. Meanwhile, for those skilled in the art, according to the present invention The scope of the present invention is not limited by the scope of the present invention.

Claims

权 利 要 求 Rights request
1、 一种动态隧道故障诊断方法, 其特征在于, 包括:  A dynamic tunnel fault diagnosis method, comprising:
网管保存动态隧道 Tunnel的路由信息;  The NMS saves the routing information of the dynamic tunnel.
在发现所述动态 Tunnel故障时, 查询在所述动态 Tunnel故障前所保存的对 应该动态 Tunnel的最新路由信息;  When the dynamic tunnel is faulty, the latest routing information of the corresponding dynamic tunnel saved before the dynamic tunnel fault is queried;
根据查询得到的所述动态 Tunnel的最新路由信息, 解析出所述动态 Tunnel 所经过的接口和网元;  The interface and the network element through which the dynamic tunnel passes are parsed according to the latest routing information of the dynamic tunnel obtained by the query;
检查所述动态 Tunnel所经过的接口和网元, 以获知所述动态 Tunnel中的疑 似故障点。  Checking the interfaces and network elements that the dynamic tunnel passes through to learn the suspected fault points in the dynamic tunnel.
2、 根据权利要求 1所述的方法, 其特征在于,  2. The method of claim 1 wherein
所述检查所述动态 Tunnel所经过的接口和网元, 以获知所述动态 Tunnel中 的疑似故障点, 包括:  Checking the interface and the network element that the dynamic tunnel passes through to learn the suspected fault points in the dynamic tunnel, including:
生成测试命令集合;  Generate a test command set;
利用所述测试命令集合测试所述动态 Tunnel所经过的接口和网元, 以获知 所述动态 Tunnel的疑似故障点。  The interface and the network element through which the dynamic tunnel passes are tested by using the test command set to learn the suspected fault point of the dynamic tunnel.
3、 根据权利要求 2所述的方法, 其特征在于,  3. The method of claim 2, wherein
所述生成测试命令集合包括:  The generating test command set includes:
生成对应从所述动态 Tunnel的源节点到目的节点之间的每跳的 ping和 /或 Trace Route;则试命令。  Generate a ping and/or trace route corresponding to each hop from the source node of the dynamic tunnel to the destination node;
4、 根据权利要求 1所述的方法, 其特征在于,  4. The method of claim 1 wherein:
所述检查所述动态 Tunnel所经过的接口和网元, 以获知所述动态 Tunnel中 的疑似故障点, 包括:  Checking the interface and the network element that the dynamic tunnel passes through to learn the suspected fault points in the dynamic tunnel, including:
根据所述动态 Tunnel所经过的接口和网元, 查询出所述动态 Tunnel所经过 路由上的所有告警和该动态 Tunnel上的所有告警;  Querying all the alarms on the route that the dynamic tunnel passes and all the alarms on the dynamic tunnel according to the interface and the network element that the dynamic tunnel passes through;
基于预设的推演规则对所述查询出的所有告警进行相关性分析,以得出根 因告警, 其中, 所述推演规则至少包括如下规则中的一种: 上游告警是下游告 警的根因告警、 接口告警是动态 Tunnel告警的根因告警、 物理接口告警是其绑 定的逻辑接口告警的根因告警。 Performing correlation analysis on all the queried alarms based on the preset deduction rules to obtain a root cause alarm, wherein the derivation rule includes at least one of the following rules: The upstream alarm is a root cause alarm of the downstream alarm The interface alarm is the root cause alarm of the dynamic tunnel alarm, and the physical interface alarm is the root cause alarm of the logical interface alarm bound to it.
5、 根据权利要求 1所述的方法, 其特征在于, 5. The method of claim 1 wherein:
所述检查所述动态 Tunnel所经过的接口和网元, 以获知所述动态 Tunnel中 的疑似故障点, 包括:  Checking the interface and the network element that the dynamic tunnel passes through to learn the suspected fault points in the dynamic tunnel, including:
检查所述动态 Tunnel所经过的接口和网元的配置数据的正确性;  Checking the correctness of the configuration data of the interface and the NE that the dynamic tunnel passes through;
根据检测结果确定所述动态 Tunnel中的疑似故障点。  A suspected fault point in the dynamic tunnel is determined according to the detection result.
6、 根据权利要求 1至 5任一项所述的方法, 其特征在于,  6. A method according to any one of claims 1 to 5, characterized in that
所述保存动态隧道 Tunnel的路由信息, 包括:  The routing information of the dynamic tunnel is saved, including:
在接收到所述动态 Tunnel首次变为 UP状态的通知后,保存所述动态 Tunnel 当时的路由信息;  After receiving the notification that the dynamic tunnel first becomes the UP state, the routing information of the dynamic tunnel is saved;
当用户查询所述动态 Tunnel当前的路由信息时, 若所述动态 Tunnel当前的 路由信息与已保存的对应所述动态 Tunnel的路由信息存在差异,且接收到用户 的路由信息保存指令, 则保存所述动态 Tunnel当前的路由信息;  When the user queries the current routing information of the dynamic tunnel, if the current routing information of the dynamic tunnel is different from the saved routing information corresponding to the dynamic tunnel, and the routing information storage instruction of the user is received, the saver The current routing information of the dynamic tunnel;
在接收到对应所述动态 Tunnel的路由变更通知后, 若接收到用户的路由信 息保存指令, 则保存所述路由变更通知中携带的所述动态 Tunnel变更后的路由 信息。  After receiving the route change notification corresponding to the dynamic tunnel, if the route information storage command of the user is received, the routing information after the dynamic tunnel change carried in the route change notification is saved.
7、 根据权利要求 1至 5任一项所述的方法, 其特征在于,  7. A method according to any one of claims 1 to 5, characterized in that
所述保存动态隧道 Tunnel的路由信息, 包括:  The routing information of the dynamic tunnel is saved, including:
建立第一线程和第二线程;  Establishing a first thread and a second thread;
第一线程在接收到对应所述动态 Tunnel的路由变更通知后,在緩存中查询 是否已有对应该动态 Tunnel的路由变更通知, 若没有, 则将所述接收到对应所 述动态 Tunnel的路由变更通知存储到緩存中; 若有, 则在删除緩存中该已有的 对应该动态 Tunnel的路由变更通知后, 将所述接收到对应所述动态 Tunnel的路 由变更通知存储到緩存中, 其中, 所述路由变更通知携带有所述动态 Tunnel 变更后的路由信息;  After receiving the route change notification corresponding to the dynamic tunnel, the first thread queries the cache whether there is a route change notification corresponding to the dynamic tunnel, and if not, the route change corresponding to the dynamic tunnel is received. The notification is stored in the cache; if yes, after the existing route change notification corresponding to the dynamic tunnel is deleted in the cache, the route change notification corresponding to the dynamic tunnel is stored in the cache, where The routing change notification carries the routing information after the dynamic tunnel is changed;
第二线程从緩存中读取对应所述动态 Tunnel的路由变更通知, 将读取的所 述路由变更通知中携带的所述动态 Tunnel变更后的路由信息存入数据库。  The second thread reads the route change notification corresponding to the dynamic tunnel from the cache, and stores the changed route information of the dynamic tunnel carried in the read route change notification into the database.
8、 一种网管设备, 其特征在于, 包括:  8. A network management device, comprising:
存储器, 用于保存动态隧道 Tunnel的路由信息; 查询模块, 用于在发现所述动态 Tunnel故障时, 查询所述存储器在所述动 态 Tunnel故障前所保存的对应该动态 Tunnel的最新路由信息; a memory, configured to save routing information of the dynamic tunnel tunnel; a querying module, configured to query, when the dynamic tunnel is faulty, the latest routing information of the dynamic tunnel that is saved before the dynamic tunnel is faulty;
路由解析模块, 用于根据所述查询模块查询得到的所述动态 Tunnel的最新 路由信息, 解析出所述动态 Tunnel所经过的接口和网元;  a route parsing module, configured to parse out an interface and a network element that the dynamic tunnel passes through according to the latest routing information of the dynamic tunnel that is obtained by the query module;
诊断模块, 用于检查所述动态 Tunnel所经过的接口和网元, 以获知所述动 态 Tunnel中的疑似故障点。  The diagnosis module is configured to check an interface and a network element that the dynamic tunnel passes through to learn a suspected fault point in the dynamic tunnel.
9、 根据权利要求 8所述的网管设备, 其特征在于,  9. The network management device according to claim 8, wherein:
所述诊断模块包括:  The diagnostic module includes:
生成子模块, 用于生成测试命令集合;  Generating a submodule for generating a test command set;
测试子模块,用于利用所述生成子模块生成的测试命令集合测试所述动态 a test submodule for testing the dynamics with a test command set generated by the generating submodule
Tunnel所经过的接口和网元, 以获知所述动态 Tunnel的疑似故障点; The interface and the network element that the tunnel passes through to learn the suspected fault point of the dynamic tunnel.
或者,  Or,
所述诊断模块包括:  The diagnostic module includes:
查询子模块, 用于根据所述动态 Tunnel所经过的接口和网元, 查询出所述 动态 Tunnel所经过路由上的所有告警和该动态 Tunnel上的所有告警;  Querying a sub-module, configured to query, according to the interface and the network element that the dynamic tunnel passes, all the alarms on the route that the dynamic tunnel passes and all the alarms on the dynamic tunnel;
推演子模块,用于基于预设的推演规则对所述查询出的所有告警进行相关 性分析, 以得出根因告警, 其中, 所述推演规则至少包括如下规则中的一种: 上游告警是下游告警的根因告警、 接口告警是动态 Tunnel告警的根因告警、 物 理接口告警是其绑定的逻辑接口告警的根因告警;  And a derivation sub-module, configured to perform correlation analysis on all the alarms that are queried based on the preset deduction rules, to obtain a root cause alarm, where the derivation rule includes at least one of the following rules: The root cause alarm of the downstream alarm, the interface alarm is the root cause alarm of the dynamic tunnel alarm, and the physical interface alarm is the root cause alarm of the logical interface alarm bound to it;
或者,  Or,
所述诊断模块包括:  The diagnostic module includes:
配置检查子模块,检查所述动态 Tunnel所经过的接口和网元的配置数据的 正确性;  The configuration check module is configured to check the correctness of the configuration data of the interface and the network element that the dynamic tunnel passes through;
确定子模块, 用于根据检测结果确定所述动态 Tunnel中的疑似故障点。  The determining submodule is configured to determine a suspected fault point in the dynamic tunnel according to the detection result.
10、 根据权利要求 8或 9所述的网管设备, 其特征在于, 10. The network management device according to claim 8 or 9, wherein
所述存储器具体用于, 在接收到所述动态 Tunnel首次变为 UP状态的通知 后, 保存所述动态 Tunnel当时的路由信息; 当用户查询所述动态 Tunnel当前的 路由信息时, 若所述动态 Tunnel当前的路由信息与已保存的对应所述动态 Tunnel的路由信息存在差异, 且接收到用户的路由信息保存指令, 则保存所述 动态 Tunnel当前的路由信息; The memory is specifically configured to: when receiving the notification that the dynamic tunnel first becomes the UP state, save the routing information of the dynamic tunnel at the time; when the user queries the current routing information of the dynamic tunnel, if the dynamic The current routing information of the tunnel corresponds to the saved dynamics. If the routing information of the tunnel is different, and the routing information of the user is saved, the current routing information of the dynamic tunnel is saved.
在接收到对应所述动态 Tunnel的路由变更通知后, 若接收到用户的路由信 息保存指令, 则保存所述路由变更通知中携带的所述动态 Tunnel变更后的路由 信息。  After receiving the route change notification corresponding to the dynamic tunnel, if the route information storage command of the user is received, the routing information after the dynamic tunnel change carried in the route change notification is saved.
11、 一种故障诊断系统, 其特征在于, 包括:  11. A fault diagnosis system, comprising:
如权利要求 8~10任一项所述的网管设备。  A network management device according to any one of claims 8 to 10.
PCT/CN2011/077469 2011-07-22 2011-07-22 Dynamic tunnel fault diagnosis method, device and system WO2012106914A1 (en)

Priority Applications (2)

Application Number Priority Date Filing Date Title
PCT/CN2011/077469 WO2012106914A1 (en) 2011-07-22 2011-07-22 Dynamic tunnel fault diagnosis method, device and system
CN201180001572.6A CN102308524B (en) 2011-07-22 2011-07-22 Dynamic tunnel fault diagnosis method and device and system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
PCT/CN2011/077469 WO2012106914A1 (en) 2011-07-22 2011-07-22 Dynamic tunnel fault diagnosis method, device and system

Publications (1)

Publication Number Publication Date
WO2012106914A1 true WO2012106914A1 (en) 2012-08-16

Family

ID=45381268

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2011/077469 WO2012106914A1 (en) 2011-07-22 2011-07-22 Dynamic tunnel fault diagnosis method, device and system

Country Status (2)

Country Link
CN (1) CN102308524B (en)
WO (1) WO2012106914A1 (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106951656A (en) * 2017-03-31 2017-07-14 广州电力机车有限公司 A kind of mine vehicle fault handling method based on MATLAB softwares
CN109379241A (en) * 2018-12-27 2019-02-22 新华三技术有限公司 A kind of routing information determines method and device

Families Citing this family (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103312806B (en) * 2013-06-19 2016-09-28 广东威创视讯科技股份有限公司 Diagnostic method and diagnostic system for distributed system
CN103369044A (en) * 2013-07-11 2013-10-23 无锡交大联云科技有限公司 Mobile terminal user network perception diagnosis and treatment method based on cloud knowledge base
CN107623602B (en) * 2017-10-12 2021-03-26 郑州市景安网络科技股份有限公司 Automatic checking method and system

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20040153572A1 (en) * 2003-01-31 2004-08-05 Walker Anthony Paul Michael Method of indicating a path in a computer network
CN1878322A (en) * 2005-09-27 2006-12-13 华为技术有限公司 Fault positioning method and device
US20090225652A1 (en) * 2008-03-07 2009-09-10 Jean-Philippe Vasseur Locating tunnel failure based on next-next hop connectivity in a computer network
CN101577680A (en) * 2009-06-19 2009-11-11 福建星网锐捷网络有限公司 Updating method, device and system of tunnel routing information

Family Cites Families (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101436960A (en) * 2008-12-18 2009-05-20 中国移动通信集团浙江有限公司 Test method for monitoring P bearing network quality
CN101707537B (en) * 2009-11-18 2012-01-25 华为技术有限公司 Positioning method of failed link and alarm root cause analyzing method, equipment and system

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20040153572A1 (en) * 2003-01-31 2004-08-05 Walker Anthony Paul Michael Method of indicating a path in a computer network
CN1878322A (en) * 2005-09-27 2006-12-13 华为技术有限公司 Fault positioning method and device
US20090225652A1 (en) * 2008-03-07 2009-09-10 Jean-Philippe Vasseur Locating tunnel failure based on next-next hop connectivity in a computer network
CN101577680A (en) * 2009-06-19 2009-11-11 福建星网锐捷网络有限公司 Updating method, device and system of tunnel routing information

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106951656A (en) * 2017-03-31 2017-07-14 广州电力机车有限公司 A kind of mine vehicle fault handling method based on MATLAB softwares
CN109379241A (en) * 2018-12-27 2019-02-22 新华三技术有限公司 A kind of routing information determines method and device
CN109379241B (en) * 2018-12-27 2021-12-24 新华三技术有限公司 Path information determination method and device

Also Published As

Publication number Publication date
CN102308524A (en) 2012-01-04
CN102308524B (en) 2014-03-26

Similar Documents

Publication Publication Date Title
WO2017092400A1 (en) Failure recovery method and device, controller, and software defined network
US8605603B2 (en) Route convergence based on ethernet operations, administration, and maintenance protocol
Mohan et al. TCAM-aware local rerouting for fast and efficient failure recovery in software defined networks
WO2017036180A1 (en) Packet processing method and device
WO2021018309A1 (en) Method, device and system for determination of message transmission path, and computer storage medium
JP2019500812A (en) Link detection method, apparatus, network device, and controller
KR20120120472A (en) Optimized fast re-route in mpls ring topologies
JP5530864B2 (en) Network system, management server, and management method
US10764214B1 (en) Error source identification in cut-through networks
Liang et al. On diagnosis of forwarding plane via static forwarding rules in software defined networks
US10862735B2 (en) Method and apparatus for implementing operation, administration, and maintenance function
US20110141914A1 (en) Systems and Methods for Providing Ethernet Service Circuit Management
US11818038B2 (en) Initiator-based data-plane validation for segment routed, multiprotocol label switched (MPLS) networks
JP2009303092A (en) Network equipment and line switching method
WO2012106914A1 (en) Dynamic tunnel fault diagnosis method, device and system
Schmid et al. Polynomial-time what-if analysis for prefix-manipulating MPLS networks
JP4861293B2 (en) COMMUNICATION DEVICE, COMMUNICATION METHOD, AND COMMUNICATION PROGRAM
WO2021143524A1 (en) Fault detection method, and apparatus
US10979328B2 (en) Resource monitoring
Molina et al. Availability improvement of layer 2 seamless networks using openflow
JP5480189B2 (en) Network monitoring apparatus, network test method, path information management method, and program
WO2011124178A2 (en) Fault detection method, route node and system
US10917504B1 (en) Identifying the source of CRC errors in a computing network
CN114500366B (en) Method and device for preventing routing loop between main node and standby node
US11516122B2 (en) Validating active and inactive paths in a multiprotocol label switching (MPLS) network

Legal Events

Date Code Title Description
WWE Wipo information: entry into national phase

Ref document number: 201180001572.6

Country of ref document: CN

121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 11858146

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 11858146

Country of ref document: EP

Kind code of ref document: A1