WO2014026374A1 - Server system, management method, and device - Google Patents

Server system, management method, and device Download PDF

Info

Publication number
WO2014026374A1
WO2014026374A1 PCT/CN2012/080293 CN2012080293W WO2014026374A1 WO 2014026374 A1 WO2014026374 A1 WO 2014026374A1 CN 2012080293 W CN2012080293 W CN 2012080293W WO 2014026374 A1 WO2014026374 A1 WO 2014026374A1
Authority
WO
WIPO (PCT)
Prior art keywords
service node
hardware resources
node device
management
information
Prior art date
Application number
PCT/CN2012/080293
Other languages
French (fr)
Chinese (zh)
Inventor
郑力
许利霞
张羽
Original Assignee
华为技术有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 华为技术有限公司 filed Critical 华为技术有限公司
Priority to PCT/CN2012/080293 priority Critical patent/WO2014026374A1/en
Priority to CN201280002787.4A priority patent/CN103210379B/en
Publication of WO2014026374A1 publication Critical patent/WO2014026374A1/en

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L12/00Data switching networks
    • H04L12/64Hybrid switching systems
    • H04L12/6418Hybrid transport

Definitions

  • the present invention relates to virtualization technologies, and in particular, to a server system, a management method, and a device. Background technique
  • the server system is generally formed by a plurality of service node device interconnection and combination, and the input/output (10) resources in the service node device are interconnected by a traditional network cable or cable, and the central processing in the service node device
  • the Quick Processing Interconnect (QPI) cable and the dedicated chip are interconnected between the Central Processing Units (CPUs).
  • the server management system divides the logical partitions on the hardware resources of the server system for service use.
  • the embodiment of the invention provides a server system, a management method and a device, which are used to improve the flexibility of using hardware resources in a server system, and further improve the performance of the server system.
  • a first aspect of the present invention provides a server system, including: a management device, a high-speed peripheral component interconnect PCIE switch board, and at least one service node device; the management device through the PCIE switch board and each service Node device connection;
  • the management device is configured to acquire information about hardware resources of each service node device by using the PCIE switch board, construct a physical resource pool according to the obtained information, and establish a logical partition on the physical resource pool.
  • the management device is specifically configured to: present information about a hardware resource included in the physical resource pool to a user, and receive a selected instruction and a division instruction of the user to establish The logical partition, the selected instruction is used to indicate a hardware resource selected by the user from the physical resource pool according to an application requirement, and the dividing instruction is used to indicate the user pair
  • the selected hardware resources are allocated to form the logical partition and the hardware resources included in the logical partition.
  • the management device is further configured to install an operating system in the logical user, where the hardware platform includes hardware resources included in the logical partition.
  • the management device is further configured to: send an early warning message when a load of the hardware resource included in the logical partition is greater than a preset first load threshold.
  • the management device is further configured to: select, from the physical resource pool, when the load of the hardware resource included in the logical partition is greater than a preset first load threshold. Adding hardware resources to the logical partition and triggering a hot plug process of the newly added hardware resource; or when loading a threshold, removing excess hardware resources from the logical partition to the physical resource pool
  • the excess hardware resource is a hardware resource that is not utilized in the logical partition or whose utilization is less than a preset utilization threshold.
  • the management device is specifically connected to the hardware resources of each service node device by using the PCIE switch board;
  • the management device is configured to monitor, by using the PCIE switch board, the status of the hardware resources of each service node device, and obtain information about the hardware resources of each service node device by using the PCIE switch board.
  • the management device is specifically connected to the hardware resources of each service node device by using the PCIE switch board, and through the PCIE switch board and each of the The management controller of the service node device is connected;
  • the management device is configured to monitor, by using the PCIE switch board, a state of a hardware resource of each service node device, and obtain, by using the PCIE switch board, information about hardware resources of each service node device, or Corresponding relationship between the second ports, acquiring information of the hardware resources of each of the service node devices from the management controller of each service node device;
  • the management controller of each service node device is configured to monitor each of the service nodes.
  • the state of the hardware resource of the device; the first port is a port connected to the hardware resource of each service node device on the PCIE switch board, and the second port is the PCIE switch board and each service node device The port to which the management controller is connected.
  • the management device is specifically connected to the hardware resources of each service node device by using the PCIE switch board, and directly managed with each service node device. Controller connection
  • the management device is configured to monitor, by using the PCIE switch board, a state of a hardware resource of each service node device, and obtain, by using the PCIE switch board, information about hardware resources of each service node device; Corresponding relationship between the second ports, acquiring information of the hardware resources of each of the service node devices from the management controller of each service node device;
  • the management controller of each of the service node devices is configured to monitor a state of the hardware resources of each of the service node devices; the first port is a hardware resource connection between the PCIE switch board and each of the service node devices. Port, the second port is a port on the management device that is connected to the management controller of each of the service node devices. .
  • a second aspect of the present invention provides a management device, including:
  • a monitoring module connected to the at least one service node device by the high-speed peripheral component interconnection PCIE switch board, configured to acquire, by using the PCIE switch board, information about hardware resources of each service node device of the at least one service node device, Constructing a physical resource pool according to the obtained information;
  • a partition establishment module is configured to establish a logical partition on the physical resource pool.
  • the partition establishment module is specifically configured to: present information of a hardware resource included in the physical resource pool to a user, and receive a selected instruction and a division instruction of the user, The dividing instruction is used to indicate that the user selects a hardware resource from the physical resource pool according to an application requirement, and the dividing instruction is used to instruct the user to perform the selected hardware resource. Allocating the formed logical partition and the hardware resources included in the logical partition.
  • the partition establishment module is further configured to install an operating system in the user, where the hardware platform includes hardware resources included in the logical partition.
  • the management device further includes: an early warning module, configured to acquire, by the monitoring module, that a load of the hardware resource included in the logical partition is greater than a preset first load gate For a limited time, an early warning message is issued.
  • the partition establishment module is further configured to: when the monitoring module acquires that the load of the hardware resource included in the logical partition is greater than a preset first load threshold, Adding idle hardware resources in the physical resource pool to the logical partition, and triggering a hot plug process of the newly added hardware resources; or
  • the partition establishment module is further configured to: when the monitoring module acquires that the load of the hardware resource included in the logical partition is less than a preset second load threshold, remove excess hardware resources from the logical partition to the physical In the resource pool, the excess hardware resources are hardware resources that are not utilized in the logical partition or whose utilization is less than a preset utilization threshold.
  • the monitoring module includes:
  • a first monitoring unit configured to connect, by using the PCIE switch board, hardware resources of each of the service node devices, to monitor, by using the PCIE switch board, a status of hardware resources of each service node device, by using the PCIE
  • the switch board acquires information about the hardware resources of each of the service node devices; and the constructing unit is configured to construct the physical resource pool according to the information about the hardware resources of each service node acquired by the first monitoring unit.
  • the monitoring module further includes:
  • a second monitoring unit connected by the PCIE switch board or directly to the management controller of each service node device, between the first port and the second port connected according to the stored each service node device Corresponding relationship, obtaining information about hardware resources of each service node device from a management controller of each service node device;
  • the building unit is further configured to: according to the information about the hardware resource of each serving node acquired by the first monitoring unit, and/or the hardware resource of each serving node device acquired by the second monitoring unit Information, constructing the physical resource pool;
  • the management controller of each of the service node devices is configured to monitor a state of the hardware resources of each of the service node devices; the first port is a hardware resource connection between the PCIE switch board and each of the service node devices. Port, the second port is a port connected to the management controller of each serving node device on the management device or on the PCIE switch board.
  • a third aspect of the embodiments of the present invention provides a management method, including: The management device interconnects the PCIE switch board through the high-speed peripheral component to acquire information of hardware resources of each of the at least one service node device;
  • the management device constructs a physical resource pool according to the obtained information
  • the management device establishes a logical partition on the physical resource pool
  • the management device is connected to each of the service node devices through the PCIE switch board.
  • the establishing, by the management device, the logical partition on the physical resource pool includes: receiving a selected instruction and a dividing instruction of the user, to establish the logical partition, and selecting the The instruction is used to instruct the user to select a hardware resource from the physical resource pool according to an application requirement, where the dividing instruction is used to indicate the logical partition formed by the user to allocate the selected hardware resource, and the The hardware resources included in the logical partition.
  • the method further includes: the management device, to enable the user to install an operating system, where the hardware platform includes hardware resources included in the logical partition.
  • the method further includes: the management device sends an early warning message when a load of the hardware resource included in the logical partition is greater than a preset first load threshold.
  • the method further includes: the management device, when the load of the hardware resource included in the logical partition is greater than a preset first load threshold, from the physical resource pool Adding idle hardware resources to the logical partition and triggering a hot plug process of the newly added hardware resource; or threshold, removing excess hardware resources from the logical partition to the physical resource In the pool, the excess hardware resources are hardware resources that are not utilized in the logical partition or whose utilization is less than a preset utilization threshold.
  • the management device is connected to the hardware resources of each service node device by using the PCIE switch board;
  • the information that the management device obtains the hardware resources of each of the at least one service node device by interconnecting the PCIE switch board by the high-speed peripheral component includes:
  • the management device monitors the status of the hardware resources of each of the service node devices through the PCIE switch board, and obtains information about the hardware resources of each service node device through the PCIE switch board.
  • the management device is connected to the hardware resources of each service node device by using the PCIE switch board, and through the PCIE switch board and each service The management device of the node device is connected;
  • the management device accessing the PCIE switch board through the high-speed peripheral component to obtain the hardware resources of each of the service node devices of the at least one service node device includes:
  • the management device monitors, by using the PCIE switch board, the status of the hardware resources of each service node device, and obtains, by using the PCIE switch board, information about hardware resources of each service node device; or
  • the management device acquires, according to the stored correspondence between the first port and the second port of each service node device, the management controller of each service node device Information about hardware resources;
  • the management controller of each of the service node devices is configured to monitor a state of the hardware resources of each of the service node devices; the first port is a hardware resource connection between the PCIE switch board and each of the service node devices. Port, the second port is a port on the PCIE that is connected to the management controller of each of the service node devices.
  • the management device is connected to the hardware resources of each service node device by using the PCIE switch board, and directly controls the management of each service node device. Connection
  • the management device accessing the PCIE switch board through the high-speed peripheral component to obtain the hardware resources of each of the service node devices of the at least one service node device includes:
  • the management device monitors, by using the PCIE switch board, a state of hardware resources of each service node device, and obtains, by using the PCIE switch board, information about hardware resources of each service node device, or
  • the management device acquires each of the service port devices from each of the service node devices according to the stored correspondence between the first port and the second port of each of the service node devices Information about the hardware resources of the service node device;
  • the management controller of each of the service node devices is configured to monitor a state of the hardware resources of each of the service node devices; the first port is a hardware resource connection between the PCIE switch board and each of the service node devices. Port, the second port is a port on the management device that is connected to the management controller of each of the service node devices.
  • the management device and the service node device are connected through a PCIE switch board, and the management device acquires information about hardware resources of each service node device through the PCIE switch board, and builds the information according to the obtained hardware resource information.
  • the physical resource pool is used to establish a logical partition on the physical resource pool.
  • the physical resource pool that shields the physical topology information is implemented at the physical resource level by using the PCIE switch board, which improves the flexibility of the use of hardware resources in the physical resource pool. Improve the performance of the entire system.
  • the management device provided by the second aspect of the present invention is connected to the service node device through the PCIE switch board, obtains the hardware resource information of each service node device through the PCIE switch board, and constructs a physical resource pool according to the acquired hardware resource information.
  • a logical partition is established on the physical resource pool, and the physical resource pool for shielding physical topology information is implemented at the physical resource level by using the PCIE switch board, thereby improving the flexibility of using hardware resources in the physical resource pool, thereby effectively improving the entire system. Performance.
  • the management device obtains the information of the hardware resources of each service node device by interconnecting the PCIE switch board with the high-speed peripheral component, and constructs a physical resource pool according to the acquired hardware resource information, and then A logical partition is created on the physical resource pool to fully utilize the PCIE switch board to implement physical resource pools that shield physical topology information at the physical resource level, thereby improving the flexibility of using hardware resources in the physical resource pool, thereby effectively improving the performance of the entire system. .
  • BRIEF DESCRIPTION OF THE DRAWINGS In order to more clearly illustrate the embodiments of the present invention or the technical solutions in the prior art, a brief description of the drawings used in the embodiments or the prior art description will be briefly described below. The drawings are some embodiments of the present invention, and those skilled in the art can obtain other drawings based on these drawings without any inventive labor.
  • FIG. 1 is a schematic structural diagram of a server system according to an embodiment of the present invention.
  • FIG. 2 is a schematic diagram of a specific structure of a server system according to an embodiment of the present invention
  • 3 is a schematic structural diagram of a management device connected to a CPU and a management controller of a computing node through a PCIE switch board according to an embodiment of the present invention
  • FIG. 4 is a schematic structural diagram of a logical partition according to an embodiment of the present invention.
  • FIG. 5 is a schematic diagram of an overall structure of a logical partition built on a physical resource pool and a physical resource pool according to an embodiment of the present invention
  • FIG. 6 is a schematic structural diagram of a management device according to an embodiment of the present invention.
  • FIG. 7 is a schematic structural diagram of a management device according to another embodiment of the present invention.
  • FIG. 8 is a flowchart of a management method according to an embodiment of the present invention.
  • FIG. 9 is a schematic structural diagram of a management method according to another embodiment of the present invention.
  • FIG. 10 is a schematic structural diagram of a management method according to another embodiment of the present invention.
  • the technical solutions in the embodiments of the present invention are clearly and completely described in the following with reference to the accompanying drawings in the embodiments of the present invention.
  • the embodiments are a part of the embodiments of the invention, and not all of the embodiments. All other embodiments obtained by those skilled in the art based on the embodiments of the present invention without creative efforts are within the scope of the present invention.
  • FIG. 1 is a schematic structural diagram of a server system according to an embodiment of the present invention.
  • the system of this embodiment includes a management device 11, a Peripheral Component Interconnect Express (PCIE) switchboard 12, and at least one service node device 13.
  • the management device 11 is connected to each service node device 13 through the PCIE switch board 12.
  • the PCIE switchboard 12 has features and advantages such as high bandwidth and low latency.
  • the service node device 13 in this embodiment may be various types of service node devices, for example, a 10-node, a computing node, a storage node, a memory expansion board, a fan board, a power board, and the like.
  • the management device 11 is configured to acquire information about hardware resources of each service node device 13 through the PCIE switch board 12, construct a physical resource pool according to the obtained information of hardware resources of each service node device, and establish a physical resource pool on the physical resource pool.
  • the hardware resources in this document refer to the actual hardware constituting the server system. More specifically, the hardware resources may be, for example, a CPU, a memory card, a hard disk, etc., which may be implemented by a server system to a computing node, a storage node, and 10 Pool management of hardware resources in a node to form a physical resource pool, which can be subdivided into, for example, a CPU Pool, memory pool, 10 resource pool, network card pool, etc.
  • the information about the hardware resources of each service node device 13 includes status information of hardware resources such as network card, CPU, memory, storage device in-position information, fault information, load, temperature, etc., and may further include, for example, a memory size, a CPU.
  • Device attribute information of hardware resources such as frequency and manufacturer.
  • the management device 11 can obtain each service node device through the PCIE switch board 12.
  • the management device and the service node device are connected through the PCIE switch board, and the management device acquires the hardware resource information of each service node device through the PCIE switch board, and builds a physical resource pool according to the acquired hardware resource information, and then Logical partitions are created on the physical resource pool to fully utilize the PCIE switch board to implement physical resource pools that shield physical topology information at the physical resource level, thereby improving the flexibility of using hardware resources in the physical resource pool and dynamically adjusting the logical partitions. Effectively improve the performance of the entire system.
  • each service node device 13 may include a management controller in addition to the foregoing hardware resources.
  • the management controller can be kept in a power-on state, and is mainly used to obtain information about the hardware resources of the service node device 13 for the management device 11 in an abnormal situation. For example, a certain hardware resource in the service node device 13 is powered off to cause the management device.
  • the management device 11 can obtain the information of the hardware resource through the management controller of the service node device 13, and in other cases, obtain the service node device through the management controller. 13 information on hardware resources.
  • the management controller may be a Baseboard Management Controller (BMC) or a logic device that implements a BMC-like function.
  • various types of service node devices may be deployed one or more as needed, and there are usually a plurality of computing nodes, storage nodes, and the like, and the server system may be expanded as needed.
  • the PCIE switch board 12 may include a Gigabit Ethernet (GE) interface, a Fast Ethernet (FE) interface, and/or a PCIE interface, and the management device 11 can pass. At least one of a GE interface, a FE interface, and a PCIE interface is connected to each of the service node devices 13.
  • the GE interface, the FE interface, and the PCIE interface may be implemented on a PCIE switch board 12 or on multiple PCIE switch boards 12 in a specific implementation.
  • Fig. 2 shows a specific structure of a server system.
  • the management device 11 is connected to the hardware resources of the service node device such as the 10-node 131, the compute node 132, and the storage node 133 through the PCIE interface on the PCIE switch board 12.
  • the management device 11 passes the GE interface on the PCIE switch board 12. It is connected to a management controller of 10 nodes 131, compute nodes 132, storage nodes 133, and the like.
  • the shared module such as the fan board 14 in the entire server system is directly connected to the management device 11 through its management controller, but is not limited thereto, and other service node devices other than the fan board 14, for example, 10 nodes 131, calculation
  • the management controller of the node 132, the storage node 133, and the like may also be directly connected to the management device 11. That is to say, the management controller of the service node device can directly connect with the management device 11 without going through the PCIE switch board 12, but the hardware resources of the service node device 13 must be connected to the management device 11 through the PCIE switch board 12.
  • connection manner between the management device 11 and the service node device 13 and the information acquisition manner in each connection mode include but are not limited to the following:
  • the management device 11 can be connected to the hardware resources of each service node device 13 through the PCIE switch board 12, and the management device 11 can monitor the state of the hardware resources of each service node device 13 through the PCIE switch board 12, and then pass through the PCIE switch board. 12 Obtain information of hardware resources of each service node device 13.
  • the management device 11 can also be connected to the hardware resources of each service node device 13 through the PCIE switch board 12, and connected to the management controller of each service node device 13 through the PCIE switch board 12, and then manage the device.
  • the status of the hardware resources of each service node device 13 can be monitored by the PCIE switch board 12, and the information of the hardware resources of each service node device 13 can be obtained through the PCIE switch board 12, or the management device 11 can also be stored according to each Corresponding relationship between the first port and the second port connected by the service node device 13 acquires information of hardware resources of each service node device 13 from the management controller of each service node device 13.
  • the management controller of each service node device 13 is used to monitor the state of the hardware resources of each service node device 13.
  • the first port is a port on the PCIE switch board 12 that is connected to the hardware resources of each service node device 13, and the second port is a port on the PCIE switch board 12 that is connected to the management controller of each service node device 13. That is, the management device 11 can obtain each service node setting through the PCIE switch board.
  • the information of the hardware resources of the standby device 13 may also receive the information of the hardware resources of each of the service node devices 13 that are reported from the management controller of each service node device 13 and according to the stored between the first port and the second port. The corresponding relationship determines the hardware resources corresponding to the information reported by the management controller, thereby obtaining the information of the hardware resources of each service node device 13.
  • the management device 11 can obtain all the information about the hardware resources of each service node device 13 through the PCIE switch board, and can also obtain all the information about the hardware resources of each service node device 13 from the management controller, or obtain the hardware through the PCIE switch board. Part of the resource information, get another part of the information from the management controller. For example, the management controller acquires the in-position information of the hardware resources, the manufacturer information, and the like, and acquires the load information of the hardware resources through the PCIE switch board.
  • management device 11 can specifically pass through the PCIE switch board 12 and each service node device.
  • the hardware resources of the 13 are connected to the management controller of each of the service node devices 13, and the management device 11 can monitor the state of the hardware resources of each of the service node devices 13 through the PCIE switch board 12 through the PCIE switch board 12. Obtaining the information of the hardware resources of each service node device 13, or the management device 11 may also obtain the correspondence between the first port and the second port connected by each of the service node devices 13 from each of the service node devices. The management controller of 13 acquires information of the hardware resources of each of the service node devices 13.
  • the management controller of each service node device 13 is configured to monitor the state of the hardware resources of each service node device 13; the first port is a port on the PCIE switch board 12 that is connected to the hardware resources of each service node device 13, The two ports are ports on the management device 11 that are connected to the management controller of each of the service node devices 13. That is, the management device 11 can obtain the information of the hardware resources of each of the service node devices 13 through the PCIE switch board, and can also receive the information of the hardware resources of each of the service node devices 13 that are reported from the management controller of each service node device 13.
  • the management device 11 can obtain all the information about the hardware resources of each service node device 13 through the PCIE switch board, and can also obtain all the information about the hardware resources of each service node device 13 from the management controller, or obtain the hardware through the PCIE switch board. Part of the resource information, get another part of the information from the management controller. For example, the management controller obtains the in-position information of the hardware resources, the manufacturer information, and the like, and acquires the load information of the hardware resources through the PCIE switch board.
  • the second port is a port on the PCIE switch board 12; if the management node and the management node of the service node device 13 The device 11 is directly connected, and the second port is a port on the management device 11.
  • the CPU of the computing node 132 is interconnected with the management device 11, and the detailed structure of the management controller of the computing node 132 and the management device 11 is as shown in FIG. 3.
  • the management device 11 is connected to the GE interface and the PCIE interface on the PCIE switch board 12, and the management controller of the compute node 132 is connected to the port 10 (PortlO) of the GE interface.
  • the computing node 132 includes a plurality of CPUs, each CPU is connected to the management device 11 through a PCIE interface. Among them, PortlO belongs to the second port, and Port20 belongs to the first port.
  • the management device 11 stores the correspondence between the Port 10 and the Port 20 connected to the computing node 132.
  • the management device 11 can obtain information about the CPU on the computing node 132, such as the slot number of the CPU. Specifically, the management device 11 further stores a correspondence between the identifier of the CPU on the computing node 132 connecting the port 20 and the connection port 20 and the slot number of the CPU. Based on this, the management device 11 can obtain the computing node 132 through the port 10.
  • the management unit reports part of the information of the CPU, such as the in-position information of the CPU, the manufacturer information, and the like, and further obtains the port 20 connected to the CPU according to the correspondence between the port 10 and the port 20, and then according to the acquired identifier of the CPU. Obtain other information such as the slot number and load of the CPU.
  • the process for the management device 11 to establish a logical partition on the physical resource pool is: the management device 11 is configured to present information of the hardware resource included in the physical resource pool to the user, and receive the selected instruction of the user. And dividing instructions to establish the logical partition.
  • the selected instruction is used to indicate a hardware resource selected by the user from the physical resource pool according to the application requirement, and the dividing instruction is used to indicate the hardware resource included in the logical partition and the logical partition formed by the user to allocate the selected hardware resource in the selected instruction. .
  • the function of the management device 11 to present the information of the hardware resources included in the physical resource pool to the user is equivalent to providing the user with a logical partition platform, so that the user uses the PCIE switch board 12 to perform management between the device 11 and the service node device 13.
  • Interconnection which solves the difference between the hardware resources of the service node devices 13 caused by various interconnection methods such as network cables and cables.
  • the topology relationship of the same type of hardware resources is the same for the user, and therefore, When partitioning logical partitions, you can consider only the application requirements, without considering the differences between hardware resources, and without considering the physical topology of each hardware resource, which is beneficial to improve the division of logical partitions. effectiveness.
  • the process of selecting hardware resources in a physical resource pool according to application requirements and performing allocation to form different logical partitions is illustrated.
  • two logical partitions are required to host the desktop cloud terminal service and related database applications.
  • the desktop cloud terminal service requires 8 CPUs, several memory, storage, and 100 virtual network cards;
  • the database application requires 2 CPUs, some memory, storage, and a virtual network card.
  • the user can select the CPU, memory, storage, and network card according to the requirements on the logical partition platform presented by the management device 11, and then divide the selected CPU, memory, storage, network card and other hardware resources into corresponding CPUs, memories, and
  • the storage and network card resources may constitute a logical partition that carries the desktop cloud terminal service and a logical partition of the related database application.
  • the management device 11 is further configured to: after the logical partition is powered on, present a hardware platform corresponding to the logical partition to the user, so that the user installs the operating system. And middleware or upper-level applications.
  • the hardware platform includes hardware resources included in the logical partition.
  • the operating system can be a normal operating system or a virtual operating system.
  • the management device 11 presents the information of the hardware resources in each logical partition to the user, which is equivalent to presenting one hardware platform to the user, and each hardware platform is similar to the ordinary for the user. Server or computer. The user can then choose to install a normal operating system or virtual operating system on the hardware platform.
  • the function of the management device 11 to present the hardware platform corresponding to the logical partition to the user is equivalent to providing an enhanced adaptation layer on the hardware resource (which can be referred to as the hardware layer) of the logical partition.
  • the enhanced adaptation layer here provides the role of the BIOS for this logical partition, in addition to providing open hardware platform support, various virtualization software support, and more.
  • FIG. 4 the structure of a logical partition is as shown in FIG. 4, which includes a hardware layer, an enhanced adaptation layer, and an operating system in order from bottom to top.
  • the embodiment provides an overall structure of a logical partition established by the management device 11 on a physical resource pool, as shown in FIG. 5.
  • the physical resource pool mainly includes a CPU pool composed of CPU resources, 10 pools composed of 10 resources, and a memory (MEM) pool composed of MEM resources, wherein the CPU resources, 10 resources, and MEM resources are service node devices.
  • Each logical partition uses the included hardware resources as a hardware platform, and the hardware platform includes enhanced adaptation. Layers, operating systems, and applications.
  • the operating system of logical partition 1 is mainly a common operating system, for example, Windows/Linux/unix, etc.;
  • the operating system of logical partition N is a virtual operating system, and a virtualized operating system, such as ESX of vmware, needs to be installed first. , create a virtual machine (VM) on it, and then install the operating system and applications for the VM.
  • the VMware ESX server is a virtual host software that partitions and consolidates the system in a common environment.
  • the PCIE switch board is used for the interconnection between the management device and the service node device, the characteristics and advantages of the high bandwidth and low delay of the PCIE switch board are fully utilized, so that the built
  • the physical resource pool shields the physical topology information of each hardware resource, and the virtualization management software layer can be omitted, which solves the problem that the prior art must shield the hardware resource difference caused by the physical topology through the virtualization technology, and reduce the problem. Manage costs while improving performance.
  • the management device 11 is further configured to issue the early warning information when the load of the hardware resource included in the logical partition is greater than a preset first load threshold.
  • the management device 11 is further configured to: when the load of the hardware resource included in the logical partition is greater than a preset first load threshold, select an idle hardware resource to be added from the constructed physical resource pool. Go to the logical partition and trigger the hot plug process of the newly added hardware resources.
  • the hot plug process of the newly added hardware resources will be different depending on the newly added hardware resources. For example, if the newly added hardware resource is a NIC resource, the corresponding hot plug process is mainly to add a driver for the newly added hardware resource. If the newly added hardware resource is a CPU, the corresponding hot plug process is mainly the CPU hot swap, online, offline, and the like.
  • the management device 11 may acquire the load status of the hardware resources in the logical partition by monitoring the state of the hardware resources of each service node device 13 during the operation of the logical partition, and include the logical partition in the discovery logical partition.
  • the load of the hardware resource is greater than the preset first load threshold, the warning information is sent and/or the hardware resource is newly added for the logical partition, and the hot plug process is triggered after the new hardware resource is added.
  • the first load threshold may be an upper threshold in the load threshold, but is not limited thereto.
  • the management device 11 may further remove excess hardware resources from the logical partition into the physical resource pool when the load of the hardware resources included in the logical partition is less than the preset second load threshold.
  • the second load threshold may be a lower threshold in the load threshold.
  • the excess hardware resource refers to The excess hardware resources in the partition to the physical resource pool is beneficial to improve the utilization of hardware resources.
  • the hardware resources with lower utilization ratios are usually the hardware resources whose utilization is lower than the preset usage threshold.
  • the thresholds can be set according to different application scenarios.
  • first load threshold is greater than the second load threshold, and the specific values of the first load threshold and the second load threshold are not limited.
  • the management device 11 of the embodiment can flexibly adjust the hardware resources in the logical partition according to the information of the hardware resources, and manage the hardware resources such as the CPU and the CPU by acquiring the information of the hardware resources in the logical partition when the logical partition is running. Hot swapping, enabling dynamic management of logical partitions.
  • the dynamic management of logical partitions can be specifically implemented by the enhanced adaptation layer provided by the management device 11.
  • the embodiment of the present invention implements interconnection between the management device and the service node device through the PCIE switch board, and fully utilizes the PCIE switch board to implement free scheduling of hardware resources in the system, so that the logical partition is not subject to physical resource topology.
  • the impact of the creation and use efficiency of the logical partition is improved; the user does not need to perceive the difference of the PCIE switch architecture, and the logical partition operation can be directly performed on the physical resource pool, which can save the virtualization management software layer, reduce the management cost, and improve the time. Performance;
  • the dynamic logical partition management scheme is realized, and the flexible allocation and hot swapping of 10 physical resources are realized.
  • FIG. 6 is a schematic structural diagram of a management device according to an embodiment of the present invention. As shown in FIG. 6, the management device of this embodiment includes: a monitoring module 61 and a partition establishing module 62.
  • the monitoring module 61 is connected to the at least one service node device through the PCIE switch board, and is configured to acquire, by using the PCIE switch board, information about hardware resources of each service node device in the at least one service node device, and construct the information according to the obtained information.
  • the partition establishment module 62 is connected to the monitoring module 61 for establishing a logical partition on the physical resource pool constructed by the monitoring module 61.
  • the service node device in this embodiment may be various types of service node devices, for example, a 10-node, a computing node, a storage node, a memory expansion board, a fan board, a power board, and the like.
  • the hardware resources of the 10 nodes refer to 10 resources
  • the hardware resources of the computing nodes refer to CPU resources
  • the hardware resources of the storage nodes refer to various storage resources such as HDD and SSD.
  • the management device of the embodiment is connected to the service node device through the PCIE switch board, and the management device obtains the hardware resource information of each service node device through the PCIE switch board, and according to the acquired hardware. Resource information builds a physical resource pool, and then establishes logical partitions on the physical resource pool to make full use of
  • the PCIE switch board implements a physical resource pool that blocks physical topology information at the physical resource level, which improves the flexibility of using hardware resources in the physical resource pool, thereby effectively improving the performance of the entire system.
  • FIG. 7 is a schematic structural diagram of a management device according to another embodiment of the present invention. The embodiment is implemented based on the embodiment shown in FIG. 6. As shown in FIG. 7, the management device of this embodiment also includes: a monitoring module 61 and a partition establishing module 62.
  • each service node device may include a management controller in addition to hardware resources.
  • the management controller can be kept in the power-on state, and is mainly used for the management device to obtain the information of the hardware resources of the service node device under abnormal conditions. Of course, under normal circumstances, the management controller can also obtain information about the hardware resources of the service node device.
  • the monitoring module 61 of the embodiment specifically includes a first monitoring unit 611 and a construction unit 613.
  • the first monitoring unit 611 is connected to the hardware resources of each service node device through the PCIE switch board, and is configured to monitor the state of the hardware resources of each service node device through the PCIE switch board, and obtain each service node through the PCIE switch board. Information about the hardware resources of the device.
  • the PCIe switch board may include GE, FE, and/or PCIE interfaces, and the first monitoring unit 611 may connect to the hardware resources of each serving node device through any one of the GE, FE, and PCIE interfaces.
  • the building unit 613 is connected to the first monitoring unit 611, and is configured to construct the physical resource pool according to the information of the hardware resources of each service node acquired by the first monitoring unit 611.
  • the monitoring module 61 of the embodiment further includes: a second monitoring unit 612.
  • the second monitoring unit 612 is connected to the management controller of each service node device through the PCIE switch board, and is configured to use the correspondence between the first port and the second port connected to each of the stored service node devices.
  • the information of the hardware resources of each service node device is obtained on the management controller of each service node device.
  • the second monitoring unit 612 can also connect to each service node device through any one of the GE, FE, and PCIE interfaces. Management controller connection.
  • the second monitoring unit 612 can also be directly connected to the management controller of the serving node device without passing through the PCIE switchboard.
  • the building unit 613 is also connected to the second monitoring unit 612, and is also used for information of hardware resources of each serving node acquired according to the first monitoring unit 611 and/or hardware of each serving node device acquired by the second monitoring unit 612. Resource information, build a pool of physical resources.
  • the management controller of each service node device is configured to monitor the state of the hardware resources of each service node device; the first port is a port connected to the hardware resource of the service node device on the PCIE switch board, and the second port is The port to which the management node device's management controller is connected.
  • the second port may be a port on the PCIE switch board 12 if the second monitoring unit 612 is directly connected to the management controller of the service node device.
  • the second port may be a port on the management device (more specifically, the port may be a port on the second monitoring unit 612).
  • the information of the hardware resources of each service node device may be acquired by the first monitoring unit 611 through the PCIE switch board, or may be received by the second monitoring unit 612 from the management controller of each service node device.
  • the information of the hardware resource is determined according to the correspondence between the stored first port and the second port, and the hardware resources corresponding to the information reported by the management controller are determined, thereby obtaining the information of the hardware resources of each service node device.
  • All the information of the hardware resources of each service node device may be acquired by the first monitoring unit 611 through the PCIE switch board, and all the information of the hardware resources of each service node device may also be obtained by the second monitoring unit 612 from the management controller.
  • a portion of the hardware resources may be acquired by the first monitoring unit 611 through the PCIE switchboard, and another portion of the information may be acquired by the second monitoring unit 612 from the management controller.
  • the computing node of the computing node is connected to the management controller of the computing node by the first monitoring unit 611 and the second monitoring unit 612 respectively.
  • the structure is shown in Figure 3.
  • the partition establishment module 62 of the embodiment is specifically configured to present the information of the hardware resources included in the physical resource pool constructed by the monitoring module 61 to the user, and receive the selected instruction and the partitioning instruction issued by the user to establish the logical partition.
  • the selected instruction is used to indicate a hardware resource selected by the user from the physical resource pool according to the application requirement
  • the dividing instruction is used to indicate the logical partition formed by the user to allocate the selected hardware resource and the hardware resource included in the logical partition.
  • the function of the partition establishment module 62 to present the information of the hardware resources included in the physical resource pool to the user is equivalent to providing the user with a logical partition platform, so that the user performs logical partition division on the logical partition platform.
  • the management device uses the interconnection between the PCIE switch board and the service node device, the difference between the hardware resources of each service node device caused by the various interconnection modes such as the network cable and the cable is solved, and the extension of the same type of hardware resources is solved.
  • the relationship between the PCs is the same for the user. Therefore, when logical partitioning is performed, only the application requirements can be considered without considering the difference between the hardware resources. Different, it is also unnecessary to consider the physical topology of each hardware resource, which is beneficial to improve the efficiency of partitioning logical partitions.
  • the partition establishment module 62 of the embodiment is further configured to: after the logical partition is powered on, present a hardware platform corresponding to the logical partition to the user, so that the user installs the operating system.
  • the hardware platform includes hardware resources included in the logical partition.
  • the operating system can be a normal operating system or a virtual operating system.
  • the partition establishing module 62 presents the hardware resource information in each logical partition to the user, which is equivalent to presenting one hardware platform to the user, and each hardware platform is similar to the ordinary user. Server or computer. Users can then choose to install a normal operating system or virtual operating system on the hardware platform.
  • the function of the partitioning module 62 to present the hardware platform corresponding to the logical partition to the user is equivalent to providing an enhanced adaptation layer on the hardware resource (which may be referred to as the hardware layer) of the logical partition.
  • the enhanced adaptation layer here provides the role of the BIOS for the logical partition, in addition to providing open hardware platform support, various virtualization software support, and so on.
  • a structure of the logical partition constructed by the monitoring device 61 and the partition establishing module 62 of the embodiment management device is as shown in FIG.
  • the physical structure of the physical resource pool constructed by the monitoring module 61 and the partition establishing module 62 and the logical partition established on the physical resource pool in this embodiment are as shown in FIG. 5.
  • the management device of this embodiment further includes: an early warning module 63.
  • the warning module 63 is connected to the monitoring module 61, and is configured to issue the warning information when the monitoring module 61 acquires that the load of the hardware resource included in the logical partition is greater than the preset first load threshold.
  • the early warning module 63 is connected to the first monitoring unit 611 and the second monitoring unit 612, respectively.
  • the monitoring module 61 can monitor the state of the hardware resources of each service node device through the PCIE switch board, and obtain the load status of the hardware resources in the logical partition. Alternatively, the monitoring module 61 may obtain the state of the hardware resources of each service node device through the management controller of each service node device, thereby acquiring the load status of the hardware resources in the logical partition.
  • the partition establishment module 62 of the embodiment is further configured to: when the monitoring module 61 acquires that the load of the hardware resource included in the logical partition is greater than a preset first load threshold, select an idle hardware resource from the physical resource pool to add to the In the logical partition, and trigger hot swap of newly added hardware resources Process.
  • the hot plug process of newly added hardware resources will vary depending on the newly added hardware resources. For example, if the newly added hardware resource is a NIC resource, the corresponding hot plug process is mainly to add a driver for the newly added hardware resource. If the newly added hardware resource is a CPU, the corresponding hot plug process is mainly the CPU hot swap, online, offline, and the like.
  • the first load threshold may be an upper threshold in the load threshold, but is not limited thereto.
  • the partition establishment module 62 may also remove excess hardware resources from the logical partition into the physical resource pool when the load of the hardware resources included in the logical partition is less than the preset second load threshold.
  • the second load threshold may be a lower threshold in the load threshold.
  • the excess hardware resources in addition to the excess hardware resources in the logical partition to the physical resource pool, are beneficial to improve the utilization of hardware resources.
  • the lower-utilization hardware resources usually refer to hardware resources whose utilization is less than the preset utilization threshold.
  • the utilization threshold can be set to different values according to actual requirements.
  • first load threshold is greater than the second load threshold, and the specific values of the first load threshold and the second load threshold are not limited.
  • the partition establishment module 62 of the embodiment can flexibly adjust the hardware resources in the logical partition according to the information of the hardware resources, and manage the hardware such as the CPU and the CPU by acquiring the information of the hardware resources in the logical partition when the logical partition is running. Hot swapping of resources to achieve dynamic management of logical partitions.
  • dynamic management of logical partitions may be specifically implemented by an enhanced adaptation layer provided by partition creation module 62.
  • the management device in this embodiment implements interconnection with the service node device through the PCIE switch board, and implements free scheduling of hardware resources in the system by using the PCIE switch board, so that the logical partition is not affected by the physical resource topology.
  • the efficiency of creating and using logical partitions is improved; the user does not need to perceive the difference of the PCIE switching architecture, and can perform logical partitioning operations directly on the physical resource pool, thereby eliminating the virtualization management software layer, reducing management costs, and improving performance.
  • the management scheme of the dynamic logical partition is realized through the monitoring of the hardware resources, and the flexible allocation and hot plugging of the 10 physical resources are realized.
  • FIG. 8 is a flowchart of a management method according to an embodiment of the present invention. As shown in FIG. 8, the method in this embodiment includes:
  • Step 801 The management device acquires each of the at least one service node device through the PCIE switch board. Information about the hardware resources of the service node device.
  • Step 802 The management device builds a physical resource pool according to the obtained information.
  • Step 803 The management device establishes a logical partition on the foregoing physical resource pool.
  • the management device is connected to each service node device through a PCIE switch board.
  • the service node device in this embodiment may be various types of service node devices, for example, 10 nodes, a computing node, a storage node, a memory expansion board, a fan board, a power board, and the like.
  • the hardware resources of the 10 nodes refer to 10 resources
  • the hardware resources of the compute nodes refer to CPU resources
  • the hardware resources of the storage nodes refer to various storage resources such as HDD and SSD.
  • the management device is connected to the hardware resources of each service node device through a PCIE switch board.
  • the specific implementation manner of step 801 includes: the management device monitors the state of the hardware resources of each service node device through the PCIE switch board, and obtains the hardware resource information of each service node device through the PCIE switch board. Specifically, the interaction between the management device and each service node device is implemented through a PCIE switch board.
  • the management device can be connected to the management controller of each service node device through the PCIE switch board, or directly to the management controller of each service node device.
  • the management device is connected to the hardware resources of each service node device through the PCIE switch board, and is connected to the management controller of each service node device through the PCIE switch board.
  • the specific implementation manner of step 801 includes: the management device monitors the state of the hardware resources of each service node device through the PCIE switch board, acquires the hardware resource information of each service node device through the PCIE switch board, or manages the device according to the storage. Corresponding relationship between the first port and the second port of each service node device connection, and acquiring information of hardware resources of each service node device from a management controller of each service node device.
  • the management controller of each service node device is used to monitor the status of hardware resources of each service node device.
  • the first port is a port connected to the hardware resource of each service node device on the PCIE switch board
  • the second port is a port connected to the management controller of each service node device on the PCIE switch board. That is, the management device may obtain the information of the hardware resources of each service node device through the PCIE switch board, and may also receive the information of the hardware resources of each service node device that is reported from the management controller of each service node device, and according to the storage. Corresponding relationship between the first port and the second port determines the hardware resources corresponding to the information reported by the management controller, thereby obtaining the information of the hardware resources of each service node device.
  • the management device can obtain all the information about the hardware resources of each service node device through the PCIE switch board. It is also possible to obtain all the information of the hardware resources of each service node device from the management controller, or obtain a part of the hardware resources through the PCIE switch board, and obtain another part of the information from the management controller.
  • the management device is connected to the hardware resources of each service node device through the PCIE switch board, and is directly connected to the management controller of the service node device.
  • the specific implementation manner of the step 801 includes: the management device monitors the state of the hardware resources of each service node device through the PCIE switch board, acquires the hardware resource information of each service node device through the PCIE switch board, or manages the device according to the storage. Corresponding relationship between the first port and the second port of each service node device connection, and acquiring information of hardware resources of each service node device from a management controller of each service node device.
  • the management controller of each service node device is used to monitor the status of hardware resources of each service node device.
  • the first port is a port on the PCIE switch board that is connected to the hardware resource of each service node device
  • the second port is a port on the management device that is connected to the management controller of each service node device. That is, the second port can be a port on the PCIE switch board or a port on the management device.
  • the second port is a port on the PCIE switch board; when the management device is directly connected to the management controller of the service node device, but not through the PCIE switch board The second port is the port on the management device.
  • the management device may obtain the information of the hardware resources of each service node device through the PCIE switch board, and may also receive the information of the hardware resources of each service node device that is reported from the management controller of each service node device, and according to the storage. Corresponding relationship between the first port and the second port determines the hardware resources corresponding to the information reported by the management controller, thereby obtaining the information of the hardware resources of each service node device.
  • the management device can obtain all the information about the hardware resources of each service node device through the PCIE switch board, or obtain all the hardware resources of each service node device from the management controller, or obtain a part of the hardware resources through the PCIE switch board. Information, get another piece of information from the management controller.
  • the implementation manner of step 803 includes: the management device presents the information of the hardware resource included in the physical resource pool to the user, and receives the selected instruction and the partitioning instruction of the user to establish the logical partition.
  • the selected instruction is used to indicate a hardware resource selected by the user from the physical resource pool according to the application requirement
  • the dividing instruction is used to indicate the logical partition formed by the user to allocate the selected hardware resource and the hardware resource included in the logical partition. .
  • This process is equivalent to providing a logical partition platform to the user, so that the user can divide the logical partition.
  • the management device and the service node device are connected through the PCIE switch board, and the management device acquires the hardware resource information of each service node device through the PCIE switch board, and constructs the physical resource according to the acquired hardware resource information.
  • the pool, and then the logical partition is established on the physical resource pool, and the PCIE switch board is utilized to implement the physical resource pool for shielding physical topology information at the physical resource level, thereby improving the flexibility of using hardware resources in the physical resource pool, thereby effectively improving the pool.
  • the performance of the entire system is performed by the PCIE switch board.
  • FIG. 9 is a schematic structural diagram of a management method according to another embodiment of the present invention. As shown in FIG. 9, the method in this embodiment includes:
  • Step 901 In the case of standby power supply, the management device acquires information about hardware resources of each service node device.
  • the management device can monitor the state of the hardware resources of each service node device through the PCIE switch board, and interact with each service node device through the PCIE switch board to obtain information about the hardware resources of each service node device.
  • the management device may also receive, by using a management controller of each service node device, information about hardware resources of the service node device where the management controller is reported by the management controller.
  • Step 902 The management device constructs a physical resource pool according to information about hardware resources of each service node device.
  • Step 903 The management device presents the physical resource pool to the user by using a UI or the like, so that the user performs logical partition division.
  • Step 904 The user divides the logical partition according to the application requirement.
  • Step 905 After the logical partition is powered on, the enhanced adaptation layer provided by the management device takes effect, and the hardware platform corresponding to the logical partition is presented to the user.
  • the hardware platform corresponding to each logical partition may include: CPU, 10, MEM, and the like.
  • Step 906 The user installs an operating system, an intermediate layer, and an upper layer application software on the hardware platform to perform an application.
  • the management device and the service node device are connected through the PCIE switch board.
  • the management device obtains the information of the hardware resources of each service node device through the PCIE switch board, and constructs a physical resource pool according to the acquired hardware resource information, and then the physical resource.
  • a logical partition is created on the pool to fully utilize the PCIE switch board to implement a physical resource pool that blocks physical topology information at the physical resource level, that is, no Restricted by the interconnection mode in the prior art, the hardware resources in the physical resource pool can be flexibly allocated, and the logical partition can be dynamically adjusted, thereby effectively improving the performance of the entire system.
  • the management method shown in FIG. 10 further includes:
  • Step 907 The management device monitors a load condition of a hardware resource of the logical partition.
  • Step 908 The management device determines whether the load of the hardware resource included in the logical partition is greater than a preset first load threshold. If the determination result is yes, go to step 909; if the structure is negative, go back to step 907.
  • Step 909 The management device selects an idle hardware resource from the physical resource pool to be added to the logical partition, and triggers a hot plugging process of the newly added hardware resource.
  • the newly added hardware resource is of the same type as the hardware resource whose load exceeds the first load threshold. For example, if the hardware resource whose load exceeds the first load threshold is the CPU, the newly added hardware resource is also the CPU resource; The hardware resource of the load threshold is 10, and the newly added hardware resources are also 10, and so on.
  • step 908 may be: the management device sends the early warning information.
  • the first load threshold may be an upper threshold in the load threshold, but is not limited thereto.
  • the second load threshold may be preset, excess hardware resources are removed from the logical partition into the physical resource pool.
  • the second load threshold may be a lower threshold in the load threshold.
  • the excess hardware resources in addition to the excess hardware resources in the logical partition to the physical resource pool, are beneficial to improve the utilization of hardware resources.
  • the lower-utilization hardware resources usually refer to hardware resources whose utilization is less than the preset utilization threshold.
  • the utilization threshold can be set to different values according to actual requirements.
  • first load threshold is greater than the second load threshold, and the specific values of the first load threshold and the second load threshold are not limited.
  • the management method provided in this embodiment when the logical partition is running, obtains the hardware resources in the logical partition according to the information of the hardware resources, and manages the hardware resources in the logical partition according to the information of the hardware resources. Hot swapping, enabling dynamic management of logical partitions.
  • the dynamic management of logical partitions can be implemented by an enhanced adaptation layer provided by the management device.
  • the foregoing program may be stored in a computer readable storage medium, and when executed, the program includes the steps of the foregoing method embodiment; and the foregoing storage medium includes: ROM, RAM , a variety of media that can store program code, such as a disk or an optical disk.

Abstract

Embodiments of the present invention provide a server system, a management method, and a device. The system comprises a management device, a PCIE exchange plate, and at least one service node device. The management device is connected to each service node device through the PCIE exchange plate. The management device is used for acquiring hardware resource information of each service node device through the PCIE exchange plate, building a physical resource pool according to the acquired information, and establishing a logical partition on the physical resource pool. In the technical solutions of the present invention, the management device and the service node device are connected through the PCIE exchange plate, thereby improving usage flexibility of hardware resources in the server system, and further improving performance of the whole system.

Description

服务器系统、 管理方法及设备 技术领域 本发明涉及虚拟化技术, 尤其涉及一种服务器系统、 管理方法及设备。 背景技术  The present invention relates to virtualization technologies, and in particular, to a server system, a management method, and a device. Background technique
在服务器技术发展中, 逻辑分区概念的提出有效的提高了服务器硬件资 源的利用率。 目前, 服务器系统一般都是由多个服务节点设备互联组合形成, 服务节点设备中的输入 /输出 (Input/Output, 10 )资源之间采用传统的网线或 电缆互联, 服务节点设备中的中央处理单元( Central Processing Unit, CPU ) 之间采用快速通道互联 ( QuickPath Interconnect, QPI ) 线缆以及专用芯片互 联。 服务器管理系统在服务器系统的硬件资源上划分逻辑分区提供给业务使 用。  In the development of server technology, the concept of logical partitioning effectively improves the utilization of server hardware resources. At present, the server system is generally formed by a plurality of service node device interconnection and combination, and the input/output (10) resources in the service node device are interconnected by a traditional network cable or cable, and the central processing in the service node device The Quick Processing Interconnect (QPI) cable and the dedicated chip are interconnected between the Central Processing Units (CPUs). The server management system divides the logical partitions on the hardware resources of the server system for service use.
但是, 现有服务器系统中硬件资源的使用不够灵活, 使得服务器系统的 性能无法进一步提高。 发明内容  However, the use of hardware resources in existing server systems is not flexible enough to further improve the performance of the server system. Summary of the invention
本发明实施例提供一种服务器系统、 管理方法及设备, 用以提高服务器 系统中硬件资源使用的灵活性, 进一步提高服务器系统的性能。  The embodiment of the invention provides a server system, a management method and a device, which are used to improve the flexibility of using hardware resources in a server system, and further improve the performance of the server system.
根据本发明实施例的第一方面提供一种服务器系统, 包括: 管理设备、 高速外设组件互连 PCIE交换板和至少一个服务节点设备;所述管理设备通过 所述 PCIE交换板与每个服务节点设备连接;  A first aspect of the present invention provides a server system, including: a management device, a high-speed peripheral component interconnect PCIE switch board, and at least one service node device; the management device through the PCIE switch board and each service Node device connection;
所述管理设备,用于通过所述 PCIE交换板获取所述每个服务节点设备的 硬件资源的信息, 根据获取的所述信息构建物理资源池, 并在所述物理资源 池上建立逻辑分区。  The management device is configured to acquire information about hardware resources of each service node device by using the PCIE switch board, construct a physical resource pool according to the obtained information, and establish a logical partition on the physical resource pool.
在所述第一方面的一可选实施方式中, 所述管理设备具体用于将所述物 理资源池包括的硬件资源的信息呈现给用户, 接收所述用户的选中指令和划 分指令, 以建立所述逻辑分区, 所述选中指令用于指示所述用户根据应用需 求从所述物理资源池中选择的硬件资源, 所述划分指令用于指示所述用户对 所述选择的硬件资源进行分配形成的所述逻辑分区和所述逻辑分区包括的硬 件资源。 In an optional implementation manner of the first aspect, the management device is specifically configured to: present information about a hardware resource included in the physical resource pool to a user, and receive a selected instruction and a division instruction of the user to establish The logical partition, the selected instruction is used to indicate a hardware resource selected by the user from the physical resource pool according to an application requirement, and the dividing instruction is used to indicate the user pair The selected hardware resources are allocated to form the logical partition and the hardware resources included in the logical partition.
在所述第一方面的一可选实施方式中, 所述管理设备还用于在所述逻辑 户安装操作系统, 所述硬件平台包括所述逻辑分区包括的硬件资源。  In an optional implementation manner of the first aspect, the management device is further configured to install an operating system in the logical user, where the hardware platform includes hardware resources included in the logical partition.
在所述第一方面的一可选实施方式中, 所述管理设备还用于在所述逻辑 分区包括的硬件资源的负载大于预设第一负载门限时, 发出预警信息。  In an optional implementation manner of the first aspect, the management device is further configured to: send an early warning message when a load of the hardware resource included in the logical partition is greater than a preset first load threshold.
在所述第一方面的一可选实施方式中, 所述管理设备还用于在所述逻辑 分区包括的硬件资源的负载大于预设第一负载门限时, 从所述物理资源池中 选择空闲的硬件资源添加到所述逻辑分区中, 并触发所述新添加的硬件资源 的热插拔流程; 或者 二负载门限时,从所述逻辑分区中移除过剩的硬件资源到所述物理资源池中 , 所述过剩的硬件资源是在所述逻辑分区中没有被利用或者利用率小于预设利 用率门限的硬件资源。  In an optional implementation manner of the first aspect, the management device is further configured to: select, from the physical resource pool, when the load of the hardware resource included in the logical partition is greater than a preset first load threshold. Adding hardware resources to the logical partition and triggering a hot plug process of the newly added hardware resource; or when loading a threshold, removing excess hardware resources from the logical partition to the physical resource pool The excess hardware resource is a hardware resource that is not utilized in the logical partition or whose utilization is less than a preset utilization threshold.
在所述第一方面的一可选实施方式中,所述管理设备具体通过所述 PCIE 交换板与所述每个服务节点设备的硬件资源连接;  In an optional implementation manner of the first aspect, the management device is specifically connected to the hardware resources of each service node device by using the PCIE switch board;
所述管理设备具体用于通过所述 PCIE 交换板监控所述每个服务节点设 备的硬件资源的状态,通过所述 PCIE交换板获取所述每个服务节点设备的硬 件资源的信息。  The management device is configured to monitor, by using the PCIE switch board, the status of the hardware resources of each service node device, and obtain information about the hardware resources of each service node device by using the PCIE switch board.
在所述第一方面的一可选实施方式中,所述管理设备具体通过所述 PCIE 交换板与所述每个服务节点设备的硬件资源连接,并通过所述 PCIE交换板与 所述每个服务节点设备的管理控制器连接;  In an optional implementation manner of the first aspect, the management device is specifically connected to the hardware resources of each service node device by using the PCIE switch board, and through the PCIE switch board and each of the The management controller of the service node device is connected;
所述管理设备具体用于通过所述 PCIE 交换板监控所述每个服务节点设 备的硬件资源的状态,通过所述 PCIE交换板获取所述每个服务节点设备的硬 件资源的信息, 或者 口和第二端口之间的对应关系, 从所述每个服务节点设备的管理控制器获取 所述每个服务节点设备的硬件资源的信息;  The management device is configured to monitor, by using the PCIE switch board, a state of a hardware resource of each service node device, and obtain, by using the PCIE switch board, information about hardware resources of each service node device, or Corresponding relationship between the second ports, acquiring information of the hardware resources of each of the service node devices from the management controller of each service node device;
其中, 所述每个服务节点设备的管理控制器用于监控所述每个服务节点 设备的硬件资源的状态;第一端口为所述 PCIE交换板上与所述每个服务节点 设备的硬件资源连接的端口,第二端口为所述 PCIE交换板上与所述每个服务 节点设备的管理控制器连接的端口。 The management controller of each service node device is configured to monitor each of the service nodes. The state of the hardware resource of the device; the first port is a port connected to the hardware resource of each service node device on the PCIE switch board, and the second port is the PCIE switch board and each service node device The port to which the management controller is connected.
在所述第一方面的一可选实施方式中, 所述管理设备具体通过所述 PCIE 交换板与所述每个服务节点设备的硬件资源连接, 并直接与所述每个服务节 点设备的管理控制器连接;  In an optional implementation manner of the first aspect, the management device is specifically connected to the hardware resources of each service node device by using the PCIE switch board, and directly managed with each service node device. Controller connection
所述管理设备具体用于通过所述 PCIE 交换板监控所述每个服务节点设 备的硬件资源的状态,通过所述 PCIE交换板获取所述每个服务节点设备的硬 件资源的信息; 或者 口和第二端口之间的对应关系, 从所述每个服务节点设备的管理控制器获取 所述每个服务节点设备的硬件资源的信息;  The management device is configured to monitor, by using the PCIE switch board, a state of a hardware resource of each service node device, and obtain, by using the PCIE switch board, information about hardware resources of each service node device; Corresponding relationship between the second ports, acquiring information of the hardware resources of each of the service node devices from the management controller of each service node device;
其中, 所述每个服务节点设备的管理控制器用于监控所述每个服务节点 设备的硬件资源的状态;第一端口为所述 PCIE交换板上与所述每个服务节点 设备的硬件资源连接的端口, 第二端口为所述管理设备上与所述每个服务节 点设备的管理控制器连接的端口。 。  The management controller of each of the service node devices is configured to monitor a state of the hardware resources of each of the service node devices; the first port is a hardware resource connection between the PCIE switch board and each of the service node devices. Port, the second port is a port on the management device that is connected to the management controller of each of the service node devices. .
根据本发明实施例的第二方面提供一种管理设备, 包括:  A second aspect of the present invention provides a management device, including:
监控模块,通过高速外设组件互连 PCIE交换板与至少一个服务节点设备 连接,用于通过所述 PCIE交换板获取所述至少一个服务节点设备中的每个服 务节点设备的硬件资源的信息, 根据获取的所述信息构建物理资源池;  a monitoring module, connected to the at least one service node device by the high-speed peripheral component interconnection PCIE switch board, configured to acquire, by using the PCIE switch board, information about hardware resources of each service node device of the at least one service node device, Constructing a physical resource pool according to the obtained information;
分区建立模块, 用于在所述物理资源池上建立逻辑分区。  A partition establishment module is configured to establish a logical partition on the physical resource pool.
在所述第二方面的一可选实施方式中, 所述分区建立模块具体用于将所 述物理资源池包括的硬件资源的信息呈现给用户, 并接收所述用户的选中指 令和划分指令, 以建立所述逻辑分区, 所述选中指令用于指示所述用户根据 应用需求从所述物理资源池中选择的硬件资源, 所述划分指令用于指示所述 用户对所述选择的硬件资源进行分配形成的所述逻辑分区和所述逻辑分区包 括的硬件资源。  In an optional implementation manner of the second aspect, the partition establishment module is specifically configured to: present information of a hardware resource included in the physical resource pool to a user, and receive a selected instruction and a division instruction of the user, The dividing instruction is used to indicate that the user selects a hardware resource from the physical resource pool according to an application requirement, and the dividing instruction is used to instruct the user to perform the selected hardware resource. Allocating the formed logical partition and the hardware resources included in the logical partition.
在所述第二方面的一可选实施方式中, 所述分区建立模块还用于在所述 述用户安装操作系统, 所述硬件平台包括所述逻辑分区包括的硬件资源。 在所述第二方面的一可选实施方式中, 所述管理设备还包括: 预警模块, 用于在所述监控模块获取到所述逻辑分区包括的硬件资源的负载大于预设第 一负载门限时, 发出预警信息。 In an optional implementation manner of the second aspect, the partition establishment module is further configured to install an operating system in the user, where the hardware platform includes hardware resources included in the logical partition. In an optional implementation of the second aspect, the management device further includes: an early warning module, configured to acquire, by the monitoring module, that a load of the hardware resource included in the logical partition is greater than a preset first load gate For a limited time, an early warning message is issued.
在所述第二方面的一可选实施方式中, 所述分区建立模块还用于在所述 监控模块获取到所述逻辑分区包括的硬件资源的负载大于预设第一负载门限 时, 从所述物理资源池中选择空闲的硬件资源添加到所述逻辑分区中, 并触 发所述新添加的硬件资源的热插拔流程; 或者  In an optional implementation of the second aspect, the partition establishment module is further configured to: when the monitoring module acquires that the load of the hardware resource included in the logical partition is greater than a preset first load threshold, Adding idle hardware resources in the physical resource pool to the logical partition, and triggering a hot plug process of the newly added hardware resources; or
所述分区建立模块还用于在所述监控模块获取到所述逻辑分区包括的硬 件资源的负载小于预设第二负载门限时, 从所述逻辑分区中移除过剩的硬件 资源到所述物理资源池中, 所述过剩的硬件资源是在所述逻辑分区中没有被 利用或者利用率小于预设利用率门限的硬件资源。  The partition establishment module is further configured to: when the monitoring module acquires that the load of the hardware resource included in the logical partition is less than a preset second load threshold, remove excess hardware resources from the logical partition to the physical In the resource pool, the excess hardware resources are hardware resources that are not utilized in the logical partition or whose utilization is less than a preset utilization threshold.
在所述第二方面的一可选实施方式中, 所述监控模块包括:  In an optional implementation manner of the second aspect, the monitoring module includes:
第一监控单元,通过所述 PCIE交换板与所述每个服务节点设备的硬件资 源连接,用于通过所述 PCIE交换板监控所述每个服务节点设备的硬件资源的 状态, 通过所述 PCIE交换板获取所述每个服务节点设备的硬件资源的信息; 构建单元, 用于根据所述第一监控单元获取的所述每个服务节点的硬件 资源的信息, 构建所述物理资源池。  a first monitoring unit, configured to connect, by using the PCIE switch board, hardware resources of each of the service node devices, to monitor, by using the PCIE switch board, a status of hardware resources of each service node device, by using the PCIE The switch board acquires information about the hardware resources of each of the service node devices; and the constructing unit is configured to construct the physical resource pool according to the information about the hardware resources of each service node acquired by the first monitoring unit.
在所述第二方面的一可选实施方式中, 所述监控模块还包括:  In an optional implementation manner of the second aspect, the monitoring module further includes:
第二监控单元,通过所述 PCIE交换板或直接与所述每个服务节点设备的 管理控制器连接, 用于根据存储的所述每个服务节点设备连接的第一端口和 第二端口之间的对应关系, 从所述每个服务节点设备的管理控制器上获取所 述每个服务节点设备的硬件资源的信息;  a second monitoring unit, connected by the PCIE switch board or directly to the management controller of each service node device, between the first port and the second port connected according to the stored each service node device Corresponding relationship, obtaining information about hardware resources of each service node device from a management controller of each service node device;
所述构建单元, 还用于根据所述第一监控单元获取的所述每个服务节点 的硬件资源的信息和 /或所述第二监控单元获取的所述每个服务节点设备的 硬件资源的信息, 构建所述物理资源池;  The building unit is further configured to: according to the information about the hardware resource of each serving node acquired by the first monitoring unit, and/or the hardware resource of each serving node device acquired by the second monitoring unit Information, constructing the physical resource pool;
其中, 所述每个服务节点设备的管理控制器用于监控所述每个服务节点 设备的硬件资源的状态;第一端口为所述 PCIE交换板上与所述每个服务节点 设备的硬件资源连接的端口,第二端口为所述管理设备上或所述 PCIE交换板 上与所述每个服务节点设备的管理控制器连接的端口。  The management controller of each of the service node devices is configured to monitor a state of the hardware resources of each of the service node devices; the first port is a hardware resource connection between the PCIE switch board and each of the service node devices. Port, the second port is a port connected to the management controller of each serving node device on the management device or on the PCIE switch board.
根据本发明实施例的第三方面提供一种管理方法, 包括: 管理设备通过高速外设组件互连 PCIE 交换板获取至少一个服务节点设 备中的每个服务节点设备的硬件资源的信息; A third aspect of the embodiments of the present invention provides a management method, including: The management device interconnects the PCIE switch board through the high-speed peripheral component to acquire information of hardware resources of each of the at least one service node device;
所述管理设备根据获取的所述信息, 构建物理资源池;  The management device constructs a physical resource pool according to the obtained information;
所述管理设备在所述物理资源池上建立逻辑分区;  The management device establishes a logical partition on the physical resource pool;
其中, 所述管理设备通过所述 PCIE 交换板与所述每个服务节点设备连 接。  The management device is connected to each of the service node devices through the PCIE switch board.
在所述第三方面的一可选实施方式中, 所述管理设备在所述物理资源池 上建立逻辑分区包括: 收所述用户的选中指令和划分指令, 以建立所述逻辑分区, 所述选中指令用 于指示所述用户根据应用需求从所述物理资源池中选择的硬件资源, 所述划 分指令用于指示所述用户对所述选择的硬件资源进行分配形成的所述逻辑分 区和所述逻辑分区包括的硬件资源。  In an optional implementation manner of the third aspect, the establishing, by the management device, the logical partition on the physical resource pool includes: receiving a selected instruction and a dividing instruction of the user, to establish the logical partition, and selecting the The instruction is used to instruct the user to select a hardware resource from the physical resource pool according to an application requirement, where the dividing instruction is used to indicate the logical partition formed by the user to allocate the selected hardware resource, and the The hardware resources included in the logical partition.
在所述第三方面的一可选实施方式中, 所述方法还包括: 所述管理设备 以使所述用户安装操作系统, 所述硬件平台包括所述逻辑分区包括的硬件资 源。  In an optional implementation manner of the third aspect, the method further includes: the management device, to enable the user to install an operating system, where the hardware platform includes hardware resources included in the logical partition.
在所述第三方面的一可选实施方式中, 所述方法还包括: 所述管理设备 在所述逻辑分区包括的硬件资源的负载大于预设第一负载门限时, 发出预警 信息。  In an optional implementation manner of the third aspect, the method further includes: the management device sends an early warning message when a load of the hardware resource included in the logical partition is greater than a preset first load threshold.
在所述第三方面的一可选实施方式中, 所述方法还包括: 所述管理设备 在所述逻辑分区包括的硬件资源的负载大于预设第一负载门限时, 从所述物 理资源池中选择空闲的硬件资源添加到所述逻辑分区中, 并触发所述新添加 的硬件资源的热插拔流程; 或者 门限时, 从所述逻辑分区中移除过剩的硬件资源到所述物理资源池中, 所述 过剩的硬件资源是在所述逻辑分区中没有被利用或者利用率小于预设利用率 门限的硬件资源。  In an optional implementation manner of the third aspect, the method further includes: the management device, when the load of the hardware resource included in the logical partition is greater than a preset first load threshold, from the physical resource pool Adding idle hardware resources to the logical partition and triggering a hot plug process of the newly added hardware resource; or threshold, removing excess hardware resources from the logical partition to the physical resource In the pool, the excess hardware resources are hardware resources that are not utilized in the logical partition or whose utilization is less than a preset utilization threshold.
在所述第三方面的一可选实施方式中 ,所述管理设备通过所述 PCIE交换 板与所述每个服务节点设备的硬件资源连接; 所述管理设备通过高速外设组件互连 PCIE 交换板获取至少一个服务节 点设备中的每个服务节点设备的硬件资源的信息包括: In an optional implementation manner of the third aspect, the management device is connected to the hardware resources of each service node device by using the PCIE switch board; The information that the management device obtains the hardware resources of each of the at least one service node device by interconnecting the PCIE switch board by the high-speed peripheral component includes:
所述管理设备通过所述 PCIE 交换板监控所述每个服务节点设备的硬件 资源的状态,通过所述 PCIE交换板获取所述每个服务节点设备的硬件资源的 信息。  The management device monitors the status of the hardware resources of each of the service node devices through the PCIE switch board, and obtains information about the hardware resources of each service node device through the PCIE switch board.
在所述第三方面的一可选实施方式中 ,所述管理设备通过所述 PCIE交换 板与所述每个服务节点设备的硬件资源连接,并通过所述 PCIE交换板与所述 每个服务节点设备的管理控制器连接;  In an optional implementation manner of the third aspect, the management device is connected to the hardware resources of each service node device by using the PCIE switch board, and through the PCIE switch board and each service The management device of the node device is connected;
所述管理设备通过高速外设组件互连 PCIE 交换板获取至少一个服务节 点设备中的每个服务节点设备的硬件资源的信息包括:  The management device accessing the PCIE switch board through the high-speed peripheral component to obtain the hardware resources of each of the service node devices of the at least one service node device includes:
所述管理设备通过所述 PCIE 交换板监控所述每个服务节点设备的硬件 资源的状态,通过所述 PCIE交换板获取所述每个服务节点设备的硬件资源的 信息; 或者  The management device monitors, by using the PCIE switch board, the status of the hardware resources of each service node device, and obtains, by using the PCIE switch board, information about hardware resources of each service node device; or
所述管理设备根据存储的所述每个服务节点设备连接的第一端口和第二 端口之间的对应关系, 从所述每个服务节点设备的管理控制器获取所述每个 服务节点设备的硬件资源的信息;  The management device acquires, according to the stored correspondence between the first port and the second port of each service node device, the management controller of each service node device Information about hardware resources;
其中, 所述每个服务节点设备的管理控制器用于监控所述每个服务节点 设备的硬件资源的状态;第一端口为所述 PCIE交换板上与所述每个服务节点 设备的硬件资源连接的端口,第二端口为所述 PCIE上与所述每个服务节点设 备的管理控制器连接的端口。  The management controller of each of the service node devices is configured to monitor a state of the hardware resources of each of the service node devices; the first port is a hardware resource connection between the PCIE switch board and each of the service node devices. Port, the second port is a port on the PCIE that is connected to the management controller of each of the service node devices.
在所述第三方面的一可选实施方式中 ,所述管理设备通过所述 PCIE交换 板与所述每个服务节点设备的硬件资源连接, 并直接与所述每个服务节点设 备的管理控制器连接;  In an optional implementation manner of the third aspect, the management device is connected to the hardware resources of each service node device by using the PCIE switch board, and directly controls the management of each service node device. Connection
所述管理设备通过高速外设组件互连 PCIE 交换板获取至少一个服务节 点设备中的每个服务节点设备的硬件资源的信息包括:  The management device accessing the PCIE switch board through the high-speed peripheral component to obtain the hardware resources of each of the service node devices of the at least one service node device includes:
所述管理设备通过所述 PCIE 交换板监控所述每个服务节点设备的硬件 资源的状态,通过所述 PCIE交换板获取所述每个服务节点设备的硬件资源的 信息, 或者  The management device monitors, by using the PCIE switch board, a state of hardware resources of each service node device, and obtains, by using the PCIE switch board, information about hardware resources of each service node device, or
所述管理设备根据存储的所述每个服务节点设备连接的第一端口和第二 端口之间的对应关系, 从所述每个服务节点设备的管理控制器获取所述每个 服务节点设备的硬件资源的信息; The management device acquires each of the service port devices from each of the service node devices according to the stored correspondence between the first port and the second port of each of the service node devices Information about the hardware resources of the service node device;
其中, 所述每个服务节点设备的管理控制器用于监控所述每个服务节点 设备的硬件资源的状态;第一端口为所述 PCIE交换板上与所述每个服务节点 设备的硬件资源连接的端口, 第二端口为所述管理设备上与所述每个服务节 点设备的管理控制器连接的端口。  The management controller of each of the service node devices is configured to monitor a state of the hardware resources of each of the service node devices; the first port is a hardware resource connection between the PCIE switch board and each of the service node devices. Port, the second port is a port on the management device that is connected to the management controller of each of the service node devices.
本发明实施例第一方面提供的服务器系统, 管理设备与服务节点设备通 过 PCIE交换板连接, 管理设备通过 PCIE交换板获取各个服务节点设备的硬 件资源的信息, 并根据获取的硬件资源的信息构建物理资源池, 进而在物理 资源池上建立逻辑分区,充分利用 PCIE交换板在物理资源层面实现了屏蔽物 理拓朴信息的物理资源池, 提高了物理资源池中的硬件资源使用的灵活性, 从而有效提高了整个系统的性能。  In the server system provided by the first aspect of the present invention, the management device and the service node device are connected through a PCIE switch board, and the management device acquires information about hardware resources of each service node device through the PCIE switch board, and builds the information according to the obtained hardware resource information. The physical resource pool is used to establish a logical partition on the physical resource pool. The physical resource pool that shields the physical topology information is implemented at the physical resource level by using the PCIE switch board, which improves the flexibility of the use of hardware resources in the physical resource pool. Improve the performance of the entire system.
本发明实施例第二方面提供的管理设备,与服务节点设备通过 PCIE交换 板连接,通过 PCIE交换板获取各个服务节点设备的硬件资源的信息,并根据 获取的硬件资源的信息构建物理资源池,进而在物理资源池上建立逻辑分区 , 充分利用 PCIE 交换板在物理资源层面实现了屏蔽物理拓朴信息的物理资源 池, 提高了物理资源池中的硬件资源使用的灵活性, 从而有效提高了整个系 统的性能。  The management device provided by the second aspect of the present invention is connected to the service node device through the PCIE switch board, obtains the hardware resource information of each service node device through the PCIE switch board, and constructs a physical resource pool according to the acquired hardware resource information. In addition, a logical partition is established on the physical resource pool, and the physical resource pool for shielding physical topology information is implemented at the physical resource level by using the PCIE switch board, thereby improving the flexibility of using hardware resources in the physical resource pool, thereby effectively improving the entire system. Performance.
本发明实施例第三方面提供的管理方法, 管理设备通过高速外设组件互 连 PCIE交换板获取各个服务节点设备的硬件资源的信息,并根据获取的硬件 资源的信息构建物理资源池, 进而在物理资源池上建立逻辑分区, 充分利用 PCIE交换板在物理资源层面实现了屏蔽物理拓朴信息的物理资源池,提高了 物理资源池中的硬件资源使用的灵活性, 从而有效提高了整个系统的性能。 附图说明 为了更清楚地说明本发明实施例或现有技术中的技术方案, 下面将对实 施例或现有技术描述中所需要使用的附图作一简单地介绍, 显而易见地, 下 面描述中的附图是本发明的一些实施例, 对于本领域普通技术人员来讲, 在 不付出创造性劳动性的前提下, 还可以根据这些附图获得其他的附图。  According to the management method provided by the third aspect of the present invention, the management device obtains the information of the hardware resources of each service node device by interconnecting the PCIE switch board with the high-speed peripheral component, and constructs a physical resource pool according to the acquired hardware resource information, and then A logical partition is created on the physical resource pool to fully utilize the PCIE switch board to implement physical resource pools that shield physical topology information at the physical resource level, thereby improving the flexibility of using hardware resources in the physical resource pool, thereby effectively improving the performance of the entire system. . BRIEF DESCRIPTION OF THE DRAWINGS In order to more clearly illustrate the embodiments of the present invention or the technical solutions in the prior art, a brief description of the drawings used in the embodiments or the prior art description will be briefly described below. The drawings are some embodiments of the present invention, and those skilled in the art can obtain other drawings based on these drawings without any inventive labor.
图 1为本发明一实施例提供的服务器系统的结构示意图;  1 is a schematic structural diagram of a server system according to an embodiment of the present invention;
图 2为本发明一实施例给出的一种服务器系统的具体结构的示意图; 图 3为本发明一实施例提供的管理设备通过 PCIE交换板与计算节点的 CPU和管理控制器分别连接的结构示意图; 2 is a schematic diagram of a specific structure of a server system according to an embodiment of the present invention; 3 is a schematic structural diagram of a management device connected to a CPU and a management controller of a computing node through a PCIE switch board according to an embodiment of the present invention;
图 4为本发明一实施例提供的逻辑分区的一种结构示意图;  4 is a schematic structural diagram of a logical partition according to an embodiment of the present invention;
图 5为本发明一实施例提供的在物理资源池以及在物理资源池上构建的 逻辑分区的整体结构示意图;  FIG. 5 is a schematic diagram of an overall structure of a logical partition built on a physical resource pool and a physical resource pool according to an embodiment of the present invention;
图 6为本发明一实施例提供的管理设备的结构示意图;  FIG. 6 is a schematic structural diagram of a management device according to an embodiment of the present invention;
图 7为本发明另一实施例提供的管理设备的结构示意图;  FIG. 7 is a schematic structural diagram of a management device according to another embodiment of the present invention;
图 8为本发明一实施例提供的管理方法的流程图;  FIG. 8 is a flowchart of a management method according to an embodiment of the present invention;
图 9为本发明另一实施例提供的管理方法的结构示意图;  FIG. 9 is a schematic structural diagram of a management method according to another embodiment of the present invention;
图 10为本发明又一实施例提供的管理方法的结构示意图。 具体实施方式 为使本发明实施例的目的、 技术方案和优点更加清楚, 下面将结合本发 明实施例中的附图, 对本发明实施例中的技术方案进行清楚、 完整地描述, 显然, 所描述的实施例是本发明一部分实施例, 而不是全部的实施例。 基于 本发明中的实施例, 本领域普通技术人员在没有作出创造性劳动前提下所获 得的所有其他实施例, 都属于本发明保护的范围。  FIG. 10 is a schematic structural diagram of a management method according to another embodiment of the present invention. The technical solutions in the embodiments of the present invention are clearly and completely described in the following with reference to the accompanying drawings in the embodiments of the present invention. The embodiments are a part of the embodiments of the invention, and not all of the embodiments. All other embodiments obtained by those skilled in the art based on the embodiments of the present invention without creative efforts are within the scope of the present invention.
图 1为本发明一实施例提供的服务器系统的结构示意图。 如图 1所示, 本实施例的系统包括:管理设备 11、高速外设组件互连(Peripheral Component Interconnect Express, PCIE ) 交换板 12和至少一个服务节点设备 13。 其中, 管理设备 11通过 PCIE交换板 12与每个服务节点设备 13连接。 PCIE交换板 12具有高带宽、 低延迟等特性与优点。  FIG. 1 is a schematic structural diagram of a server system according to an embodiment of the present invention. As shown in FIG. 1, the system of this embodiment includes a management device 11, a Peripheral Component Interconnect Express (PCIE) switchboard 12, and at least one service node device 13. The management device 11 is connected to each service node device 13 through the PCIE switch board 12. The PCIE switchboard 12 has features and advantages such as high bandwidth and low latency.
其中, 本实施例中的服务节点设备 13可以是各种类型的服务节点设备, 例如, 10节点, 计算节点, 存储节点, 内存扩展板, 风扇板, 电源板等模块。  The service node device 13 in this embodiment may be various types of service node devices, for example, a 10-node, a computing node, a storage node, a memory expansion board, a fan board, a power board, and the like.
管理设备 11 , 用于通过 PCIE交换板 12获取每个服务节点设备 13的硬 件资源的信息, 根据获取的每个服务节点设备的硬件资源的信息构建物理资 源池, 并在所述物理资源池上建立逻辑分区。 本文中的硬件资源指的是构成 服务器系统的实际硬件, 更具体地, 硬件资源可以是例如一颗 CPU, —张内 存卡、 一块硬盘等, 可以由服务器系统实现对计算节点、 存储节点、 10节点 中的硬件资源的池化管理,形成物理资源池,资源池内部可以细分为例如 CPU 池、 内存池、 10资源池、 网卡池等。 其中, 每个服务节点设备 13 的硬件资 源的信息包括例如网卡、 CPU、 内存、 存储设备的在位信息、 故障信息、 负 载、 温度等硬件资源的状态信息, 进一步还可以包括例如内存大小、 CPU频 率、 生产厂家等硬件资源的设备属性信息。 The management device 11 is configured to acquire information about hardware resources of each service node device 13 through the PCIE switch board 12, construct a physical resource pool according to the obtained information of hardware resources of each service node device, and establish a physical resource pool on the physical resource pool. Logical partition. The hardware resources in this document refer to the actual hardware constituting the server system. More specifically, the hardware resources may be, for example, a CPU, a memory card, a hard disk, etc., which may be implemented by a server system to a computing node, a storage node, and 10 Pool management of hardware resources in a node to form a physical resource pool, which can be subdivided into, for example, a CPU Pool, memory pool, 10 resource pool, network card pool, etc. The information about the hardware resources of each service node device 13 includes status information of hardware resources such as network card, CPU, memory, storage device in-position information, fault information, load, temperature, etc., and may further include, for example, a memory size, a CPU. Device attribute information of hardware resources such as frequency and manufacturer.
具体的, 管理设备 11可以通过 PCIE交换板 12获取每个服务节点设备 Specifically, the management device 11 can obtain each service node device through the PCIE switch board 12.
13的硬件资源的信息, 并根据硬件资源的信息中包含的硬件资源的容量、 型 号等属性信息以及状态信息, 用其中可用的硬件资源构成物理资源池, 然后 在所构建的物理资源池上建立逻辑分区。 13 hardware resource information, and according to the hardware resource capacity, model and other attribute information and status information contained in the hardware resource information, use the available hardware resources to form a physical resource pool, and then establish logic on the constructed physical resource pool Partition.
在本实施例中,管理设备与服务节点设备通过 PCIE交换板连接,管理设 备通过 PCIE交换板获取各个服务节点设备的硬件资源的信息,并根据获取的 硬件资源的信息构建物理资源池, 进而在物理资源池上建立逻辑分区, 充分 利用 PCIE交换板在物理资源层面实现了屏蔽物理拓朴信息的物理资源池,提 高了物理资源池中的硬件资源使用的灵活性, 使逻辑分区动态可调, 从而有 效提高了整个系统的性能。  In this embodiment, the management device and the service node device are connected through the PCIE switch board, and the management device acquires the hardware resource information of each service node device through the PCIE switch board, and builds a physical resource pool according to the acquired hardware resource information, and then Logical partitions are created on the physical resource pool to fully utilize the PCIE switch board to implement physical resource pools that shield physical topology information at the physical resource level, thereby improving the flexibility of using hardware resources in the physical resource pool and dynamically adjusting the logical partitions. Effectively improve the performance of the entire system.
进一步, 本实施例中的 10节点的硬件资源是指 10资源, 计算节点的硬 件资源是指 CPU 资源, 存储节点的硬件资源是指硬盘驱动器 (Hard Disk Drive, HDD ) 、 固态硬盘 ( Solid State Disk, SSD )等各种存储资源。 可选 的, 每个服务节点设备 13除了包括上述硬件资源之外, 还可以包括管理控制 器。 管理控制器可以一直保持上电状态, 主要用于在异常情况下, 供管理设 备 11获取服务节点设备 13的硬件资源的信息, 例如, 服务节点设备 13中的 某个硬件资源断电导致管理设备 11无法从硬件资源获取该硬件资源的信息, 则管理设备 11可以通过该服务节点设备 13的管理控制器获取该硬件资源的 信息, 另外在正常情况下, 也可以通过管理控制器获取服务节点设备 13的硬 件资源的信息。 可选的, 管理控制器可以是基板管理控制器 (Baseboard Management Controller, BMC ) , 还可以是实现类 BMC功能的逻辑器件。  Further, the hardware resources of the 10 nodes in the embodiment refer to 10 resources, and the hardware resources of the computing nodes refer to CPU resources, and the hardware resources of the storage nodes refer to hard disk drives (HDDs) and solid state disks (Solid State Disk). , SSD) and other storage resources. Optionally, each service node device 13 may include a management controller in addition to the foregoing hardware resources. The management controller can be kept in a power-on state, and is mainly used to obtain information about the hardware resources of the service node device 13 for the management device 11 in an abnormal situation. For example, a certain hardware resource in the service node device 13 is powered off to cause the management device. If the information of the hardware resource cannot be obtained from the hardware resource, the management device 11 can obtain the information of the hardware resource through the management controller of the service node device 13, and in other cases, obtain the service node device through the management controller. 13 information on hardware resources. Optionally, the management controller may be a Baseboard Management Controller (BMC) or a logic device that implements a BMC-like function.
在本实施例提供的服务器系统中, 各种类型的服务节点设备根据需要可 以部署一个或者多个, 计算节点、 存储节点等通常为多个, 可以根据需要进 行服务器系统的扩展。  In the server system provided in this embodiment, various types of service node devices may be deployed one or more as needed, and there are usually a plurality of computing nodes, storage nodes, and the like, and the server system may be expanded as needed.
PCIE交换板 12上可以包括千 M以太网 ( Gigabit Ethernet, GE )接口、 快速以太网 ( Fast Ethernet, FE )接口和 /或 PCIE接口, 管理设备 11可以通 过 GE接口、 FE接口和 PCIE接口中的至少一种与每个服务节点设备 13连接。 在此说明,上述 GE接口、 FE接口和 PCIE接口在具体实现上可以在一块 PCIE 交换板 12上, 也可以是在多块 PCIE交换板 12上。 The PCIE switch board 12 may include a Gigabit Ethernet (GE) interface, a Fast Ethernet (FE) interface, and/or a PCIE interface, and the management device 11 can pass. At least one of a GE interface, a FE interface, and a PCIE interface is connected to each of the service node devices 13. The GE interface, the FE interface, and the PCIE interface may be implemented on a PCIE switch board 12 or on multiple PCIE switch boards 12 in a specific implementation.
基于上述, 图 2给出一种服务器系统的具体结构。 在图 2中, 管理设备 11通过 PCIE交换板 12上的 PCIE接口与 10节点 131、 计算节点 132、 存储 节点 133等服务节点设备的硬件资源连接, 管理设备 11通过 PCIE交换板 12 上的 GE接口与 10节点 131、 计算节点 132、 存储节点 133等的管理控制器 连接。 在图 2中, 整个服务器系统中的风扇板 14等共用模块直接通过其管理 控制器与管理设备 11连接, 但不限于此, 风扇板 14之外的其他服务节点设 备, 例如 10节点 131、 计算节点 132、 存储节点 133等的管理控制器也可以 直接与管理设备 11连接。 也就是说, 服务节点设备的管理控制器可以直接与 管理设备 11连接, 而不用通过 PCIE交换板 12, 但服务节点设备 13的硬件 资源一定要通过 PCIE交换板 12与管理设备 11连接。  Based on the above, Fig. 2 shows a specific structure of a server system. In FIG. 2, the management device 11 is connected to the hardware resources of the service node device such as the 10-node 131, the compute node 132, and the storage node 133 through the PCIE interface on the PCIE switch board 12. The management device 11 passes the GE interface on the PCIE switch board 12. It is connected to a management controller of 10 nodes 131, compute nodes 132, storage nodes 133, and the like. In FIG. 2, the shared module such as the fan board 14 in the entire server system is directly connected to the management device 11 through its management controller, but is not limited thereto, and other service node devices other than the fan board 14, for example, 10 nodes 131, calculation The management controller of the node 132, the storage node 133, and the like may also be directly connected to the management device 11. That is to say, the management controller of the service node device can directly connect with the management device 11 without going through the PCIE switch board 12, but the hardware resources of the service node device 13 must be connected to the management device 11 through the PCIE switch board 12.
基于上述可知, 管理设备 11与服务节点设备 13之间的连接方式以及每 种连接方式下的信息获取方式包括但不限于如下几种:  Based on the above, the connection manner between the management device 11 and the service node device 13 and the information acquisition manner in each connection mode include but are not limited to the following:
管理设备 11具体可以通过 PCIE交换板 12与每个服务节点设备 13的硬 件资源连接,则管理设备 11可以通过 PCIE交换板 12监控每个服务节点设备 13的硬件资源的状态, 进而通过 PCIE交换板 12获取每个服务节点设备 13 的硬件资源的信息。  The management device 11 can be connected to the hardware resources of each service node device 13 through the PCIE switch board 12, and the management device 11 can monitor the state of the hardware resources of each service node device 13 through the PCIE switch board 12, and then pass through the PCIE switch board. 12 Obtain information of hardware resources of each service node device 13.
除此之外,管理设备 11具体还可以通过 PCIE交换板 12与每个服务节点 设备 13的硬件资源连接, 并通过 PCIE交换板 12与每个服务节点设备 13的 管理控制器连接,则管理设备 11可以通过 PCIE交换板 12监控每个服务节点 设备 13的硬件资源的状态,进而通过 PCIE交换板 12获取每个服务节点设备 13 的硬件资源的信息, 或者, 管理设备 11也可以根据存储的每个服务节点 设备 13 连接的第一端口和第二端口之间的对应关系, 从每个服务节点设备 13 的管理控制器获取每个服务节点设备 13 的硬件资源的信息。 其中, 每个 服务节点设备 13的管理控制器用于监控每个服务节点设备 13的硬件资源的 状态。第一端口为 PCIE交换板 12上与每个服务节点设备 13的硬件资源连接 的端口, 第二端口为 PCIE交换板 12上与每个服务节点设备 13的管理控制 器连接的端口。 即, 管理设备 11可以通过 PCIE交换板获取每个服务节点设 备 13的硬件资源的信息, 也可以从每个服务节点设备 13的管理控制器接收 上报的每个服务节点设备 13的硬件资源的信息,并根据存储的第一端口和第 二端口之间的对应关系, 确定管理控制器上报的信息所对应的硬件资源, 从 而获得每个服务节点设备 13的硬件资源的信息。 管理设备 11可以通过 PCIE 交换板获取每个服务节点设备 13的硬件资源的全部信息,也可以从管理控制 器获取每个服务节点设备 13的硬件资源的全部信息, 也可以通过 PCIE交换 板获取硬件资源的一部分信息, 从管理控制器获取另一部分信息。 例如, 通 过管理控制器获取硬件资源的在位信息、厂家信息等,通过 PCIE交换板获取 硬件资源的负载信息等。 In addition, the management device 11 can also be connected to the hardware resources of each service node device 13 through the PCIE switch board 12, and connected to the management controller of each service node device 13 through the PCIE switch board 12, and then manage the device. The status of the hardware resources of each service node device 13 can be monitored by the PCIE switch board 12, and the information of the hardware resources of each service node device 13 can be obtained through the PCIE switch board 12, or the management device 11 can also be stored according to each Corresponding relationship between the first port and the second port connected by the service node device 13 acquires information of hardware resources of each service node device 13 from the management controller of each service node device 13. The management controller of each service node device 13 is used to monitor the state of the hardware resources of each service node device 13. The first port is a port on the PCIE switch board 12 that is connected to the hardware resources of each service node device 13, and the second port is a port on the PCIE switch board 12 that is connected to the management controller of each service node device 13. That is, the management device 11 can obtain each service node setting through the PCIE switch board. The information of the hardware resources of the standby device 13 may also receive the information of the hardware resources of each of the service node devices 13 that are reported from the management controller of each service node device 13 and according to the stored between the first port and the second port. The corresponding relationship determines the hardware resources corresponding to the information reported by the management controller, thereby obtaining the information of the hardware resources of each service node device 13. The management device 11 can obtain all the information about the hardware resources of each service node device 13 through the PCIE switch board, and can also obtain all the information about the hardware resources of each service node device 13 from the management controller, or obtain the hardware through the PCIE switch board. Part of the resource information, get another part of the information from the management controller. For example, the management controller acquires the in-position information of the hardware resources, the manufacturer information, and the like, and acquires the load information of the hardware resources through the PCIE switch board.
另外, 管理设备 11具体可以通过 PCIE交换板 12与每个服务节点设备 In addition, the management device 11 can specifically pass through the PCIE switch board 12 and each service node device.
13 的硬件资源连接, 并直接与每个服务节点设备 13 的管理控制器连接, 则 管理设备 11具体可以通过 PCIE交换板 12监控每个服务节点设备 13的硬件 资源的状态,通过 PCIE交换板 12获取每个服务节点设备 13的硬件资源的信 息, 或者, 管理设备 11也可以根据存储的每个服务节点设备 13连接的第一 端口和第二端口之间的对应关系,从每个服务节点设备 13的管理控制器获取 每个服务节点设备 13的硬件资源的信息。 其中, 每个服务节点设备 13的管 理控制器用于监控每个服务节点设备 13的硬件资源的状态;第一端口为 PCIE 交换板 12上与每个服务节点设备 13的硬件资源连接的端口, 第二端口为管 理设备 11上与每个服务节点设备 13的管理控制器连接的端口。 即, 管理设 备 11可以通过 PCIE交换板获取每个服务节点设备 13的硬件资源的信息,也 可以从每个服务节点设备 13 的管理控制器接收上报的每个服务节点设备 13 的硬件资源的信息, 并根据存储的第一端口和第二端口之间的对应关系, 确 定管理控制器上报的信息所对应的硬件资源, 从而获得每个服务节点设备 13 的硬件资源的信息。 管理设备 11可以通过 PCIE交换板获取每个服务节点设 备 13 的硬件资源的全部信息, 也可以从管理控制器获取每个服务节点设备 13的硬件资源的全部信息, 也可以通过 PCIE交换板获取硬件资源的一部分 信息, 从管理控制器获取另一部分信息。 例如, 通过管理控制器获取硬件资 源的在位信息、 厂家信息等, 通过 PCIE交换板获取硬件资源的负载信息等。 The hardware resources of the 13 are connected to the management controller of each of the service node devices 13, and the management device 11 can monitor the state of the hardware resources of each of the service node devices 13 through the PCIE switch board 12 through the PCIE switch board 12. Obtaining the information of the hardware resources of each service node device 13, or the management device 11 may also obtain the correspondence between the first port and the second port connected by each of the service node devices 13 from each of the service node devices. The management controller of 13 acquires information of the hardware resources of each of the service node devices 13. The management controller of each service node device 13 is configured to monitor the state of the hardware resources of each service node device 13; the first port is a port on the PCIE switch board 12 that is connected to the hardware resources of each service node device 13, The two ports are ports on the management device 11 that are connected to the management controller of each of the service node devices 13. That is, the management device 11 can obtain the information of the hardware resources of each of the service node devices 13 through the PCIE switch board, and can also receive the information of the hardware resources of each of the service node devices 13 that are reported from the management controller of each service node device 13. And determining, according to the correspondence between the stored first port and the second port, the hardware resources corresponding to the information reported by the management controller, thereby obtaining the information of the hardware resources of each service node device 13. The management device 11 can obtain all the information about the hardware resources of each service node device 13 through the PCIE switch board, and can also obtain all the information about the hardware resources of each service node device 13 from the management controller, or obtain the hardware through the PCIE switch board. Part of the resource information, get another part of the information from the management controller. For example, the management controller obtains the in-position information of the hardware resources, the manufacturer information, and the like, and acquires the load information of the hardware resources through the PCIE switch board.
其中,如果服务节点设备 13的管理控制器与 PCIE交换板 12连接,则第 二端口是 PCIE交换板 12上的端口;如果服务节点设备 13的管理控制器与管 理设备 11直接连接, 则第二端口是管理设备 11上的端口。 其中, 以计算节 点 132作为服务节点设备 13为例, 计算节点 132的一个 CPU与管理设备 11 互联、 以及计算节点 132的管理控制器与管理设备 11互联的详细结构如图 3 所示。 管理设备 11分别与 PCIE交换板 12上的 GE接口和 PCIE接口连接, 计算节点 132的管理控制器与 GE接口的端口 10 ( PortlO )连接, 计算节点 132的该 CPU与 PCIE接口的端口 20 ( Port20 )连接。 图 3中仅示出一个 CPU 通过 PCIE接口与管理设备 11连接, 但不限于此, 当计算节点 132包括多个 CPU时,每个 CPU分别通过一个 PCIE接口与管理设备 11连接。其中, PortlO 属于第二端口, Port20属于第一端口。 管理设备 11上存储有该计算节点 132 连接的 PortlO与 Port20的对应关系, 管理设备 11通过该对应关系可以获取 计算节点 132上该 CPU的信息, 例如该 CPU的槽号等。 具体的, 管理设备 11上还存储有 Port20与连接 Port20的计算节点 132上的 CPU的标识与该 CPU 的槽号之间的对应关系,基于此,管理设备 11可以通过 PortlO获取到计算节 点 132的管理控制器上报的该 CPU的部分信息, 例如 CPU的在位信息、 厂 家信息等, 进一步还可以根据 PortlO与 Port20之间的对应关系获知该 CPU 所连接的 Port20, 然后根据获取的该 CPU的标识获取该 CPU的槽号、 负载 等其他信息。 Wherein, if the management controller of the service node device 13 is connected to the PCIE switch board 12, the second port is a port on the PCIE switch board 12; if the management node and the management node of the service node device 13 The device 11 is directly connected, and the second port is a port on the management device 11. For example, taking the computing node 132 as the serving node device 13 as an example, the CPU of the computing node 132 is interconnected with the management device 11, and the detailed structure of the management controller of the computing node 132 and the management device 11 is as shown in FIG. 3. The management device 11 is connected to the GE interface and the PCIE interface on the PCIE switch board 12, and the management controller of the compute node 132 is connected to the port 10 (PortlO) of the GE interface. The CPU of the node 132 and the port 20 of the PCIE interface (Port 20) )connection. Only one CPU is connected to the management device 11 through the PCIE interface in FIG. 3, but is not limited thereto. When the computing node 132 includes a plurality of CPUs, each CPU is connected to the management device 11 through a PCIE interface. Among them, PortlO belongs to the second port, and Port20 belongs to the first port. The management device 11 stores the correspondence between the Port 10 and the Port 20 connected to the computing node 132. The management device 11 can obtain information about the CPU on the computing node 132, such as the slot number of the CPU. Specifically, the management device 11 further stores a correspondence between the identifier of the CPU on the computing node 132 connecting the port 20 and the connection port 20 and the slot number of the CPU. Based on this, the management device 11 can obtain the computing node 132 through the port 10. The management unit reports part of the information of the CPU, such as the in-position information of the CPU, the manufacturer information, and the like, and further obtains the port 20 connected to the CPU according to the correspondence between the port 10 and the port 20, and then according to the acquired identifier of the CPU. Obtain other information such as the slot number and load of the CPU.
在本实施例的一个可选实施方式中,管理设备 11在物理资源池上建立逻 辑分区的过程为:管理设备 11用于将物理资源池包括的硬件资源的信息呈现 给用户, 接收用户的选中指令和划分指令, 以建立所述逻辑分区。 其中, 选 中指令用于指示用户根据应用需求从物理资源池中选择的硬件资源, 划分指 令用于指示用户对选中指令中所选择的硬件资源进行分配形成的逻辑分区和 逻辑分区所包括的硬件资源。 其中, 管理设备 11向用户呈现物理资源池包括 的硬件资源的信息的功能相当于向用户提供了一个逻辑分区平台, 以使用户 使用 PCIE交换板 12进行管理设备 11和服务节点设备 13之间的互联, 解决 了使用网线、线缆等各种互联方式造成的各服务节点设备 13的硬件资源之间 的差异, 同一类型的硬件资源的拓朴关系对用户来说是相同的, 因此, 在进 行逻辑分区划分时可以仅考虑应用需求, 而不需要考虑各硬件资源之间的差 异, 也不需要考虑各硬件资源构成的物理拓朴, 有利于提高逻辑分区的划分 效率。 In an optional implementation manner of the embodiment, the process for the management device 11 to establish a logical partition on the physical resource pool is: the management device 11 is configured to present information of the hardware resource included in the physical resource pool to the user, and receive the selected instruction of the user. And dividing instructions to establish the logical partition. The selected instruction is used to indicate a hardware resource selected by the user from the physical resource pool according to the application requirement, and the dividing instruction is used to indicate the hardware resource included in the logical partition and the logical partition formed by the user to allocate the selected hardware resource in the selected instruction. . The function of the management device 11 to present the information of the hardware resources included in the physical resource pool to the user is equivalent to providing the user with a logical partition platform, so that the user uses the PCIE switch board 12 to perform management between the device 11 and the service node device 13. Interconnection, which solves the difference between the hardware resources of the service node devices 13 caused by various interconnection methods such as network cables and cables. The topology relationship of the same type of hardware resources is the same for the user, and therefore, When partitioning logical partitions, you can consider only the application requirements, without considering the differences between hardware resources, and without considering the physical topology of each hardware resource, which is beneficial to improve the division of logical partitions. effectiveness.
对用户根据应用需求选择物理资源池中的硬件资源, 并进行分配形成不 同逻辑分区的过程进行举例说明。 例如, 在部署桌面云应用时, 需要两个逻 辑分区, 分别承载桌面云终端业务和相关的数据库应用。 根据评估得到: 桌 面云终端业务需要 8个 CPU, 若干内存、 存储, 100张虚拟网卡; 数据库应 用需要 2个 CPU, 若干内存、 存储, 一张虚拟网卡。 则用户可以在管理设备 11呈现出的逻辑分区平台上,根据要求进行 CPU、 内存、存储、 网卡的选择, 然后再将选择的 CPU、 内存、存储、 网卡等硬件资源划分相应的 CPU、 内存、 存储、 网卡资源组成承载桌面云终端业务的逻辑分区和相关的数据库应用的 逻辑分区即可。  The process of selecting hardware resources in a physical resource pool according to application requirements and performing allocation to form different logical partitions is illustrated. For example, when deploying a desktop cloud application, two logical partitions are required to host the desktop cloud terminal service and related database applications. According to the evaluation: The desktop cloud terminal service requires 8 CPUs, several memory, storage, and 100 virtual network cards; the database application requires 2 CPUs, some memory, storage, and a virtual network card. The user can select the CPU, memory, storage, and network card according to the requirements on the logical partition platform presented by the management device 11, and then divide the selected CPU, memory, storage, network card and other hardware resources into corresponding CPUs, memories, and The storage and network card resources may constitute a logical partition that carries the desktop cloud terminal service and a logical partition of the related database application.
在本实施例的一个可选实施方式中, 在逻辑分区建立之后, 管理设备 11 还用于在逻辑分区上电后, 向用户呈现与所述逻辑分区对应的硬件平台, 以 使用户安装操作系统以及中间件或上层应用等。 其中, 硬件平台包括所述逻 辑分区包括的硬件资源。 其中, 操作系统可以是普通操作系统也可以是虚拟 操作系统。 其中, 在逻辑分区建立后, 管理设备 11通过向用户呈现每个逻辑 分区中的硬件资源的信息, 相当于向用户呈现一个个硬件平台, 而每个硬件 平台对用户来说就类似于普通的服务器或计算机。 然后, 用户可以在硬件平 台上自行选择安装普通的操作系统或虚拟操作系统。 其中, 管理设备 11向用 户呈现逻辑分区对应的硬件平台的功能相当于在逻辑分区的硬件资源 (可称 为硬件层)上提供一个增强适配层。 这里的增强适配层提供针对该逻辑分区 的 BIOS 的作用, 除此之外还提供开放硬件平台支持, 各种虚拟化软件支持 等。  In an optional implementation manner of the embodiment, after the logical partition is established, the management device 11 is further configured to: after the logical partition is powered on, present a hardware platform corresponding to the logical partition to the user, so that the user installs the operating system. And middleware or upper-level applications. The hardware platform includes hardware resources included in the logical partition. The operating system can be a normal operating system or a virtual operating system. After the logical partition is established, the management device 11 presents the information of the hardware resources in each logical partition to the user, which is equivalent to presenting one hardware platform to the user, and each hardware platform is similar to the ordinary for the user. Server or computer. The user can then choose to install a normal operating system or virtual operating system on the hardware platform. The function of the management device 11 to present the hardware platform corresponding to the logical partition to the user is equivalent to providing an enhanced adaptation layer on the hardware resource (which can be referred to as the hardware layer) of the logical partition. The enhanced adaptation layer here provides the role of the BIOS for this logical partition, in addition to providing open hardware platform support, various virtualization software support, and more.
基于上述, 一种逻辑分区的结构如图 4所示, 从底向上依次包括硬件层、 增强适配层和操作系统。  Based on the above, the structure of a logical partition is as shown in FIG. 4, which includes a hardware layer, an enhanced adaptation layer, and an operating system in order from bottom to top.
基于上述,本实施例提供一种管理设备 11在物理资源池上建立的逻辑分 区的整体结构, 如图 5所示。 其中, 物理资源池主要包括有 CPU资源构成的 CPU池、由 10资源构成的 10池和由 MEM资源构成的内存( Memory, MEM ) 池, 其中的 CPU资源、 10资源和 MEM资源为服务节点设备中的硬件资源; 在该物理资源池上建立了 N个逻辑分区, 分别为逻辑分区 1-逻辑分 Ν。 每 个逻辑分区以所包括的硬件资源作为硬件平台, 硬件平台之上包括增强适配 层、 操作系统和应用程序。 如图 5所示, 逻辑分区 1的操作系统主要是普通 操作系统, 例如, Windows/Linux/unix等; 逻辑分区 N的操作系统是虚拟操 作系统, 需要先安装虚拟化操作系统, 如 vmware的 ESX, 在其上创建虚拟 机( Virtual machine , VM ) , 然后才能为 VM安装操作系统和应用程序等。 其中, VMware ESX服务器是一种在通用环境下分区和整合系统的虚拟主机 软件。 由此可见, 在本实施例的服务器系统中, 由于使用 PCIE交换板进行管 理设备与服务节点设备之间的互联,充分利用 PCIE交换板具有的高带宽、低 延迟等特性与优点, 使得所构建的物理资源池屏蔽了各硬件资源的物理拓朴 信息, 可以省去虚拟化管理软件层, 解决了现有技术中必须通过虚拟化技术 来屏蔽由于物理拓朴导致的硬件资源差异的问题, 降低了管理成本, 同时提 高了性能。 Based on the above, the embodiment provides an overall structure of a logical partition established by the management device 11 on a physical resource pool, as shown in FIG. 5. The physical resource pool mainly includes a CPU pool composed of CPU resources, 10 pools composed of 10 resources, and a memory (MEM) pool composed of MEM resources, wherein the CPU resources, 10 resources, and MEM resources are service node devices. Hardware resources in the network; N logical partitions are established on the physical resource pool, which are logical partitions 1 - logical branches. Each logical partition uses the included hardware resources as a hardware platform, and the hardware platform includes enhanced adaptation. Layers, operating systems, and applications. As shown in Figure 5, the operating system of logical partition 1 is mainly a common operating system, for example, Windows/Linux/unix, etc.; the operating system of logical partition N is a virtual operating system, and a virtualized operating system, such as ESX of vmware, needs to be installed first. , create a virtual machine (VM) on it, and then install the operating system and applications for the VM. Among them, the VMware ESX server is a virtual host software that partitions and consolidates the system in a common environment. Therefore, in the server system of the embodiment, since the PCIE switch board is used for the interconnection between the management device and the service node device, the characteristics and advantages of the high bandwidth and low delay of the PCIE switch board are fully utilized, so that the built The physical resource pool shields the physical topology information of each hardware resource, and the virtualization management software layer can be omitted, which solves the problem that the prior art must shield the hardware resource difference caused by the physical topology through the virtualization technology, and reduce the problem. Manage costs while improving performance.
在本实施例的一个可选实施方式中,管理设备 11还用于在逻辑分区包括 的硬件资源的负载大于预设第一负载门限时, 发出预警信息。  In an optional implementation manner of the embodiment, the management device 11 is further configured to issue the early warning information when the load of the hardware resource included in the logical partition is greater than a preset first load threshold.
在本实施例的一个可选实施方式中,管理设备 11还用于在逻辑分区包括 的硬件资源的负载大于预设第一负载门限时, 从所构建的物理资源池中选择 空闲的硬件资源添加到逻辑分区中,并触发新添加的硬件资源的热插拔流程。 其中,新添加的硬件资源的热插拔流程会因新添加的硬件资源的不同而不同。 举例说明, 如果新添加的硬件资源为网卡资源, 则相应的热插拔流程主要是 为新添加的硬件资源增加驱动等操作。 如果新添加的硬件资源为 CPU, 则相 应的热插拔流程主要是 CPU的热插拔、 上线( online ) , 下线( offline )等流 程。  In an optional implementation manner of this embodiment, the management device 11 is further configured to: when the load of the hardware resource included in the logical partition is greater than a preset first load threshold, select an idle hardware resource to be added from the constructed physical resource pool. Go to the logical partition and trigger the hot plug process of the newly added hardware resources. The hot plug process of the newly added hardware resources will be different depending on the newly added hardware resources. For example, if the newly added hardware resource is a NIC resource, the corresponding hot plug process is mainly to add a driver for the newly added hardware resource. If the newly added hardware resource is a CPU, the corresponding hot plug process is mainly the CPU hot swap, online, offline, and the like.
在上述可选实施方式中, 管理设备 11可以在逻辑分区运行过程中, 通过 监控每个服务节点设备 13的硬件资源的状态,获取逻辑分区中的硬件资源的 负载情况, 并在发现逻辑分区包括的硬件资源的负载大于预设第一负载门限 时, 发出预警信息和 /或为逻辑分区新添加硬件资源, 并在新添加硬件资源后 触发热插拔流程。  In the above optional implementation manner, the management device 11 may acquire the load status of the hardware resources in the logical partition by monitoring the state of the hardware resources of each service node device 13 during the operation of the logical partition, and include the logical partition in the discovery logical partition. When the load of the hardware resource is greater than the preset first load threshold, the warning information is sent and/or the hardware resource is newly added for the logical partition, and the hot plug process is triggered after the new hardware resource is added.
上述第一负载门限可以是负载门限中的上门限, 但不限于此。  The first load threshold may be an upper threshold in the load threshold, but is not limited thereto.
进一步,管理设备 11还可以在逻辑分区包括的硬件资源的负载小于预设 第二负载门限时, 从逻辑分区中移除过剩的硬件资源到所述物理资源池中。 其中, 第二负载门限可以是负载门限中的下门限。 所述过剩的硬件资源是指 辑分区中过剩的硬件资源到物理资源池有利于提高硬件资源的利用率。 上述 利用率较低的硬件资源通常是指利用率低于预设利用率门限的硬件资源 , 所 述利用率门限可以根据不同应用场景设置不同的取值。 Further, the management device 11 may further remove excess hardware resources from the logical partition into the physical resource pool when the load of the hardware resources included in the logical partition is less than the preset second load threshold. The second load threshold may be a lower threshold in the load threshold. The excess hardware resource refers to The excess hardware resources in the partition to the physical resource pool is beneficial to improve the utilization of hardware resources. The hardware resources with lower utilization ratios are usually the hardware resources whose utilization is lower than the preset usage threshold. The thresholds can be set according to different application scenarios.
在此说明, 上述第一负载门限大于第二负载门限, 关于第一负载门限和 第二负载门限的具体数值不做限定。  It is noted that the first load threshold is greater than the second load threshold, and the specific values of the first load threshold and the second load threshold are not limited.
由上述可见, 本实施例的管理设备 11可以在逻辑分区运行时, 通过获取 逻辑分区中的硬件资源的信息, 根据硬件资源的信息灵活调整逻辑分区中的 硬件资源,管理 10、 CPU等硬件资源的热插拔, 实现对逻辑分区的动态管理。 结合上述,对逻辑分区的动态管理具体可以由管理设备 11提供的增强适配层 实现。  It can be seen from the above that the management device 11 of the embodiment can flexibly adjust the hardware resources in the logical partition according to the information of the hardware resources, and manage the hardware resources such as the CPU and the CPU by acquiring the information of the hardware resources in the logical partition when the logical partition is running. Hot swapping, enabling dynamic management of logical partitions. In combination with the above, the dynamic management of logical partitions can be specifically implemented by the enhanced adaptation layer provided by the management device 11.
由上述可见,本发明实施例通过 PCIE交换板实现管理设备与服务节点设 备之间的互联, 充分利用 PCIE交换板实现了系统中的硬件资源的自由调度, 使逻辑分区不受物理资源拓朴的影响, 提升了逻辑分区的创建与使用效率; 使得用户不需要感知 PCIE交换架构的差异,可以在物理资源池上直接进行逻 辑分区操作, 可以省去虚拟化管理软件层, 降低管理成本, 同时提高了性能; 另外, 在逻辑分区运行过程中, 通过对硬件资源的监控, 实现了动态逻辑分 区的管理方案, 实现了 10等物理资源的灵活分配、 热插拔等。  It can be seen from the foregoing that the embodiment of the present invention implements interconnection between the management device and the service node device through the PCIE switch board, and fully utilizes the PCIE switch board to implement free scheduling of hardware resources in the system, so that the logical partition is not subject to physical resource topology. The impact of the creation and use efficiency of the logical partition is improved; the user does not need to perceive the difference of the PCIE switch architecture, and the logical partition operation can be directly performed on the physical resource pool, which can save the virtualization management software layer, reduce the management cost, and improve the time. Performance; In addition, during the operation of the logical partition, through the monitoring of hardware resources, the dynamic logical partition management scheme is realized, and the flexible allocation and hot swapping of 10 physical resources are realized.
图 6为本发明一实施例提供的管理设备的结构示意图。 如图 6所示, 本 实施例的管理设备包括: 监控模块 61和分区建立模块 62。  FIG. 6 is a schematic structural diagram of a management device according to an embodiment of the present invention. As shown in FIG. 6, the management device of this embodiment includes: a monitoring module 61 and a partition establishing module 62.
其中, 监控模块 61 , 通过 PCIE交换板与至少一个服务节点设备连接, 用于通过 PCIE 交换板获取至少一个服务节点设备中的每个服务节点设备的 硬件资源的信息, 根据获取的所述信息构建物理资源池。 分区建立模块 62, 与监控模块 61连接,用于在监控模块 61构建的物理资源池上建立逻辑分区。  The monitoring module 61 is connected to the at least one service node device through the PCIE switch board, and is configured to acquire, by using the PCIE switch board, information about hardware resources of each service node device in the at least one service node device, and construct the information according to the obtained information. Physical resource pool. The partition establishment module 62 is connected to the monitoring module 61 for establishing a logical partition on the physical resource pool constructed by the monitoring module 61.
本实施例中的服务节点设备可以是各种类型的服务节点设备, 例如, 10 节点, 计算节点, 存储节点, 内存扩展板, 风扇板, 电源板等模块。 其中, 10节点的硬件资源是指 10资源, 计算节点的硬件资源是指 CPU资源, 存储 节点的硬件资源是指 HDD、 SSD等各种存储资源。  The service node device in this embodiment may be various types of service node devices, for example, a 10-node, a computing node, a storage node, a memory expansion board, a fan board, a power board, and the like. The hardware resources of the 10 nodes refer to 10 resources, the hardware resources of the computing nodes refer to CPU resources, and the hardware resources of the storage nodes refer to various storage resources such as HDD and SSD.
本实施例的管理设备与服务节点设备通过 PCIE交换板连接,管理设备通 过 PCIE交换板获取各个服务节点设备的硬件资源的信息,并根据获取的硬件 资源的信息构建物理资源池, 进而在物理资源池上建立逻辑分区, 充分利用The management device of the embodiment is connected to the service node device through the PCIE switch board, and the management device obtains the hardware resource information of each service node device through the PCIE switch board, and according to the acquired hardware. Resource information builds a physical resource pool, and then establishes logical partitions on the physical resource pool to make full use of
PCIE交换板在物理资源层面实现了屏蔽物理拓朴信息的物理资源池,提高了 物理资源池中的硬件资源的使用灵活性, 从而有效提高了整个系统的性能。 The PCIE switch board implements a physical resource pool that blocks physical topology information at the physical resource level, which improves the flexibility of using hardware resources in the physical resource pool, thereby effectively improving the performance of the entire system.
图 7为本发明另一实施例提供的管理设备的结构示意图。 本实施例基于 图 6所示实施例实现, 如图 7所示, 本实施例的管理设备也包括: 监控模块 61和分区建立模块 62。  FIG. 7 is a schematic structural diagram of a management device according to another embodiment of the present invention. The embodiment is implemented based on the embodiment shown in FIG. 6. As shown in FIG. 7, the management device of this embodiment also includes: a monitoring module 61 and a partition establishing module 62.
可选的, 每个服务节点设备除了包括硬件资源之外, 还可以包括管理控 制器。 管理控制器可以一直保持上电状态, 主要用于在异常情况下, 供管理 设备获取服务节点设备的硬件资源的信息。 当然在正常情况下, 管理控制器 也可以获取服务节点设备的硬件资源的信息。  Optionally, each service node device may include a management controller in addition to hardware resources. The management controller can be kept in the power-on state, and is mainly used for the management device to obtain the information of the hardware resources of the service node device under abnormal conditions. Of course, under normal circumstances, the management controller can also obtain information about the hardware resources of the service node device.
基于上述,本实施例的监控模块 61具体包括第一监控单元 611和构建单 元 613。  Based on the above, the monitoring module 61 of the embodiment specifically includes a first monitoring unit 611 and a construction unit 613.
其中, 第一监控单元 611 , 通过 PCIE交换板与每个服务节点设备的硬件 资源连接, 用于通过 PCIE交换板监控每个服务节点设备的硬件资源的状态, 通过 PCIE交换板获取每个服务节点设备的硬件资源的信息。 PCIE交换板上 可以包括 GE、 FE和 /或 PCIE接口 , 第一监控单元 611可以通过 GE、 FE和 PCIE接口中的任何一种接口与每个服务节点设备的硬件资源连接。  The first monitoring unit 611 is connected to the hardware resources of each service node device through the PCIE switch board, and is configured to monitor the state of the hardware resources of each service node device through the PCIE switch board, and obtain each service node through the PCIE switch board. Information about the hardware resources of the device. The PCIe switch board may include GE, FE, and/or PCIE interfaces, and the first monitoring unit 611 may connect to the hardware resources of each serving node device through any one of the GE, FE, and PCIE interfaces.
构建单元 613 , 与第一监控单元 611 连接, 用于根据第一监控单元 611 获取的每个服务节点的硬件资源的信息, 构建上述物理资源池。  The building unit 613 is connected to the first monitoring unit 611, and is configured to construct the physical resource pool according to the information of the hardware resources of each service node acquired by the first monitoring unit 611.
进一步, 本实施例的监控模块 61还包括: 第二监控单元 612。 第二监控 单元 612,通过 PCIE交换板或者直接与每个服务节点设备的管理控制器连接, 用于根据存储的每个服务节点设备连接的第一端口和第二端口之间的对应关 系, 从每个服务节点设备的管理控制器上获取每个服务节点设备的硬件资源 的信息。 同理,如果第二监控单元 612通过 PCIE交换板与服务节点设备的管 理控制器连接,则第二监控单元 612也可以通过 GE、 FE和 PCIE接口中的任 何一种接口与每个服务节点设备的管理控制器连接。此外,第二监控单元 612 也可以直接与服务节点设备的管理控制器连接, 而不通过 PCIE交换板。  Further, the monitoring module 61 of the embodiment further includes: a second monitoring unit 612. The second monitoring unit 612 is connected to the management controller of each service node device through the PCIE switch board, and is configured to use the correspondence between the first port and the second port connected to each of the stored service node devices. The information of the hardware resources of each service node device is obtained on the management controller of each service node device. Similarly, if the second monitoring unit 612 is connected to the management controller of the service node device through the PCIE switch board, the second monitoring unit 612 can also connect to each service node device through any one of the GE, FE, and PCIE interfaces. Management controller connection. In addition, the second monitoring unit 612 can also be directly connected to the management controller of the serving node device without passing through the PCIE switchboard.
构建单元 613 , 还与第二监控单元 612连接, 还用于根据第一监控单元 611获取的每个服务节点的硬件资源的信息和 /或第二监控单元 612获取的每 个服务节点设备的硬件资源的信息, 构建物理资源池。 其中, 每个服务节点设备的管理控制器用于监控每个服务节点设备的硬 件资源的状态;第一端口为 PCIE交换板上与服务节点设备的硬件资源连接的 端口, 第二端口为与所述服务节点设备的管理控制器连接的端口。 如果第二 监控单元 612通过 PCIE交换板与服务节点设备的管理控制器连接,则第二端 口可以是 PCIE交换板 12上的端口, 如果第二监控单元 612直接与服务节点 设备的管理控制器连接, 则第二端口可以是管理设备上的端口 (更为具体的 该端口可以是第二监控单元 612上的端口) 。 可以由第一监控单元 611通过 PCIE交换板获取每个服务节点设备的硬件资源的信息,也可以由第二监控单 元 612从每个服务节点设备的管理控制器接收上报的每个服务节点设备的硬 件资源的信息, 并根据存储的第一端口和第二端口之间的对应关系, 确定管 理控制器上报的信息所对应的硬件资源, 从而获得每个服务节点设备的硬件 资源的信息。可以由第一监控单元 611通过 PCIE交换板获取每个服务节点设 备的硬件资源的全部信息, 也可以由第二监控单元 612从管理控制器获取每 个服务节点设备的硬件资源的全部信息, 也可以由第一监控单元 611 通过 PCIE交换板获取硬件资源的一部分信息, 而由第二监控单元 612从管理控制 器获取另一部分信息。 The building unit 613 is also connected to the second monitoring unit 612, and is also used for information of hardware resources of each serving node acquired according to the first monitoring unit 611 and/or hardware of each serving node device acquired by the second monitoring unit 612. Resource information, build a pool of physical resources. The management controller of each service node device is configured to monitor the state of the hardware resources of each service node device; the first port is a port connected to the hardware resource of the service node device on the PCIE switch board, and the second port is The port to which the management node device's management controller is connected. If the second monitoring unit 612 is connected to the management controller of the service node device through the PCIE switch board, the second port may be a port on the PCIE switch board 12 if the second monitoring unit 612 is directly connected to the management controller of the service node device. The second port may be a port on the management device (more specifically, the port may be a port on the second monitoring unit 612). The information of the hardware resources of each service node device may be acquired by the first monitoring unit 611 through the PCIE switch board, or may be received by the second monitoring unit 612 from the management controller of each service node device. The information of the hardware resource is determined according to the correspondence between the stored first port and the second port, and the hardware resources corresponding to the information reported by the management controller are determined, thereby obtaining the information of the hardware resources of each service node device. All the information of the hardware resources of each service node device may be acquired by the first monitoring unit 611 through the PCIE switch board, and all the information of the hardware resources of each service node device may also be obtained by the second monitoring unit 612 from the management controller. A portion of the hardware resources may be acquired by the first monitoring unit 611 through the PCIE switchboard, and another portion of the information may be acquired by the second monitoring unit 612 from the management controller.
其中, 以计算节点作为服务节点设备为例, 本实施例的监控模块 61通过 第一监控单元 611和第二监控单元 612分别与计算节点的每个 CPU资源和计 算节点的管理控制器互联的详细结构如图 3所示。  For example, the computing node of the computing node is connected to the management controller of the computing node by the first monitoring unit 611 and the second monitoring unit 612 respectively. The structure is shown in Figure 3.
进一步, 本实施例的分区建立模块 62具体用于将监控模块 61构建的物 理资源池包括的硬件资源的信息呈现给用户, 接收用户发出的选中指令和划 分指令, 以建立所述逻辑分区。 其中, 选中指令用于指示用户根据应用需求 从物理资源池中选择的硬件资源, 划分指令用于指示用户对所选择的硬件资 源进行分配形成的所述逻辑分区和所述逻辑分区包括的硬件资源。 其中, 分 区建立模块 62 向用户呈现物理资源池包括的硬件资源的信息的功能相当于 向用户提供了一个逻辑分区平台, 以使用户在逻辑分区平台上进行逻辑分区 的划分。 在这里, 由于管理设备使用 PCIE交换板和服务节点设备之间互联, 解决了使用网线、 线缆等各种互联方式造成的各服务节点设备的硬件资源之 间差异, 同一类型的硬件资源的拓朴关系对用户来说是相同的, 因此, 在进 行逻辑分区划分时可以仅考虑应用需求, 而不需要考虑各硬件资源之间的差 异, 也不需要考虑各硬件资源构成的物理拓朴, 有利于提高逻辑分区的划分 效率。 Further, the partition establishment module 62 of the embodiment is specifically configured to present the information of the hardware resources included in the physical resource pool constructed by the monitoring module 61 to the user, and receive the selected instruction and the partitioning instruction issued by the user to establish the logical partition. The selected instruction is used to indicate a hardware resource selected by the user from the physical resource pool according to the application requirement, and the dividing instruction is used to indicate the logical partition formed by the user to allocate the selected hardware resource and the hardware resource included in the logical partition. . The function of the partition establishment module 62 to present the information of the hardware resources included in the physical resource pool to the user is equivalent to providing the user with a logical partition platform, so that the user performs logical partition division on the logical partition platform. Here, since the management device uses the interconnection between the PCIE switch board and the service node device, the difference between the hardware resources of each service node device caused by the various interconnection modes such as the network cable and the cable is solved, and the extension of the same type of hardware resources is solved. The relationship between the PCs is the same for the user. Therefore, when logical partitioning is performed, only the application requirements can be considered without considering the difference between the hardware resources. Different, it is also unnecessary to consider the physical topology of each hardware resource, which is beneficial to improve the efficiency of partitioning logical partitions.
对用户根据应用需求选择物理资源池中的硬件资源, 并进行分配形成不 同逻辑分区的过程进行的举例说明可参见上述系统实施例的描述。  For an example of the process of selecting a hardware resource in a physical resource pool according to an application requirement and performing a process of allocating different logical partitions, refer to the description of the foregoing system embodiment.
进一步, 本实施例的分区建立模块 62还用于在逻辑分区上电后, 向用户 呈现与逻辑分区对应的硬件平台, 以使用户安装操作系统。 其中, 硬件平台 包括逻辑分区包括的硬件资源。 其中, 操作系统可以是普通操作系统也可以 是虚拟操作系统。 其中, 在逻辑分区建立后, 分区建立模块 62通过向用户呈 现每个逻辑分区中的硬件资源的信息, 相当于向用户呈现一个个硬件平台, 而每个硬件平台对用户来说就类似于普通的服务器或计算机。 然后, 用户可 以在硬件平台上自行选择安装普通的操作系统或虚拟操作系统。 其中, 分区 建立模块 62 向用户呈现逻辑分区对应的硬件平台的功能相当于在逻辑分区 的硬件资源 (可称为硬件层)上提供一个增强适配层。 这里的增强适配层提 供针对该逻辑分区的 BIOS 的作用, 除此之外还提供开放硬件平台支持, 各 种虚拟化软件支持等。  Further, the partition establishment module 62 of the embodiment is further configured to: after the logical partition is powered on, present a hardware platform corresponding to the logical partition to the user, so that the user installs the operating system. The hardware platform includes hardware resources included in the logical partition. The operating system can be a normal operating system or a virtual operating system. After the logical partition is established, the partition establishing module 62 presents the hardware resource information in each logical partition to the user, which is equivalent to presenting one hardware platform to the user, and each hardware platform is similar to the ordinary user. Server or computer. Users can then choose to install a normal operating system or virtual operating system on the hardware platform. The function of the partitioning module 62 to present the hardware platform corresponding to the logical partition to the user is equivalent to providing an enhanced adaptation layer on the hardware resource (which may be referred to as the hardware layer) of the logical partition. The enhanced adaptation layer here provides the role of the BIOS for the logical partition, in addition to providing open hardware platform support, various virtualization software support, and so on.
本实施例管理设备通过监控模块 61和分区建立模块 62构建的逻辑分区 的一种结构如图 4所示。本实施例管理设备通过监控模块 61和分区建立模块 62构建的物理资源池以及在物理资源池上建立的逻辑分区的整体结构如图 5 所示。  A structure of the logical partition constructed by the monitoring device 61 and the partition establishing module 62 of the embodiment management device is as shown in FIG. The physical structure of the physical resource pool constructed by the monitoring module 61 and the partition establishing module 62 and the logical partition established on the physical resource pool in this embodiment are as shown in FIG. 5.
进一步, 本实施例的管理设备还包括: 预警模块 63。 该预警模块 63与 监控模块 61连接, 用于在监控模块 61获取到逻辑分区包括的硬件资源的负 载大于预设第一负载门限时, 发出预警信息。 具体的, 预警模块 63分别与第 一监控单元 611和第二监控单元 612连接。  Further, the management device of this embodiment further includes: an early warning module 63. The warning module 63 is connected to the monitoring module 61, and is configured to issue the warning information when the monitoring module 61 acquires that the load of the hardware resource included in the logical partition is greater than the preset first load threshold. Specifically, the early warning module 63 is connected to the first monitoring unit 611 and the second monitoring unit 612, respectively.
其中, 监控模块 61可以通过 PCIE交换板监控每个服务节点设备的硬件 资源的状态, 获取逻辑分区中的硬件资源的负载情况。 或者, 监控模块 61也 可以通过每个服务节点设备的管理控制器获取每个服务节点设备的硬件资源 的状态, 进而获取逻辑分区中的硬件资源的负载情况。  The monitoring module 61 can monitor the state of the hardware resources of each service node device through the PCIE switch board, and obtain the load status of the hardware resources in the logical partition. Alternatively, the monitoring module 61 may obtain the state of the hardware resources of each service node device through the management controller of each service node device, thereby acquiring the load status of the hardware resources in the logical partition.
进一步, 本实施例的分区建立模块 62还用于在监控模块 61获取到逻辑 分区包括的硬件资源的负载大于预设第一负载门限时, 从物理资源池中选择 空闲的硬件资源添加到所述逻辑分区中, 并触发新添加的硬件资源的热插拔 流程。 Further, the partition establishment module 62 of the embodiment is further configured to: when the monitoring module 61 acquires that the load of the hardware resource included in the logical partition is greater than a preset first load threshold, select an idle hardware resource from the physical resource pool to add to the In the logical partition, and trigger hot swap of newly added hardware resources Process.
其中, 新添加的硬件资源的热插拔流程会因新添加的硬件资源的不同而 不同。 举例说明, 如果新添加的硬件资源为网卡资源, 则相应的热插拔流程 主要是为新添加的硬件资源增加驱动等操作。如果新添加的硬件资源为 CPU, 则相应的热插拔流程主要是 CPU的热插拔、 online, offline等流程。  The hot plug process of newly added hardware resources will vary depending on the newly added hardware resources. For example, if the newly added hardware resource is a NIC resource, the corresponding hot plug process is mainly to add a driver for the newly added hardware resource. If the newly added hardware resource is a CPU, the corresponding hot plug process is mainly the CPU hot swap, online, offline, and the like.
上述第一负载门限可以是负载门限中的上门限, 但不限于此。  The first load threshold may be an upper threshold in the load threshold, but is not limited thereto.
进一步,分区建立模块 62还可以在逻辑分区包括的硬件资源的负载小于 预设第二负载门限时, 从逻辑分区中移除过剩的硬件资源到所述物理资源池 中。 其中, 第二负载门限可以是负载门限中的下门限。 所述过剩的硬件资源 除逻辑分区中过剩的硬件资源到物理资源池中有利于提高硬件资源的利用 率。 上述利用率较低的硬件资源通常是指利用率小于预设利用率门限的硬件 资源, 所述利用率门限可以根据实际需求设置不同的值。  Further, the partition establishment module 62 may also remove excess hardware resources from the logical partition into the physical resource pool when the load of the hardware resources included in the logical partition is less than the preset second load threshold. The second load threshold may be a lower threshold in the load threshold. The excess hardware resources, in addition to the excess hardware resources in the logical partition to the physical resource pool, are beneficial to improve the utilization of hardware resources. The lower-utilization hardware resources usually refer to hardware resources whose utilization is less than the preset utilization threshold. The utilization threshold can be set to different values according to actual requirements.
在此说明, 上述第一负载门限大于第二负载门限, 关于第一负载门限和 第二负载门限的具体数值不做限定。  It is noted that the first load threshold is greater than the second load threshold, and the specific values of the first load threshold and the second load threshold are not limited.
由上述可见, 本实施例的分区建立模块 62可以在逻辑分区运行时, 通过 获取逻辑分区中的硬件资源的信息, 根据硬件资源的信息灵活调整逻辑分区 中的硬件资源, 管理 10、 CPU等硬件资源的热插拔, 实现对逻辑分区的动态 管理。 结合上述, 对逻辑分区的动态管理具体可以由分区建立模块 62提供的 增强适配层实现。  It can be seen from the above that the partition establishment module 62 of the embodiment can flexibly adjust the hardware resources in the logical partition according to the information of the hardware resources, and manage the hardware such as the CPU and the CPU by acquiring the information of the hardware resources in the logical partition when the logical partition is running. Hot swapping of resources to achieve dynamic management of logical partitions. In conjunction with the above, dynamic management of logical partitions may be specifically implemented by an enhanced adaptation layer provided by partition creation module 62.
由上述可见,本实施例的管理设备通过 PCIE交换板实现与服务节点设备 之间的互联,利用 PCIE交换板实现了系统中的硬件资源的自由调度,使逻辑 分区不受物理资源拓朴的影响, 提升了逻辑分区的创建与使用效率; 使得用 户不需要感知 PCIE交换架构的差异,可以在物理资源池上直接进行逻辑分区 操作, 可以省去虚拟化管理软件层, 降低管理成本, 同时提高了性能; 另外, 在逻辑分区运行过程中, 通过对硬件资源的监控, 实现了动态逻辑分区的管 理方案, 实现了 10等物理资源的灵活分配、 热插拔等。  It can be seen that the management device in this embodiment implements interconnection with the service node device through the PCIE switch board, and implements free scheduling of hardware resources in the system by using the PCIE switch board, so that the logical partition is not affected by the physical resource topology. The efficiency of creating and using logical partitions is improved; the user does not need to perceive the difference of the PCIE switching architecture, and can perform logical partitioning operations directly on the physical resource pool, thereby eliminating the virtualization management software layer, reducing management costs, and improving performance. In addition, during the operation of the logical partition, the management scheme of the dynamic logical partition is realized through the monitoring of the hardware resources, and the flexible allocation and hot plugging of the 10 physical resources are realized.
图 8为本发明一实施例提供的管理方法的流程图。 如图 8所示, 本实施 例的方法包括:  FIG. 8 is a flowchart of a management method according to an embodiment of the present invention. As shown in FIG. 8, the method in this embodiment includes:
步骤 801、管理设备通过 PCIE交换板获取至少一个服务节点设备中的每 个服务节点设备的硬件资源的信息。 Step 801: The management device acquires each of the at least one service node device through the PCIE switch board. Information about the hardware resources of the service node device.
步骤 802、 管理设备根据获取的所述信息, 构建物理资源池。  Step 802: The management device builds a physical resource pool according to the obtained information.
步骤 803、 管理设备在上述物理资源池上建立逻辑分区。  Step 803: The management device establishes a logical partition on the foregoing physical resource pool.
其中, 管理设备通过 PCIE交换板与每个服务节点设备连接。  The management device is connected to each service node device through a PCIE switch board.
本实施例的服务节点设备可以是各种类型的服务节点设备, 例如, 10节 点, 计算节点, 存储节点, 内存扩展板, 风扇板, 电源板等模块。 其中, 10 节点的硬件资源是指 10资源, 计算节点的硬件资源是指 CPU资源, 存储节 点的硬件资源是指 HDD、 SSD等各种存储资源。  The service node device in this embodiment may be various types of service node devices, for example, 10 nodes, a computing node, a storage node, a memory expansion board, a fan board, a power board, and the like. The hardware resources of the 10 nodes refer to 10 resources, the hardware resources of the compute nodes refer to CPU resources, and the hardware resources of the storage nodes refer to various storage resources such as HDD and SSD.
在本实施例的一个可选实施方式中,管理设备通过 PCIE交换板与每个服 务节点设备的硬件资源连接。 则步骤 801 的具体实施方式包括: 管理设备通 过 PCIE交换板监控每个服务节点设备的硬件资源的状态, 通过 PCIE交换板 获取每个服务节点设备的硬件资源的信息。 具体的, 管理设备与每个服务节 点设备之间的交互通过 PCIE交换板实现。在该实施方式中,管理设备可以通 过 PCIE交换板与每个服务节点设备的管理控制器连接,也可以直接与每个服 务节点设备的管理控制器连接。  In an optional implementation of this embodiment, the management device is connected to the hardware resources of each service node device through a PCIE switch board. The specific implementation manner of step 801 includes: the management device monitors the state of the hardware resources of each service node device through the PCIE switch board, and obtains the hardware resource information of each service node device through the PCIE switch board. Specifically, the interaction between the management device and each service node device is implemented through a PCIE switch board. In this embodiment, the management device can be connected to the management controller of each service node device through the PCIE switch board, or directly to the management controller of each service node device.
在本实施例的一个可选实施方式中,管理设备通过 PCIE交换板与每个服 务节点设备的硬件资源连接,并通过 PCIE交换板与每个服务节点设备的管理 控制器连接。 基于此, 步骤 801 的具体实施方式包括: 管理设备通过 PCIE 交换板监控每个服务节点设备的硬件资源的状态,通过 PCIE交换板获取每个 服务节点设备的硬件资源的信息, 或者管理设备根据存储的每个服务节点设 备连接的第一端口和第二端口之间的对应关系, 从每个服务节点设备的管理 控制器获取每个服务节点设备的硬件资源的信息。 其中, 每个服务节点设备 的管理控制器用于监控每个服务节点设备的硬件资源的状态。 第一端口为 PCIE交换板上与每个服务节点设备的硬件资源连接的端口,第二端口为 PCIE 交换板上与每个服务节点设备的管理控制器连接的端口。 即, 管理设备可以 通过 PCIE交换板获取每个服务节点设备的硬件资源的信息,也可以从每个服 务节点设备的管理控制器接收上报的每个服务节点设备的硬件资源的信息, 并根据存储的第一端口和第二端口之间的对应关系, 确定管理控制器上报的 信息所对应的硬件资源, 从而获得每个服务节点设备的硬件资源的信息。 管 理设备可以通过 PCIE交换板获取每个服务节点设备的硬件资源的全部信息, 也可以从管理控制器获取每个服务节点设备的硬件资源的全部信息, 也可以 通过 PCIE交换板获取硬件资源的一部分信息,从管理控制器获取另一部分信 自 In an optional implementation manner of this embodiment, the management device is connected to the hardware resources of each service node device through the PCIE switch board, and is connected to the management controller of each service node device through the PCIE switch board. Based on this, the specific implementation manner of step 801 includes: the management device monitors the state of the hardware resources of each service node device through the PCIE switch board, acquires the hardware resource information of each service node device through the PCIE switch board, or manages the device according to the storage. Corresponding relationship between the first port and the second port of each service node device connection, and acquiring information of hardware resources of each service node device from a management controller of each service node device. The management controller of each service node device is used to monitor the status of hardware resources of each service node device. The first port is a port connected to the hardware resource of each service node device on the PCIE switch board, and the second port is a port connected to the management controller of each service node device on the PCIE switch board. That is, the management device may obtain the information of the hardware resources of each service node device through the PCIE switch board, and may also receive the information of the hardware resources of each service node device that is reported from the management controller of each service node device, and according to the storage. Corresponding relationship between the first port and the second port determines the hardware resources corresponding to the information reported by the management controller, thereby obtaining the information of the hardware resources of each service node device. The management device can obtain all the information about the hardware resources of each service node device through the PCIE switch board. It is also possible to obtain all the information of the hardware resources of each service node device from the management controller, or obtain a part of the hardware resources through the PCIE switch board, and obtain another part of the information from the management controller.
在本实施例的一个可选实施方式中,管理设备通过 PCIE交换板与每个服 务节点设备的硬件资源连接, 并直接与服务节点设备的管理控制器连接。 基 于此, 步骤 801的具体实施方式包括: 管理设备通过 PCIE交换板监控每个服 务节点设备的硬件资源的状态,通过 PCIE交换板获取每个服务节点设备的硬 件资源的信息, 或者管理设备根据存储的每个服务节点设备连接的第一端口 和第二端口之间的对应关系, 从每个服务节点设备的管理控制器获取每个服 务节点设备的硬件资源的信息。 其中, 每个服务节点设备的管理控制器用于 监控每个服务节点设备的硬件资源的状态。第一端口为 PCIE交换板上与每个 服务节点设备的硬件资源连接的端口, 第二端口为管理设备上与每个服务节 点设备的管理控制器连接的端口。也就是说,第二端口可以是 PCIE交换板上 的端口,也可以是管理设备上的端口。 当管理设备通过 PCIE交换板与服务节 点设备的管理控制器连接时,第二端口为 PCIE交换板上的端口; 当管理设备 直接与服务节点设备的管理控制器连接, 而不通过 PCIE交换板时,第二端口 为管理设备上的端口。 即,管理设备可以通过 PCIE交换板获取每个服务节点 设备的硬件资源的信息, 也可以从每个服务节点设备的管理控制器接收上报 的每个服务节点设备的硬件资源的信息, 并根据存储的第一端口和第二端口 之间的对应关系, 确定管理控制器上报的信息所对应的硬件资源, 从而获得 每个服务节点设备的硬件资源的信息。管理设备可以通过 PCIE交换板获取每 个服务节点设备的硬件资源的全部信息, 也可以从管理控制器获取每个服务 节点设备的硬件资源的全部信息,也可以通过 PCIE交换板获取硬件资源的一 部分信息, 从管理控制器获取另一部分信息。  In an optional implementation manner of this embodiment, the management device is connected to the hardware resources of each service node device through the PCIE switch board, and is directly connected to the management controller of the service node device. Based on this, the specific implementation manner of the step 801 includes: the management device monitors the state of the hardware resources of each service node device through the PCIE switch board, acquires the hardware resource information of each service node device through the PCIE switch board, or manages the device according to the storage. Corresponding relationship between the first port and the second port of each service node device connection, and acquiring information of hardware resources of each service node device from a management controller of each service node device. The management controller of each service node device is used to monitor the status of hardware resources of each service node device. The first port is a port on the PCIE switch board that is connected to the hardware resource of each service node device, and the second port is a port on the management device that is connected to the management controller of each service node device. That is, the second port can be a port on the PCIE switch board or a port on the management device. When the management device is connected to the management controller of the service node device through the PCIE switch board, the second port is a port on the PCIE switch board; when the management device is directly connected to the management controller of the service node device, but not through the PCIE switch board The second port is the port on the management device. That is, the management device may obtain the information of the hardware resources of each service node device through the PCIE switch board, and may also receive the information of the hardware resources of each service node device that is reported from the management controller of each service node device, and according to the storage. Corresponding relationship between the first port and the second port determines the hardware resources corresponding to the information reported by the management controller, thereby obtaining the information of the hardware resources of each service node device. The management device can obtain all the information about the hardware resources of each service node device through the PCIE switch board, or obtain all the hardware resources of each service node device from the management controller, or obtain a part of the hardware resources through the PCIE switch board. Information, get another piece of information from the management controller.
在本实施例的一个可选实施方式中, 步骤 803的实施方式包括: 管理设 备将物理资源池包括的硬件资源的信息呈现给用户, 接收用户的选中指令和 划分指令, 以建立所述逻辑分区。 其中, 选中指令用于指示用户根据应用需 求从物理资源池中选择的硬件资源, 划分指令用于指示用户对所选择的硬件 资源进行分配形成的所述逻辑分区和所述逻辑分区包括的硬件资源。 该过程 相当于向用户提供了一个逻辑分区平台, 以便于用户进行逻辑分区的划分。 在本实施例提供的管理方法中,管理设备与服务节点设备通过 PCIE交换 板连接, 管理设备通过 PCIE 交换板获取各个服务节点设备的硬件资源的信 息, 并根据获取的硬件资源的信息构建物理资源池, 进而在物理资源池上建 立逻辑分区,充分利用 PCIE交换板在物理资源层面实现了屏蔽物理拓朴信息 的物理资源池, 提高了物理资源池中的硬件资源使用的灵活性, 从而有效提 高了整个系统的性能。 In an optional implementation manner of this embodiment, the implementation manner of step 803 includes: the management device presents the information of the hardware resource included in the physical resource pool to the user, and receives the selected instruction and the partitioning instruction of the user to establish the logical partition. . The selected instruction is used to indicate a hardware resource selected by the user from the physical resource pool according to the application requirement, and the dividing instruction is used to indicate the logical partition formed by the user to allocate the selected hardware resource and the hardware resource included in the logical partition. . This process is equivalent to providing a logical partition platform to the user, so that the user can divide the logical partition. In the management method provided by the embodiment, the management device and the service node device are connected through the PCIE switch board, and the management device acquires the hardware resource information of each service node device through the PCIE switch board, and constructs the physical resource according to the acquired hardware resource information. The pool, and then the logical partition is established on the physical resource pool, and the PCIE switch board is utilized to implement the physical resource pool for shielding physical topology information at the physical resource level, thereby improving the flexibility of using hardware resources in the physical resource pool, thereby effectively improving the pool. The performance of the entire system.
图 9为本发明另一实施例提供的管理方法的结构示意图。 如图 9所示, 本实施例的方法包括:  FIG. 9 is a schematic structural diagram of a management method according to another embodiment of the present invention. As shown in FIG. 9, the method in this embodiment includes:
步骤 901、 在旁路(standby )供电情况下, 管理设备获取每个服务节点 设备的硬件资源的信息。  Step 901: In the case of standby power supply, the management device acquires information about hardware resources of each service node device.
具体的,管理设备可以通过 PCIE交换板监控各服务节点设备的硬件资源 的状态,通过 PCIE交换板与每个服务节点设备进行交互,获取每个服务节点 设备的硬件资源的信息。 或者  Specifically, the management device can monitor the state of the hardware resources of each service node device through the PCIE switch board, and interact with each service node device through the PCIE switch board to obtain information about the hardware resources of each service node device. Or
管理设备还可以通过每个服务节点设备的管理控制器, 接收管理控制器 上报的管理控制器所在服务节点设备的硬件资源的信息。  The management device may also receive, by using a management controller of each service node device, information about hardware resources of the service node device where the management controller is reported by the management controller.
步骤 902、 管理设备根据每个服务节点设备的硬件资源的信息, 构建物 理资源池。  Step 902: The management device constructs a physical resource pool according to information about hardware resources of each service node device.
步骤 903、 管理设备通过 UI等方式将物理资源池呈现给用户, 以使用户 进行逻辑分区的划分。  Step 903: The management device presents the physical resource pool to the user by using a UI or the like, so that the user performs logical partition division.
步骤 904、 用户根据应用需求进行逻辑分区的划分。  Step 904: The user divides the logical partition according to the application requirement.
步骤 905、 逻辑分区上电后, 管理设备提供的增强适配层生效, 向用户 呈现该逻辑分区对应的硬件平台。  Step 905: After the logical partition is powered on, the enhanced adaptation layer provided by the management device takes effect, and the hardware platform corresponding to the logical partition is presented to the user.
可选的, 每个逻辑分区对应的硬件平台可以包括: CPU、 10、 MEM等资 源。  Optionally, the hardware platform corresponding to each logical partition may include: CPU, 10, MEM, and the like.
步骤 906、 用户在硬件平台上安装操作系统、 中间及上层应用软件等, 进行应用。  Step 906: The user installs an operating system, an intermediate layer, and an upper layer application software on the hardware platform to perform an application.
由上述可见,管理设备与服务节点设备通过 PCIE交换板连接,管理设备 通过 PCIE交换板获取各个服务节点设备的硬件资源的信息,并根据获取的硬 件资源的信息构建物理资源池, 进而在物理资源池上建立逻辑分区, 充分利 用 PCIE交换板在物理资源层面实现了屏蔽物理拓朴信息的物理资源池,即不 再受现有技术中的互联方式的限制, 使得物理资源池中的硬件资源可以灵活 分配, 使逻辑分区动态可调, 从而有效提高了整个系统的性能。 As shown in the above, the management device and the service node device are connected through the PCIE switch board. The management device obtains the information of the hardware resources of each service node device through the PCIE switch board, and constructs a physical resource pool according to the acquired hardware resource information, and then the physical resource. A logical partition is created on the pool to fully utilize the PCIE switch board to implement a physical resource pool that blocks physical topology information at the physical resource level, that is, no Restricted by the interconnection mode in the prior art, the hardware resources in the physical resource pool can be flexibly allocated, and the logical partition can be dynamically adjusted, thereby effectively improving the performance of the entire system.
进一步, 如图 10所示管理方法, 在步骤 906之后, 还包括:  Further, after the step 906, the management method shown in FIG. 10 further includes:
步骤 907、 管理设备监控逻辑分区的硬件资源的负载情况。  Step 907: The management device monitors a load condition of a hardware resource of the logical partition.
步骤 908、 管理设备判断逻辑分区包括的硬件资源的负载是否大于预设 第一负载门限; 如果判断结果为是, 执行步骤 909; 如果判断结构为否, 返 回继续执行步骤 907。  Step 908: The management device determines whether the load of the hardware resource included in the logical partition is greater than a preset first load threshold. If the determination result is yes, go to step 909; if the structure is negative, go back to step 907.
步骤 909、 管理设备从物理资源池中选择空闲的硬件资源添加到该逻辑 分区中, 并触发所述新添加的硬件资源的热插拔流程。  Step 909: The management device selects an idle hardware resource from the physical resource pool to be added to the logical partition, and triggers a hot plugging process of the newly added hardware resource.
其中, 新添加的硬件资源与负载超过第一负载门限的硬件资源属于同一 类型, 例如, 如果负载超过第一负载门限的硬件资源是 CPU, 则新添加的硬 件资源也是 CPU资源; 如果超过第一负载门限的硬件资源是 10, 则新添加 的硬件资源也是 10, 等等。  The newly added hardware resource is of the same type as the hardware resource whose load exceeds the first load threshold. For example, if the hardware resource whose load exceeds the first load threshold is the CPU, the newly added hardware resource is also the CPU resource; The hardware resource of the load threshold is 10, and the newly added hardware resources are also 10, and so on.
进一步, 步骤 908的一可选实施方式还可以是: 管理设备发出预警信息。 上述第一负载门限可以是负载门限中的上门限, 但不限于此。 预设第二负载门限时, 从逻辑分区中移除过剩的硬件资源到所述物理资源池 中。 其中, 第二负载门限可以是负载门限中的下门限。 所述过剩的硬件资源 除逻辑分区中过剩的硬件资源到物理资源池中有利于提高硬件资源的利用 率。 上述利用率较低的硬件资源通常是指利用率小于预设利用率门限的硬件 资源, 所述利用率门限可以根据实际需求设置不同的值。  Further, an optional implementation of step 908 may be: the management device sends the early warning information. The first load threshold may be an upper threshold in the load threshold, but is not limited thereto. When the second load threshold is preset, excess hardware resources are removed from the logical partition into the physical resource pool. The second load threshold may be a lower threshold in the load threshold. The excess hardware resources, in addition to the excess hardware resources in the logical partition to the physical resource pool, are beneficial to improve the utilization of hardware resources. The lower-utilization hardware resources usually refer to hardware resources whose utilization is less than the preset utilization threshold. The utilization threshold can be set to different values according to actual requirements.
在此说明, 上述第一负载门限大于第二负载门限, 关于第一负载门限和 第二负载门限的具体数值不做限定。  It is noted that the first load threshold is greater than the second load threshold, and the specific values of the first load threshold and the second load threshold are not limited.
由上述可见, 本实施例提供的管理方法, 在逻辑分区运行时, 通过获取 逻辑分区中的硬件资源的信息, 根据硬件资源的信息灵活调整逻辑分区中的 硬件资源,管理 10、 CPU等硬件资源的热插拔, 实现对逻辑分区的动态管理。 结合上述, 对逻辑分区的动态管理具体可以由管理设备提供的增强适配层实 现。  It can be seen from the above that the management method provided in this embodiment, when the logical partition is running, obtains the hardware resources in the logical partition according to the information of the hardware resources, and manages the hardware resources in the logical partition according to the information of the hardware resources. Hot swapping, enabling dynamic management of logical partitions. In combination with the above, the dynamic management of logical partitions can be implemented by an enhanced adaptation layer provided by the management device.
本领域普通技术人员可以理解: 实现上述方法实施例的全部或部分步骤 可以通过程序指令相关的硬件来完成, 前述的程序可以存储于一计算机可读 取存储介质中, 该程序在执行时, 执行包括上述方法实施例的步骤; 而前述 的存储介质包括: ROM、 RAM, 磁碟或者光盘等各种可以存储程序代码的介 质。 One of ordinary skill in the art can understand that all or part of the steps of the above method embodiments are implemented. The foregoing program may be stored in a computer readable storage medium, and when executed, the program includes the steps of the foregoing method embodiment; and the foregoing storage medium includes: ROM, RAM , a variety of media that can store program code, such as a disk or an optical disk.
最后应说明的是: 以上各实施例仅用以说明本发明的技术方案, 而非对 其限制; 尽管参照前述各实施例对本发明进行了详细的说明, 本领域的普通 技术人员应当理解:其依然可以对前述各实施例所记载的技术方案进行修改, 或者对其中部分或者全部技术特征进行等同替换; 而这些修改或者替换, 并 不使相应技术方案的本质脱离本发明各实施例技术方案的范围。  Finally, it should be noted that the above embodiments are only for explaining the technical solutions of the present invention, and are not intended to be limiting thereof; although the present invention has been described in detail with reference to the foregoing embodiments, those skilled in the art will understand that The technical solutions described in the foregoing embodiments may be modified, or some or all of the technical features may be equivalently replaced; and the modifications or substitutions do not deviate from the technical solutions of the embodiments of the present invention. range.

Claims

权 利 要求 书 claims
1、 一种服务器系统, 其特征在于, 包括: 管理设备、 高速外设组件互连 PCIE交换板和至少一个服务节点设备; 所述管理设备通过所述 PCIE交换板 与所述至少一个服务节点设备中的每个服务节点设备连接; 1. A server system, characterized in that it includes: a management device, a high-speed peripheral component interconnecting a PCIE switching board and at least one service node device; the management device communicates with the at least one service node device through the PCIE switching board Each service node device in the connection is connected;
所述管理设备,用于通过所述 PCIE交换板获取所述每个服务节点设备的 硬件资源的信息, 根据获取的所述信息构建物理资源池, 并在所述物理资源 池上建立逻辑分区。 The management device is configured to obtain information about the hardware resources of each service node device through the PCIE switching board, construct a physical resource pool based on the obtained information, and establish logical partitions on the physical resource pool.
2、 根据权利要求 1所述的服务器系统, 其特征在于, 所述管理设备具体 用于将所述物理资源池包括的硬件资源的信息呈现给用户, 接收所述用户的 选中指令和划分指令, 以建立所述逻辑分区, 所述选中指令用于指示所述用 户根据应用需求从所述物理资源池中选择的硬件资源, 所述划分指令用于指 示所述用户对所述选择的硬件资源进行分配而形成的所述逻辑分区和所述逻 辑分区包括的硬件资源。 2. The server system according to claim 1, wherein the management device is specifically configured to present information about hardware resources included in the physical resource pool to a user, and receive selection instructions and division instructions from the user, To establish the logical partition, the selection instruction is used to instruct the user to select hardware resources from the physical resource pool according to application requirements, and the partition instruction is used to instruct the user to perform operations on the selected hardware resources. The logical partition formed by the allocation and the hardware resources included in the logical partition.
3、 根据权利要求 2所述的服务器系统, 其特征在于, 所述管理设备还用 以使所述用户安装操作系统, 所述硬件平台包括所述逻辑分区包括的硬件资 源。 3. The server system according to claim 2, wherein the management device is also used to enable the user to install an operating system, and the hardware platform includes hardware resources included in the logical partition.
4、 根据权利要求 2或 3所述的服务器系统, 其特征在于, 所述管理设备 还用于在所述逻辑分区包括的硬件资源的负载大于预设第一负载门限时, 发 出预警信息。 4. The server system according to claim 2 or 3, wherein the management device is further configured to issue an early warning message when the load of the hardware resources included in the logical partition is greater than the preset first load threshold.
5、 根据权利要求 2或 3所述的服务器系统, 其特征在于, 一负载门限时, 从所述物理资源池中选择空闲的硬件资源添加到所述逻辑分 区中, 并触发所述新添加的硬件资源的热插拔流程; 或者 二负载门限时,从所述逻辑分区中移除过剩的硬件资源到所述物理资源池中 , 所述过剩的硬件资源是在所述逻辑分区中没有被利用或者利用率小于预设利 用率门限的硬件资源。 5. The server system according to claim 2 or 3, characterized in that, when a load threshold is reached, idle hardware resources are selected from the physical resource pool and added to the logical partition, and trigger the newly added Hot swap process of hardware resources; or when the load threshold is reached, remove excess hardware resources from the logical partition to the physical resource pool, and the excess hardware resources are not utilized in the logical partition. Or hardware resources whose utilization is less than the preset utilization threshold.
6、 根据权利要求 1-5任一项所述的服务器系统, 其特征在于, 所述管理 设备具体通过所述 PCIE交换板与所述每个服务节点设备的硬件资源连接; 所述管理设备具体用于通过所述 PCIE 交换板监控所述每个服务节点设 备的硬件资源的状态,通过所述 PCIE交换板获取所述每个服务节点设备的硬 件资源的信息。 6. The server system according to any one of claims 1 to 5, characterized in that, the management device is specifically connected to the hardware resources of each service node device through the PCIE switching board; The management device is specifically configured to monitor the status of the hardware resources of each service node device through the PCIE switching board, and obtain information about the hardware resources of each service node device through the PCIE switching board.
7、 根据权利要求 1-5任一项所述的服务器系统, 其特征在于, 所述管理 设备具体通过所述 PCIE交换板与所述每个服务节点设备的硬件资源连接,并 通过所述 PCIE交换板与所述每个服务节点设备的管理控制器连接; 7. The server system according to any one of claims 1 to 5, characterized in that, the management device is connected to the hardware resources of each service node device through the PCIE switching board, and is connected to the hardware resources of each service node device through the PCIE switching board. The switching board is connected to the management controller of each service node device;
所述管理设备具体用于通过所述 PCIE 交换板监控所述每个服务节点设 备的硬件资源的状态,通过所述 PCIE交换板获取所述每个服务节点设备的硬 件资源的信息; 或者 口和第二端口之间的对应关系, 从所述每个服务节点设备的管理控制器获取 所述每个服务节点设备的硬件资源的信息; The management device is specifically configured to monitor the status of the hardware resources of each service node device through the PCIE switching board, and obtain the information of the hardware resources of each service node device through the PCIE switching board; or The corresponding relationship between the second ports is to obtain the information of the hardware resources of each service node device from the management controller of each service node device;
其中, 所述每个服务节点设备的管理控制器用于监控所述每个服务节点 设备的硬件资源的状态,第一端口为所述 PCIE交换板上与所述每个服务节点 设备的硬件资源连接的端口,第二端口为所述 PCIE交换板上与所述每个服务 节点设备的管理控制器连接的端口。 Wherein, the management controller of each service node device is used to monitor the status of the hardware resources of each service node device, and the first port is connected to the hardware resources of each service node device on the PCIE switching board. The second port is a port on the PCIE switching board connected to the management controller of each service node device.
8、 根据权利要求 1-5任一项所述的服务器系统, 其特征在于, 所述管理 设备具体通过所述 PCIE交换板与所述每个服务节点设备的硬件资源连接,并 直接与所述每个服务节点设备的管理控制器连接; 8. The server system according to any one of claims 1 to 5, characterized in that, the management device is specifically connected to the hardware resources of each service node device through the PCIE switching board, and is directly connected to the Management controller connections for each service node device;
所述管理设备具体用于通过所述 PCIE 交换板监控所述每个服务节点设 备的硬件资源的状态,通过所述 PCIE交换板获取所述每个服务节点设备的硬 件资源的信息; 或者 口和第二端口之间的对应关系, 从所述每个服务节点设备的管理控制器获取 所述每个服务节点设备的硬件资源的信息; The management device is specifically configured to monitor the status of the hardware resources of each service node device through the PCIE switching board, and obtain the information of the hardware resources of each service node device through the PCIE switching board; or The corresponding relationship between the second ports is to obtain the information of the hardware resources of each service node device from the management controller of each service node device;
其中, 所述每个服务节点设备的管理控制器用于监控所述每个服务节点 设备的硬件资源的状态,第一端口为所述 PCIE交换板上与所述每个服务节点 设备的硬件资源连接的端口, 第二端口为所述管理设备上与所述每个服务节 点设备的管理控制器连接的端口。 Wherein, the management controller of each service node device is used to monitor the status of the hardware resources of each service node device, and the first port is connected to the hardware resources of each service node device on the PCIE switching board. The second port is a port on the management device connected to the management controller of each service node device.
9、 一种管理设备, 其特征在于, 包括: 监控模块,通过高速外设组件互连 PCIE交换板与至少一个服务节点设备 连接,用于通过所述 PCIE交换板获取所述至少一个服务节点设备中的每个服 务节点设备的硬件资源的信息, 根据获取的所述信息构建物理资源池; 9. A management device, characterized by including: a monitoring module, connected to at least one service node device through a high-speed peripheral component interconnection PCIE switching board, and used to obtain information about the hardware resources of each service node device in the at least one service node device through the PCIE switching board, Construct a physical resource pool based on the obtained information;
分区建立模块, 用于在所述物理资源池上建立逻辑分区。 A partition establishment module is used to establish logical partitions on the physical resource pool.
10、 根据权利要求 9所述的管理设备, 其特征在于, 所述分区建立模块 具体用于将所述物理资源池包括的硬件资源的信息呈现给用户, 并接收所述 用户的选中指令和划分指令, 以建立所述逻辑分区, 所述选中指令用于指示 所述用户根据应用需求从所述物理资源池中选择的硬件资源, 所述划分指令 述逻辑分区包括的硬件资源。 10. The management device according to claim 9, wherein the partition establishment module is specifically configured to present the information of the hardware resources included in the physical resource pool to the user, and receive the user's selection instructions and partitions. The instruction is to establish the logical partition. The selection instruction is used to instruct the user to select hardware resources from the physical resource pool according to application requirements. The partition instruction describes the hardware resources included in the logical partition.
11、 根据权利要求 10所述的管理设备, 其特征在于, 所述分区建立模块 还用于在所述逻辑分区上电后, 向所述用户呈现与所述逻辑分区对应的硬件 平台, 以使所述用户安装操作系统, 所述硬件平台包括所述逻辑分区包括的 硬件资源。 11. The management device according to claim 10, wherein the partition establishment module is further configured to present the hardware platform corresponding to the logical partition to the user after the logical partition is powered on, so that the The user installs an operating system, and the hardware platform includes hardware resources included in the logical partition.
12、 根据权利要求 10或 11所述的管理设备, 其特征在于, 还包括: 预警模块, 用于在所述监控模块获取到所述逻辑分区包括的硬件资源的 负载大于预设第一负载门限时, 发出预警信息。 12. The management device according to claim 10 or 11, further comprising: an early warning module, configured to obtain in the monitoring module that the load of the hardware resources included in the logical partition is greater than the preset first load gate For a limited time, an early warning message will be issued.
13、 根据权利要求 10或 11所述的管理设备, 其特征在于, 13. The management device according to claim 10 or 11, characterized in that,
所述分区建立模块还用于在所述监控模块获取到所述逻辑分区包括的硬 件资源的负载大于预设第一负载门限时, 从所述物理资源池中选择空闲的硬 件资源添加到所述逻辑分区中,并触发所述新添加的硬件资源的热插拔流程; 或者 The partition establishment module is also configured to select idle hardware resources from the physical resource pool and add them to the physical resource pool when the monitoring module obtains that the load of the hardware resources included in the logical partition is greater than the preset first load threshold. logical partition, and trigger the hot-swapping process of the newly added hardware resources; or
所述分区建立模块还用于在所述监控模块获取到所述逻辑分区包括的硬 件资源的负载小于预设第二负载门限时, 从所述逻辑分区中移除过剩的硬件 资源到所述物理资源池中, 所述过剩的硬件资源是在所述逻辑分区中没有被 利用或者利用率小于预设利用率门限的硬件资源。 The partition establishment module is also configured to remove excess hardware resources from the logical partition to the physical partition when the monitoring module obtains that the load of the hardware resources included in the logical partition is less than a preset second load threshold. In the resource pool, the excess hardware resources are hardware resources that are not utilized in the logical partition or have a utilization rate less than a preset utilization threshold.
14、 根据权利要求 9-13任一项所述的管理设备, 其特征在于, 所述监控 模块包括: 14. The management device according to any one of claims 9-13, characterized in that the monitoring module includes:
第一监控单元,通过所述 PCIE交换板与所述每个服务节点设备的硬件资 源连接,用于通过所述 PCIE交换板监控所述每个服务节点设备的硬件资源的 状态, 通过所述 PCIE交换板获取所述每个服务节点设备的硬件资源的信息; 构建单元, 用于根据所述第一监控单元获取的所述每个服务节点的硬件 资源的信息, 构建所述物理资源池。 A first monitoring unit is connected to the hardware resources of each service node device through the PCIE switching board, and is used to monitor the hardware resources of each service node device through the PCIE switching board. Status, obtain the information of the hardware resources of each service node device through the PCIE switching board; The construction unit is used to construct the information of the hardware resources of each service node according to the information obtained by the first monitoring unit. Describe the physical resource pool.
15、 根据权利要求 14所述的管理设备, 其特征在于, 所述监控模块还包 括: 15. The management device according to claim 14, characterized in that the monitoring module further includes:
第二监控单元,通过所述 PCIE交换板或直接与所述每个服务节点设备的 管理控制器连接, 用于根据存储的所述每个服务节点设备连接的第一端口和 第二端口之间的对应关系, 从所述每个服务节点设备的管理控制器上获取所 述每个服务节点设备的硬件资源的信息; The second monitoring unit is connected to the management controller of each service node device through the PCIE switching board or directly, and is used to connect the first port and the second port of each service node device according to the storage corresponding relationship, obtaining information about the hardware resources of each service node device from the management controller of each service node device;
所述构建单元, 还用于根据所述第一监控单元获取的所述每个服务节点 的硬件资源的信息和 /或所述第二监控单元获取的所述每个服务节点设备的 硬件资源的信息, 构建所述物理资源池; The building unit is further configured to obtain information about the hardware resources of each service node according to the information obtained by the first monitoring unit and/or obtain information about the hardware resources of each service node device obtained by the second monitoring unit. information to construct the physical resource pool;
其中, 所述每个服务节点设备的管理控制器用于监控所述每个服务节点 设备的硬件资源的状态;第一端口为所述 PCIE交换板上与所述每个服务节点 设备的硬件资源连接的端口,第二端口为所述 PCIE交换板上或所述管理设备 上与所述每个服务节点设备的管理控制器连接的端口。 Wherein, the management controller of each service node device is used to monitor the status of the hardware resources of each service node device; the first port is connected to the hardware resources of each service node device on the PCIE switching board The second port is a port on the PCIE switching board or the management device connected to the management controller of each service node device.
16、 一种管理方法, 其特征在于, 包括: 16. A management method characterized by including:
管理设备通过高速外设组件互连 PCIE 交换板获取至少一个服务节点设 备中的每个服务节点设备的硬件资源的信息; The management device obtains information about the hardware resources of each service node device in at least one service node device through the high-speed peripheral component interconnection PCIE switching board;
所述管理设备根据获取的所述信息, 构建物理资源池; The management device constructs a physical resource pool based on the obtained information;
所述管理设备在所述物理资源池上建立逻辑分区; The management device establishes logical partitions on the physical resource pool;
其中, 所述管理设备通过所述 PCIE 交换板与所述每个服务节点设备连 接。 Wherein, the management device is connected to each service node device through the PCIE switching board.
17、 根据权利要求 16所述的管理方法, 其特征在于, 所述管理设备在所 述物理资源池上建立逻辑分区包括: 收所述用户的选中指令和划分指令, 以建立所述逻辑分区, 所述选中指令用 于指示所述用户根据应用需求从所述物理资源池中选择的硬件资源, 所述划 分指令用于指示所述用户对所述选择的硬件资源进行分配形成的所述逻辑分 区和所述逻辑分区包括的硬件资源。 17. The management method according to claim 16, wherein the management device establishing a logical partition on the physical resource pool includes: receiving a selection instruction and a division instruction from the user to establish the logical partition, so The selection instruction is used to instruct the user to select hardware resources from the physical resource pool according to application requirements, and the partition instruction is used to instruct the user to allocate the logical partitions formed by the selected hardware resources. The hardware resources included in the logical partition.
18、 根据权利要求 17所述的管理方法, 其特征在于, 还包括: 所述管理设备在所述逻辑分区上电后, 向所述用户呈现与所述逻辑分区 对应的硬件平台, 以使所述用户安装操作系统, 所述硬件平台包括所述逻辑 分区包括的硬件资源。 18. The management method according to claim 17, further comprising: after the logical partition is powered on, the management device presents the hardware platform corresponding to the logical partition to the user, so that all The user installs an operating system, and the hardware platform includes hardware resources included in the logical partition.
19、 根据权利要求 17或 18所述的管理方法, 其特征在于, 还包括: 门限时, 发出预警信息。 19. The management method according to claim 17 or 18, further comprising: issuing an early warning message when the threshold is reached.
20、 根据权利要求 17或 18所述的管理方法, 其特征在于, 还包括: 门限时, 从所述物理资源池中选择空闲的硬件资源添加到所述逻辑分区中, 并触发所述新添加的硬件资源的热插拔流程; 或者 门限时, 从所述逻辑分区中移除过剩的硬件资源到所述物理资源池中, 所述 过剩的硬件资源是在所述逻辑分区中没有被利用或者利用率小于预设利用率 门限的硬件资源。 20. The management method according to claim 17 or 18, further comprising: at a threshold, selecting idle hardware resources from the physical resource pool to add to the logical partition, and triggering the new addition. hot-swapping process of hardware resources; or at a threshold, remove excess hardware resources from the logical partition to the physical resource pool, and the excess hardware resources are not utilized in the logical partition or Hardware resources whose utilization is less than the preset utilization threshold.
21、 根据权利要求 16-20任一项所述的服务器方法, 其特征在于, 所述 管理设备通过所述 PCIE交换板与所述每个服务节点设备的硬件资源连接; 所述管理设备通过高速外设组件互连 PCIE 交换板获取至少一个服务节 点设备中的每个服务节点设备的硬件资源的信息包括: 21. The server method according to any one of claims 16 to 20, characterized in that: the management device is connected to the hardware resources of each service node device through the PCIE switching board; the management device is connected through a high-speed The peripheral component interconnection PCIE switching board obtains the information of the hardware resources of each service node device in at least one service node device, including:
所述管理设备通过所述 PCIE 交换板监控所述每个服务节点设备的硬件 资源的状态,通过所述 PCIE交换板获取所述每个服务节点设备的硬件资源的 信息。 The management device monitors the status of the hardware resources of each service node device through the PCIE switching board, and obtains the information of the hardware resources of each service node device through the PCIE switching board.
22、 根据权利要求 16-20任一项所述的管理方法, 其特征在于, 所述管 理设备通过所述 PCIE交换板与所述每个服务节点设备的硬件资源连接,并通 过所述 PCIE交换板与所述每个服务节点设备的管理控制器连接; 22. The management method according to any one of claims 16 to 20, characterized in that: the management device is connected to the hardware resources of each service node device through the PCIE switching board, and is connected to the hardware resources of each service node device through the PCIE switching board. The board is connected to the management controller of each service node device;
所述管理设备通过高速外设组件互连 PCIE 交换板获取至少一个服务节 点设备中的每个服务节点设备的硬件资源的信息包括: The management device obtains the information of the hardware resources of each service node device in at least one service node device through the high-speed peripheral component interconnection PCIE switching board, including:
所述管理设备通过所述 PCIE 交换板监控所述每个服务节点设备的硬件 资源的状态,通过所述 PCIE交换板获取所述每个服务节点设备的硬件资源的 信息; 或者 所述管理设备根据存储的所述每个服务节点设备连接的第一端口和第二 端口之间的对应关系, 从所述每个服务节点设备的管理控制器获取所述每个 服务节点设备的硬件资源的信息; The management device monitors the status of the hardware resources of each service node device through the PCIE switching board, and obtains the information of the hardware resources of each service node device through the PCIE switching board; or The management device obtains the information of each service node device from the management controller of each service node device according to the stored correspondence between the first port and the second port connected to each service node device. Hardware resource information;
其中, 所述每个服务节点设备的管理控制器用于监控所述每个服务节点 设备的硬件资源的状态;第一端口为所述 PCIE交换板上与所述每个服务节点 设备的硬件资源连接的端口,第二端口为所述 PCIE交换板上与所述每个服务 节点设备的管理控制器连接的端口。 Wherein, the management controller of each service node device is used to monitor the status of the hardware resources of each service node device; the first port is connected to the hardware resources of each service node device on the PCIE switching board The second port is a port on the PCIE switching board connected to the management controller of each service node device.
23、 根据权利要求 16-20任一项所述的管理方法, 其特征在于, 所述管 理设备通过所述 PCIE交换板与所述每个服务节点设备的硬件资源连接,并直 接与所述每个服务节点设备的管理控制器连接; 23. The management method according to any one of claims 16 to 20, characterized in that the management device is connected to the hardware resources of each service node device through the PCIE switching board, and is directly connected to the hardware resources of each service node device. Management controller connection for each service node device;
所述管理设备通过高速外设组件互连 PCIE 交换板获取至少一个服务节 点设备中的每个服务节点设备的硬件资源的信息包括: The management device obtains the information of the hardware resources of each service node device in at least one service node device through the high-speed peripheral component interconnection PCIE switching board, including:
所述管理设备通过所述 PCIE 交换板监控所述每个服务节点设备的硬件 资源的状态,通过所述 PCIE交换板获取所述每个服务节点设备的硬件资源的 信息, 或者 The management device monitors the status of the hardware resources of each service node device through the PCIE switching board, and obtains the information of the hardware resources of each service node device through the PCIE switching board, or
所述管理设备根据存储的所述每个服务节点设备连接的第一端口和第二 端口之间的对应关系, 从所述每个服务节点设备的管理控制器获取所述每个 服务节点设备的硬件资源的信息; The management device obtains the information of each service node device from the management controller of each service node device according to the stored correspondence between the first port and the second port connected to each service node device. Hardware resource information;
其中, 所述每个服务节点设备的管理控制器用于监控所述每个服务节点 设备的硬件资源的状态;第一端口为所述 PCIE交换板上与所述每个服务节点 设备的硬件资源连接的端口, 第二端口为所述管理设备上与所述每个服务节 点设备的管理控制器连接的端口。 Wherein, the management controller of each service node device is used to monitor the status of the hardware resources of each service node device; the first port is connected to the hardware resources of each service node device on the PCIE switching board The second port is a port on the management device connected to the management controller of each service node device.
PCT/CN2012/080293 2012-08-17 2012-08-17 Server system, management method, and device WO2014026374A1 (en)

Priority Applications (2)

Application Number Priority Date Filing Date Title
PCT/CN2012/080293 WO2014026374A1 (en) 2012-08-17 2012-08-17 Server system, management method, and device
CN201280002787.4A CN103210379B (en) 2012-08-17 2012-08-17 Server system, management method and equipment

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
PCT/CN2012/080293 WO2014026374A1 (en) 2012-08-17 2012-08-17 Server system, management method, and device

Publications (1)

Publication Number Publication Date
WO2014026374A1 true WO2014026374A1 (en) 2014-02-20

Family

ID=48756620

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2012/080293 WO2014026374A1 (en) 2012-08-17 2012-08-17 Server system, management method, and device

Country Status (2)

Country Link
CN (1) CN103210379B (en)
WO (1) WO2014026374A1 (en)

Families Citing this family (16)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9672167B2 (en) * 2013-07-22 2017-06-06 Futurewei Technologies, Inc. Resource management for peripheral component interconnect-express domains
CN103533306B (en) * 2013-10-11 2017-01-18 浙江宇视科技有限公司 Intelligent distribution device and method of video bandwidth
KR101781769B1 (en) 2013-11-27 2017-09-25 인텔 코포레이션 Method and apparatus for server platform architectures that enable serviceable nonvolatile memory modules
CN103955441B (en) * 2014-04-02 2017-04-12 华为技术有限公司 Equipment management system, equipment management method and IO (Input/Output) expansion interface
CN104168135B (en) * 2014-08-06 2018-11-02 中国船舶重工集团公司第七0九研究所 Network interface card resource pool management method and system
CN104125303B (en) * 2014-08-12 2018-09-04 北京奇虎科技有限公司 Reading and writing data requesting method, client and system
US9934187B2 (en) * 2014-12-17 2018-04-03 Quanta Computer Inc. Hot-pluggable computing system
CN105515870B (en) * 2015-12-18 2019-06-21 华为技术有限公司 A kind of blade server, resource allocation methods and system
CN105512080A (en) * 2015-12-31 2016-04-20 浪潮(北京)电子信息产业有限公司 Server system
CN106126357A (en) * 2016-06-29 2016-11-16 浪潮集团有限公司 A kind of server management system implementation method
CN106445006B (en) * 2016-09-29 2019-05-28 郑州云海信息技术有限公司 A kind of fusion architecture server
CN107205048A (en) * 2017-07-21 2017-09-26 深圳市智网云联科技有限公司 A kind of desktop cloud protocol code system based on PCI E buses
CN107864055A (en) * 2017-10-31 2018-03-30 云宏信息科技股份有限公司 The management method and platform of virtualization system
TWI658367B (en) * 2017-10-31 2019-05-01 英業達股份有限公司 Hardware resource expansion system
CN108337307B (en) * 2018-01-31 2021-06-29 郑州云海信息技术有限公司 Multi-path server and communication method between nodes thereof
CN108388476A (en) * 2018-03-01 2018-08-10 郑州云海信息技术有限公司 A kind of server calculating acceleration resource pool method and system

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101436165A (en) * 2007-11-15 2009-05-20 国际商业机器公司 System and method for management of an IOV adapter
US7765552B2 (en) * 2004-09-17 2010-07-27 Hewlett-Packard Development Company, L.P. System and method for allocating computing resources for a grid virtual system
CN102087618A (en) * 2011-02-12 2011-06-08 浪潮(北京)电子信息产业有限公司 Resource management method and system for cloud computing operating system

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7765552B2 (en) * 2004-09-17 2010-07-27 Hewlett-Packard Development Company, L.P. System and method for allocating computing resources for a grid virtual system
CN101436165A (en) * 2007-11-15 2009-05-20 国际商业机器公司 System and method for management of an IOV adapter
CN102087618A (en) * 2011-02-12 2011-06-08 浪潮(北京)电子信息产业有限公司 Resource management method and system for cloud computing operating system

Also Published As

Publication number Publication date
CN103210379B (en) 2016-05-25
CN103210379A (en) 2013-07-17

Similar Documents

Publication Publication Date Title
WO2014026374A1 (en) Server system, management method, and device
US11899943B2 (en) Node interconnection apparatus, resource control node, and server system
JP4579298B2 (en) Method and apparatus for providing a virtual server blade
US20180373557A1 (en) System and Method for Virtual Machine Live Migration
US8762999B2 (en) Guest-initiated resource allocation request based on comparison of host hardware information and projected workload requirement
EP3457288B1 (en) Computer system and storage access device
WO2014101010A1 (en) Resource management method of virtual machine system, virtual machine system, and apparatus
US20090276773A1 (en) Multi-Root I/O Virtualization Using Separate Management Facilities of Multiple Logical Partitions
CN105159798A (en) Dual-machine hot-standby method for virtual machines, dual-machine hot-standby management server and system
EP2603857A2 (en) Methods and systems for platform optimized design
WO2016107023A1 (en) Cloud server system
JP2009075718A (en) Method of managing virtual i/o path, information processing system, and program
US9866474B2 (en) Centralized server switch management
CN113312142A (en) Virtualization processing system, method, device and equipment
CN112134752B (en) Method, system, equipment and medium for monitoring switch based on BMC
WO2020108537A1 (en) Software-defined virtual network-based resource allocation system
Chashoo et al. VM_Mig_Framework: virtual machine migration with and without ballooning
CN109002347B (en) Virtual machine memory allocation method, device and system
US20240020174A1 (en) Memory disaggregation in a multi-node environment
US11838149B2 (en) Time division control of virtual local area network (vlan) to accommodate multiple virtual applications
JP2017111597A (en) Bandwidth setting method, bandwidth setting program, information processor, and information processing system
CN116010307A (en) Server resource allocation system, method and device
Partitions ESXi Install

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 12882943

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 12882943

Country of ref document: EP

Kind code of ref document: A1