WO2007033922A2 - Method and device for dynamically adjusting resources - Google Patents

Method and device for dynamically adjusting resources Download PDF

Info

Publication number
WO2007033922A2
WO2007033922A2 PCT/EP2006/066298 EP2006066298W WO2007033922A2 WO 2007033922 A2 WO2007033922 A2 WO 2007033922A2 EP 2006066298 W EP2006066298 W EP 2006066298W WO 2007033922 A2 WO2007033922 A2 WO 2007033922A2
Authority
WO
WIPO (PCT)
Prior art keywords
resource consumption
applications
web
resources
web service
Prior art date
Application number
PCT/EP2006/066298
Other languages
French (fr)
Other versions
WO2007033922B1 (en
WO2007033922A3 (en
Inventor
Ying Chen
Ying Li
Jie Qiu
Original Assignee
International Business Machines Corporation
Ibm United Kingdom Limited
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by International Business Machines Corporation, Ibm United Kingdom Limited filed Critical International Business Machines Corporation
Priority to US12/067,454 priority Critical patent/US7949756B2/en
Publication of WO2007033922A2 publication Critical patent/WO2007033922A2/en
Publication of WO2007033922A3 publication Critical patent/WO2007033922A3/en
Publication of WO2007033922B1 publication Critical patent/WO2007033922B1/en

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F9/00Arrangements for program control, e.g. control units
    • G06F9/06Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
    • G06F9/46Multiprogramming arrangements
    • G06F9/50Allocation of resources, e.g. of the central processing unit [CPU]
    • G06F9/5061Partitioning or combining of resources
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/30Monitoring
    • G06F11/34Recording or statistical evaluation of computer activity, e.g. of down time, of input/output operation ; Recording or statistical evaluation of user activity, e.g. usability assessment
    • G06F11/3409Recording or statistical evaluation of computer activity, e.g. of down time, of input/output operation ; Recording or statistical evaluation of user activity, e.g. usability assessment for performance assessment
    • G06F11/3433Recording or statistical evaluation of computer activity, e.g. of down time, of input/output operation ; Recording or statistical evaluation of user activity, e.g. usability assessment for performance assessment for load management
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/30Monitoring
    • G06F11/34Recording or statistical evaluation of computer activity, e.g. of down time, of input/output operation ; Recording or statistical evaluation of user activity, e.g. usability assessment
    • G06F11/3447Performance evaluation by modeling
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F9/00Arrangements for program control, e.g. control units
    • G06F9/06Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
    • G06F9/46Multiprogramming arrangements
    • G06F9/50Allocation of resources, e.g. of the central processing unit [CPU]
    • G06F9/5083Techniques for rebalancing the load in a distributed system
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F2201/00Indexing scheme relating to error detection, to error correction, and to monitoring
    • G06F2201/88Monitoring involving counting
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F2209/00Indexing scheme relating to G06F9/00
    • G06F2209/50Indexing scheme relating to G06F9/50
    • G06F2209/508Monitor

Definitions

  • the present invention relates to the dynamic adjustment of resources among multiple co-existing applications.
  • Every application needs to occupy resources in an electronic equipment, such as central processing unit (CPU) resources, memory resources and the like.
  • resources such as central processing unit (CPU) resources, memory resources and the like.
  • CPU central processing unit
  • memory resources such as random access memory (RAM) resources, etc.
  • the amount of resources needed by every application varies due to specific properties of every application, e.g. time complexity, space complexity and the like.
  • an electronic equipment wherein a plurality of applications co-exist is often faced with a difficult problem of how to rationally adjust resources among all applications.
  • Web service is a brand-new distributed computing environment, the basic idea of which is to utilize open standard technology to realize distributed software development, software engineering and software use on the Internet.
  • service refers to various software distributed on the Internet. Any user, whether he is an entity user or an individual user, can use Web service technology to schedule existing service software on the Internet and construct his own application software on the basis of service demands. Scheduling can be performed among application software of different entities and among application software of customers and entities via the Web service technology, so as to support e-business, customer relation model and other applications.
  • the meaning of the Web service technology is completely different from that of common Web which is only capable of providing data service, the adopted protocols and interfaces belong to the Web technology which has already in common use.
  • XML The basic technology of Web service is XML, which is defined by W3C.
  • XML is an extensible markup language for the use of data description. Different from hypertext markup language (HTML) , XML describes only data contents per se but does not touch upon data display, so it can be used to describe any generalized contents.
  • HTML hypertext markup language
  • XML In Web services, it is XML that is used to describe remote scheduling operation and its enforcement results. This description is loaded in the simple object access protocol (SOAP) , and a SOAP message is usually transmitted in the most common HTTP. Since XML text description is irrelevant to enforcement, interactions can be conducted via Web services among platforms employing different operating systems and different programming languages. The release and description of Web services per se is also implemented using XML.
  • SOAP simple object access protocol
  • SLA Service level agreements
  • the Web service providers To meet the SLAs, the Web service providers often provide excess capacity when statically provisioning resources for their customers because it is hard to accurately predict demands, especially for the scenarios that some customers may occasionally experience a "web storm", i.e. a dramatic temporary surge in demand. For example, a particular web service may experience a heavy request volume caused by an emergent event. The statically provisioned resources according to the average requirements can not deal with such a situation, which will cause the SLA violations. If excess capacity is provided in order to meet the SLAs under various sudden conditions, a great cost of resources is thus paid, which is unacceptable to the Web service provider.
  • existing electronic equipment can provide real-time reports on resource consumption, they are merely able to provide the total amount of resources consumed by the entire progress in a unit of progress or task. It is hard to provide resources consumed by each application respectively when multiple applications co-exist in one process or task. That is to say, the prior art is just able to obtain the amount of resources jointly- consumed by several applications, but can not possess the proportion of resources consumed by each application in the process or task, i.e. resource consumption ratio. Therefore, in this case, existing electronic equipment cannot dynamically adjust resources among each application in one process or task on the basis of the prediction of resource consumption amount .
  • Grid technology which has developed lately allows enterprises to share resources as they form "virtual organizations". That is, the enterprises, which may be geographically widespread locations and which may have heterogeneous computing platforms, share their resources and services to form virtual services.
  • the open grid services architecture (OGSA) developed by academic and scientific communities, along with commercial entities (such as IBM) is an evolution of Grid technology. OGSA enables the grid to provide the enterprises with a set of extensible services collected by the virtual organizations.
  • the emerging Web service resource framework (WSRF) technology is also an evolution of Grid technology. All services on Grid will be Web services with WS-Resources if the business logics of the Web services have a certain state. Thus, in this disclosure, Grid services are called "Web services" to represent the service provisioned on Grid.
  • the Web service When the demands drop down, the Web service can be moved from the node to release the additional resource for other Web services. Thus, it is necessary to observe or monitor the resource consumption of Web services for management, especially SLA enforcement.
  • existing tools of Web services for example, tools developed by AmberPoint Company
  • QoS Quality of Service
  • the SLA measurement can not indicate the root cause of the SLA violation, so that correct remedial measures can not be taken.
  • Web service A is found to be with a delayed response time by the SLA measurement tool, and then it is decided to schedule more resources for Web service A; while the real reason causing the response time delay of Web service A may be that the resource required by Web service A has been consumed or exhausted by Web service B running on the same electronic equipment/server.
  • the detected SLA violation can be used for dynamically provisioning of the Web service, the SLA has already been violated and the customer has experienced the degraded quality of service before supplying the gap by allocating additional resources.
  • the resource consumption of Web services should be measured to support the dynamical adjusting of resources, i.e., to proactively determine when and which Web service should be dynamically provisioned (remote cloned or moved) in commercial grids.
  • a method of dynamically adjusting resources among a plurality of co-existing applications including the steps of: building a relation model between a request number and resource consumption of said plurality of applications; obtaining at multiple sampling moments a request number and resource consumption of each of said plurality of applications; calculating resource consumption ratio of each of said plurality of applications; and analyzing resource consumption of a plurality of currently co-existing applications.
  • a device for dynamically adjusting resources among a plurality of applications comprising: model building means for building a relation model between request numbers and resource consumptions of said plurality of applications; first detecting means for obtaining at multiple sampling moments the request numbers of each of said plurality of applications; second detecting means for obtaining at said multiple sampling moments resource consumptions of said plurality of applications; model learning means for calculating resource consumption ratio of each of said plurality of applications; and analyzing means for analyzing resource consumptions of a plurality of co-existing applications.
  • resource consumption ratios of a plurality of co-existing applications can be obtained, and further, resource consumption of each of currently co-existing applications can be calculated according to the obtained resource consumption ratio, so that dynamically adjusting of resources is realized.
  • the present application is especially suitable for management of Web services on a Web server, and is of great significance for satisfaction with service level agreement of Web services and evaluation of Web services .
  • Fig. 1 is a flow chart of a method of dynamically adjusting resources according to a first embodiment of the present invention
  • Fig 2 is a detailed flow chart of detecting request number in the period of learning
  • Fig. 3 is a detailed flow chart of analyzing resource consumption of a plurality of currently co-existing applications
  • Fig. 4 is a layout view of a device of dynamically adjusting resources according to a second embodiment of the present invention.
  • Fig. 5 is a schematic inner layout view of a first detecting means shown in Fig. 4.
  • the same elements and components are represented by identical reference numerals, repeated depiction thereof being omitted.
  • Fig. 1 is a flow chart of a method of dynamically adjusting resources according to a first embodiment of the present invention.
  • step SlOl a relation model on the relationship between request numbers of n Web services and resource consumptions of these Web services running on a Web server is established.
  • a relation model is called Request Number Resource Consumption (RN-RC) relation model for short.
  • a linear statistic model is taken for example.
  • the linear statistic model uses different resource consumption ratios to represent the resource consumption ratio of each Web service:
  • step S102 the request number to each Web service at multiple moments in a determined period of time is detected.
  • step S103 the corresponding resource consumption situation for the Web server at multiple moments in the above determined period of time is detected.
  • the above determined period of time is called period of learning, and multiple moments in the period of learning are called sampling moments.
  • the request number to each Web service is respectively detected at (n + 1) sampling moments, i.e. ti, t 2 , ..., t n , t (n + i), and the total resource consumption of the n Web services is detected at each sampling moment.
  • the detailed flow of step S102 will be depicted with reference to Fig. 2.
  • the aforesaid request number to each Web service and the resource consumption of the n Web services running on the Web server can also be obtained by reading dedicated learning sequence data, for example, reading pre-stored learning data from metric data buffer.
  • step S104 the request number to each Web service at each sampling moment detected in step S102 and the total resource consumption of the n Web services running on the Web server detected in step S103 are respectively substituted into expression (1) to obtain (n + 1) equations corresponding to the (n + 1) sampling moments, i.e. ti, t 2 , ..., t n/ t (n + ll :
  • Qt 1 As 1 *Rs 1 t 1 + AS 2 *RS2ti+..ASi*RSiti...+ASn*Rs n ti+Ao
  • Qt 2 As 1 *Rs 1 t 2 + AS 2 *Rs 2 t 2 +..ASi*RSit 2 ...+ASn*Rs n t 2 +Ao
  • Qt 1 in the first expression represents the resource consumption of the n Web services at sampling moment ti
  • Qt 2 in the second expression represents the resource consumption of the n Web services at sampling moment t2, ..., and so on and so forth
  • Asi represents the resource consumption ratio of the (i)th Web service
  • Rsitl represents the request number to the (i)th Web service at sampling moment tl
  • Rsit2 represents the request number to the (i)th Web service at sampling moment t 2 , ..., and so on.
  • Qsi represents resources consumed by the (i)th Web service at sampling moment t
  • Qt represents the resource consumption of the Web server consumed by the n Web services at sampling moment t
  • * represents multiplying relation
  • As 1 represents the resource consumption ratio of the (i)th web service
  • Rsit represents the request number to the (i)th Web service at sampling moment t.
  • the calculated resource consumption ratio Asi of these Web services can be checked in step S105. Since the resource consumption ratio Asi represents the ratio of resources consumed by each Web service on the Web server, it should meet:
  • the flow shown in Fig. 1 will return to steps S102 and S103 (not shown in Fig. 1) to re-detect the request number and resource consumption of each Web service at each sampling moment in the period of learning and to thereby re-calculate the resource consumption ratio of each Web service.
  • the above resource consumption ratio might vary with changes of the system configuration, and then, the corresponding resource consumption ratio needs to be obtained by re-learning in the period of learning .
  • the step S105 can be omitted.
  • the validation can be that the consumption ratio must be greater than zero or not be negative.
  • step S106 according to the resource consumption ratio calculated in step S104 and the currently detected request number to each Web service, the resource consumption of a plurality of Web services currently co-existing on the Web server is analyzed and corresponding treatment is taken.
  • the resource consumption of a plurality of Web services which are about to co-exist the Web server is analyzed and corresponding treatment is then taken.
  • step S106 will be depicted in detail.
  • an electronic equipment can, according to the resource consumption ratios of a plurality of Web services obtained in the period of learning, analyze the resource consumption of the plurality of Web service co-existing on the Web server after the period of learning, perform processing based on the analysis results and then accomplish the dynamic adjusting of resources.
  • step S102 shown in Fig. 1 with reference to Fig. 2.
  • Step S102 shown in Fig. 1 can be divided into steps S1021, S1022, and S1023.
  • SOAP simple object access protocol
  • a SOAP routing handler is added to the engine receiving a SOAP message.
  • an intercepted SOAP message is parsed in step S1022.
  • the aforesaid SOAP routing handler parses part of SOAP protocol (mainly the SOAP header) to obtain the end corresponding to SOAP, which represents the name and accessed address of a concrete Web service.
  • the request number to each Web service is obtained by respectively counting the requests to each Web service.
  • step S106 shown in Fig. 1 The detailed flow of step S106 shown in Fig. 1 is as shown in Fig. 3.
  • step S105 the resource consumption ratio of each Web service, which is checked to be measure up, is obtained in step S105 shown in Fig. 1
  • the resource consumption requests to Web service at the current moment t tn t2 ) is detected in step S1061 shown in Fig. 3.
  • step S1062 the current request number to each Web service is detected in step S1062 according to these resource consumption requests.
  • This detecting method is the same as the detecting method shown in Fig. 2, so details are omitted here.
  • the flow as shown in Fig. 3 goes to step S1063.
  • step S1063 the current resource consumption of Web service (s) is (are) calculated according to the current request number to each Web service and the resource consumption ratio of each Web service obtained in the preceding step. Specifically, the request number of each Web service at moment t (n* 2) is substituted into expression (3) so that the resource consumption of each Web service and the resource consumption Qt (nt2 > of the n Web services at moment t (ru2) can be calculated.
  • the request number of each Web service which is about to co-exist with other Web service at a certain moment is known, the resource consumption of each Web service and a plurality of Web services at this moment can also be calculated.
  • step S1064 the calculated resource consumption of a plurality of currently co-existing Web services is compared with its predetermined threshold. If the resource consumption exceeds the predetermined threshold, it means that the resource consumption of a plurality of currently co-existing Web services will go beyond a predetermined resource use range, and then the flow as shown in Fig. 3 will go to step S1065. In step S1065, resources are re-adjusted, that is, the resource consumption ratio corresponding to each Web service is re-adjusted.
  • the resource consumption ratio of the Web service with more requests may be adjusted to be smaller while the resource consumption ratio of the Web service with fewer requests may be adjusted to be larger, so that the resource consumption of these Web services will not go beyond the predetermined resource use range. If the predicted resource consumption in step S1064 does not exceed the predetermined threshold, it means that current resources can meet requirement of each Web service.
  • a predetermined threshold may be set for the resource consumption of n Web services, e.g. 100%; or a predetermined threshold may be set for the resource consumption of a certain Web service, e.g. 30%; or thresholds may be set for both of them at the same time. It is to be understood that selection of numeric values of the predetermined threshold does not constitute limitation to the present invention.
  • RN-RC relation models can be built according to needs.
  • a non-linear statistic model of the neutral network a generic algorithm model and the like can also be built.
  • Selection of a concrete model type does not form limitation to the present invention.
  • BP CPUsi represents the CPU consumption of the (i)th service
  • RSitj represents the request number of the (i)th service at moment tj
  • - CPUtj represents the CPU consumption at moment t ⁇
  • CPUo represents a CPU consumption constant of the Web server
  • steps of the method of dynamically adjusting resources according to the first embodiment of the present invention can be specifically implemented in a computer program product associated with each step running on a processing system.
  • a device of dynamically adjusting resources is further provided, wherein illustration is made still by taking n (n ⁇ 2) Web services running on a Web server for example.
  • Fig. 4 schematically shows the structure of a device of dynamically adjusting resources according to a second embodiment of the present invention.
  • the device 400 of dynamically adjusting resources is used for dynamically adjusting resources for each Web service in the case that only the resource consumption of a plurality of Web services can be known.
  • the device 400 of dynamically adjusting resources may be directly integrated into a Web server, or serve as an external separate device of a Web server .
  • the device 400 of dynamically adjusting resources as shown in Fig. 4 comprises: model building means 401, first detecting means 402, second detecting means 403, model learning means 404, resource consumption ratio checking means 405, analyzing means 406 and metric data storage means 407.
  • the model building means 401 is used for building a relation model on the relationship between the resource consumption of n Web services co-existing on the Web server and the request number of each Web service, i.e. an RN-RC relation model.
  • an RN-RC relation model i.e. an RN-RC relation model.
  • various RN-RC relation models can be built according to needs, such as a linear statistic model, a non-linear statistic model of the neural network and a generic algorithm model . Selection of a concrete model type does not constitute limitation to the present invention.
  • the first detecting means 402 detects the request number to each Web service at multiple moments in a determined period of time.
  • the second detecting means 403 detects the corresponding resource consumption situation for the Web server at multiple moments in the above determined period of time.
  • the functions of the first detecting means 402 and the second detecting means 403 can also be performed by one means.
  • the request number to each Web service is respectively detected at (n+1) sampling moments, i.e. ti, t 2 , ..., t n , t lntl) , and the total resource consumption of the n Web services is detected at each sampling moment.
  • the detailed structure of the first detecting means 402 will be depicted with reference to Fig. 5.
  • the aforesaid request number to each Web service and the corresponding resource consumption of the Web server can also be obtained by reading dedicated learning sequence data, for example, reading pre-stored learning data from the metric data storage means 407.
  • the model learning means 404 substitutes the request number to each Web service at each sampling moment detected in the first detecting means 402 and the total resource consumption of the n Web services running on the Web server detected in the second detecting means 403 into expression (5) to obtain (n+1) equations corresponding to the (n+1) sampling moments, i.e. ti, t2, •• ⁇ , t n , t(ntl) :
  • Qt 1n+D AS ⁇ RS I t 1n+I , + AS 2 *Rs 2 t,n t i) +..ASi*RSi t , nt i,...+As n *Rs n t (n+ i>+Ao ( 6 )
  • Qti in the first expression represents the resource consumption of the n Web services at sampling moment ti
  • Qt 2 in the second expression represents the resource consumption of the n Web services at sampling moment t 2/ ..., and so on.
  • Asi represents the resource consumption ratio of the (i)th Web service
  • Rsiti represents the request number to the (i)th Web service at sampling moment tj.
  • Rsit 2 represents the request number to the (i)th Web service at sampling moment t 2 , ..., and so on.
  • Qsi represents resources consumed by the (i)th Web service
  • Qt represents the resource consumption of the Web server consumed by the n Web services at sampling moment t
  • * represents multiplying relation
  • As 3. represents the resource consumption ratio of the (i)th Web service
  • RSit represents the request number to the (i)th Web service at sampling moment t.
  • the calculated resource consumption ratio As 1 of these Web services can be checked in the resource consumption ratio checking means 405. Since the resource consumption ratio As 1 represents the ratio of resources consumed by each Web service on the Web server, it should meet:
  • the resource consumption ratio checking means 405 will send a notification signal to the first detecting means 402 and the second detecting means 403 to cause the first detecting means 402 and the second detecting means 403 to re-detect the request number to each Web service and the resource consumption of the n Web services running on a Web server at each sampling moment in a period of learning, so that the resource consumption ratio of each Web service can be re-calculated.
  • the first detecting means 402 will detect the current request number to each Web service, the model learning means 404 will calculate the resource consumption of the n Web services currently co-existing on the Web server, and the analyzing means 406 will analyze whether the current resource amount of the Web server meets the requirements of all the currently co-existing Web services, and then corresponding treatment will be taken.
  • the model learning means 404 calculates the resource consumption of the n Web services currently co-existing on the Web server. Specifically, the request number to each Web service at moment t(n+2) is substituted into expression (7) so that the resource consumption of each Web service at moment t (n+2 ) and the resource consumption Qt (n+2) of the n Web services can be calculated.
  • the analyzing means 406 comprises a judging means 4061 and an adjusting means 4062.
  • the judging means 4061 decides whether the current resource amount of the Web server meets the requirements of all the currently Co-existing Web services according to the resource consumption of the currently co-existing n Web services calculated by the model learning means 404.
  • the resource consumption of the n Web services co-existing currently is compared with a predetermined threshold. If the resource consumption exceeds the predetermined threshold, it means that the resource consumption of the n Web services co-existing currently will go beyond a predetermined resource use range.
  • the adjusting means 4062 re-adjusts resources, that is, to re-adjust the resource consumption ratio corresponding to each Web service. For example, the resource consumption ratio of the Web service with more requests may be adjusted to be smaller while the resource consumption ratio of the Web service with fewer requests is adjusted to be larger, so that the resource consumption of these Web services will not go beyond the predetermined resource use range. If the resource consumption of the n Web services co-existing currently does not exceed the predetermined threshold, it means that current allocation of resources can meet requirements of each Web service.
  • a predetermined threshold may be set for the resource consumption of n Web services, e.g. 100%; or a predetermined threshold may be set for the resource consumption of a certain Web service, e.g. 30%; or thresholds may be set for both of them at the same time. It is to be understood that selection of the predetermined threshold does not constitute limitation to the present invention.
  • Fig. 5 schematically shows the inner structure of the first detecting means in Fig. 4.
  • the first detecting means 402 comprises intercepting means 4021, parsing means 4022 and counting means 4023.
  • the intercepting means 4021 intercepts at each sampling moment each SOAP message reaching the Web server. Specifically, SOAP routing handler is added to the engine receiving a SOAP message. Next, an intercepted SOAP message is parsed by the parsing means 4022. That is to say, according to the SOAP standard, the aforesaid SOAP routing handler parses part of the SOAP protocol (mainly the SOAP header) to obtain the end corresponding to SOAP, which represents the name and accessed address of a concrete Web service. Then, the counting means 4023 obtains the request number to each Web service by respectively counting the requests to each Web service.
  • SOAP protocol mainly the SOAP header
  • the present invention has been described above by taking a plurality of Web services co-existing on a Web server for example.
  • the present invention is not limited to this but is also suitable for the circumstance in which a plurality of applications are co-existing on one electronic equipment. It is only an example that a plurality of applications are co-existing in a process as described above. As a matter of fact, the present invention is even suitable for the circumstance in which there is only one task running on an electronic equipment and the resource consumption of a plurality of applications running on the task is already known.
  • the present invention is of special practical value in a computer system which Grid technology supports. Since Grid technology allows multiple computers or servers to share their resources and services, adjusting resources for each application among different computers or servers is an extremely effective measure to improve the network performance.
  • the above-mentioned resources include, but are not limited to, CPU resources and memory resources.
  • the device of dynamically adjusting resources according to an embodiment of the present invention can be easily realized in an existing Web server without interrupting Web services. Moreover, the device of dynamically adjusting resources can provide more information for an administrator of Web services, and the method and device of dynamically adjusting resources are suitable for all kinds of Web services and platforms. The implementation of the method and device of dynamically adjusting resources of the present invention is irrelevant to the implementation of platforms, programming languages, middleware and applications.
  • the present invention is described specifically on the basis of the circumstances of Web server, however, the subject matter that the invention seeks to protect can be implemented in any of information technology systems.
  • Those ordinary skilled in the field of computing may understand that there is correlation between the disclosed embodiments and the various computing circumstances as described.
  • the disclosed method can be implemented by software, hardware or the combination thereof.
  • hardware can be implemented with dedicated logic; software, which might be stored in memory, can be performed by a proper instructions performing system, such as microprocessor, personal computer or mainframe.

Abstract

A method for dynamically adjusting resources among a plurality of co-existing applications comprises the steps of : building a relation model (S101) between a request number and resource consumption of said plurality of applications; obtaining at multiple sampling moments a request number (S102) and resource consumption (S103) of each of said plurality of applications; calculating resource consumption ratio (S104) of each of said plurality of applications; and analyzing resource consumption of a plurality of currently co-existing applications (S106).

Description

METHOD AND DEVICE FOR DYNAMICALLY ADJUSTING RESOOKCES
FIELD OF THE INVENTION
The present invention relates to the dynamic adjustment of resources among multiple co-existing applications.
BACKGROUND OF THE INVENTION
With the development of computer and network technology, it is a common sight nowadays that multiple applications co-exist on the same electronic equipment. For example, multiple applications co-exist on the same computer or multiple Web services are simultaneously provided on one Web server .
Every application needs to occupy resources in an electronic equipment, such as central processing unit (CPU) resources, memory resources and the like. However, the amount of resources needed by every application varies due to specific properties of every application, e.g. time complexity, space complexity and the like. Thus, an electronic equipment wherein a plurality of applications co-exist is often faced with a difficult problem of how to rationally adjust resources among all applications.
To guarantee normal running of every application even under extreme conditions, a usual practice is to provide a maximum of resources for every application. Obviously, such a practice will cause part of resources to leave idle during normal running, thereby a waste phenomenon.
Web service is a brand-new distributed computing environment, the basic idea of which is to utilize open standard technology to realize distributed software development, software engineering and software use on the Internet. Here, "service" refers to various software distributed on the Internet. Any user, whether he is an entity user or an individual user, can use Web service technology to schedule existing service software on the Internet and construct his own application software on the basis of service demands. Scheduling can be performed among application software of different entities and among application software of customers and entities via the Web service technology, so as to support e-business, customer relation model and other applications. Although the meaning of the Web service technology is completely different from that of common Web which is only capable of providing data service, the adopted protocols and interfaces belong to the Web technology which has already in common use. The basic technology of Web service is XML, which is defined by W3C. XML is an extensible markup language for the use of data description. Different from hypertext markup language (HTML) , XML describes only data contents per se but does not touch upon data display, so it can be used to describe any generalized contents. In Web services, it is XML that is used to describe remote scheduling operation and its enforcement results. This description is loaded in the simple object access protocol (SOAP) , and a SOAP message is usually transmitted in the most common HTTP. Since XML text description is irrelevant to enforcement, interactions can be conducted via Web services among platforms employing different operating systems and different programming languages. The release and description of Web services per se is also implemented using XML.
Service level agreements (SLA) are commonly used by VJeb service providers to define their contractual service obligations to their customers. These contractual service obligations usually include response time commitments. If the service obligations are not met, the customer might be entitled to be reduction in the fees owned to the Web service provider, web service providers are therefore highly motivated to meet the commitments in their SLAs.
To meet the SLAs, the Web service providers often provide excess capacity when statically provisioning resources for their customers because it is hard to accurately predict demands, especially for the scenarios that some customers may occasionally experience a "web storm", i.e. a dramatic temporary surge in demand. For example, a particular web service may experience a heavy request volume caused by an emergent event. The statically provisioned resources according to the average requirements can not deal with such a situation, which will cause the SLA violations. If excess capacity is provided in order to meet the SLAs under various sudden conditions, a great cost of resources is thus paid, which is unacceptable to the Web service provider.
Therefore, there is a need to dynamically adjust resources, i.e., to dynamically adjust resources allocated to each Web service according to requirements of each Web service on resources, so as to accomplish rational adjustment of resources.
Though existing electronic equipment can provide real-time reports on resource consumption, they are merely able to provide the total amount of resources consumed by the entire progress in a unit of progress or task. It is hard to provide resources consumed by each application respectively when multiple applications co-exist in one process or task. That is to say, the prior art is just able to obtain the amount of resources jointly- consumed by several applications, but can not possess the proportion of resources consumed by each application in the process or task, i.e. resource consumption ratio. Therefore, in this case, existing electronic equipment cannot dynamically adjust resources among each application in one process or task on the basis of the prediction of resource consumption amount .
Besides, Grid technology which has developed lately allows enterprises to share resources as they form "virtual organizations". That is, the enterprises, which may be geographically widespread locations and which may have heterogeneous computing platforms, share their resources and services to form virtual services. The open grid services architecture (OGSA) developed by academic and scientific communities, along with commercial entities (such as IBM) is an evolution of Grid technology. OGSA enables the grid to provide the enterprises with a set of extensible services collected by the virtual organizations.
The emerging Web service resource framework (WSRF) technology is also an evolution of Grid technology. All services on Grid will be Web services with WS-Resources if the business logics of the Web services have a certain state. Thus, in this disclosure, Grid services are called "Web services" to represent the service provisioned on Grid.
Compared to the normal computing noncommercial Grids that have sufficient resources available to satisfy all concurrent users, commercial grids profit by maintaining the smallest possible set of resources that can still meet user demand. To do so, such commercial grids must efficiently manage competing demands for the same resource set. Also, since the customers of commercial Grids pay for resources, they will not tolerate being denied service or being rescheduled to a different time slot, i.e., they will not tolerate the SLA violation. Thus, it is necessary that Web services providers dynamically provision on the nodes Web services provisioned on these grids according to the present resource consumption and demands. When the demands of one Web service increase to the peak which the present resources assigned can not meet, this Web service can be remote cloned to other available nodes in the commercial Grids to get additional resources. When the demands drop down, the Web service can be moved from the node to release the additional resource for other Web services. Thus, it is necessary to observe or monitor the resource consumption of Web services for management, especially SLA enforcement. While existing tools of Web services (for example, tools developed by AmberPoint Company) focus on the measurement of SLA/Quality of Service (QoS) such as performance and security that is not enough for the Web services dynamically provisioning based on commercial Grids. Firstly, the SLA measurement can not indicate the root cause of the SLA violation, so that correct remedial measures can not be taken. For example, Web service A is found to be with a delayed response time by the SLA measurement tool, and then it is decided to schedule more resources for Web service A; while the real reason causing the response time delay of Web service A may be that the resource required by Web service A has been consumed or exhausted by Web service B running on the same electronic equipment/server. Secondly, even the detected SLA violation can be used for dynamically provisioning of the Web service, the SLA has already been violated and the customer has experienced the degraded quality of service before supplying the gap by allocating additional resources.
Therefore, the resource consumption of Web services should be measured to support the dynamical adjusting of resources, i.e., to proactively determine when and which Web service should be dynamically provisioned (remote cloned or moved) in commercial grids.
SUMMARY OF THE INVENTION
It is an object of the present invention to provide a method and device for dynamically adjusting resources among a plurality of co-existing applications.
To this end, according to an aspect of the present invention, a method of dynamically adjusting resources among a plurality of co-existing applications is provided, the method including the steps of: building a relation model between a request number and resource consumption of said plurality of applications; obtaining at multiple sampling moments a request number and resource consumption of each of said plurality of applications; calculating resource consumption ratio of each of said plurality of applications; and analyzing resource consumption of a plurality of currently co-existing applications.
According to another aspect of the present invention, a device for dynamically adjusting resources among a plurality of applications is provided, the device comprising: model building means for building a relation model between request numbers and resource consumptions of said plurality of applications; first detecting means for obtaining at multiple sampling moments the request numbers of each of said plurality of applications; second detecting means for obtaining at said multiple sampling moments resource consumptions of said plurality of applications; model learning means for calculating resource consumption ratio of each of said plurality of applications; and analyzing means for analyzing resource consumptions of a plurality of co-existing applications.
With the method and device provided by the present invention, resource consumption ratios of a plurality of co-existing applications can be obtained, and further, resource consumption of each of currently co-existing applications can be calculated according to the obtained resource consumption ratio, so that dynamically adjusting of resources is realized.
The present application is especially suitable for management of Web services on a Web server, and is of great significance for satisfaction with service level agreement of Web services and evaluation of Web services .
Other features and advantages of the present invention will become more apparent from the following description, taken in conjunction with the accompanying drawings.
BRIEF DESCRIPTION OF THE DRAWINGS
Embodiments of the invention will now be described, by way of example only, with reference to the accompanying drawings, in which:
Fig. 1 is a flow chart of a method of dynamically adjusting resources according to a first embodiment of the present invention;
Fig 2 is a detailed flow chart of detecting request number in the period of learning;
Fig. 3 is a detailed flow chart of analyzing resource consumption of a plurality of currently co-existing applications;
Fig. 4 is a layout view of a device of dynamically adjusting resources according to a second embodiment of the present invention; and
Fig. 5 is a schematic inner layout view of a first detecting means shown in Fig. 4. In the accompanying drawings, the same elements and components are represented by identical reference numerals, repeated depiction thereof being omitted.
DETAILED DESCRIPTION OF PREFERRED EMBODIMENTS
Hereinafter, the present invention will be described with reference to the accompanying drawings illustrating the preferred embodiments thereof which take n (n ≥ 2) Web services running on a Web server for example, wherein resource consumption of said n Web services on the Web server can be measured.
Fig. 1 is a flow chart of a method of dynamically adjusting resources according to a first embodiment of the present invention.
As shown in Fig. 1, in step SlOl, a relation model on the relationship between request numbers of n Web services and resource consumptions of these Web services running on a Web server is established. Such a relation model is called Request Number Resource Consumption (RN-RC) relation model for short.
In the present embodiment, a linear statistic model is taken for example.
As illustrated by expression (1) , the linear statistic model uses different resource consumption ratios to represent the resource consumption ratio of each Web service:
Q=ASi*RSi+ As2*RS2+...+Asi*Rsi+...+Asn*RSn+A0 (1)
In expression (1) , Q represents the resource consumption of n Web services on the Web server; * represents multiplying relation; Asi is the resource consumption ratio of the (i)th Web service, which represents the ratio of resources on the Web server consumed by the (i)th Web service with respect to each request of the (i)th Web service, i = 1, 2, ..., n, wherein n represents the number of Web services running on the Web server; Rsi represents the request number for the (i)th Web service; and A0 represents a resource consumption constant of the Web server.
After the aforesaid RN-RC relation model is determined, the flow shown in Fig. 1 goes to step S102. In step S102, the request number to each Web service at multiple moments in a determined period of time is detected. Then in step S103, the corresponding resource consumption situation for the Web server at multiple moments in the above determined period of time is detected. In the present embodiment, the above determined period of time is called period of learning, and multiple moments in the period of learning are called sampling moments. In the present embodiment, since there are n Web services running on the Web server, the request number to each Web service is respectively detected at (n + 1) sampling moments, i.e. ti, t2, ..., tn, t(n + i), and the total resource consumption of the n Web services is detected at each sampling moment. The detailed flow of step S102 will be depicted with reference to Fig. 2.
Those ordinary skilled in the art should understand that, the aforesaid request number to each Web service and the resource consumption of the n Web services running on the Web server can also be obtained by reading dedicated learning sequence data, for example, reading pre-stored learning data from metric data buffer.
Afterwards, the flow shown in Fig. 1 goes to step S104. In step S104, the request number to each Web service at each sampling moment detected in step S102 and the total resource consumption of the n Web services running on the Web server detected in step S103 are respectively substituted into expression (1) to obtain (n + 1) equations corresponding to the (n + 1) sampling moments, i.e. ti, t2, ..., tn/ t(n + ll:
Qt1=As1*Rs1t1+ AS2*RS2ti+..ASi*RSiti...+ASn*Rsnti+Ao Qt2=As1*Rs1t2+ AS2*Rs2t2+..ASi*RSit2...+ASn*Rsnt2+Ao
Qt,ntl)=As1*Rs1t,ntl)+ As2*Rs2t(n+1)+..ASi*RSi. t,n+1)...+Asn*Rsnt(ntl)+Ao (2)
where, * represents multiplying relation, Qt1 in the first expression represents the resource consumption of the n Web services at sampling moment ti, Qt2 in the second expression represents the resource consumption of the n Web services at sampling moment t2, ..., and so on and so forth; Asi represents the resource consumption ratio of the (i)th Web service; Rsitl represents the request number to the (i)th Web service at sampling moment tl; Rsit2 represents the request number to the (i)th Web service at sampling moment t2, ..., and so on.
Using the above expression (2), the resource consumption ratio, i.e. Asi of each Web service can be calculated, wherein i = 1, 2, ..., n, so that resources consumed by each Web service and the resource consumption of the n Web services are obtained as follows: Qt=ASi*RSit+AS2*Rs2t+...+ASi*RSit+...+Asn*RSnt+A0 ( 3 )
where, Qsi represents resources consumed by the (i)th Web service at sampling moment t; Qt represents the resource consumption of the Web server consumed by the n Web services at sampling moment t; * represents multiplying relation; As1 represents the resource consumption ratio of the (i)th web service; and Rsit represents the request number to the (i)th Web service at sampling moment t. As for the resource consumption represented by expression (3), the resource consumption ratio Asi (i=l ... n) of each Web service is a known number obtained after calculation according to expression (2) .
The calculated resource consumption ratio Asi of these Web services can be checked in step S105. Since the resource consumption ratio Asi represents the ratio of resources consumed by each Web service on the Web server, it should meet:
O≤ASi≤l (4)
If the resource consumption ratio As1 does not meet the above-stated condition, the flow shown in Fig. 1 will return to steps S102 and S103 (not shown in Fig. 1) to re-detect the request number and resource consumption of each Web service at each sampling moment in the period of learning and to thereby re-calculate the resource consumption ratio of each Web service. The above resource consumption ratio might vary with changes of the system configuration, and then, the corresponding resource consumption ratio needs to be obtained by re-learning in the period of learning .
Of course, if comsumption ratio is not required to be validated, , the step S105 can be omitted. For example, the validation can be that the consumption ratio must be greater than zero or not be negative.
After the resource consumption ratio is checked to measure up, the flow shown in Fig. 1 goes to step S106. In step S106, according to the resource consumption ratio calculated in step S104 and the currently detected request number to each Web service, the resource consumption of a plurality of Web services currently co-existing on the Web server is analyzed and corresponding treatment is taken. Of course, according to the resource consumption ratio calculated in step S104 and the request number to each Web service which is about to co-exist on the Web server, the resource consumption of a plurality of Web services which are about to co-exist the Web server is analyzed and corresponding treatment is then taken.
Referring to Fig. 3, step S106 will be depicted in detail.
Through the method as shown in Fig. 1, an electronic equipment can, according to the resource consumption ratios of a plurality of Web services obtained in the period of learning, analyze the resource consumption of the plurality of Web service co-existing on the Web server after the period of learning, perform processing based on the analysis results and then accomplish the dynamic adjusting of resources.
Hereinafter, detailed depiction will be given to step S102 shown in Fig. 1 with reference to Fig. 2.
Step S102 shown in Fig. 1 can be divided into steps S1021, S1022, and S1023. In step S1021, each simple object access protocol (SOAP) message reaching the Web server is intercepted at each sampling moment, the message serving as the request to each Web service. Specifically, a SOAP routing handler is added to the engine receiving a SOAP message. Next, an intercepted SOAP message is parsed in step S1022. In step S1022, according to the SOAP standard, the aforesaid SOAP routing handler parses part of SOAP protocol (mainly the SOAP header) to obtain the end corresponding to SOAP, which represents the name and accessed address of a concrete Web service. Then, in step S1023, the request number to each Web service is obtained by respectively counting the requests to each Web service.
The detailed flow of step S106 shown in Fig. 1 is as shown in Fig. 3. After the resource consumption ratio of each Web service, which is checked to be measure up, is obtained in step S105 shown in Fig. 1, the resource consumption requests to Web service at the current moment ttn t2) is detected in step S1061 shown in Fig. 3. When the above resource consumption requests are detected, the current request number to each Web service is detected in step S1062 according to these resource consumption requests. This detecting method is the same as the detecting method shown in Fig. 2, so details are omitted here. After that, the flow as shown in Fig. 3 goes to step S1063. In step S1063, the current resource consumption of Web service (s) is (are) calculated according to the current request number to each Web service and the resource consumption ratio of each Web service obtained in the preceding step. Specifically, the request number of each Web service at moment t(n*2) is substituted into expression (3) so that the resource consumption of each Web service and the resource consumption Qt(nt2> of the n Web services at moment t(ru2) can be calculated. Of course, if the request number of each Web service which is about to co-exist with other Web service at a certain moment is known, the resource consumption of each Web service and a plurality of Web services at this moment can also be calculated.
After finishing calculation of the resource consumption, the flow as shown in Fig. 3 goes to step S1064. In step S1064, the calculated resource consumption of a plurality of currently co-existing Web services is compared with its predetermined threshold. If the resource consumption exceeds the predetermined threshold, it means that the resource consumption of a plurality of currently co-existing Web services will go beyond a predetermined resource use range, and then the flow as shown in Fig. 3 will go to step S1065. In step S1065, resources are re-adjusted, that is, the resource consumption ratio corresponding to each Web service is re-adjusted. For example, the resource consumption ratio of the Web service with more requests may be adjusted to be smaller while the resource consumption ratio of the Web service with fewer requests may be adjusted to be larger, so that the resource consumption of these Web services will not go beyond the predetermined resource use range. If the predicted resource consumption in step S1064 does not exceed the predetermined threshold, it means that current resources can meet requirement of each Web service.
A predetermined threshold may be set for the resource consumption of n Web services, e.g. 100%; or a predetermined threshold may be set for the resource consumption of a certain Web service, e.g. 30%; or thresholds may be set for both of them at the same time. It is to be understood that selection of numeric values of the predetermined threshold does not constitute limitation to the present invention.
It should be noted that various RN-RC relation models can be built according to needs. In addition to the above-described linear statistic model, a non-linear statistic model of the neutral network, a generic algorithm model and the like can also be built. Selection of a concrete model type does not form limitation to the present invention.
The CPU resource consumption of a non-linear statistic model of the neural model can be illustrated, for example, as follows: (CPUSi)= BP (((RSitj) , CPUt3, CPU0)) | BP where CPUsi represents the CPU consumption of the (i)th service; RSitj represents the request number of the (i)th service at moment tj,- CPUtj represents the CPU consumption at moment t^; CPUo represents a CPU consumption constant of the Web server; BP is the back propagation learning algorithm for minimizing the mean-square error ^- , , , i = 1 ... n, j = 1 ... N; n represents the number of services running on the Web server; and N represents the number of sampling moments selected in the period of learning.
Figure imgf000013_0001
where i = 1 ... n; k = 1 ... M; j = 1 ... N; 1 = 1 ... n; gki and plk are parameters of the neural network, which can be adjusted by the back propagation learning algorithm in the course of learning, for minimizing the mean-square error ^*
Those skilled in the art should understand that the steps of the method of dynamically adjusting resources according to the first embodiment of the present invention can be specifically implemented in a computer program product associated with each step running on a processing system.
To realize the method of dynamically adjusting resources as shown in Fig. 1, a device of dynamically adjusting resources is further provided, wherein illustration is made still by taking n (n ≥ 2) Web services running on a Web server for example.
Fig. 4 schematically shows the structure of a device of dynamically adjusting resources according to a second embodiment of the present invention. The device 400 of dynamically adjusting resources is used for dynamically adjusting resources for each Web service in the case that only the resource consumption of a plurality of Web services can be known. The device 400 of dynamically adjusting resources may be directly integrated into a Web server, or serve as an external separate device of a Web server .
The device 400 of dynamically adjusting resources as shown in Fig. 4 comprises: model building means 401, first detecting means 402, second detecting means 403, model learning means 404, resource consumption ratio checking means 405, analyzing means 406 and metric data storage means 407.
The model building means 401 is used for building a relation model on the relationship between the resource consumption of n Web services co-existing on the Web server and the request number of each Web service, i.e. an RN-RC relation model. However, it is to be understood that various RN-RC relation models can be built according to needs, such as a linear statistic model, a non-linear statistic model of the neural network and a generic algorithm model . Selection of a concrete model type does not constitute limitation to the present invention.
In a linear statistic model, different resource consumption ratios are used to represent the proportion of resources consumed by each Web service, which is as illustrated by expression (5) :
As2*Rs2+...+Asi*Rsi+...+Asn*Rsπ+Ao (5)
In expression (5) , Q represents the resource consumption of n Web services on the Web server;* represents multiplying relation; Asi is the resource consumption ratio of the (i)th Web service, which represents the ratio of resources on the Web server consumed by the (i)th Web service with respect to each request of the (i)th Web service, i = 1, 2, ..., n, wherein n represents the number of Web services running on the Web server; Rsi represents the request number for the (i)th Web service; and A0 represents a resource consumption constant of the Web server.
The first detecting means 402 detects the request number to each Web service at multiple moments in a determined period of time. The second detecting means 403 detects the corresponding resource consumption situation for the Web server at multiple moments in the above determined period of time. Those skilled in the art can appreciate that the functions of the first detecting means 402 and the second detecting means 403 can also be performed by one means.
In the present embodiment, since there are n Web services co-existing on the Web server, the request number to each Web service is respectively detected at (n+1) sampling moments, i.e. ti, t2, ..., tn, tlntl), and the total resource consumption of the n Web services is detected at each sampling moment. The detailed structure of the first detecting means 402 will be depicted with reference to Fig. 5.
Those of ordinary skill in the art should understand that, the aforesaid request number to each Web service and the corresponding resource consumption of the Web server can also be obtained by reading dedicated learning sequence data, for example, reading pre-stored learning data from the metric data storage means 407. According to the statistic model built by the model building means 401, the model learning means 404 substitutes the request number to each Web service at each sampling moment detected in the first detecting means 402 and the total resource consumption of the n Web services running on the Web server detected in the second detecting means 403 into expression (5) to obtain (n+1) equations corresponding to the (n+1) sampling moments, i.e. ti, t2, ••■, tn, t(ntl) :
Qti=ASi*RSiti+ AS2*Rs2ti+..ASi*RSiti...+Asn*Rsnti+Ao
Figure imgf000015_0001
AS2*RS2t2+..ASi*RSit2...+Asn*RSnt2+Ao
Qt 1n+D =AS^RSIt1n+I, + AS2*Rs2t,nti) +..ASi*RSi t ,nti,...+Asn*Rsnt(n+i>+Ao ( 6 )
where, * represents multiplying relation; Qti in the first expression represents the resource consumption of the n Web services at sampling moment ti; Qt2 in the second expression represents the resource consumption of the n Web services at sampling moment t2/ ..., and so on. Asi represents the resource consumption ratio of the (i)th Web service; Rsiti represents the request number to the (i)th Web service at sampling moment tj.; Rsit2 represents the request number to the (i)th Web service at sampling moment t2, ..., and so on.
Using the above expression (6), the resource consumption ratio, i.e. Asi of each Web service can be calculated, wherein i = 1, 2, ..., n, so that resources consumed by each Web service and the resource consumption of the Web server are obtained as follows:
Qt=As1*RSit+As2*Rs2t+...+Asi*RSit+...+ASn*Rsnt+A0 (7 )
where, Qsi represents resources consumed by the (i)th Web service; Qt represents the resource consumption of the Web server consumed by the n Web services at sampling moment t; * represents multiplying relation; As3. represents the resource consumption ratio of the (i)th Web service; and RSit represents the request number to the (i)th Web service at sampling moment t. As for the resource consumption represented by expression (7), the resource consumption ratio As1 (i=l ... n) of each Web service is a known number obtained after calculation according to expression (6) . The calculated resource consumption ratio As1 of these Web services can be checked in the resource consumption ratio checking means 405. Since the resource consumption ratio As1 represents the ratio of resources consumed by each Web service on the Web server, it should meet:
O≤Asi≤l (8)
If the resource consumption ratio As1 does not meet the above-stated condition, the resource consumption ratio checking means 405 will send a notification signal to the first detecting means 402 and the second detecting means 403 to cause the first detecting means 402 and the second detecting means 403 to re-detect the request number to each Web service and the resource consumption of the n Web services running on a Web server at each sampling moment in a period of learning, so that the resource consumption ratio of each Web service can be re-calculated.
If the resource consumption ratio Asi meets the above-stated condition, the first detecting means 402 will detect the current request number to each Web service, the model learning means 404 will calculate the resource consumption of the n Web services currently co-existing on the Web server, and the analyzing means 406 will analyze whether the current resource amount of the Web server meets the requirements of all the currently co-existing Web services, and then corresponding treatment will be taken.
According to the request number to each Web service at the current moment t(n+2) detected by the first detecting means 402 and the calculated resource consumption ratio of each Web service, the model learning means 404 calculates the resource consumption of the n Web services currently co-existing on the Web server. Specifically, the request number to each Web service at moment t(n+2) is substituted into expression (7) so that the resource consumption of each Web service at moment t (n+2 ) and the resource consumption Qt (n+2) of the n Web services can be calculated.
The analyzing means 406 comprises a judging means 4061 and an adjusting means 4062. The judging means 4061 decides whether the current resource amount of the Web server meets the requirements of all the currently Co-existing Web services according to the resource consumption of the currently co-existing n Web services calculated by the model learning means 404.
In the present embodiment, the resource consumption of the n Web services co-existing currently is compared with a predetermined threshold. If the resource consumption exceeds the predetermined threshold, it means that the resource consumption of the n Web services co-existing currently will go beyond a predetermined resource use range. Thus, the adjusting means 4062 re-adjusts resources, that is, to re-adjust the resource consumption ratio corresponding to each Web service. For example, the resource consumption ratio of the Web service with more requests may be adjusted to be smaller while the resource consumption ratio of the Web service with fewer requests is adjusted to be larger, so that the resource consumption of these Web services will not go beyond the predetermined resource use range. If the resource consumption of the n Web services co-existing currently does not exceed the predetermined threshold, it means that current allocation of resources can meet requirements of each Web service.
A predetermined threshold may be set for the resource consumption of n Web services, e.g. 100%; or a predetermined threshold may be set for the resource consumption of a certain Web service, e.g. 30%; or thresholds may be set for both of them at the same time. It is to be understood that selection of the predetermined threshold does not constitute limitation to the present invention.
Fig. 5 schematically shows the inner structure of the first detecting means in Fig. 4. The first detecting means 402 comprises intercepting means 4021, parsing means 4022 and counting means 4023.
The intercepting means 4021 intercepts at each sampling moment each SOAP message reaching the Web server. Specifically, SOAP routing handler is added to the engine receiving a SOAP message. Next, an intercepted SOAP message is parsed by the parsing means 4022. That is to say, according to the SOAP standard, the aforesaid SOAP routing handler parses part of the SOAP protocol (mainly the SOAP header) to obtain the end corresponding to SOAP, which represents the name and accessed address of a concrete Web service. Then, the counting means 4023 obtains the request number to each Web service by respectively counting the requests to each Web service.
The preferred embodiments of the present invention have been described above by taking a plurality of Web services co-existing on a Web server for example. However, the present invention is not limited to this but is also suitable for the circumstance in which a plurality of applications are co-existing on one electronic equipment. It is only an example that a plurality of applications are co-existing in a process as described above. As a matter of fact, the present invention is even suitable for the circumstance in which there is only one task running on an electronic equipment and the resource consumption of a plurality of applications running on the task is already known.
The present invention is of special practical value in a computer system which Grid technology supports. Since Grid technology allows multiple computers or servers to share their resources and services, adjusting resources for each application among different computers or servers is an extremely effective measure to improve the network performance.
The above-mentioned resources include, but are not limited to, CPU resources and memory resources.
The device of dynamically adjusting resources according to an embodiment of the present invention can be easily realized in an existing Web server without interrupting Web services. Moreover, the device of dynamically adjusting resources can provide more information for an administrator of Web services, and the method and device of dynamically adjusting resources are suitable for all kinds of Web services and platforms. The implementation of the method and device of dynamically adjusting resources of the present invention is irrelevant to the implementation of platforms, programming languages, middleware and applications.
The present invention is described specifically on the basis of the circumstances of Web server, however, the subject matter that the invention seeks to protect can be implemented in any of information technology systems. Those ordinary skilled in the field of computing may understand that there is correlation between the disclosed embodiments and the various computing circumstances as described. Further, the disclosed method can be implemented by software, hardware or the combination thereof. Thus, hardware can be implemented with dedicated logic; software, which might be stored in memory, can be performed by a proper instructions performing system, such as microprocessor, personal computer or mainframe.
As the present invention has been described with reference to the embodiments devised currently, it is to be understood that the present invention is not limited to the disclosed embodiments. Instead, the present invention is intended to cover various alterations and equivalent arrangements within the scope of the appended claims. The scope of the following claims conforms to the broadest interpretation, so as to include all such alterations, equivalent structures and functions.

Claims

CU-XMS
1. A method for dynamically adjusting resources among a plurality of co-existing applications, the method including the steps of:
a) building a relation model between request numbers and resource consumptions of said plurality of applications;
b) obtaining at multiple sampling moments the request numbers and resource consumptions of each of said plurality of applications;
c) calculating resource consumption ratio of each of said plurality of applications; and
d) analyzing resource consumptions of a plurality of co-existing applications .
2. The method as claimed in claim 1, wherein said relation model between the request numbers and the resource consumptions is a linear statistic model expressed as follows:
Q= ASi*RSi+ AS2*RS2+...+ASi*RSi+.M+Asn*RSn+Ao
where Q represents the resource consumption of said plurality of applications; * represents multiplying relation; n represents the number of said plurality of co-existing applications; ASi is the resource consumption ratio of the (i)th application, i = 1, 2, ..., n; Rsi represents the request number for the (i)th application; and A0 represents a resource consumption constant.
3. The method as claimed in claim 1, wherein step b) includes the steps of:
intercepting each simple object access protocol message;
parsing said intercepted message so as to obtain the name of an application corresponding to said intercepted message; and
obtaining the request number for each of said applications by respectively counting said message.
4. The method as claimed in claim 1, wherein in step b) , the respective request number and resource consumption of each of said plurality of applications are obtained by reading dedicated learning sequence data.
5. The method as claimed in claim 2, wherein in step c) , the respective request number and resource consumption of each of said plurality of applications are respectively substituted into said linear statistic model to calculate said resource consumption ratio.
6. The method as claimed in claim 1, wherein step d) includes the steps of:
obtaining the respective request number of each of said plurality of applications at the current moment or a moment in future;
calculating the resource consumption of said plurality of applications;
comparing said resource consumption with a predetermined threshold; and
adjusting the resource consumption ratios of at least part of applications of said plurality of applications when said resource consumption is larger than said predetermined threshold.
7. The method as claimed in claim 1, further including the step of checking whether said resource consumption ratio meets the following condition:
0 ≤ resource consumption ratio ≤ 1.
8. The method as claimed in claim 1, wherein said plurality of applications are co-existing on one electronic equipment.
9. The method as claimed in claim I1 wherein said plurality of applications are co-existing on one Web server.
10. The method as claimed in claim 9, wherein said applications are Web services .
11. A device (400) for dynamically adjusting resources among a plurality of co-existing applications, the device comprising means for carrying out the steps o£ any of claims 1 to 10.
12. A computer program for dynamically adjusting resources among a plurality of co-existing applications comprising program code executable in a computer to perform the steps of any of claims 1 to 10.
PCT/EP2006/066298 2005-09-19 2006-09-12 Method and device for dynamically adjusting resources WO2007033922A2 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
US12/067,454 US7949756B2 (en) 2005-09-19 2006-09-12 Method and apparatus for monitoring web services resource utilization

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN200510104803.2 2005-09-19
CNA2005101048032A CN1936849A (en) 2005-09-19 2005-09-19 Resource dynamic regulation method and apparatus

Publications (3)

Publication Number Publication Date
WO2007033922A2 true WO2007033922A2 (en) 2007-03-29
WO2007033922A3 WO2007033922A3 (en) 2007-05-10
WO2007033922B1 WO2007033922B1 (en) 2007-06-21

Family

ID=37728286

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/EP2006/066298 WO2007033922A2 (en) 2005-09-19 2006-09-12 Method and device for dynamically adjusting resources

Country Status (3)

Country Link
US (1) US7949756B2 (en)
CN (1) CN1936849A (en)
WO (1) WO2007033922A2 (en)

Cited By (15)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2008119650A3 (en) * 2007-03-29 2008-11-27 Ibm Dynamic learning in redesigning a composition of web services
EP2115581A1 (en) * 2007-03-02 2009-11-11 Pegasystems Inc. Proactive performance management for multi-user enterprise software systems
US8479157B2 (en) 2004-05-26 2013-07-02 Pegasystems Inc. Methods and apparatus for integration of declarative rule-based processing with procedural programming in a digital data-processing evironment
US8880487B1 (en) 2011-02-18 2014-11-04 Pegasystems Inc. Systems and methods for distributed rules processing
US8924335B1 (en) 2006-03-30 2014-12-30 Pegasystems Inc. Rule-based user interface conformance methods
US9003416B2 (en) 2010-09-29 2015-04-07 International Business Machines Corporation Predicting resource requirements for a computer application
US9098799B2 (en) 2007-03-29 2015-08-04 International Business Machines Corporation Tooling for implementing business processes using web services
US9195936B1 (en) 2011-12-30 2015-11-24 Pegasystems Inc. System and method for updating or modifying an application without manual coding
US9678719B1 (en) 2009-03-30 2017-06-13 Pegasystems Inc. System and software for creation and modification of software
US10467200B1 (en) 2009-03-12 2019-11-05 Pegasystems, Inc. Techniques for dynamic data processing
US10469396B2 (en) 2014-10-10 2019-11-05 Pegasystems, Inc. Event processing with enhanced throughput
US10698647B2 (en) 2016-07-11 2020-06-30 Pegasystems Inc. Selective sharing for collaborative application usage
US10698599B2 (en) 2016-06-03 2020-06-30 Pegasystems, Inc. Connecting graphical shapes using gestures
US11048488B2 (en) 2018-08-14 2021-06-29 Pegasystems, Inc. Software code optimizer and method
US11567945B1 (en) 2020-08-27 2023-01-31 Pegasystems Inc. Customized digital content generation systems and methods

Families Citing this family (18)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7836092B2 (en) * 2006-05-12 2010-11-16 At&T Intellectual Property I, L.P. Systems, methods, and computer-readable media for providing information regarding communication systems
US20090070335A1 (en) * 2007-09-06 2009-03-12 Cozianu Costin V System and method for web processing monitoring
US8904394B2 (en) 2009-06-04 2014-12-02 International Business Machines Corporation System and method for controlling heat dissipation through service level agreement analysis by modifying scheduled processing jobs
US9354852B2 (en) * 2010-12-23 2016-05-31 Microsoft Technology Licensing, Llc Satisfying application dependencies
GB2494406A (en) * 2011-09-06 2013-03-13 Skype System to detect protocol discrimination by network provider in the event of communication problems
WO2013091159A1 (en) * 2011-12-19 2013-06-27 北京瑞星信息技术有限公司 Frequency conversion anti-virus technology
KR101590288B1 (en) * 2012-01-31 2016-01-29 지티이 (유에스에이) 인크. Method and system for managing multimedia quality of experience in a transport-independent fashion
US9374314B2 (en) * 2012-02-26 2016-06-21 Palo Alto Research Center Incorporated QoS aware balancing in data centers
CN102693395B (en) * 2012-06-07 2015-02-11 北京奇虎科技有限公司 Method and device for intercepting calling of application program for service
CN104102541B (en) * 2013-04-08 2018-02-13 鸿富锦精密工业(深圳)有限公司 Sign-off dynamic adjusting method and system
US9026652B1 (en) * 2014-07-09 2015-05-05 Fmr Llc Web service asset management and web service information storage
CN106776041B (en) * 2016-12-30 2020-01-14 Oppo广东移动通信有限公司 Data processing method and device
JP6710711B2 (en) * 2018-02-06 2020-06-17 日本電信電話株式会社 Estimating device, estimating method, and program
US10616136B2 (en) * 2018-04-19 2020-04-07 Microsoft Technology Licensing, Llc Utilization based dynamic resource allocation
CN109412829B (en) * 2018-08-30 2020-11-17 华为技术有限公司 Resource allocation prediction method and equipment
CN109614282B (en) * 2018-09-28 2022-06-03 东软集团股份有限公司 Method and device for determining system resource use information of application
CN111858015B (en) * 2019-04-25 2024-01-12 中国移动通信集团河北有限公司 Method, device and gateway for configuring running resources of application program
CN112991628B (en) * 2021-02-04 2022-07-26 福建乐摩物联科技有限公司 Method, system and medium for dynamically adjusting seat occupation time of shared massage chair

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20040210871A1 (en) * 2003-04-16 2004-10-21 Fujitsu Limited Apparatus for adjusting use resources of system and method thereof
US20050107997A1 (en) * 2002-03-14 2005-05-19 Julian Watts System and method for resource usage estimation
US20050177635A1 (en) * 2003-12-18 2005-08-11 Roland Schmidt System and method for allocating server resources

Family Cites Families (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5805599A (en) * 1996-12-04 1998-09-08 At&T Corp. Adaptive channel allocation system for communication network
US6282570B1 (en) * 1998-12-07 2001-08-28 International Business Machines Corporation Monitoring a large parallel database through dynamic grouping and sequential sampling
AU2002311585A1 (en) * 2002-06-06 2003-12-22 Crescendo Networks Ltd. System and method for connecting multiple slow connections to multiple fast connections
US7519173B2 (en) * 2003-06-24 2009-04-14 International Business Machines Corporation Method for managing resources in a multi-channeled customer service environment
US7523041B2 (en) * 2003-09-18 2009-04-21 International Business Machines Corporation Method of displaying real-time service level performance, breach, and guaranteed uniformity with automatic alerts and proactive rebating for utility computing environment
US7650344B2 (en) * 2004-02-09 2010-01-19 Coremetrics, Inc. System and method of managing software product-line customizations
TWI276322B (en) * 2005-12-28 2007-03-11 Ind Tech Res Inst A system and a method of request scheduling using request classification and allowable server resource for the differentiated quality of service
JP2007183883A (en) * 2006-01-10 2007-07-19 Fujitsu Ltd Resource plan preparation program, recording medium with this program recorded thereon, and apparatus and method for preparing resource plan

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20050107997A1 (en) * 2002-03-14 2005-05-19 Julian Watts System and method for resource usage estimation
US20040210871A1 (en) * 2003-04-16 2004-10-21 Fujitsu Limited Apparatus for adjusting use resources of system and method thereof
US20050177635A1 (en) * 2003-12-18 2005-08-11 Roland Schmidt System and method for allocating server resources

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
"Solaris 9 Resource Manager Software, A Technical White Paper" INTERNET CITATION, [Online] 3 June 2002 (2002-06-03), XP002291080 Retrieved from the Internet: URL:http://wwws.sun.com/software/whitepape rs/solaris9/srm.pdf> [retrieved on 2004-08-03] *

Cited By (26)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8479157B2 (en) 2004-05-26 2013-07-02 Pegasystems Inc. Methods and apparatus for integration of declarative rule-based processing with procedural programming in a digital data-processing evironment
US8959480B2 (en) 2004-05-26 2015-02-17 Pegasystems Inc. Methods and apparatus for integration of declarative rule-based processing with procedural programming in a digital data-processing environment
US9658735B2 (en) 2006-03-30 2017-05-23 Pegasystems Inc. Methods and apparatus for user interface optimization
US10838569B2 (en) 2006-03-30 2020-11-17 Pegasystems Inc. Method and apparatus for user interface non-conformance detection and correction
US8924335B1 (en) 2006-03-30 2014-12-30 Pegasystems Inc. Rule-based user interface conformance methods
US9189361B2 (en) 2007-03-02 2015-11-17 Pegasystems Inc. Proactive performance management for multi-user enterprise software systems
EP2115581A1 (en) * 2007-03-02 2009-11-11 Pegasystems Inc. Proactive performance management for multi-user enterprise software systems
EP2115581A4 (en) * 2007-03-02 2012-11-28 Pegasystems Inc Proactive performance management for multi-user enterprise software systems
WO2008119650A3 (en) * 2007-03-29 2008-11-27 Ibm Dynamic learning in redesigning a composition of web services
US9098799B2 (en) 2007-03-29 2015-08-04 International Business Machines Corporation Tooling for implementing business processes using web services
US9384445B2 (en) 2007-03-29 2016-07-05 International Business Machines Corporation Tooling for implementing business processes using web services
US7596583B2 (en) 2007-03-29 2009-09-29 International Business Machines Corporation Dynamic learning in redesigning a composition of web services
US10467200B1 (en) 2009-03-12 2019-11-05 Pegasystems, Inc. Techniques for dynamic data processing
US9678719B1 (en) 2009-03-30 2017-06-13 Pegasystems Inc. System and software for creation and modification of software
US9052954B2 (en) 2010-09-29 2015-06-09 International Business Machines Corporation Predicting resource requirements for a computer application
US9003416B2 (en) 2010-09-29 2015-04-07 International Business Machines Corporation Predicting resource requirements for a computer application
US8880487B1 (en) 2011-02-18 2014-11-04 Pegasystems Inc. Systems and methods for distributed rules processing
US9270743B2 (en) 2011-02-18 2016-02-23 Pegasystems Inc. Systems and methods for distributed rules processing
US9195936B1 (en) 2011-12-30 2015-11-24 Pegasystems Inc. System and method for updating or modifying an application without manual coding
US10572236B2 (en) 2011-12-30 2020-02-25 Pegasystems, Inc. System and method for updating or modifying an application without manual coding
US10469396B2 (en) 2014-10-10 2019-11-05 Pegasystems, Inc. Event processing with enhanced throughput
US11057313B2 (en) 2014-10-10 2021-07-06 Pegasystems Inc. Event processing with enhanced throughput
US10698599B2 (en) 2016-06-03 2020-06-30 Pegasystems, Inc. Connecting graphical shapes using gestures
US10698647B2 (en) 2016-07-11 2020-06-30 Pegasystems Inc. Selective sharing for collaborative application usage
US11048488B2 (en) 2018-08-14 2021-06-29 Pegasystems, Inc. Software code optimizer and method
US11567945B1 (en) 2020-08-27 2023-01-31 Pegasystems Inc. Customized digital content generation systems and methods

Also Published As

Publication number Publication date
US20090100172A1 (en) 2009-04-16
WO2007033922B1 (en) 2007-06-21
US7949756B2 (en) 2011-05-24
CN1936849A (en) 2007-03-28
WO2007033922A3 (en) 2007-05-10

Similar Documents

Publication Publication Date Title
US7949756B2 (en) Method and apparatus for monitoring web services resource utilization
Yu et al. Stochastic load balancing for virtual resource management in datacenters
US7349340B2 (en) System and method of monitoring e-service Quality of Service at a transaction level
US8015289B2 (en) System and method predicting and managing network capacity requirements
Ali-Eldin et al. An adaptive hybrid elasticity controller for cloud infrastructures
Rolia et al. Statistical service assurances for applications in utility grid environments
RU2526711C2 (en) Service performance manager with obligation-bound service level agreements and patterns for mitigation and autoprotection
CN114930293A (en) Predictive auto-expansion and resource optimization
Herbst et al. Ready for rain? A view from SPEC research on the future of cloud metrics
US8775624B2 (en) Load-balancing and technology sharing using Lempel-Ziv complexity to select optimal client-sets
US7668096B2 (en) Apparatus for modeling queueing systems with highly variable traffic arrival rates
Andreolini et al. Load prediction models in web-based systems
Andreolini et al. Models and framework for supporting runtime decisions in web-based systems
US20070250629A1 (en) Method and a system that enables the calculation of resource requirements for a composite application
Bacigalupo et al. Managing dynamic enterprise and urgent workloads on clouds using layered queuing and historical performance models
Ataie et al. Power-aware performance analysis of self-adaptive resource management in IaaS clouds
Goiri et al. Resource-level QoS metric for CPU-based guarantees in cloud providers
Kumar et al. Identifying and estimating technical debt for service composition in SaaS cloud
Bacigalupo et al. An investigation into the application of different performance prediction methods to distributed enterprise applications
Xiong et al. Sla-based service composition in enterprise computing
He et al. A WSLA-based monitoring system for grid service-GSMon
Breitgand et al. Derivation of response time service level objectives for business services
Diao et al. Control of large scale computing systems
Cope et al. Adapting grid services for urgent computing environments
Hühn et al. Performance modeling for SLA-compliant but cost-effective IT-service provisioning

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application
NENP Non-entry into the national phase

Ref country code: DE

WWE Wipo information: entry into national phase

Ref document number: 12067454

Country of ref document: US

122 Ep: pct application non-entry in european phase

Ref document number: 06793465

Country of ref document: EP

Kind code of ref document: A2