WO2001039003A1 - Method for operating an integrated point of presence server network - Google Patents

Method for operating an integrated point of presence server network Download PDF

Info

Publication number
WO2001039003A1
WO2001039003A1 PCT/US2000/031939 US0031939W WO0139003A1 WO 2001039003 A1 WO2001039003 A1 WO 2001039003A1 US 0031939 W US0031939 W US 0031939W WO 0139003 A1 WO0139003 A1 WO 0139003A1
Authority
WO
WIPO (PCT)
Prior art keywords
web
server
servers
customer
network
Prior art date
Application number
PCT/US2000/031939
Other languages
French (fr)
Other versions
WO2001039003A9 (en
Inventor
Richard David Day
Eric Sven-Johnan Swildens
Ajit Gupta
Original Assignee
Speedera Networks, Inc.
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Priority claimed from US09/645,067 external-priority patent/US6694358B1/en
Priority claimed from US09/648,420 external-priority patent/US7925713B1/en
Priority claimed from US09/644,927 external-priority patent/US6405252B1/en
Application filed by Speedera Networks, Inc. filed Critical Speedera Networks, Inc.
Priority to AU19243/01A priority Critical patent/AU1924301A/en
Publication of WO2001039003A1 publication Critical patent/WO2001039003A1/en
Publication of WO2001039003A9 publication Critical patent/WO2001039003A9/en

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L41/00Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
    • H04L41/50Network service management, e.g. ensuring proper service fulfilment according to agreements
    • H04L41/5061Network service management, e.g. ensuring proper service fulfilment according to agreements characterised by the interaction between service providers and their network customers, e.g. customer relationship management
    • H04L41/5067Customer-centric QoS measurements
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L41/00Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
    • H04L41/50Network service management, e.g. ensuring proper service fulfilment according to agreements
    • H04L41/508Network service management, e.g. ensuring proper service fulfilment according to agreements based on type of value added network service under agreement
    • H04L41/5083Network service management, e.g. ensuring proper service fulfilment according to agreements based on type of value added network service under agreement wherein the managed service relates to web hosting
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L41/00Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
    • H04L41/50Network service management, e.g. ensuring proper service fulfilment according to agreements
    • H04L41/508Network service management, e.g. ensuring proper service fulfilment according to agreements based on type of value added network service under agreement
    • H04L41/509Network service management, e.g. ensuring proper service fulfilment according to agreements based on type of value added network service under agreement wherein the managed service relates to media content delivery, e.g. audio, video or TV
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L43/00Arrangements for monitoring or testing data switching networks
    • H04L43/08Monitoring or testing based on specific metrics, e.g. QoS, energy consumption or environmental parameters
    • H04L43/0876Network utilisation, e.g. volume of load or congestion level
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L61/00Network arrangements, protocols or services for addressing or naming
    • H04L61/45Network directories; Name-to-address mapping
    • H04L61/4505Network directories; Name-to-address mapping using standardised directories; using standardised directory access protocols
    • H04L61/4511Network directories; Name-to-address mapping using standardised directories; using standardised directory access protocols using domain name system [DNS]
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L61/00Network arrangements, protocols or services for addressing or naming
    • H04L61/45Network directories; Name-to-address mapping
    • H04L61/4541Directories for service discovery
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L67/00Network arrangements or protocols for supporting network services or applications
    • H04L67/01Protocols
    • H04L67/06Protocols specially adapted for file transfer, e.g. file transfer protocol [FTP]
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L67/00Network arrangements or protocols for supporting network services or applications
    • H04L67/01Protocols
    • H04L67/10Protocols in which an application is distributed across nodes in the network
    • H04L67/1001Protocols in which an application is distributed across nodes in the network for accessing one among a plurality of replicated servers
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L67/00Network arrangements or protocols for supporting network services or applications
    • H04L67/01Protocols
    • H04L67/10Protocols in which an application is distributed across nodes in the network
    • H04L67/1001Protocols in which an application is distributed across nodes in the network for accessing one among a plurality of replicated servers
    • H04L67/1004Server selection for load balancing
    • H04L67/1008Server selection for load balancing based on parameters of servers, e.g. available memory or workload
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L67/00Network arrangements or protocols for supporting network services or applications
    • H04L67/01Protocols
    • H04L67/10Protocols in which an application is distributed across nodes in the network
    • H04L67/1001Protocols in which an application is distributed across nodes in the network for accessing one among a plurality of replicated servers
    • H04L67/1004Server selection for load balancing
    • H04L67/101Server selection for load balancing based on network conditions
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L67/00Network arrangements or protocols for supporting network services or applications
    • H04L67/01Protocols
    • H04L67/10Protocols in which an application is distributed across nodes in the network
    • H04L67/1095Replication or mirroring of data, e.g. scheduling or transport for data synchronisation between network nodes
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L67/00Network arrangements or protocols for supporting network services or applications
    • H04L67/50Network services
    • H04L67/56Provisioning of proxy services
    • H04L67/568Storing data temporarily at an intermediate stage, e.g. caching
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L69/00Network arrangements, protocols or services independent of the application payload and not provided for in the other groups of this subclass
    • H04L69/30Definitions, standards or architectural aspects of layered protocol stacks
    • H04L69/32Architecture of open systems interconnection [OSI] 7-layer type protocol stacks, e.g. the interfaces between the data link level and the physical level
    • H04L69/322Intralayer communication protocols among peer entities or protocol data unit [PDU] definitions
    • H04L69/329Intralayer communication protocols among peer entities or protocol data unit [PDU] definitions in the application layer [OSI layer 7]
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L67/00Network arrangements or protocols for supporting network services or applications
    • H04L67/01Protocols
    • H04L67/02Protocols based on web technology, e.g. hypertext transfer protocol [HTTP]
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L67/00Network arrangements or protocols for supporting network services or applications
    • H04L67/01Protocols
    • H04L67/10Protocols in which an application is distributed across nodes in the network
    • H04L67/1001Protocols in which an application is distributed across nodes in the network for accessing one among a plurality of replicated servers
    • H04L67/10015Access to distributed or replicated servers, e.g. using brokers
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L67/00Network arrangements or protocols for supporting network services or applications
    • H04L67/14Session management
    • H04L67/142Managing session states for stateless protocols; Signalling session states; State transitions; Keeping-state mechanisms

Definitions

  • the present invention relates to world wide area networking. More particularly, the invention provides a technique including a method and system for using a global traffic management system coupled to a plurality of content servers for a service fee. But it would be recognized that the invention has a much broader range of applicability. For example, the invention can also be applied to operation on a private network, and the like.
  • the Internet is a world wide "super-network" which connects together millions of individual computer networks and computers.
  • the Internet is generally not a single entity. It is an extremely diffuse and complex system over where no single entity has complete authority or control.
  • the Internet is widely know for one of its ways of presenting information through the World Wide Web (herein "Web"), there are many other services currently available based upon the general Internet protocols and infrastructure.
  • Web World Wide Web
  • the Web is often easy to use for people inexperienced with computers.
  • Information on the Web often is presented on "pages" of graphics and text that contain "links" to other pages either within the same set of data files (i.e., Web site) or within data files located on other computer networks.
  • Users often access information on the Web using a "browser” program such as one made by Netscape Communications Corporation (now America Online, Inc.) of Mountain View, California or ExplorerTM from Microsoft Corporation of Redmond, Washington. Browser programs can process information from Web sites and display the information using graphics, text, sound, and animation.
  • the Web has become a popular medium for advertising goods and services directly to consumers.
  • Traffic generally refers to the transfer of information from a Web site at a server computer to a user at a client computer.
  • the traffic generally travels through the world wide network of computers using a packetized communication protocol, such as TCP/IP. Tiny packets of information travel from the server computer through the network to the client computer. Like automobiles during "rush hour" on Highway 101 in Silicon Valley, the tiny packets of information traveling through the Internet become congested. Here, traffic jams that cause a delay in the information from the server to the client occur during high usage hours on the Internet. These traffic jams lead to long wait times at the client location. Here, a user of the client computer may wait for a long time for a graphical object to load onto his/her computer.
  • the present invention relates to a computer server network. More particularly, the present invention relates to an integrated load balancing and content delivery computer network.
  • Embodiments of the present invention include a number of Linux machines (on other operating machines) running software available from the current assignee.
  • This software currently includes eight separate software components that are delivered as a single product.
  • the software When deployed across a large number of machines, the software creates a network of point of presence (POP) computers that provides a solution for integrated content hosting and traffic management.
  • POP point of presence
  • Customers using the POP network can store content such as HTML, images, video, sound, software, or the like on the network for fast and highly available access by clients (end users).
  • the network also provides load balancing and high availability for servers outside the network.
  • Customers with generated content such as search engines, auctions and shopping carts, can use the latter feature to add their own content servers to the network.
  • the computer network may be monitored using a standard web browser in some embodiments.
  • HTML interfaces are provided that display the computer networks current status as well as historical statistics.
  • a method for operating a network of point of presence servers sharing a hostname includes a user making a request for a web page with a domain name the POP network is authoritative for.
  • the user's computer sends a request for name resolution of the domain name they are requesting content from to the configured local DNS server (LDNS) for that users computer.
  • the LDNS is typically configured in close proximity to the user's computer.
  • the LDNS resolves the IP address of the domain name the user's computer has queried for by sending a query to the Authoritative DNS server (ADNS) for that domain name.
  • the POP network DNS servers (SPDNS) is designated as an ADNS for all domains the POP network provides service for.
  • the SPDNS On being asked to resolve the domain name from the LDNS on behalf of the user, the SPDNS first determines from the plurality of POP network servers and customer web servers the subset of servers addressable by the requested domain name.
  • addressable servers are servers that have been configured by either the customer or the POP network to satisfy user requests for content within the requested domain name. These servers have been defined in the POP network configuration as such.
  • the POP network selects the optimal server/servers for the user based on rules defined in the POP network configuration. These rules define how criteria such as POP/customer server health, latency from
  • POP/customer servers to the user making the request, packet loss experienced at each POP, cost associated with each POP/customer server, and the like. These factors are then combined to define the optimal server for a given user making a request for a given domain name.
  • persistence is defined in embodiments of the present invention for a domain name so that the user is returned to the same POP/customer server on subsequent requests within a definable time.
  • These rules for defining the optimal server are created on a per domain name basis allowing each domain name to be configured independently.
  • the next step includes the SPDNS returning the IP address of the optimal servers to the LDNS making the request, and the LDNS resolving the domain name for the user computer making the request for content with the IPs provided by the SPDNS.
  • the POP network servers are defined in the present embodiment.
  • POP network configuration to be addressable by a requested domain names that represent a customer's static or cacheable content, a static domain name.
  • the customer's servers are defined as addressable for domain names that represent dynamic or non-cacheable content, a dynamic domain name.
  • Resolution for either domain name type is performed by the same set of servers controlled and operated by the POP network, the SPDNS.
  • both domain name types leverage the same infrastructure for determining network and service load as well as configuration.
  • This technique allows the customer to organize their web site by functions where functions are mapped into domain names.
  • the functions have requirements that define what is needed by a server to satisfy the function. Some functions may only be limited by the ability to access a collection of static content while others require a proprietary application and access to a customer's database.
  • the domain names are then mapped into resources or servers that can be addressed by the domain name and that satisfy requests for the function that domain name has been defined to support.
  • This method of organization allows individual elements of a web page to be retrieved from different resources. Further, such embodiments provides the SPDNS the control to direct users to an available and optimal server for a given resource. This greatly improves the user experience by reducing connection time, download time, errors, and the like.
  • a method for operating a network of point of presence servers sharing a hostname includes receiving a request from a user for a web page at a first web address, the first web address including the hostname, determining traffic loads of a plurality of customer web servers, each of the customer web servers storing the web page, and determining a customer web server from the plurality of customer web servers that is appropriate for the request, the customer web server having a traffic load and/or latency lower than traffic loads of remaining customer web servers from the plurality of customer web servers.
  • the technique also includes determining an IP address of the customer web server, and providing the IP address of the customer web server to the user.
  • the method may thereafter include receiving a request from the user for static content on the web page at a second web address, the second web address including the hostname, and determining latency of point of presence servers in the network of point of presence servers. Determining the point of presence server from the network of point of presence servers that is appropriate for the request, the point of presence server having a latency and/or server load lower than latency of remaining point of presence servers from the network of point of presence servers, retrieving the static content from the point of presence server, and providing the static content to the user may also be performed.
  • a method for operating a network of point of presence servers includes receiving a first request from a client DNS server to resolve a first domain name, the client DNS server receiving a request from a user of a web page address that includes the first domain name, and determining load measurements of a plurality of customer web servers, each of the customer web servers addressable by the first domain name, and each of the customer web servers configured to service the request from the user.
  • the technique may also include determining a customer web server from the plurality of customer web servers, the customer web server having load measurements lower than load measurements (also including latency and other factors) of other customer web servers from the plurality of customer web servers latency, determining an IP address of the customer web server, and providing the IP address of the customer web server to the client DNS server.
  • the method typically includes receiving a second request from the client DNS server to resolve a second domain name, the client DNS server receiving a request from the user of a uniform resource locator that includes the second domain name, and sending the request to the authoritative DNS for this domain name (SPDNS) who determines performance metric measurement of point of presence servers in the network of point of presence servers, each of the point of presence servers addressable by the second domain name.
  • SPDNS domain name
  • determining a point of presence server from the network of point of presence servers the point of presence server having performance metric measurements lower than performance metric measurements of other point of presence servers from the network of point of presence servers, providing the IP address of the point of presence server to the client DNS server, retrieving data from the point of presence server in response to the uniform resource locator, and providing the data to the user are included.
  • FIG. 1 is an overview block diagram of an embodiment of the present invention
  • Fig. 2 illustrates a block diagram of a typical server machine according to an embodiment of the present invention
  • Figs. 3A-3C illustrate a flow chart according to an embodiment of the present invention
  • Fig. 4 illustrates a flow chart according to another embodiment of the present invention.
  • Fig. 5 illustrates a more detailed block diagram according to one embodiment.
  • Fig. 1 is an overview block diagram of an embodiment of the present invention.
  • Fig. 1 illustrates a series of "point of presence” (POP) servers 10, 20, and 30, customer web servers (POP servers) 40 and 45, a monitoring POP server 50, and an ISP cluster 60.
  • POP point of presence
  • POP servers customer web servers
  • monitoring POP server 50 monitoring POP server 50
  • ISP cluster 60 an ISP cluster 60.
  • the servers 10-50, and ISP 60 are typically coupled via a computer network 70, such as the Internet.
  • computer network 70 such as the Internet.
  • other types of computer networks may include other wide area networks, local area networks, such as intranets, and the like.
  • POP servers 10-30 are typically distributed around the world and together they form a POP network.
  • each POP server will include a plurality of dedicated servers, providing functionality as will be described below.
  • POP server 20 includes servers 65, 80-100.
  • server 65 operates as a probing server according to NetProbe software described below;
  • server 80 and 90 operate as a cache according to WebCache software described below;
  • server 100 operates as a content server according to Webserver software described below.
  • server 10 and 20 may include a server 110 and 120 that includes Domain Name Server (DNS) software, described below.
  • DNS Domain Name Server
  • NetProbe server software performs probes across computer network 70 to determine server load and availability (metrics), to perform content checks to ensure servers are delivering correct content, to determine latency between customer web sites and POP servers, and the like.
  • NetProbe software may operate continuously and typically sends results to
  • POP servers running NameServer software such as POP server 10 and 30.
  • measurements of server metrics may include response times for HTTP service, FTP service, customer web server CPU load, customer web server memory load and/or utilization, or the like. In other embodiment, other types of measurements are contemplated.
  • the WebCache server software responds to requests for static web content from clients, such as web browsers from memory.
  • clients such as web browsers from memory.
  • the WebCache software will generate a request to a Webserver.
  • Webserver sends the requested content, the
  • WebCache software caches the content in memory.
  • the Webserver software responds to requests for content from WebCache servers. If the content requested does not exist on disk, the Webserver software responds to requests for content from WebCache servers. If the content requested does not exist on disk, the Webserver software responds to requests for content from WebCache servers. If the content requested does not exist on disk, the Webserver software responds to requests for content from WebCache servers. If the content requested does not exist on disk, the Webserver software responds to requests for content from WebCache servers. If the content requested does not exist on disk, the
  • Webserver software will generate a request to a web server, that is known to include the content, for example, web server 130. Once the content is provided by web server 130, the Webserver software stores the content to disk and also returns the content to the
  • this server only gets request for WebCache "misses" so it does not require caching functionality.
  • DNS software performs a mapping from domain name to IP address.
  • the DNS software is queried to resolve a hostname, or the like, and returns an IP address of a web server that has the bandwidth to respond to the request. Further, in response to a query, the DNS software returns an IP address of a web server that can quickly provide web content.
  • the dedicated servers are typically coupled to the computer network 70 via a router.
  • fewer number of physical servers may be used to implement functionality of servers 65, 80-100, or the like.
  • a physical server may have multiple software packages operating upon it at one time.
  • WebCache software and Webserver software may operate as two different processes on a single server.
  • each POP server will include NetProbe, WebCache, and Webserver software.
  • only a limited number of POP servers will include DNS software, however, in other embodiments, the number of servers with DNS software may be increased.
  • customer web servers 40 and 45 typically include web servers 130 and 135 and servers 140 and 145.
  • web servers 130 and 135 provide static content, such as text and graphics
  • further servers 140 and 145 provide non-static content, or content that changes in response to a user query, or the like.
  • servers 140 and 145 may be search engines, databases, or the like.
  • customer POP web servers 40 and 45 are mirrors of each other, and are designed to improve the perceived responsiveness of a web site with respect to a user.
  • ISP 60 is illustrated including users 150-160, and a client DNS server 170.
  • ISP 60 may be a conventional ISP such as Earthlink, AOL, or the like. Accordingly, users 150 and 160 may be home users using dial-up lines, cable modems, DSL, or the like, to connect with ISP 60.
  • client DNS server 170 is typically provided within ISP 60. In the present embodiment, client DNS server 170 is typically used to resolve a host name into an IP address. If client DNS server 170 does not know the IP address, client DNS server 170 may pass the request to other DNS servers, such as DNS servers 110-120, or the like.
  • a monitoring POP server 50 may include a plurality of dedicated servers, providing functionality described herein.
  • monitoring POP server 50 includes servers 180-210.
  • server 180 operates as an monitoring display server according to NetView software described below;
  • server 190 operates as an administration server according to tools, together termed AdminTool software, described below;
  • servers 200 and 210 operate as logging servers according to LogServer software described below.
  • POP server 10 and 20 may include a server 110 and 120 that includes Domain Name Server (DNS) software, described below.
  • DNS Domain Name Server
  • NetView software provides an HTML interface to current and historical statistics for customers and network administrators.
  • the statistics about the POP network, and the like are typically derived from servers running LogServer software, such as servers 200 and 210, described below.
  • NetView software includes web server CGI programs to provide the HTML output.
  • the NetView software also provides an interface that allows customers to flush content from the POP network as they update the content on their web servers.
  • LogServer software receives log file data from servers running NameServer, WebCache, Webserver and NetProbes software, and the like.
  • LogServer software receives the log file data, and stores the data onto disk.
  • the LogServer software separates the data into appropriate categories and processes the data to generate statistics. As mentioned above, the data is thus used for monitoring the state of POP servers on the network.
  • AdminTool software various administrator tools, termed AdminTool software are also typically provided. These tools are used by the administrator to configure and administer the network of POP servers.
  • tools provides tools that "spider" a customer web site and load WebServers and caches with content.
  • monitoring POP server 50 In the present embodiment, a single monitoring POP server 50 is contemplated, however, multiple monitoring servers may be provided in alternative embodiments of the present invention. In the present embodiment, POP servers 10-30, and monitoring POP server
  • This set of client and server software programs includes a server that receives a global configuration file that specifies the architecture of the POP servers, and stores it locally.
  • POP servers will include multiple machines, each with a single purpose (DNS name server, for instance).
  • DNS name server for instance
  • POP servers 10-30 and customer web server
  • the POP network illustrated in Fig. 1 provides content hosting for static content, such as images, video, software, etc.
  • the POP network also provides for load balancing and traffic management for services, and dynamic content, that cannot be stored. Further detail regarding the process will be described below.
  • the POP server network may simultaneously provide traffic management, as well as content hosting to more than one specific customer.
  • customerl and customer2 share the POP server network.
  • POP server network may provide traffic load management by directing traffic to either customerl web server 40 or customerl web server 45.
  • POP server network may be used to host static content for customer2.
  • Still further POP server may provide both traffic management and content hosting, as will be illustrated and discussed below. Accordingly, the POP server network may be termed a "shared" server network.
  • Fig. 2 illustrates a block diagram of a typical server machine 300 according to an embodiment of the present invention.
  • Server machine 300 typically includes a monitor 330, a computer 340, a keyboard 350, an input device 360, and a network interface 370.
  • Computer 340 includes familiar computer components such as a processor 310, and memory storage devices, such as a random access memory (RAM)
  • Network interface 370 is illustrated coupled to a network 395, or the like
  • a mouse is but one example of an input device
  • network interface 370 may be an Ethernet card, a modem, or the like.
  • RAM 320 and disk drive 380 are examples of tangible media for storage of data, files, log data, computer programs, drivers, embodiments of the herein described methods, and the like.
  • Other types of tangible media include floppy disks, removable hard disks, optical storage media such as CD-ROMS and bar codes, and semiconductor memories such as flash memories, read-only-memories (ROMS), battery-backed memories, and the like.
  • Server 300 includes a PC compatible computer having '586 or '686 class based microprocessors, such as the AthlonTM microprocessor from AMD. Further, in the present embodiment, Server 300 utilizes LINUX, and runs software provided by Speedera Networks, Inc., the assignee of the present invention.
  • Fig. 2 is representative of but one type of system for embodying the present invention. It will be readily apparent to one of ordinary skill in the art that many types of hardware and software configurations are suitable for use in conjunction with the present invention. For example, processors such as the G4 from Motorola, Pentiumlll, Pentium4, Itanium from Intel, and the like may be used; further any computer communications bus may be used in alternative embodiments of the present invention.
  • server 300 may operate under the WindowsNT operating system, may operate under MAC OS from Apple Computer, BeOS from Be, Incorporated, and the like.
  • Figs. 3A-3C illustrate a flow chart according to an embodiment of the present invention.
  • Figs. 3A-3C illustrate a method for a network of POP servers to service a request from a user.
  • NetProbe servers 670 and 675 determine the amount of latency between itself and customer web server 40 and customer web server 45, step 390. Latency may also be determined between NetProbe servers 670 and 675 and the user's (client's local) DNS). Latency between a local DNS and NetProbe servers 670 and
  • the latencies may include a measurement of the amount of traffic and the load on the customer web servers at that moment in time. Further, the latency may include a measurement of response times between the particular NetProbe server 670 and the different customer web servers. In embodiments of the present invention, measurements of latency may include response times for HTTP service, FTP service, streaming media service. Additionally, factors such as customer web server CPU load, customer web server memory load and/or utilization, or the like may also be considered. In other embodiments, other types of measurements are contemplated, for example performance metrics, availability, persistence data, client weighting factors, and the like.
  • NetProbe servers 670 and 675, and the like run virtually independently from other servers within a POP server, such as within POP server 10, this step may be repeated at regular intervals, for example, every hour or the like. In embodiments of the present invention, it is envisioned that some customers may have many more than two web servers, thus NetProbe servers 670 and 675 may take additional traffic load measurements. NetProbe servers 670 and 675, and the like also determine the performance of other POP servers, such as POP server 20, 30, or the like, step 400. This type of data is termed "service metrics" of the POP network. In the present embodiment, the metrics may include a measurement of the amount of traffic and the load on a POP server at a recent period in time.
  • service metrics may also consider response times for HTTP service, FTP service, POP server CPU load, POP server memory load and/or utilization, or the like.
  • other types of measurements are contemplated, for example availability, persistence data, client weighting factors, monetary costs, and the like. Because NetProbe servers 670 and 675, and the like run virtually independently from other servers within a POP server, such as POP server 10, this step may be repeated at regular intervals, for example, every hour or the like.
  • a user such as user 150 enters the name of a customer's web site, step 410.
  • a user can type-in the name of a customer's web site via keyboard, and in another embodiment, the user can select a hyperlink to the customer's web site from a page the user is viewing, or the like.
  • the customer's web site is "customerl.com.”
  • the name of the customer's web site is passed (220) to a local client DNS (LDNS) 170, step 420.
  • LDNS local client DNS
  • domain name servers typically translate textual names, or the like into IP addresses. If the LDNS does not know how to resolve the name, the DNS may route the request to another DNS.
  • client DNS 170 passes (230) the name to an authoritative NameServer server (ANDS) 110, step 430.
  • NameServer server 110 typically uses the latency and traffic load information associated with the customer's web sites that was determined in step 390, to select an appropriate customer web server to return to the user, step 440.
  • NameServer server 110 determines that the west coast customer web server 45 is very heavily loaded with requests, but the east coast customer web server 40 has light traffic. Thus, in response to the user's request, NameServer server 110 determines that the user should be directed to customer web server 40. NameServer server 110 then determines the IP address of customer web server 40, step 450, and then sends the IP address to the user, step 460.
  • NameServer server 110 may uses latency information from multiple NetProbe servers such as
  • NetProbe server 670 and 675 may be used by NameServer server 110 in addition to latencies between the NameServer server and NetProbe servers 670 and 675 in step 440.
  • NetProbe server 670 reports the latency of customer web server 45 to NameServer 110 because NetProbe server 670 may be the closest NetProbe server; further NetProbe server 675 reports the latency of customer web server 40 to
  • NameServer 110 because NetProbe server 675 may be the closest NetProbe server. In an example, if the reported latencies are the same, NameServer 110 would resolve the domain name to customer web server 45 because the additional latency between NetProbe server 670 and NameServer 110 is smaller than the latency between NetProbe server 675 and NameServer 110. In other words, the sum of the latencies are used to determine how to resolve the IP address.
  • user 150 receives the IP address, and then sends requests for a web page, or the like, at the received IP address, step 470.
  • the web page request typically includes static content and dynamic content.
  • static content may include graphics and text that typically do not change between web page accesses, for example, corporate logos, archival images, documents, and the like.
  • dynamic content may include graphics and text that do change between web page accesses, for example, stock quotes, search results, live web cameras, and the like.
  • the web page requested from customer web server 40 includes static data, such as HTML text and URL addresses of static content, such as images, and dynamic data, such as stock quotes.
  • static data typically resides on web server 130
  • dynamic data is resides within server 140.
  • this data is also typically mirrored on web server 135 and server 145 in customer web server 45.
  • the customer does not want web server 130 to have the burden of providing static content to many different users.
  • the POP network provides the content hosting of the static content.
  • the URL addresses of the static content typically include textual domain names specifying the POP server network.
  • web server 130 In response to the request from user 150, web server 130 sends the HTML text, or the like, and the URL address of images to user 150, step 480. Additionally, server 140 may process the request from user 150, determine the responsive data dynamically, and send the dynamic data back to user 150, step 490.
  • the user's 150 client software receives the HTML text, dynamic data, URL address of static content, and the like from customer web server 40. Similar to above, the URL address is passed (220) to client DNS (LDNS) 170 for resolution, step 500. In this example, in response, client DNS 170 passes (230) the URL address to NameServer server (ADNS) 110, step 510.
  • ADNS NameServer server
  • NameServer server 110 uses the service metrics information associated with the other POP servers that was determined in step 400, to select an appropriate POP server that includes the static content, step 520.
  • NameServer server 110 determines that the west coast POP server 10 is very heavily loaded with content requests, but the east coast POP server 30 has light few requests.
  • NameServer server 110 determines that the user should retrieve data from POP server 30. NameServer server 110 then recalls the
  • IP address of POP server 40 step 530, and then sends the IP address to the user, step 540.
  • user 150 receives the IP address of POP server 30, and then sends requests (250) for the static content from POP server 30, step 550.
  • POP server 30 routes the request to WebCache server 280.
  • WebCache server 280 retrieves the static content from memory, and sends the requested static content back to user 150, step 560.
  • the customer's web server may also be considered for providing the static content.
  • the latency and traffic load information for the customer web server may be considered at the same time as the other POP servers . If the customer's web server is lightly loaded compared to the POP servers, the IP of the customer's web server may be retrieved in step 530, and passed to the user in step 540. Accordingly, in step 560, the customer's web server provides the static content back to the user.
  • the client receives the static content, the user can view the HTML text and dynamic data from customer web server 40, as well as view the static content from POP server 30, step 570.
  • the source of the static content may be specified as being provided at a specific server or a limited set of servers.
  • Fig. 4 illustrates a flow chart according to another embodiment of the present invention.
  • Fig. 4 illustrates a more detailed block diagram of step 560, above.
  • WebCache server 280 determines whether it has the requested static content in memory, step 600. In one embodiment of the present invention, this step may include determining whether the file requested is resident in memory. If the requested static content is not cached within WebCache server 280, WebCache server 280 sends a request (260) for the static content to WebSever server 290, step 610.
  • Webserver server 290 determines whether it has the requested static content on disk, or the like, 620. In one embodiment of the present invention, this step may include determining whether the file requested is resident in on a disk or database. If the requested static content is not stored within Webserver server 290, Webserver server 290 determines sends a request (270) for the static content to a customer web server, step 630.
  • the appropriate customer web server selected is also determined based upon the traffic load and latency of the multiple customer web servers to POP server 30. As an example, since POP server 30 is on the east coast and close to customer web server 40. As a result, the request for the static content is directed to customer web server 40. In an alternative embodiment, the customer web server from which static content is retrieved is predetermined. In this example, customer web server 40 provides the static content to
  • Webserver server 290 which in turn stores the static content to disk, or the like, step 640.
  • Fig. 5 illustrates a more detailed block diagram according to one embodiment. In particular, Fig. 5 illustrates are more detailed description of the flow of data between software components on the POP network.
  • Fig. 5 includes a reduced number of software components on servers on the POP network, including a NameServer server 700, a WebCache server 710, a Webserver server 720, NetProbes 730, LogServer server 740, and NetView server 750.
  • a NameServer server 700 a WebCache server 710
  • a Webserver server 720 NetProbes 730
  • LogServer server 740 NetView server 750.
  • Fig. 5 also illustrates a customer web site 760.
  • the configuration of the POP network is maintained in a single global configuration file.
  • This file is known as the "global configuration" file.
  • every POP server in the POP network determines configuration information by the copy of the current Speedera configuration file stored locally.
  • the configuration file includes a list of servers allowed to change the configuration; a list of domains the network is responsible for (i.e. customers' web sites); list of software components and services available in each POP server; a list of probes that perform latency checks at each POP server; and the like.
  • a new configuration file can be pushed to all POP server by using the AdminTools software, mentioned above.
  • the configuration file in the present embodiment is in a "registry tree” type format. Typically few statistics, status or extended information is kept in the configuration file. Such data is typically stored in server 740, as will be described below.
  • the configuration file is kept small to reduce the amount of time required for updating a configuration file across the POP network.
  • a tree-type format is used to allow the configuration to be easily extended without having to change its structure.
  • the configuration file is not necessarily sent to all POP servers in the POP network. Instead, only POP servers including NameServers,
  • POP servers only including a WebCache or a WebServers are not updated, because WebCache and
  • WebServers access NameServers to find services on the POP network. For example, when a WebCache needs to send information to a log server, it will make a request to: "log.name.speedera.com.” In response, a request is generated to a NameServer and in response the NameServer returns the appropriate IP address. Because the configuration file does not need to be sent to all the POP servers in the network, any configuration file update is performed efficiently.
  • the domain name uniquely identifies the POP server.
  • the name is "name.pop.speedera.net”, "dpl.westusa.speedera.net”, or the like.
  • the NameServer software is embodied as a DNS server that performs name to IP address mapping.
  • the software receives requests containing textual names and returns numeric IP addresses typically in the form of xxx.xxx.xxx.xxx, associated with those names.
  • the software is also responsible for receiving and storing latency, POP server status information, and the like from the NetProbe servers.
  • Other types of data includes the source IP address, the availability of the POP server, the traffic load, and the like are also stored. In the present embodiment, approximately 10 servers including the
  • NameServer software will operate on the POP network. Typically, the NameServer software is installed onto its own machine to increase the performance of the software. In one embodiment, the DNS servers should be designed to handle upwards of 1000 hits per second. It is contemplated that the 10 DNS servers will handle all the DNS requests for all users on the Internet that are serviced by the POP network. For example, users that require the POP network to determine the least loaded customer web site, and users that require content stored on the POP network both require services from the DNS servers. In alternative embodiments, a greater number of lesser number of DNS servers may be used according to specific application.
  • the NameServer replies to DNS queries and does not return any other information.
  • the DNS server may uses the following information: the configuration file to determine what machines have the content corresponding to the request; a table containing latency information for each POP to each client DNS (traffic load); a table containing service metrics (load/availability) information for each POP server, and the like.
  • the data received from NetProbe servers comprise UDP traffic. Further the NameServer software transmits log information to the data received from NetProbe servers.
  • LogServer software also via UDP traffic.
  • other schemes for transferring packetized data is contemplated.
  • the NameServer receives the service and latency information data from the NetProbe servers. In addition, in this embodiment, timestamp is also received so the Name Server software knows how stale the performance information is. The NameServer software stores this information to disk, to RAM, or the like. In this embodiment, the NameServer data is typically stored on disk to reduce the effects of a crash or a reboot. In this way, the latency and probe information is maintained. Further, in some cases, data such as persistence information, should be maintained even if the system is rebooted. In this embodiment, the NameServers log the requests they get and the replies they give, i.e. IP addresses. In operation, at regular intervals (specified by the global configuration file), the NameServer sends its latest statistics (e.g. number of hits it has taken per domain name, etc.) to the various NetProbe servers and appropriate log analysis servers via UDP. In the global configuration file, log analysis servers are assigned based on the domain name served. In the present embodiment, the log analysis is a distributed process.
  • the DNS software is proprietary and supports all features of BIND.
  • the WebCache software is typically embodied as a caching web server.
  • the WebCache server accepts HTTP or URL requests and checks memory for the requested content.
  • the content is typically static content from a customer's web page, or the like. If the content is not in the memory, the WebCache server contacts a Webserver server that may have the web page stored using HTTP.
  • WebCache is a customized caching server.
  • WebCache may include hardware cache boxes available from companies such as CacheFlow, Inktomi, or the like.
  • the WebCache server compresses and sends the logs of the content it stores, and sends the logs to log analysis servers. This transfer is typically via UDP traffic. This information provided by the
  • WebCache server is typically used for billing purpose as well as by customers for auditing and log analysis. In the case where a hardware cache box is used, the box must also be configured to send the logs to the log analysis servers.
  • the WebCache software is proprietary but could be integrated with off the shelf caching products.
  • a WebCache server when a WebCache server cannot provide the content requested, it forwards the request to a server running Webserver software it believes has the content. As discussed above, the Webserver software processes the request by looking on its local disk for the content. If the content exists on the local disk, the Webserver will pass the data to the WebCache. Further, if the content in not on the disk, the Webserver contacts the customers web site to retrieve the content.
  • the Webserver typically does not provide log data. Instead, the WebCache provides the cache miss information to the log analysis servers.
  • Webserver along with many of the other software modules do report error conditions to the log server.
  • the Webserver and WebCache may exist as one program. In alternative embodiments, these pieces of software may be separate and even reside on different machines.
  • the NetProbes software performs service availability/metric and latency probes and sends the results via UDP to NameServer servers.
  • service probes There are two distinct types of probes in the present embodiment, service probes and latency probes.
  • service probes determine service availability and load
  • Content delivery machines includes the POP servers including WebCache and Webserver modules, and may also include the customer's web site(s).
  • Service probes monitor data such as HTTP total response time, FTP total response time, and the like.
  • service probes run constantly, and send current metric and availability information to all DNS servers in the POP network.
  • the timing of the probe intervals and configuration of the service probes is typically speechified in the global configuration file.
  • Latency probes determine latency from the POP server hosting the
  • the Speedera DNS servers typically drive the latency probes.
  • a DNS server determines that it needs latency information from a probe, it sends a request to the probe.
  • the latency probe probes the client DNS server and returns with the result.
  • the DNS servers send a number of requests in a single UDP packet to a latency probe.
  • the NetProbe servers typically do not store the results of the probes, but simply sends them to other servers via UDP. Each piece of probe information has a timestamp of when the probe occurred so the receiving server can determine how stale the probe information is.
  • LogServer servers receives data from machines running NameServer, NetProbes and WebCache software.
  • the data is typically transmitted to one or more log analysis servers where the data is stored and analyzed.
  • the data is typically stored by customer name to facilitate billing and preparation of audit and historical statistics.
  • the LogServer software also gets requests for information from the NetView software. Such a request may be made when an administrator, or the like wants to graphically view or view the table of data for current or historical status of a web site (hits, open connections, etc.).
  • the configuration file will have a list of log servers and the customer domains they manage, however, LogServers will only receive data for the customer domains they manage, not for other domains.
  • the NetView software referred to above is typically an HTML user- interface providing customers and administrators to use a web browser to view real-time and historical information.
  • the NetView software includes a web server and CGI or web server plug-in modules written in Perl or C.
  • NetView servers contact LogServers or the LogServer database to get the underlying data.
  • the raw data from LogServers and formats and displays it to customers, or the like in the form of tables and/or graphs.
  • the NetView software also provides an interface that allows a customer to flush content from the POP network when they publish new content to their servers. To perform the action of flushing content, the NetView software executes the AdminTool software that has the ability to flush content from the system.
  • the AdminTools software includes tools used to administer the Speedera Network. It contains two main tools, one to push a new configuration file to servers in the network and another to cause content to be flushed from the network.
  • the configuration administration tool sends the global configuration file to servers in the network in a hierarchical fashion.
  • a list of servers will get the configuration file and send it to another list of servers and so on.
  • the list of servers to receive the configuration file and what servers they control is typically specified by the configuration file itself.
  • the configuration system is written such that it does not allow configuration files to get out of sync, when machines are down or if a socket connection breaks while transmission is in process, or the like.
  • the administration tool opens sockets to all the master configuration servers, sends the configuration file and waits for confirmation that they and everyone they send the file to receives it and then send a command to tell everyone to switch to use that configuration file.
  • the configuration administration tool uses TCP and not UDP because of the increased security. Additionally, password protection and encryption may also be used.
  • the Shared software includes software that will exist on all servers in the network. This includes a configuration daemon that listens for the transmission of a new configuration file, stores it locally and sends a SIGHUP to local processes to notify them the configuration has changed.
  • steps 460 and 470 are not explicitly performed. Instead, after the DNS server resolves the IP address of the customer web server in step 450, the IP address is automatically used to address the customer web server. Similarly, in some embodiments, steps 540 and 550 are not explicitly performed. Instead, after the DNS server resolves the IP address of the POP server in step 530, the IP address is automatically used to address the POP server.
  • the static content stored on the web caches may include movies, audio, images, text, plug-in program data, programs, and the like.
  • Other types of data can be stored within the web caches, for example, streaming audio data, video data, and the like.
  • the embodiments have discussed providing data across wide area networks, such as the Internet.
  • embodiments of the above described invention may be applied to other types of networks including wireless networks, local area networks, and the like.

Abstract

A method for operating a network point of presence servers sharing a hostname includes receiving a request from a user for a web page at a first web address, the web address including hostname, determining the traffic load of a plurality of customer web servers (390) each of the customer web servers storing the web page, determining a customer web server from the plurality of customer web servers that is appropriate for the request (440), the customer web server having a traffic load lower than the loads of remaining customer web servers from the plurality of customer web servers, determining an IP address of the customer web server (450), directing the request from the user to the customer web server, thereafter receiving a request from the user for static content on the web page at a second web address, the second web address including a hostname, determining the service metrics of point of presence servers in the network of point of presence.

Description

METHOD FOR OPERATING AN INTEGRATED
POINT OF PRESENCE SERVER NETWORK
CROSS-REFERENCES TO RELATED APPLICATIONS The present invention disclosure claims priority to and incorporates by reference for all purposes, Provisional U.S. Patent Application Number 60/166,906, Attorney Docket No. 22119-04581 (Case 4851 US), filed November 22, 1999; U.S. Patent Application Number 09/648,420 filed August 23, 2000 (Attorney Docket No. 20533-000210US); U.S. Patent Application Number 09/644,927 filed August 23, 2000 (Attorney Docket No. 20533-000220US); and U.S. Patent Application Number 09/645,067 filed August 23, 2000 (Attorney Docket No. 20533-000270US).
BACKGROUND OF THE INVENTION The present invention relates to world wide area networking. More particularly, the invention provides a technique including a method and system for using a global traffic management system coupled to a plurality of content servers for a service fee. But it would be recognized that the invention has a much broader range of applicability. For example, the invention can also be applied to operation on a private network, and the like. The Internet is a world wide "super-network" which connects together millions of individual computer networks and computers. The Internet is generally not a single entity. It is an extremely diffuse and complex system over where no single entity has complete authority or control. Although the Internet is widely know for one of its ways of presenting information through the World Wide Web (herein "Web"), there are many other services currently available based upon the general Internet protocols and infrastructure.
The Web is often easy to use for people inexperienced with computers. Information on the Web often is presented on "pages" of graphics and text that contain "links" to other pages either within the same set of data files (i.e., Web site) or within data files located on other computer networks. Users often access information on the Web using a "browser" program such as one made by Netscape Communications Corporation (now America Online, Inc.) of Mountain View, California or Explorer™ from Microsoft Corporation of Redmond, Washington. Browser programs can process information from Web sites and display the information using graphics, text, sound, and animation.
Accordingly, the Web has become a popular medium for advertising goods and services directly to consumers.
As time progressed, usage of the Internet has exploded. There are literally millions of users on the Internet. Usage of the Internet is increasing daily and will eventually be in the billions of users. As usage increases so does traffic on the Internet.
Traffic generally refers to the transfer of information from a Web site at a server computer to a user at a client computer. The traffic generally travels through the world wide network of computers using a packetized communication protocol, such as TCP/IP. Tiny packets of information travel from the server computer through the network to the client computer. Like automobiles during "rush hour" on Highway 101 in Silicon Valley, the tiny packets of information traveling through the Internet become congested. Here, traffic jams that cause a delay in the information from the server to the client occur during high usage hours on the Internet. These traffic jams lead to long wait times at the client location. Here, a user of the client computer may wait for a long time for a graphical object to load onto his/her computer.
From the above, it is seen that an improved way to transfer information over a network is highly desirable.
SUMMARY OF THE INVENTION
The present invention relates to a computer server network. More particularly, the present invention relates to an integrated load balancing and content delivery computer network.
Embodiments of the present invention include a number of Linux machines (on other operating machines) running software available from the current assignee. This software currently includes eight separate software components that are delivered as a single product. When deployed across a large number of machines, the software creates a network of point of presence (POP) computers that provides a solution for integrated content hosting and traffic management. Customers using the POP network, according to one embodiment, can store content such as HTML, images, video, sound, software, or the like on the network for fast and highly available access by clients (end users). The network also provides load balancing and high availability for servers outside the network. Customers with generated content, such as search engines, auctions and shopping carts, can use the latter feature to add their own content servers to the network.
The computer network may be monitored using a standard web browser in some embodiments. In particular, HTML interfaces are provided that display the computer networks current status as well as historical statistics.
According to one embodiment of the invention a method for operating a network of point of presence servers sharing a hostname is disclosed. The method includes a user making a request for a web page with a domain name the POP network is authoritative for. The user's computer sends a request for name resolution of the domain name they are requesting content from to the configured local DNS server (LDNS) for that users computer. The LDNS is typically configured in close proximity to the user's computer. The LDNS resolves the IP address of the domain name the user's computer has queried for by sending a query to the Authoritative DNS server (ADNS) for that domain name. The POP network DNS servers (SPDNS) is designated as an ADNS for all domains the POP network provides service for. On being asked to resolve the domain name from the LDNS on behalf of the user, the SPDNS first determines from the plurality of POP network servers and customer web servers the subset of servers addressable by the requested domain name. In this embodiment, addressable servers are servers that have been configured by either the customer or the POP network to satisfy user requests for content within the requested domain name. These servers have been defined in the POP network configuration as such.
From this subset of servers, the POP network selects the optimal server/servers for the user based on rules defined in the POP network configuration. These rules define how criteria such as POP/customer server health, latency from
POP/customer servers to the user making the request, packet loss experienced at each POP, cost associated with each POP/customer server, and the like. These factors are then combined to define the optimal server for a given user making a request for a given domain name. In addition, persistence is defined in embodiments of the present invention for a domain name so that the user is returned to the same POP/customer server on subsequent requests within a definable time. These rules for defining the optimal server are created on a per domain name basis allowing each domain name to be configured independently. The next step includes the SPDNS returning the IP address of the optimal servers to the LDNS making the request, and the LDNS resolving the domain name for the user computer making the request for content with the IPs provided by the SPDNS.
In the present embodiment, the POP network servers are defined in the
POP network configuration to be addressable by a requested domain names that represent a customer's static or cacheable content, a static domain name. The customer's servers are defined as addressable for domain names that represent dynamic or non-cacheable content, a dynamic domain name. Resolution for either domain name type is performed by the same set of servers controlled and operated by the POP network, the SPDNS. In addition, both domain name types leverage the same infrastructure for determining network and service load as well as configuration.
This technique allows the customer to organize their web site by functions where functions are mapped into domain names. The functions have requirements that define what is needed by a server to satisfy the function. Some functions may only be limited by the ability to access a collection of static content while others require a proprietary application and access to a customer's database. The domain names are then mapped into resources or servers that can be addressed by the domain name and that satisfy requests for the function that domain name has been defined to support. This method of organization allows individual elements of a web page to be retrieved from different resources. Further, such embodiments provides the SPDNS the control to direct users to an available and optimal server for a given resource. This greatly improves the user experience by reducing connection time, download time, errors, and the like.
According to one aspect of the invention a method for operating a network of point of presence servers sharing a hostname is disclosed. The method includes receiving a request from a user for a web page at a first web address, the first web address including the hostname, determining traffic loads of a plurality of customer web servers, each of the customer web servers storing the web page, and determining a customer web server from the plurality of customer web servers that is appropriate for the request, the customer web server having a traffic load and/or latency lower than traffic loads of remaining customer web servers from the plurality of customer web servers. The technique also includes determining an IP address of the customer web server, and providing the IP address of the customer web server to the user. The method may thereafter include receiving a request from the user for static content on the web page at a second web address, the second web address including the hostname, and determining latency of point of presence servers in the network of point of presence servers. Determining the point of presence server from the network of point of presence servers that is appropriate for the request, the point of presence server having a latency and/or server load lower than latency of remaining point of presence servers from the network of point of presence servers, retrieving the static content from the point of presence server, and providing the static content to the user may also be performed.
According to another aspect of the invention, a method for operating a network of point of presence servers is disclosed. The method includes receiving a first request from a client DNS server to resolve a first domain name, the client DNS server receiving a request from a user of a web page address that includes the first domain name, and determining load measurements of a plurality of customer web servers, each of the customer web servers addressable by the first domain name, and each of the customer web servers configured to service the request from the user. The technique may also include determining a customer web server from the plurality of customer web servers, the customer web server having load measurements lower than load measurements (also including latency and other factors) of other customer web servers from the plurality of customer web servers latency, determining an IP address of the customer web server, and providing the IP address of the customer web server to the client DNS server.
Thereafter, the method typically includes receiving a second request from the client DNS server to resolve a second domain name, the client DNS server receiving a request from the user of a uniform resource locator that includes the second domain name, and sending the request to the authoritative DNS for this domain name (SPDNS) who determines performance metric measurement of point of presence servers in the network of point of presence servers, each of the point of presence servers addressable by the second domain name. Further, determining a point of presence server from the network of point of presence servers, the point of presence server having performance metric measurements lower than performance metric measurements of other point of presence servers from the network of point of presence servers, providing the IP address of the point of presence server to the client DNS server, retrieving data from the point of presence server in response to the uniform resource locator, and providing the data to the user are included.
BRIEF DESCRIPTION OF THE DRAWINGS In order to more fully understand the present invention, reference is made to the accompanying drawings. Understanding that these drawings are not to be considered limitations in the scope of the invention, the presently preferred embodiments and the presently understood best mode of the invention are described with additional detail through use of the accompanying drawings in which:
Fig. 1 is an overview block diagram of an embodiment of the present invention; Fig. 2 illustrates a block diagram of a typical server machine according to an embodiment of the present invention;
Figs. 3A-3C illustrate a flow chart according to an embodiment of the present invention;
Fig. 4 illustrates a flow chart according to another embodiment of the present invention; and
Fig. 5 illustrates a more detailed block diagram according to one embodiment.
DESCRIPTION OF THE SPECIFIC EMBODIMENTS Fig. 1 is an overview block diagram of an embodiment of the present invention. Fig. 1 illustrates a series of "point of presence" (POP) servers 10, 20, and 30, customer web servers (POP servers) 40 and 45, a monitoring POP server 50, and an ISP cluster 60. In Fig. 1, the servers 10-50, and ISP 60 are typically coupled via a computer network 70, such as the Internet. In other embodiments, other types of computer networks may include other wide area networks, local area networks, such as intranets, and the like.
In the present embodiment, POP servers 10-30 are typically distributed around the world and together they form a POP network. In a typical embodiment, each POP server will include a plurality of dedicated servers, providing functionality as will be described below. For example, POP server 20 includes servers 65, 80-100. In this example, server 65 operates as a probing server according to NetProbe software described below; server 80 and 90 operate as a cache according to WebCache software described below; and server 100 operates as a content server according to Webserver software described below. As illustrated in Fig. 1, server 10 and 20 may include a server 110 and 120 that includes Domain Name Server (DNS) software, described below.
In the present embodiment, NetProbe server software performs probes across computer network 70 to determine server load and availability (metrics), to perform content checks to ensure servers are delivering correct content, to determine latency between customer web sites and POP servers, and the like. As will be described further below, NetProbe software may operate continuously and typically sends results to
POP servers running NameServer software, such as POP server 10 and 30.
In embodiments of the present invention, measurements of server metrics may include response times for HTTP service, FTP service, customer web server CPU load, customer web server memory load and/or utilization, or the like. In other embodiment, other types of measurements are contemplated.
In this embodiment, the WebCache server software responds to requests for static web content from clients, such as web browsers from memory. In this example, if the requested content does not exist in server memory, the WebCache software will generate a request to a Webserver. When Webserver sends the requested content, the
WebCache software caches the content in memory.
As described above, the Webserver software responds to requests for content from WebCache servers. If the content requested does not exist on disk, the
Webserver software will generate a request to a web server, that is known to include the content, for example, web server 130. Once the content is provided by web server 130, the Webserver software stores the content to disk and also returns the content to the
WebCache. In this embodiment, this server only gets request for WebCache "misses" so it does not require caching functionality.
In the present embodiment, DNS software performs a mapping from domain name to IP address. As an example, the DNS software is queried to resolve a hostname, or the like, and returns an IP address of a web server that has the bandwidth to respond to the request. Further, in response to a query, the DNS software returns an IP address of a web server that can quickly provide web content.
In Fig. 1, the dedicated servers are typically coupled to the computer network 70 via a router. In alternative embodiments, fewer number of physical servers may be used to implement functionality of servers 65, 80-100, or the like. For example, a physical server may have multiple software packages operating upon it at one time. As an example, WebCache software and Webserver software may operate as two different processes on a single server. Generally, each POP server will include NetProbe, WebCache, and Webserver software. In one embodiment, only a limited number of POP servers will include DNS software, however, in other embodiments, the number of servers with DNS software may be increased.
As shown in Fig. 1, customer web servers 40 and 45 typically include web servers 130 and 135 and servers 140 and 145. In this embodiment, web servers 130 and 135 provide static content, such as text and graphics, further servers 140 and 145 provide non-static content, or content that changes in response to a user query, or the like. In some embodiments, servers 140 and 145 may be search engines, databases, or the like.
Typically, customer POP web servers 40 and 45 are mirrors of each other, and are designed to improve the perceived responsiveness of a web site with respect to a user.
ISP 60 is illustrated including users 150-160, and a client DNS server 170.
In this embodiment, ISP 60 may be a conventional ISP such as Earthlink, AOL, or the like. Accordingly, users 150 and 160 may be home users using dial-up lines, cable modems, DSL, or the like, to connect with ISP 60. As illustrated, client DNS server 170 is typically provided within ISP 60. In the present embodiment, client DNS server 170 is typically used to resolve a host name into an IP address. If client DNS server 170 does not know the IP address, client DNS server 170 may pass the request to other DNS servers, such as DNS servers 110-120, or the like.
In the example in Fig. 1, a monitoring POP server 50 may include a plurality of dedicated servers, providing functionality described herein. For example, monitoring POP server 50 includes servers 180-210. In this example, server 180 operates as an monitoring display server according to NetView software described below; server 190 operates as an administration server according to tools, together termed AdminTool software, described below; and servers 200 and 210 operate as logging servers according to LogServer software described below. As illustrated in Fig. 1, POP server 10 and 20 may include a server 110 and 120 that includes Domain Name Server (DNS) software, described below.
In the present embodiment, NetView software provides an HTML interface to current and historical statistics for customers and network administrators. The statistics about the POP network, and the like are typically derived from servers running LogServer software, such as servers 200 and 210, described below. In this embodiment, NetView software includes web server CGI programs to provide the HTML output. The NetView software also provides an interface that allows customers to flush content from the POP network as they update the content on their web servers. LogServer software, in this embodiment, receives log file data from servers running NameServer, WebCache, Webserver and NetProbes software, and the like. In this example, LogServer software, receives the log file data, and stores the data onto disk. Next, in response to requests for current and historical information from NetView software servers, the LogServer software separates the data into appropriate categories and processes the data to generate statistics. As mentioned above, the data is thus used for monitoring the state of POP servers on the network.
In the present embodiment, various administrator tools, termed AdminTool software are also typically provided. These tools are used by the administrator to configure and administer the network of POP servers. In one example, tools provides tools that "spider" a customer web site and load WebServers and caches with content.
In the present embodiment, a single monitoring POP server 50 is contemplated, however, multiple monitoring servers may be provided in alternative embodiments of the present invention. In the present embodiment, POP servers 10-30, and monitoring POP server
50 includes shared software. This set of client and server software programs includes a server that receives a global configuration file that specifies the architecture of the POP servers, and stores it locally.
In the present embodiment, in a typical deployment, POP servers will include multiple machines, each with a single purpose (DNS name server, for instance). However, it should be understood that virtually any combination of the software components described above can be installed on a single machine in alternative embodiments of the present invention. Further details regarding the software components will be given below. In the present embodiment, POP servers 10-30 and customer web server
40 and 45 work together to provide static content and dynamic content to a user, such as user 150 coupled to ISP 60. The POP network illustrated in Fig. 1 provides content hosting for static content, such as images, video, software, etc. The POP network also provides for load balancing and traffic management for services, and dynamic content, that cannot be stored. Further detail regarding the process will be described below.
In the embodiment illustrated in Fig. 1 , it should be noted that the POP server network may simultaneously provide traffic management, as well as content hosting to more than one specific customer. For example, customerl and customer2 share the POP server network. In one example, POP server network may provide traffic load management by directing traffic to either customerl web server 40 or customerl web server 45. At the same time, POP server network may be used to host static content for customer2. Still further POP server may provide both traffic management and content hosting, as will be illustrated and discussed below. Accordingly, the POP server network may be termed a "shared" server network. Fig. 2 illustrates a block diagram of a typical server machine 300 according to an embodiment of the present invention. Server machine 300 typically includes a monitor 330, a computer 340, a keyboard 350, an input device 360, and a network interface 370. Computer 340 includes familiar computer components such as a processor 310, and memory storage devices, such as a random access memory (RAM)
320, a disk drive 380, and a system bus 390 interconnecting the above components.
Network interface 370 is illustrated coupled to a network 395, or the like
In the present embodiment, a mouse is but one example of an input device
360, also known as a pointing device. Other types of input devices may include trackballs, drawing tablets, microphones (for voice activated input), and the like. In the present embodiment, network interface 370 may be an Ethernet card, a modem, or the like.
RAM 320 and disk drive 380 are examples of tangible media for storage of data, files, log data, computer programs, drivers, embodiments of the herein described methods, and the like. Other types of tangible media include floppy disks, removable hard disks, optical storage media such as CD-ROMS and bar codes, and semiconductor memories such as flash memories, read-only-memories (ROMS), battery-backed memories, and the like.
In a preferred embodiment, Server 300 includes a PC compatible computer having '586 or '686 class based microprocessors, such as the Athlon™ microprocessor from AMD. Further, in the present embodiment, Server 300 utilizes LINUX, and runs software provided by Speedera Networks, Inc., the assignee of the present invention.
Fig. 2 is representative of but one type of system for embodying the present invention. It will be readily apparent to one of ordinary skill in the art that many types of hardware and software configurations are suitable for use in conjunction with the present invention. For example, processors such as the G4 from Motorola, Pentiumlll, Pentium4, Itanium from Intel, and the like may be used; further any computer communications bus may be used in alternative embodiments of the present invention. In other embodiments, server 300 may operate under the WindowsNT operating system, may operate under MAC OS from Apple Computer, BeOS from Be, Incorporated, and the like.
Figs. 3A-3C illustrate a flow chart according to an embodiment of the present invention. In particular, Figs. 3A-3C illustrate a method for a network of POP servers to service a request from a user. In the flow chart below, reference are made to the system in Fig. 1 for convenience.
Initially, NetProbe servers 670 and 675, and the like determine the amount of latency between itself and customer web server 40 and customer web server 45, step 390. Latency may also be determined between NetProbe servers 670 and 675 and the user's (client's local) DNS). Latency between a local DNS and NetProbe servers 670 and
675 are measured because the latency information should be as a user would experience it. Further, a NetProbe server should be in close proximity to the Customers' origin sites.
In the present embodiment, the latencies may include a measurement of the amount of traffic and the load on the customer web servers at that moment in time. Further, the latency may include a measurement of response times between the particular NetProbe server 670 and the different customer web servers. In embodiments of the present invention, measurements of latency may include response times for HTTP service, FTP service, streaming media service. Additionally, factors such as customer web server CPU load, customer web server memory load and/or utilization, or the like may also be considered. In other embodiments, other types of measurements are contemplated, for example performance metrics, availability, persistence data, client weighting factors, and the like.
Because NetProbe servers 670 and 675, and the like run virtually independently from other servers within a POP server, such as within POP server 10, this step may be repeated at regular intervals, for example, every hour or the like. In embodiments of the present invention, it is envisioned that some customers may have many more than two web servers, thus NetProbe servers 670 and 675 may take additional traffic load measurements. NetProbe servers 670 and 675, and the like also determine the performance of other POP servers, such as POP server 20, 30, or the like, step 400. This type of data is termed "service metrics" of the POP network. In the present embodiment, the metrics may include a measurement of the amount of traffic and the load on a POP server at a recent period in time. In embodiments of the present invention, service metrics may also consider response times for HTTP service, FTP service, POP server CPU load, POP server memory load and/or utilization, or the like. In other embodiment, other types of measurements are contemplated, for example availability, persistence data, client weighting factors, monetary costs, and the like. Because NetProbe servers 670 and 675, and the like run virtually independently from other servers within a POP server, such as POP server 10, this step may be repeated at regular intervals, for example, every hour or the like.
Next a user, such as user 150 enters the name of a customer's web site, step 410. In one embodiment, a user can type-in the name of a customer's web site via keyboard, and in another embodiment, the user can select a hyperlink to the customer's web site from a page the user is viewing, or the like. In the example, the customer's web site is "customerl.com."
In response to the user entry, the name of the customer's web site is passed (220) to a local client DNS (LDNS) 170, step 420. As discussed above, domain name servers typically translate textual names, or the like into IP addresses. If the LDNS does not know how to resolve the name, the DNS may route the request to another DNS. In this example, client DNS 170 passes (230) the name to an authoritative NameServer server (ANDS) 110, step 430. In the present embodiment, NameServer server 110 typically uses the latency and traffic load information associated with the customer's web sites that was determined in step 390, to select an appropriate customer web server to return to the user, step 440. In this example, NameServer server 110 determines that the west coast customer web server 45 is very heavily loaded with requests, but the east coast customer web server 40 has light traffic. Thus, in response to the user's request, NameServer server 110 determines that the user should be directed to customer web server 40. NameServer server 110 then determines the IP address of customer web server 40, step 450, and then sends the IP address to the user, step 460.
In an alternative embodiment of the present embodiment, NameServer server 110 may uses latency information from multiple NetProbe servers such as
NetProbe server 670 and 675. Further, NameServer server 110 may uses this information in addition to latencies between the NameServer server and NetProbe servers 670 and 675 in step 440. As an example, NetProbe server 670 reports the latency of customer web server 45 to NameServer 110 because NetProbe server 670 may be the closest NetProbe server; further NetProbe server 675 reports the latency of customer web server 40 to
NameServer 110 because NetProbe server 675 may be the closest NetProbe server. In an example, if the reported latencies are the same, NameServer 110 would resolve the domain name to customer web server 45 because the additional latency between NetProbe server 670 and NameServer 110 is smaller than the latency between NetProbe server 675 and NameServer 110. In other words, the sum of the latencies are used to determine how to resolve the IP address. In this example, user 150 receives the IP address, and then sends requests for a web page, or the like, at the received IP address, step 470. The web page request typically includes static content and dynamic content. In this example, static content may include graphics and text that typically do not change between web page accesses, for example, corporate logos, archival images, documents, and the like. In this example, dynamic content may include graphics and text that do change between web page accesses, for example, stock quotes, search results, live web cameras, and the like.
In the present example, the web page requested from customer web server 40 includes static data, such as HTML text and URL addresses of static content, such as images, and dynamic data, such as stock quotes. In Fig. 1, the static data typically resides on web server 130, and the dynamic data is resides within server 140. As discussed above, this data is also typically mirrored on web server 135 and server 145 in customer web server 45. In the present embodiment, the customer does not want web server 130 to have the burden of providing static content to many different users. In this embodiment, the POP network provides the content hosting of the static content. Accordingly, the URL addresses of the static content typically include textual domain names specifying the POP server network. In response to the request from user 150, web server 130 sends the HTML text, or the like, and the URL address of images to user 150, step 480. Additionally, server 140 may process the request from user 150, determine the responsive data dynamically, and send the dynamic data back to user 150, step 490.
In the present embodiment, the user's 150 client software receives the HTML text, dynamic data, URL address of static content, and the like from customer web server 40. Similar to above, the URL address is passed (220) to client DNS (LDNS) 170 for resolution, step 500. In this example, in response, client DNS 170 passes (230) the URL address to NameServer server (ADNS) 110, step 510.
In the present embodiment, NameServer server 110 uses the service metrics information associated with the other POP servers that was determined in step 400, to select an appropriate POP server that includes the static content, step 520. In this example, NameServer server 110 determines that the west coast POP server 10 is very heavily loaded with content requests, but the east coast POP server 30 has light few requests. Thus, in response to the user's request, NameServer server 110 determines that the user should retrieve data from POP server 30. NameServer server 110 then recalls the
IP address of POP server 40, step 530, and then sends the IP address to the user, step 540.
Next, user 150 receives the IP address of POP server 30, and then sends requests (250) for the static content from POP server 30, step 550. In response to the request, POP server 30 routes the request to WebCache server 280. Next, WebCache server 280 retrieves the static content from memory, and sends the requested static content back to user 150, step 560.
In an important embodiment of the present invention, the customer's web server may also be considered for providing the static content. In such an embodiment, in step 520, above, the latency and traffic load information for the customer web server may be considered at the same time as the other POP servers . If the customer's web server is lightly loaded compared to the POP servers, the IP of the customer's web server may be retrieved in step 530, and passed to the user in step 540. Accordingly, in step 560, the customer's web server provides the static content back to the user. Once the client receives the static content, the user can view the HTML text and dynamic data from customer web server 40, as well as view the static content from POP server 30, step 570. In one embodiment, the source of the static content may be specified as being provided at a specific server or a limited set of servers.
Fig. 4 illustrates a flow chart according to another embodiment of the present invention. In particular, Fig. 4 illustrates a more detailed block diagram of step 560, above.
Initially, WebCache server 280 determines whether it has the requested static content in memory, step 600. In one embodiment of the present invention, this step may include determining whether the file requested is resident in memory. If the requested static content is not cached within WebCache server 280, WebCache server 280 sends a request (260) for the static content to WebSever server 290, step 610.
In response, Webserver server 290 determines whether it has the requested static content on disk, or the like, 620. In one embodiment of the present invention, this step may include determining whether the file requested is resident in on a disk or database. If the requested static content is not stored within Webserver server 290, Webserver server 290 determines sends a request (270) for the static content to a customer web server, step 630.
In embodiments of the present invention, the appropriate customer web server selected is also determined based upon the traffic load and latency of the multiple customer web servers to POP server 30. As an example, since POP server 30 is on the east coast and close to customer web server 40. As a result, the request for the static content is directed to customer web server 40. In an alternative embodiment, the customer web server from which static content is retrieved is predetermined. In this example, customer web server 40 provides the static content to
Webserver server 290, which in turn stores the static content to disk, or the like, step 640.
Once the data is stored within Webserver server 290, the static content is copied into the memory of WebCache server 280, step 650. Next, WebCache server 280 returns the static content to the user, step 660. Fig. 5 illustrates a more detailed block diagram according to one embodiment. In particular, Fig. 5 illustrates are more detailed description of the flow of data between software components on the POP network.
Fig. 5 includes a reduced number of software components on servers on the POP network, including a NameServer server 700, a WebCache server 710, a Webserver server 720, NetProbes 730, LogServer server 740, and NetView server 750.
As illustrated, in the present embodiment, communication between the components is performed using a packetized format such as TCP, UDP, or the like. Fig. 5 also illustrates a customer web site 760.
In the present embodiment, the configuration of the POP network is maintained in a single global configuration file. This file is known as the "global configuration" file. In this example, every POP server in the POP network determines configuration information by the copy of the current Speedera configuration file stored locally.
In this embodiment, the configuration file includes a list of servers allowed to change the configuration; a list of domains the network is responsible for (i.e. customers' web sites); list of software components and services available in each POP server; a list of probes that perform latency checks at each POP server; and the like.
Other similar types of information is also included.
In the present example, a new configuration file can be pushed to all POP server by using the AdminTools software, mentioned above.
The configuration file in the present embodiment, is in a "registry tree" type format. Typically few statistics, status or extended information is kept in the configuration file. Such data is typically stored in server 740, as will be described below.
In the present example, the configuration file is kept small to reduce the amount of time required for updating a configuration file across the POP network. Further, a tree-type format is used to allow the configuration to be easily extended without having to change its structure.
In this embodiment, the configuration file is not necessarily sent to all POP servers in the POP network. Instead, only POP servers including NameServers,
NetProbes and LogServers are updated. In the present embodiment, POP servers only including a WebCache or a WebServers are not updated, because WebCache and
WebServers access NameServers to find services on the POP network. For example, when a WebCache needs to send information to a log server, it will make a request to: "log.name.speedera.com." In response, a request is generated to a NameServer and in response the NameServer returns the appropriate IP address. Because the configuration file does not need to be sent to all the POP servers in the network, any configuration file update is performed efficiently.
With this configuration file scheme, to install a new POP server, an administrator uses the AdminTools software tools, to register the unique domain name and the services or software components installed on the new POP server. These data are added to the configuration file. Subsequently, the configuration file is copied across the POP network.
In the present embodiment, as mentioned above, the domain name uniquely identifies the POP server. In one example, the name is "name.pop.speedera.net", "dpl.westusa.speedera.net", or the like.
As mentioned above, the NameServer software is embodied as a DNS server that performs name to IP address mapping. In particular, the software receives requests containing textual names and returns numeric IP addresses typically in the form of xxx.xxx.xxx.xxx, associated with those names. In the present embodiment, the software is also responsible for receiving and storing latency, POP server status information, and the like from the NetProbe servers. Other types of data includes the source IP address, the availability of the POP server, the traffic load, and the like are also stored. In the present embodiment, approximately 10 servers including the
NameServer software will operate on the POP network. Typically, the NameServer software is installed onto its own machine to increase the performance of the software. In one embodiment, the DNS servers should be designed to handle upwards of 1000 hits per second. It is contemplated that the 10 DNS servers will handle all the DNS requests for all users on the Internet that are serviced by the POP network. For example, users that require the POP network to determine the least loaded customer web site, and users that require content stored on the POP network both require services from the DNS servers. In alternative embodiments, a greater number of lesser number of DNS servers may be used according to specific application.
In this embodiment, in operation, the NameServer replies to DNS queries and does not return any other information. Specifically, to generate a reply to a DNS query, the DNS server may uses the following information: the configuration file to determine what machines have the content corresponding to the request; a table containing latency information for each POP to each client DNS (traffic load); a table containing service metrics (load/availability) information for each POP server, and the like.
In this embodiment, the data received from NetProbe servers comprise UDP traffic. Further the NameServer software transmits log information to the
LogServer software, also via UDP traffic. In other embodiments, other schemes for transferring packetized data is contemplated.
The NameServer receives the service and latency information data from the NetProbe servers. In addition, in this embodiment, timestamp is also received so the Name Server software knows how stale the performance information is. The NameServer software stores this information to disk, to RAM, or the like. In this embodiment, the NameServer data is typically stored on disk to reduce the effects of a crash or a reboot. In this way, the latency and probe information is maintained. Further, in some cases, data such as persistence information, should be maintained even if the system is rebooted. In this embodiment, the NameServers log the requests they get and the replies they give, i.e. IP addresses. In operation, at regular intervals (specified by the global configuration file), the NameServer sends its latest statistics (e.g. number of hits it has taken per domain name, etc.) to the various NetProbe servers and appropriate log analysis servers via UDP. In the global configuration file, log analysis servers are assigned based on the domain name served. In the present embodiment, the log analysis is a distributed process.
In the present embodiment, the DNS software is proprietary and supports all features of BIND. The WebCache software is typically embodied as a caching web server.
As described above, the WebCache server accepts HTTP or URL requests and checks memory for the requested content. As discussed in the flow diagram above, the content is typically static content from a customer's web page, or the like. If the content is not in the memory, the WebCache server contacts a Webserver server that may have the web page stored using HTTP.
In one embodiment, WebCache is a customized caching server. In other embodiments, WebCache may include hardware cache boxes available from companies such as CacheFlow, Inktomi, or the like. In this example, at a regular fixed intervals, the WebCache server compresses and sends the logs of the content it stores, and sends the logs to log analysis servers. This transfer is typically via UDP traffic. This information provided by the
WebCache server is typically used for billing purpose as well as by customers for auditing and log analysis. In the case where a hardware cache box is used, the box must also be configured to send the logs to the log analysis servers.
In the present embodiment, the WebCache software is proprietary but could be integrated with off the shelf caching products.
In the present embodiment, when a WebCache server cannot provide the content requested, it forwards the request to a server running Webserver software it believes has the content. As discussed above, the Webserver software processes the request by looking on its local disk for the content. If the content exists on the local disk, the Webserver will pass the data to the WebCache. Further, if the content in not on the disk, the Webserver contacts the customers web site to retrieve the content.
In this embodiment, the Webserver typically does not provide log data. Instead, the WebCache provides the cache miss information to the log analysis servers.
Webserver, along with many of the other software modules do report error conditions to the log server.
In the present embodiment, the Webserver and WebCache may exist as one program. In alternative embodiments, these pieces of software may be separate and even reside on different machines.
The NetProbes software performs service availability/metric and latency probes and sends the results via UDP to NameServer servers. There are two distinct types of probes in the present embodiment, service probes and latency probes. In this example, service probes determine service availability and load
(metrics) for each content delivery machine in the network. Content delivery machines includes the POP servers including WebCache and Webserver modules, and may also include the customer's web site(s). Service probes monitor data such as HTTP total response time, FTP total response time, and the like.
In the present embodiment, service probes run constantly, and send current metric and availability information to all DNS servers in the POP network. The timing of the probe intervals and configuration of the service probes is typically speechified in the global configuration file. Latency probes determine latency from the POP server hosting the
NetProbe to the client DNS servers that sent the requests to Speedera DNS servers. The Speedera DNS servers typically drive the latency probes. In particular, when a DNS server determines that it needs latency information from a probe, it sends a request to the probe. In response, the latency probe probes the client DNS server and returns with the result. In one embodiment, the DNS servers send a number of requests in a single UDP packet to a latency probe.
The NetProbe servers typically do not store the results of the probes, but simply sends them to other servers via UDP. Each piece of probe information has a timestamp of when the probe occurred so the receiving server can determine how stale the probe information is.
In the present embodiment, LogServer servers receives data from machines running NameServer, NetProbes and WebCache software. The data is typically transmitted to one or more log analysis servers where the data is stored and analyzed. As mentioned above, the data is typically stored by customer name to facilitate billing and preparation of audit and historical statistics. Additionally, The LogServer software also gets requests for information from the NetView software. Such a request may be made when an administrator, or the like wants to graphically view or view the table of data for current or historical status of a web site (hits, open connections, etc.).
The configuration file will have a list of log servers and the customer domains they manage, however, LogServers will only receive data for the customer domains they manage, not for other domains.
The NetView software referred to above is typically an HTML user- interface providing customers and administrators to use a web browser to view real-time and historical information. The NetView software includes a web server and CGI or web server plug-in modules written in Perl or C.
In operation, NetView servers contact LogServers or the LogServer database to get the underlying data. The raw data from LogServers and formats and displays it to customers, or the like in the form of tables and/or graphs.
The NetView software also provides an interface that allows a customer to flush content from the POP network when they publish new content to their servers. To perform the action of flushing content, the NetView software executes the AdminTool software that has the ability to flush content from the system. In the present embodiment, the AdminTools software includes tools used to administer the Speedera Network. It contains two main tools, one to push a new configuration file to servers in the network and another to cause content to be flushed from the network.
The configuration administration tool sends the global configuration file to servers in the network in a hierarchical fashion. A list of servers will get the configuration file and send it to another list of servers and so on. The list of servers to receive the configuration file and what servers they control is typically specified by the configuration file itself.
In this embodiment, the configuration system is written such that it does not allow configuration files to get out of sync, when machines are down or if a socket connection breaks while transmission is in process, or the like. For example, in one embodiment, the administration tool opens sockets to all the master configuration servers, sends the configuration file and waits for confirmation that they and everyone they send the file to receives it and then send a command to tell everyone to switch to use that configuration file.
In this embodiment, the configuration administration tool uses TCP and not UDP because of the increased security. Additionally, password protection and encryption may also be used.
The Shared software includes software that will exist on all servers in the network. This includes a configuration daemon that listens for the transmission of a new configuration file, stores it locally and sends a SIGHUP to local processes to notify them the configuration has changed. In the foregoing specification, the invention has been described with reference to specific exemplary embodiments thereof. Many changes or modifications are readily envisioned. For example, in some embodiments, steps 460 and 470 are not explicitly performed. Instead, after the DNS server resolves the IP address of the customer web server in step 450, the IP address is automatically used to address the customer web server. Similarly, in some embodiments, steps 540 and 550 are not explicitly performed. Instead, after the DNS server resolves the IP address of the POP server in step 530, the IP address is automatically used to address the POP server.
Details of additional embodiments of the present invention may be found in the documents in the attached appendix.
In embodiments of the present invention, the static content stored on the web caches may include movies, audio, images, text, plug-in program data, programs, and the like. Other types of data can be stored within the web caches, for example, streaming audio data, video data, and the like. Further, the embodiments have discussed providing data across wide area networks, such as the Internet. In other embodiments, embodiments of the above described invention may be applied to other types of networks including wireless networks, local area networks, and the like.
The block diagrams of the architecture and flow charts are grouped for ease of understanding. However it should be understood that combinations of blocks, additions of new blocks, re-arrangement of blocks, and the like are contemplated in alternative embodiments of the present invention. Further, the above names of servers, software modules, and the like are arbitrary, and may be changed.
The specification and drawings are, accordingly, to be regarded in an illustrative rather than a restrictive sense. It will, however, be evident that various modifications and changes may be made thereunto without departing from the broader spirit and scope of the invention as set forth in the claims.

Claims

WHAT IS CLAIMED IS:
1. A method for operating a network of point of presence servers . sharing a hostname, the method comprises: receiving a request from a user for a web page at a first web address, the first web address including the hostname; determining traffic loads of a plurality of customer web servers, each of the customer web servers storing the web page; determining a customer web server from the plurality of customer web servers that is appropriate for the request, the customer web server having a traffic load lower than traffic loads of remaining customer web servers from the plurality of customer web servers; determining an IP address of the customer web server; directing the request from the user to the customer web server; thereafter receiving a request from the user for static content on the web page at a second web address, the second web address including the hostname; determining service metrics of point of presence servers in the network of point of presence servers; determining the point of presence server from the network of point of presence servers that is appropriate for the request, the point of presence server having service metrics better than service metrics of remaining point of presence servers from the network of point of presence servers; retrieving the static content from the point of presence server; and providing the static content to the user.
2. The method of claim 1 further comprising determining load of point of presence servers in the network of point of presence servers; wherein determining the point of presence server from the network of point of presence servers that is appropriate for the request, the point of presence server having a latency and a load lower than latency or load of the remaining point of presence servers from the network of point of presence servers.
3. The method of claim 1 further comprising: determining whether the point of presence server includes the static content; determining a web server that includes the static content when the point of presence server does not include the static content; retrieving the static content from the web server that includes the static content; and storing the static content from the web server in the point of presence server.
4. The method of claim 3 wherein determining the web server comprises: determining traffic loads of the plurality of customer web servers, each of the customer web servers storing the static content; and determining another customer web server from the plurality of customer web servers that is appropriate for the request, the another customer web server having a traffic load lower than traffic loads of remaining customer web servers from the plurality of customer web servers.
5. The method of claim 4 wherein retrieving the static content from the web server comprises: determining another IP address of the another customer web server; and requesting the static content from the another customer web server at the another IP address.
6. The method of claim 1 wherein the network of point of presence servers comprises a domain name server.
7. The method of claim 1 wherein the request from the user for the web page is transferred from a first domain name server; wherein the network of point of presence servers comprises a second domain name server; and wherein the second domain name server determines the customer web server from the plurality of customer web servers.
8. A network of point of presence servers sharing a hostname comprises: a first network probe server configured to determine traffic loads of a plurality of customer web servers, and configured to determine latency of web cache servers in the network of point of presence servers, each of the customer web servers storing a web page; a first domain name server coupled to the first network probe server, configured to receive a request from a user for the web page at a first web address, the first web address including the hostname, configured to determine a customer web server from the plurality of customer web servers that is appropriate for the request, the customer web server having a traffic load lower than traffic loads of remaining customer web servers from the plurality of customer web servers, configured to determine an IP address of the customer web server, configured to return the IP address of the customer web server, configured to receive a request from the user for static content on the web page at a second web address, the second web address including the hostname, and configured to determine a web cache server from the network of point of presence servers that is appropriate for the request, the web cache server having a latency lower than latency of remaining web cache servers within the network network of point of presence servers; and the web cache server configured to retrieve the static content, and to provide the static content to the user.
9. The network of claim 8 wherein the first network probe server is also configured to determine traffic load of web cache servers in the network of point of presence servers; and wherein the web cache server also has a traffic load lower than traffic loads of remaining web cache servers within the network network of point of presence servers.
10. The network of claim 8 wherein the web cache server is also configured to determine whether it includes the static content, and configured to generate a web cache miss signal, if the static content is not included; and the network further comprising a web server coupled to the web cache configured to determine whether it includes the static content in response to the web cache miss signal, configured to request data from a customer web server that includes the static content, if the static content is not included, and configured to store the static content; wherein the web cache server is also configured to receive the static content from the web server.
11. The network of claim 10 wherein the first domain name server is also configured to determining traffic loads of a plurality of customer web servers, each of the customer web servers storing the static content, and is configured to determine another customer web server from the plurality of customer web servers that is appropriate for the request, the another customer web server having a traffic load lower than traffic loads of remaining customer web servers from the plurality of customer web servers.
12. The network of claim 11 wherein the first domain name server is also configured to determining another IP address of the another customer web server, and configured to request the static content from the another customer web server at the another IP address.
13. The network of claim 8 wherein the first domain name server and the network probe server reside on a common point of presence server.
14. The network of claim 8 wherein the request from the user for the web page is transferred from a second domain name server.
15. A method for a computer network comprises: using a web client to enter a first web address of a web page; sending a first request from the web client to a client DNS server to resolve the first web address; sending the first request from the client DNS server to a POP DNS server within a POP server network; using a probe server in the POP server network to determine latencies between the probe server to a plurality of customer web servers, each of the customer web servers having the web page; using the POP DNS server to determine a customer web server from the plurality of customer web servers in response to the latencies, the customer web server having a latency lower than latencies of other customer web servers in the plurality of customer web servers; requesting the web page from the customer web server, the web page including an embedded URL representing static content; sending the web page from the customer web server to the web client; sending a second request from the web client to the client DNS server for resolving the URL; sending the second request from the client DNS server to the POP DNS server within the POP server network; using the probe server to determine loads of a plurality of web caches within the POP server network; using the POP DNS server to determine a web cache from the plurality of web caches in response to the loads, the web cache having a load lower than loads of other web caches in the plurality of web caches; retrieving the static content from the web cache; sending the static content to the web client; and outputting the static content with the web client.
16. The method of claim 15 wherein the loads comprise measurements selected from: CPU load, memory load, HTTP response time, FTP response time.
17. The method of claim 15 wherein the latencies comprise data selected from: CPU load, memory load, HTTP response time, FTP response time.
18. The method of claim 15 determining whether the web cache includes the static content; determining a web server that includes the static content when the web cache does not include the static content; retrieving the static content from the web server; and storing the static content from the web server in the web cache.
19. The method of claim 18 wherein determining the web server comprises: using the probe server to determine latencies of a plurality of customer web servers that stores the static content; and using the POP DNS server to determine another customer web server from the plurality of customer web servers that stores the static content, the other customer web server having a latency lower than latencies of other customer web servers in the plurality of customer web servers that includes the static content.
20. The method of claim 15 further comprising: using another web client to enter a second web address of another web page; sending a first request from the other web client to another client DNS server to resolve the second web address; sending the first request from the other client DNS server to the POP DNS server; using the probe server in the POP server network to determine latencies between the probe server to a plurality of second customer web servers, each of the second customer web servers having the other web page; using the POP DNS server to determine a second customer web server from the plurality of second customer web servers in response to the latencies, the second customer web server having a latency lower than latencies of other second customer web servers in the plurality of second customer web servers; requesting the other web page from the second customer web server, the other web page including a web address associated with static data; and sending the other web page from the second customer web server to the other web client.
21. The method of claim 20 further comprising: sending a second request from the other web client to the client DNS server for resolving the web address; sending the second request from the other client DNS server to the POP DNS server within the POP server network; using the probe server to determine loads of a plurality of web caches within the POP server network; using the POP DNS server to determine a web cache from the plurality of web caches in response to the loads, the web cache having a load lower than loads of other web caches in the plurality of web caches; retrieving the static data from the web cache; sending the static data to the other web client; and outputting the static data with the other web client.
PCT/US2000/031939 1999-11-22 2000-11-21 Method for operating an integrated point of presence server network WO2001039003A1 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
AU19243/01A AU1924301A (en) 1999-11-22 2000-11-21 Method for operating an integrated point of presence server network

Applications Claiming Priority (8)

Application Number Priority Date Filing Date Title
US16690699P 1999-11-22 1999-11-22
US60/166,906 1999-11-22
US09/644,927 2000-08-23
US09/645,067 US6694358B1 (en) 1999-11-22 2000-08-23 Performance computer network method
US09/648,420 US7925713B1 (en) 1999-11-22 2000-08-23 Method for operating an integrated point of presence server network
US09/645,067 2000-08-23
US09/644,927 US6405252B1 (en) 1999-11-22 2000-08-23 Integrated point of presence server network
US09/648,420 2000-08-23

Publications (2)

Publication Number Publication Date
WO2001039003A1 true WO2001039003A1 (en) 2001-05-31
WO2001039003A9 WO2001039003A9 (en) 2002-07-25

Family

ID=27496727

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/US2000/031939 WO2001039003A1 (en) 1999-11-22 2000-11-21 Method for operating an integrated point of presence server network

Country Status (2)

Country Link
AU (1) AU1924301A (en)
WO (1) WO2001039003A1 (en)

Cited By (13)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP1473907A2 (en) * 2003-04-30 2004-11-03 Avaya Technology Corp. Dynamic load balancing for enterprise IP traffic
EP1969479A2 (en) * 2005-11-21 2008-09-17 Limelight Networks, Inc. Domain name resolution resource allocation
US8504721B2 (en) 2000-09-26 2013-08-06 Brocade Communications Systems, Inc. Global server load balancing
CN103562884A (en) * 2011-06-29 2014-02-05 国际商业机器公司 Transparent cache for mobile users
EP2290916A3 (en) * 2001-09-28 2014-04-09 Level 3 CDN International, Inc. Configurable adaptive global traffic control and management
US8862740B2 (en) 2004-05-06 2014-10-14 Brocade Communications Systems, Inc. Host-level policies for global server load balancing
US8949850B2 (en) 2002-08-01 2015-02-03 Brocade Communications Systems, Inc. Statistical tracking for global server load balancing
US9015323B2 (en) 2000-09-26 2015-04-21 Brocade Communications Systems, Inc. Global server load balancing
US9130954B2 (en) 2000-09-26 2015-09-08 Brocade Communications Systems, Inc. Distributed health check for global server load balancing
US9338182B2 (en) 2010-10-15 2016-05-10 Brocade Communications Systems, Inc. Domain name system security extensions (DNSSEC) for global server load balancing
US9584360B2 (en) 2003-09-29 2017-02-28 Foundry Networks, Llc Global server load balancing support for private VIP addresses
US10193852B2 (en) 2002-08-07 2019-01-29 Avago Technologies International Sales Pte. Limited Canonical name (CNAME) handling for global server load balancing
CN111711716A (en) * 2020-08-20 2020-09-25 苏州浪潮智能科技有限公司 Domain name resolution method, device and equipment and readable storage medium

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5459837A (en) * 1993-04-21 1995-10-17 Digital Equipment Corporation System to facilitate efficient utilization of network resources in a computer network
US5664106A (en) * 1993-06-04 1997-09-02 Digital Equipment Corporation Phase-space surface representation of server computer performance in a computer network
US6038601A (en) * 1997-07-21 2000-03-14 Tibco, Inc. Method and apparatus for storing and delivering documents on the internet
US6108703A (en) * 1998-07-14 2000-08-22 Massachusetts Institute Of Technology Global hosting system
US6128279A (en) * 1997-10-06 2000-10-03 Web Balance, Inc. System for balancing loads among network servers
US6167438A (en) * 1997-05-22 2000-12-26 Trustees Of Boston University Method and system for distributed caching, prefetching and replication

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5459837A (en) * 1993-04-21 1995-10-17 Digital Equipment Corporation System to facilitate efficient utilization of network resources in a computer network
US5664106A (en) * 1993-06-04 1997-09-02 Digital Equipment Corporation Phase-space surface representation of server computer performance in a computer network
US6167438A (en) * 1997-05-22 2000-12-26 Trustees Of Boston University Method and system for distributed caching, prefetching and replication
US6038601A (en) * 1997-07-21 2000-03-14 Tibco, Inc. Method and apparatus for storing and delivering documents on the internet
US6128279A (en) * 1997-10-06 2000-10-03 Web Balance, Inc. System for balancing loads among network servers
US6108703A (en) * 1998-07-14 2000-08-22 Massachusetts Institute Of Technology Global hosting system

Cited By (24)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8504721B2 (en) 2000-09-26 2013-08-06 Brocade Communications Systems, Inc. Global server load balancing
US9479574B2 (en) 2000-09-26 2016-10-25 Brocade Communications Systems, Inc. Global server load balancing
US9225775B2 (en) 2000-09-26 2015-12-29 Brocade Communications Systems, Inc. Global server load balancing
US9130954B2 (en) 2000-09-26 2015-09-08 Brocade Communications Systems, Inc. Distributed health check for global server load balancing
US9015323B2 (en) 2000-09-26 2015-04-21 Brocade Communications Systems, Inc. Global server load balancing
US10116738B2 (en) 2001-09-28 2018-10-30 Level 3 Communications, Llc Detecting anomalous conditions in a name server network
US9203636B2 (en) 2001-09-28 2015-12-01 Level 3 Communications, Llc Distributing requests across multiple content delivery networks based on subscriber policy
EP2290916A3 (en) * 2001-09-28 2014-04-09 Level 3 CDN International, Inc. Configurable adaptive global traffic control and management
US10911531B2 (en) 2001-09-28 2021-02-02 Level 3 Communications, Llc Multi-tiered server network
US8949850B2 (en) 2002-08-01 2015-02-03 Brocade Communications Systems, Inc. Statistical tracking for global server load balancing
US11095603B2 (en) 2002-08-07 2021-08-17 Avago Technologies International Sales Pte. Limited Canonical name (CNAME) handling for global server load balancing
US10193852B2 (en) 2002-08-07 2019-01-29 Avago Technologies International Sales Pte. Limited Canonical name (CNAME) handling for global server load balancing
US7308499B2 (en) 2003-04-30 2007-12-11 Avaya Technology Corp. Dynamic load balancing for enterprise IP traffic
EP1473907A3 (en) * 2003-04-30 2004-11-24 Avaya Technology Corp. Dynamic load balancing for enterprise IP traffic
EP1473907A2 (en) * 2003-04-30 2004-11-03 Avaya Technology Corp. Dynamic load balancing for enterprise IP traffic
US9584360B2 (en) 2003-09-29 2017-02-28 Foundry Networks, Llc Global server load balancing support for private VIP addresses
US8862740B2 (en) 2004-05-06 2014-10-14 Brocade Communications Systems, Inc. Host-level policies for global server load balancing
EP1969479A2 (en) * 2005-11-21 2008-09-17 Limelight Networks, Inc. Domain name resolution resource allocation
US8645539B2 (en) 2005-11-21 2014-02-04 Limelight Networks, Inc. Domain name resolution resource allocation
EP1969479A4 (en) * 2005-11-21 2013-07-10 Limelight Networks Inc Domain name resolution resource allocation
US9338182B2 (en) 2010-10-15 2016-05-10 Brocade Communications Systems, Inc. Domain name system security extensions (DNSSEC) for global server load balancing
CN103562884A (en) * 2011-06-29 2014-02-05 国际商业机器公司 Transparent cache for mobile users
CN111711716A (en) * 2020-08-20 2020-09-25 苏州浪潮智能科技有限公司 Domain name resolution method, device and equipment and readable storage medium
CN111711716B (en) * 2020-08-20 2020-12-04 苏州浪潮智能科技有限公司 Domain name resolution method, device and equipment and readable storage medium

Also Published As

Publication number Publication date
AU1924301A (en) 2001-06-04
WO2001039003A9 (en) 2002-07-25

Similar Documents

Publication Publication Date Title
US6405252B1 (en) Integrated point of presence server network
US6694358B1 (en) Performance computer network method
US7925713B1 (en) Method for operating an integrated point of presence server network
US10476984B2 (en) Content request routing and load balancing for content distribution networks
US8806008B2 (en) HTML delivery from edge-of-network servers in a content delivery network (CDN)
EP1493094B1 (en) Method and system for tiered distribution in a content delivery network
Dilley et al. Globally distributed content delivery
JP2002503001A (en) Optimized network resource location
AU2001268140A1 (en) Viewer object proxy
WO2001080004A2 (en) Http redirector
WO2001039003A1 (en) Method for operating an integrated point of presence server network
WO1999060459A2 (en) Method and apparatus for effective traffic localization through domain name system
KR100375121B1 (en) Web contents transmission method
CA2413886A1 (en) Client side holistic health check
WO2002039699A1 (en) Domain name system extensions to support reverse proxy operations and layer-7 redirection
AU2001272931A1 (en) Client side address routing analysis
CA2410866C (en) Client side deterministic routing and transparent redirection
Beaumont Meeting worldwide demand for your content

Legal Events

Date Code Title Description
AK Designated states

Kind code of ref document: A1

Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BY BZ CA CH CN CR CU CZ DE DK DM DZ EE ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NO NZ PL PT RO RU SD SE SG SI SK SL TJ TM TR TT TZ UA UG UZ VN YU ZA ZW

AL Designated countries for regional patents

Kind code of ref document: A1

Designated state(s): GH GM KE LS MW MZ SD SL SZ TZ UG ZW AM AZ BY KG KZ MD RU TJ TM AT BE CH CY DE DK ES FI FR GB GR IE IT LU MC NL PT SE TR BF BJ CF CG CI CM GA GN GW ML MR NE SN TD TG

121 Ep: the epo has been informed by wipo that ep was designated in this application
DFPE Request for preliminary examination filed prior to expiration of 19th month from priority date (pct application filed before 20040101)
AK Designated states

Kind code of ref document: C2

Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BY BZ CA CH CN CR CU CZ DE DK DM DZ EE ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NO NZ PL PT RO RU SD SE SG SI SK SL TJ TM TR TT TZ UA UG UZ VN YU ZA ZW

AL Designated countries for regional patents

Kind code of ref document: C2

Designated state(s): GH GM KE LS MW MZ SD SL SZ TZ UG ZW AM AZ BY KG KZ MD RU TJ TM AT BE CH CY DE DK ES FI FR GB GR IE IT LU MC NL PT SE TR BF BJ CF CG CI CM GA GN GW ML MR NE SN TD TG

COP Corrected version of pamphlet

Free format text: PAGES 1/7-7/7, DRAWINGS, REPLACED BY NEW PAGES 1/7-7/7; DUE TO LATE TRANSMITTAL BY THE RECEIVING OFFICE

REG Reference to national code

Ref country code: DE

Ref legal event code: 8642

122 Ep: pct application non-entry in european phase