USRE39184E1 - Identifying, processing and caching object fragments in a web environment - Google Patents

Identifying, processing and caching object fragments in a web environment Download PDF

Info

Publication number
USRE39184E1
USRE39184E1 US10/601,267 US60126703A USRE39184E US RE39184 E1 USRE39184 E1 US RE39184E1 US 60126703 A US60126703 A US 60126703A US RE39184 E USRE39184 E US RE39184E
Authority
US
United States
Prior art keywords
fragment
description
client
persistent
storage device
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Lifetime
Application number
US10/601,267
Inventor
Robert Schloss
Philip Shi-lung Yu
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
International Business Machines Corp
Original Assignee
International Business Machines Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by International Business Machines Corp filed Critical International Business Machines Corp
Priority to US10/601,267 priority Critical patent/USRE39184E1/en
Application granted granted Critical
Publication of USRE39184E1 publication Critical patent/USRE39184E1/en
Anticipated expiration legal-status Critical
Expired - Lifetime legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/957Browsing optimisation, e.g. caching or content distillation
    • G06F16/9574Browsing optimisation, e.g. caching or content distillation of access to content, e.g. by caching

Definitions

  • the present invention relates generally to the analysis of the content of a digital document and in particular to the creation and maintenance of persistent fragment identities to facilitate caching.
  • HTML hypertext markup language
  • XML Extensible Markup Languages
  • PRISM from Spyglass
  • a means for performing on-demand data type-specific lossy compression on semantically typed data and tailoring content to the specific constraints of the clients is described in “Adapting to Newark and Client Variability via On-Demand Dynamic Distiflation,” by A. Fox, et al., Proc. 7th Intl. Conference on Architectural Support for Programming Languages and Operating Systems, Oct. 1996.
  • markup language Using formal descriptors, such as a markup language, to describe a digital document provides tremendous flexibility.
  • more powerful markup languages such as XML, or a subset of the Standard Generalized Markup Language (SGML) (see e.g., ISO 8879/1986; and Designing XML Internet Applications, by M. Laventhal, et al., Prentice Hall, 1998), arc being defined to augment HTML.
  • the markup language description can provide rich information on the document structure and the final document to be generated.
  • XML is a language that allows users to define their own language.
  • chemists can define a chemical markup language to describe a molecular structure.
  • proxy or Web servers and client browsers today do not interpret the markup language to decompose a document or object into components, provide persistent identities and tracking mechanisms to facilitate caching and recognition of repeated occurrences of components of a named object. They mainly provide caching or processing service for named objects as a whole.
  • the text documents and images which are separated out from the text documents by the authors
  • Another problem is that if a document includes dynamic content caching is not meaningful as the next reference to the same document URL can result in a different version of the document. Thus a document is not cached even if only a small fraction of its content is dynamic. This is an issue for HTML documents today and is expected to become more severe for XML documents, which are more flexible and make it easier to incorporate various types of dynamic information, such as data from a database.
  • the need remains for a system and method for identifying and creating one or more persistent object fragments from named object, for example to facilitate caching.
  • the present invention addresses this need.
  • the present invention is directed to a method and apparatus for identifying and creating persistent object fragments from a named object.
  • the present invention is directed to a method and apparatus for dynamically parsing a digital content description of a named digital object, creating and maintaining fragment identities to facilitate caching.
  • named digital objects include but are not limited to: Web pages described in XML, SGML, and HTML.
  • the present invention has features which can parse/analyze the object description, identify object fragments and create persistent object fragment identities, and revise the object description by replacing each object fragment with its newly created persistent identity and send the revised object description to the requesting node.
  • this can either enable the fragment to be cacheable (which can be at the content/proxy server and the client device in the Web environment), or make the revised object description cacheable at the server and client device.
  • the object description of a purchase order which contains a dynamic part to retrieve the current price of a product from the database. This dynamic part may be a small portion of the purchase order, but would prevent the object from being cached.
  • the revised document becomes static and therefore cacheable.
  • fragments can be nested.
  • a method is also provided to determine which part/segment of a named object to recognize as a fragment identity, based on its properties, which can include its size, processing cost to generate that segment of the object from its description, and other properties such as static vs. dynamic.
  • the present invention has yet other features to determine which fragments to cache and replace.
  • the cache manager takes into account the fragment size and processing cost to generate the fragment.
  • the present invention has still other features which allow different versions to be generated for a fragment upon request.
  • the version created can be determined by the property of the requesting devices and the fragment description. Different generators can be maintained for each type of descriptors or markup tags to generate different versions for different types of devices.
  • An example of a method for identifying object fragments in an object having features of the present invention comprises the steps of: analyzing an object description to identify one or more persistent object fragments associated with the object; creating the one or more persistent object fragments, in response to mid analyzing; and creating a persistent object fragment identity for a persistent object fragment, based on one or more of formal descriptors or an object fragment property.
  • the object description is revised by replacing at least one object fragment with an associated persistent object fragment identity to enable the fragment to be cacheable at one or more of a server and a client; and the revised object description is sent to the client.
  • the client receives the revised object description; and processes and/or caches the revised object description.
  • the client can also receive a version of the one a more object fragments associated with the fragment identity, wherein the version is generated at the server and is based on the capability of the client (e.g., whether it is a handheld device, a set top box, or an Internet appliance.
  • FIG. 1 is a diagram of an Internet environment having features of the present invention
  • FIG. 2 is a more detailed example of a network environment having features of the present invention.
  • FIG. 3 depicts an example of a digital document using a markup language
  • FIG. 4 depicts an example of a modified document
  • FIG. 5 depicts the data structure of the fragment description table
  • FIG. 6 is an example of the server logic of FIG. 2 ;
  • FIG. 7 is an example of the object request handler
  • FIG. 8 is an example of the object parser
  • FIG. 9 is an example of the next segment locator
  • FIG. 10 is as example of the persistent name creator
  • FIG. 11 is an example of the fragment request handler
  • FIG. 12 is an example of the fragment cache manager
  • FIG. 13 depicts an example of the client logic
  • FIG. 1 depicts an example of an Internet environment adaptable to the present invention.
  • a client 60 . . . 63
  • the proxy servers and Web servers can provide caching of frequently access Web objects to improve client access time.
  • the client may also have its own cache.
  • a proxy server can be replaced by a hierarchy of proxy servers.
  • a client node ( 60 ) can also run a proxy server.
  • FIG. 2 depicts an example of an overall architecture of a computing node having features of the present invention.
  • the node can be a Web server, a proxy server.
  • the computing node can include: a CPU ( 250 ); a scratch pad or main memory ( 245 ) such as RAM; and persistent storage devices ( 260 ) such as direct access storage devices (DASD).
  • the memory ( 245 ) stores the server logic 240 (with details depicted in FIG. 6 ) preferably embodied as computer executable code which may be loaded from DASD ( 260 ) into memory ( 245 ) for execution by CPU ( 250 ).
  • the server logic ( 240 ) includes an object request handler ( 205 ) (with details depicted in FIG. 7 ), and a fragment request handler ( 210 ) (with details depicted in FIG. 11 ). It also maintains a fragment cache ( 270 ), an object cache ( 275 ) and a fragment description table ( 280 ) (with detailed depicted in FIG. 5 ). This information can either reside in persistent storage ( 260 ) or in main memory ( 245 ).
  • an XML-like document will be used as an example of a document described using some formal language, such as a markup language.
  • FIG. 3 shows an example of an XML-like document.
  • the document includes multiple segments ( 330 ), where each segment ( 330 ) is enclosed between a “start-tag” ( 310 ) and an “end-tag” ( 320 ).
  • start-tag 310
  • end-tag 320
  • start-tag 310
  • end-tag 320
  • ⁇ cml: molecule> (similarly, “ ⁇ m: order>” and “ ⁇ db: price>”) is a start-tag ( 310 ) and its corresponding “end-tag” ( 320 ) is “ ⁇ /cml: molecule>” (“ ⁇ /m: order>” and “ ⁇ db: price>”, respectively).
  • the segments may be nested.
  • the segment with the ⁇ price> start-tag is included within the segment with the ⁇ m: order> start-tag.
  • parsing the document to recognize the segments can be done by matching each “end-tag” with the corresponding “start-tag”, which is the first preceding “start-tag” of the same type at the same nested level.
  • each segment can have a DTD (document type definition) to describe the semantics of the markup.
  • DTD document type definition
  • Fragment creation eligibility criterion will be introduced next to determine when an object fragment should be created. In the preferred embodiment, two sets of creation eligibility criterion are considered. For each persistent object fragment, a persistent identity or name is assigned and tracked so that if the object fragment appears in multiple objects or multiple times in the same object, it will be recognized as the same fragment.
  • the first fragment creation eligibility criterion is to recognize and separate out a segment as an object fragment so as to make the remaining document cacheable at the server or client device and/or processable/interpretable at the client device.
  • An example is to recognize a dynamic segment as an object fragment.
  • a segment can not be rendered from the markup language description by a simple client device such as WINDOWS CETTM-based Internet appliances.
  • the client can process and/or cache the remaining document and let the proxy server interpret the markup language describing the fragment and generate an appropriate version for the client.
  • This limitation on the client devices can be either due to limitation on the processing power or storage capacity of the client device to interpret the markup language and generate the object fragment, the limitation on the bandwidth available to the client device to retrieve the DTD of the fragment or other limitations.
  • the second criterion is based on the tradeoffs of processing and storage or bandwidth requirements to recognize and separate out a segment as an object fragment so it can be cached separately and reused to avoid going through interpreting the markup language description of the object to generate it again. This will improve response time and reduce server load on fragment re-references.
  • a segment or group of segments that meet a certain threshold on the processing requirements of interpreting the markup language description to generate the object segment is recognized as a fragment.
  • Another consideration is the additional storage requirement to store the rendered segment. For example, consider two cases.
  • the processing time is 100 second of CPU time to generate the segment from the description, and the size of the rendered segment is 10K bytes.
  • the processing time is 1 second of CPU time to generate the segment from the description, and the size of the rendered segment is 1000 K bytes.
  • the savings on CPU time is substantial while the additional storage cost is minimal.
  • the second case only in the first case is it worthwhile to recognize the segment as a separate fragment for caching.
  • P(O) be its processing cost to generate a segment from its description
  • S(O) be the additional storage requirement to store the segment.
  • a value function, F(P(O), S(O)), based on processing costs and storage requirements is used to determine the value of recognizing a fragment.
  • An example of a value function (F) will be processing cost (in seconds) divided by the square root of the additional storage requirement (in 100 Kbytes increments).
  • a given threshold say 5
  • FIG. 3 depicts an example of a document with 3 segments.
  • the first segment ( 330 1 ) begins with a start-tag, ⁇ cml: molecule>, and ends with an end-tag, ⁇ /cml: molecule> and the second segment begins with a start-tag, ⁇ m: order>, and ends with an end-tag, ⁇ /m: order>.
  • the second segment ( 330 2 ) includes a third segment ( 330 3 ) nested within it.
  • the third segment begins with a start-tag, ⁇ db: price>, and finishes with an end-tag, ⁇ db: price>. Assume the semantics of the three segments as follows.
  • the first segment provides an image of a molecule structure of a chemical compound.
  • the second segment contains a formula to generate an order table showing the price at different quantities.
  • the third segment retrieves the price information from the product database. Hence it is a segment with dynamic information.
  • FIG. 4 depicts an example of a modified Web document after the persistent fragments have been recognized and extracted.
  • first segment ( 330 1 ) is quite complex, whereas the computation of the order table is straightforward.
  • first segment ( 330 1 ′) and the third segments ( 330 3 ′) as recognized as persistent fragments with the identities, “ 125 . 1 ” and “ 28 . 3 ”, respectively.
  • FIG. 5 depicts an example of a fragment description table for tracking the object fragment identity and its description.
  • the table ( 505 ) includes a plurality of entries ( 507 ), where each table entry ( 507 ) points to a fragment description list ( 510 ) (only one shown for ease of description).
  • the list ( 510 ) includes one or more description elements ( 520 and 525 ).
  • Each fragment that maps to a given entry in the fragment description table ( 510 ) has a unique description element ( 520 ) on the fragment description list ( 510 ) of the entry.
  • the description element includes several fields: Nlink ( 530 ); Fname ( 535 ); and Fdescription ( 540 ).
  • the Fname ( 535 ) is the persistent name of the fragment.
  • the Fdescription ( 540 ) is the fragment description.
  • the Nlink ( 530 ) points to the next description element ( 525 ) which maps to the same fragment description table entry ( 507 ).
  • FIG. 6 depicts an example of the server logic ( 240 ).
  • the server waits for input. Depending upon the type of input, the appropriate routine will be invoked. If at step 610 , the input is an object request, the object request handler is invoked, in step 615 (with details described with reference to FIG. 7 ). Otherwise, in step 620 it is checked if the input is a fragment request. For example, in a Web environment, an object request can be identified on the basis that an object name will have as the server part of its URL, the name of a proxy server. If yes, in step 625 the fragment request handler (with details described with reference to FIG. 11 ) is invoked. Otherwise, in step 630 a miscellaneous routing is invoked to handle other types of input such as FTP requests which are orthogonal to the current invention and thus will not be described further.
  • FTP requests which are orthogonal to the current invention and thus will not be described further.
  • FIG. 7 depicts an example of the object request handler.
  • step 705 it is first checked whether the requested object is cached in the object cache maintained by this computing node. If the object is cached, in step 710 , the cached object is returned to the requesting node. Otherwise, in step 715 , the request is forwarded to the control server (or another proxy server).
  • step 720 the computing node waits for the object requested.
  • step 725 after receiving the object, the object parser (with details described with reference to FIG. 8 ) is invoked to analyze the object description and create fragments.
  • step 730 the object description, which may have been modified by the object parser, is sent back to the requestor.
  • the object cache manager is invoked to determine whether the object description (which may have been modified by the object parser) should be cached in the object cache.
  • the object cache manager is similar to a conventional Web cache manager that caches the Web objects. Any standard cache management policy, such as LRU (least recently used), or its variants can take into consideration on tradeoffs between object size, update frequency, and time since last reference (i.e., the reference frequency) can be used. See for example, C. Aggarwal, et al., “On Caching Policies for Web Objects”, IBM Research Report, RC 20619, Mar. 5, 1997, which is hereby incorporated by reference in its entirety, wherein variants of LRU caching algorithms on Web objects are described.
  • LRU least recently used
  • FIG. 8 depicts an example of the object parser depicted in FIG. 7 .
  • the object parser maintains two stacks—a “tag_stack” and a “segment_stack”—during its processing to identify persistent fragments.
  • the tag_stack includes the “start-tag”s scanned, but whose matching “end-tag”s have not yet been encountered during scanning of the object description.
  • the segment_stack includes segments recognized that are not qualified as fragments, but have the potential to be combined with segments recognized subsequently to form a fragment.
  • the two stacks are initialized to null.
  • a variable, txt is set equal to the object description.
  • a next segment locator is invoked (with details described with reference to FIG. 9 ) to identify the next segment, Nsegment, in txt.
  • Nsegment is null. If so, the processing of txt is completed. Otherwise, in step 830 , it will delete segments in the segment_stack that are included in Nsegment, if any.
  • step 835 it is checked whether Nsegment satisfies the fragment creation eligibility criterion. If so, in step 840 a persistent name creator routine (with details depicted in FIG. 10 ) is invoked to create a persistent fragment identity for the segment.
  • step 845 the txt is modified to replace the fragment description with an ⁇ include> statement to reference the persistent fragment name followed by an ⁇ include> as described in FIG. 4 .
  • step 855 the Nsegment is combined with its adjacent peer segments on the segment_stack, if any, where a peer segment is a segment at the same level (i.e., with the same parent) of the Nsegment in a nested markup language description.
  • step 860 it is checked if the combined segment satisfies the fragment creation eligibility criterion. If so, if step 865 , these adjacent peer segments are removed from the segment_stack. Otherwise, in step 870 , the N segment is added to segment_stack.
  • FIG. 9 depicts a more detailed example of the next segment locator ( FIG. 8 , step 820 ).
  • step 910 it is checked if the next token is null, where a token is a consecutive string of characters delimited between blanks (or some other delimiters defined by the markup language). If so, it step 915 , the Nsegment is set to null. Otherwise, in step 920 , it is checked if the next token is a “start-tag” type token. If so, the token is inserted into the tag_stack with an associated “token position value” set to its starting position in the txt variable. In step 930 , it is checked if the next token is an “end-tag” type token.
  • step 940 the Nsegment is set to the substring in txt starting from the token position value indicated by the top element of the tag_stack to the “end-tag” token.
  • step 945 the top element in the tag 13 stack is removed.
  • FIG. 10 depicts a more detailed example of the persistent name creator ( FIG. 8 , step 840 ).
  • the fragment description is obtained from txt.
  • the fragment description is mapped into a number which corresponds to an entry of the fragment description table.
  • mapping functions For example, this can be done by performing an exclusive—or of all the characters in the fragment description and then treating the result as an integer to divide it by the number of entries in the fragment description table. The remainder will serve as the index to the fragment description table.
  • step 1020 it is checked if the segment description already appeared in the fragment description list of the said entry in the fragment description table.
  • step 1040 the fragment name of the matching fragment description will be returned. Otherwise, in step 1025 , a new persistent name is created for the fragment.
  • One way is to maintain a counter for each entry of the fragment description table to track the number of distinct fragment descriptions that have been mapped to this entry.
  • the name given to the new fragment will be the value of its entry to the fragment description table augmented with the current value of the counter associated with the said entry. For example, if a fragment description is mapped to the 26th entry of the fragment description table and there already have 5 distinct fragments previously mapped to this entry, the persistent name for the new fragment will be “ 26 . 6 ”.
  • step 1030 the fragment name and its description is added to the fragment description list of the corresponding entry in the fragment description table.
  • step 1035 the persistent name created is returned.
  • FIG. 11 depicts an example of the fragment request handler ( FIG. 6 , step 625 ).
  • step 1105 it is determined which versions of the fragment needs to be generated and returned to the requesting client, if multiple versions are available. A degenerate case is that only one version is available e.g., a proxy server only has code to generate one version of a fragment.
  • step 1110 it is checked whether the requested version is cached in the fragment cache. If so, in step 1150 , the requested version is returned to the requesting node.
  • the fragment cache manager updates the reference statistics. In the preferred embodiment, an LRU cache management policy is used where the requested fragment will be moved to the top of the LRU chain.
  • step 1120 for the case where the fragment is not in the fragment cache, it obtains the fragment description from the fragment description table.
  • the fragment is generated based on the fragment description and the client requirement.
  • each type of markup language describing the fragment can have its own DTD to provide its semantic.
  • For each type of DTD there can be different ways of generating/rendering the fragment based on the characteristics of the requesting devices, such as processing power, storage capacity, and communication bandwidth. This can be described in a GTD (Generator Table Definition) on how to generate a different version for a given DTD to satisfy the requirement of a specific receiving device.
  • the GTD is separate from the DTD. It can be provided by a third party such as the Internet appliance manufacturer or other software manufacturer.
  • the request fragment version is returned to the requester.
  • the fragment cache manager (with details described with reference to FIG. 12 ) is invoked.
  • FIG. 12 depicts an example of the fragment cache manager.
  • the fragment cache manager uses an LRU type replacement policy.
  • step 1205 it is checked whether there is enough free space in the fragment cache to cache the requested fragment (O c ). If so, fragment O c is cached in the fragment cache. Otherwise in step 1215 , it determines the minimum k value such that the bottom k fragments, O bk in the LRU stack of the fragment cache will have a total size larger than that of fragment O c , Ind step 1220 , it is checked based on the value function (f) whether it is more desirable to cache O c or ⁇ O b1 , . . . , O bk ⁇ .
  • the total processing cost to generate ⁇ O b1 , . . . , O bk ⁇ is the sum of the processing cost of each O b1 , 1 ⁇ i ⁇ k, and the additional storage requirement to store ⁇ O b1 , . . . , O bk ⁇ is the sum of the size of each O b1 , 1 ⁇ i ⁇ k. If O c is more valuable with a large F function value, in step 1225 , ⁇ O b1 , . . . , O bk ⁇ is deleted to make room to cache O c . In step 1230 , the reference statistics for the fragment version is updated for the fragment cache manager to manage its LRU cache.
  • an object-fragment table can be maintained which tracks the fragment created for each object and an fragment-object table to track all objects containing a common fragment.
  • the object parser may detect that the object now contains some new fragments and some fragments previously contained in the object are no longer in it. It will then check for each fragment no longer in use by the object whether there is any other object containing it based on the fragment-object table. If so, the fragment description element in FIG. 5 will be deleted from the fragment description table. Finally, the object parser will update the object-fragment table and fragment-object table accordingly. For each fragment deleted from the fragment description table, the fragment cache manager will be invoked to check if any of its fragment version is in the fragment cache and delete it.
  • FIG. 13 depicts an example of the client logic.
  • the client waits for input (request from a user or a response from the server). Depending upon the type of input, the appropriate routine will be invoked. If in step 1310 , the input is an object request from the user, the request is sent to the server in step 1315 (see FIG. 6 ) where persistent object fragments in the object are identified and the object revised as necessary.
  • step 1320 if the input is an object (e.g., a server response from a previous object request), the object is rendered and displayed to the user in step 1330 .
  • object fragments since persistent, object fragments have been recognized to make the revised object document cacheable at the server or client device and/or processable/interpretable at the client device.
  • a segment can not be rendered from the markup language description by a simple client device such as WINDOWS CETM-based Internet appliances.
  • the client can process and/or cache the revised document and allow the server to interpret the markup language describing the fragment and generate an appropriate version for the client.
  • Examples of the limitations on the client device include but are not limited to the processing power or storage capacity of the client device to interpret the markup language and generate the object fragment; and/or the bandwidth available to the client device to retrieve the description of the fragment.
  • the recognition and revision of an object to remove segments qualifying as object fragments enable the object fragment to be cached separately and reused to avoid going through interpreting the markup language description of the object to generate it again. This will improve response time and reduce server load on fragment re-references.
  • only a segment or group of segments that meet a certain threshold on the processing requirements of interpreting the markup language description to generate the object segment were recognized as a fragment by the server.
  • the client determines whether the object is cacheable. Recall that any dynamic object or object exceeding a certain size will be deemed not cacheable at the client device, which often has limited caching capacity.
  • the server uses persistent object fragment identifiers to replace persistent object fragments (such as dynamic objects or large segments) in a Web object.
  • the revised object is thus more cacheable at the client device, since the server has removed the dynamic or large objects from the object and reduced the size of the object.
  • an object description for a purchase order that includes a dynamic part for retrieving the current price of a product from the database. This dynamic part may be a small portion of the purchase order, but would prevent the object from being cached.
  • the revised document becomes static and therefore cacheable.
  • the object is cacheable, the object is cached at the local client cache.
  • a miscellaneous routine is invoked to handle other types of input, such as a pager message.
  • a preferred embodiment of the present invention includes features implemented as software tangibly embodied on a computer program product or program storage device for execution on a processor (not shown) provided with the client ( 60 . . . 63 ) and/or server ( 30 . . . 33 ).
  • software implemented in a popular object-oriented computer executable code such as JAVA provides portability across different platforms.
  • object-oriented computer executable code such as JAVA provides portability across different platforms.
  • methods of the present invention may be implemented as software for execution on a computer or other processor-based device.
  • the software may be embodied on a magnetic, electrical, optical, or other persistent program and/or data storage device, including but not limited to: magnetic disks, DASD, bubble memory; tape; optical disks such as CD-ROMs; and other persistent (also called nonvolatile) storage devices such as core, ROM, PROM, flash memory, or battery backed RAM.
  • a magnetic, electrical, optical, or other persistent program and/or data storage device including but not limited to: magnetic disks, DASD, bubble memory; tape; optical disks such as CD-ROMs; and other persistent (also called nonvolatile) storage devices such as core, ROM, PROM, flash memory, or battery backed RAM.
  • one or more of the components instantiated in the memory of the clients ( 60 . . . 63 ) or server ( 30 . . . 33 ) could be accessed and maintained directly via disk ( 260 ), the network 25

Abstract

A method, apparatus and computer program product for identifying and creating persistent object fragments from a named object. For example, a digital content description of a named digital object can be dynamically parsed, and persistent fragment identities created and maintained to facilitate caching. Named digital objects include but are not limited to: Web pages described in XML, SGML, and HTML. The object description is revised by replacing each object fragment with its newly created persistent identity. The revised object description is then sent to the requesting node. Depending upon the properties of a fragment, this can either enable the fragment or the revised object description to be cacheable at the server and/or client device. For example, the object description can include a dynamic part which would otherwise prevent the object from being cached. The dynamic part can be recognized and treated as a separate fragment from the object description. Thus the revised document becomes static mad therefore cacheable. Furthermore, fragments can be nested. Other features determine which part/segment of a named object to recognize as a fragment identity, based on its properties including: size; processing cost; and static vs. dynamic. Yet other features can determine which fragments to cache and replace, for example based on the fragment size and processing cost. Still other features allow different versions to be generated for a fragment upon request. The version created can be determined by the property of the requesting devices (e.g., handheld device or Internet appliance) and the fragment description.

Description

FIELD OF THE INVENTION
The present invention relates generally to the analysis of the content of a digital document and in particular to the creation and maintenance of persistent fragment identities to facilitate caching.
BACKGROUND
With the rapid growth of the Internet, the need for efficient document exchange becomes increasingly important. In additional to the hypertext markup language (HTML), Extensible Markup Languages (XML) are becoming available that provide a meta-language for authors to design their own markup language.
On the other hand, the proliferation of various non-PC computing devices, including: handheld devices; palmtop devices; and various other Microsoft WINDOWS CE™-based devices; set-top boxes; WEB TV; smart phones; and so-called Internet appliances, (hereinafter all referred to as Internet appliances) further complicates the presentation of a Web document to a client device. In a Web document based on HTML, images are treated as separate objects pointed to by the Web document. A proxy/Web server may generate a lower resolution version or a black and white version of a color image to accommodate the limited capability of the Internet appliance. Nonetheless, these images are named persistent objects (i.e., they have separate identities which are their URLs). The proxy or Web server is merely trying to provide different versions of a named entity based on the capability of a receiving device. This is independent of any caching issues at the proxy or Web server to improve object access time.
Various work exists to provide different versions of a named object in the Web environment to support Internet appliances access to the Web. For example, PRISM from Spyglass (see e.g., http://www.spyglass.com) provides different versions of images to the Internet appliance. It can also dynamically translate richly formatted Web documents into simplified Web pages to accommodate the requirements of the receiving devices. A means for performing on-demand data type-specific lossy compression on semantically typed data and tailoring content to the specific constraints of the clients is described in “Adapting to Newark and Client Variability via On-Demand Dynamic Distiflation,” by A. Fox, et al., Proc. 7th Intl. Conference on Architectural Support for Programming Languages and Operating Systems, Oct. 1996.
Using formal descriptors, such as a markup language, to describe a digital document provides tremendous flexibility. In the Internet environment, more powerful markup languages such as XML, or a subset of the Standard Generalized Markup Language (SGML) (see e.g., ISO 8879/1986; and Designing XML Internet Applications, by M. Laventhal, et al., Prentice Hall, 1998), arc being defined to augment HTML. The markup language description can provide rich information on the document structure and the final document to be generated. In fact, XML is a language that allows users to define their own language. For example, chemists can define a chemical markup language to describe a molecular structure. Mathematicians or scientists can define a math markup language to describe complex mathematical formulas. The interpretation of the markup language description and generation of the object can thus be complex. It is desirable to avoid regeneration of the same description repeatedly. Since Web pages, objects or documents on a common subject, or from the same company/division/department or authors often have parts in common, there is a need to go beyond recognizing just the repeated references to named entities (i.e., subject already has a name, e.g., URL) to subparts of named entities.
However, proxy or Web servers and client browsers today do not interpret the markup language to decompose a document or object into components, provide persistent identities and tracking mechanisms to facilitate caching and recognition of repeated occurrences of components of a named object. They mainly provide caching or processing service for named objects as a whole. For example, as mentioned previously, in HTML the text documents and images (which are separated out from the text documents by the authors) are all named objects and hence cacheable entities. Another problem is that if a document includes dynamic content caching is not meaningful as the next reference to the same document URL can result in a different version of the document. Thus a document is not cached even if only a small fraction of its content is dynamic. This is an issue for HTML documents today and is expected to become more severe for XML documents, which are more flexible and make it easier to incorporate various types of dynamic information, such as data from a database.
Thus, the need remains for a system and method for identifying and creating one or more persistent object fragments from named object, for example to facilitate caching. The present invention addresses this need.
SUMMARY
In accordance with the aforementioned needs, the present invention is directed to a method and apparatus for identifying and creating persistent object fragments from a named object. In one example, the present invention is directed to a method and apparatus for dynamically parsing a digital content description of a named digital object, creating and maintaining fragment identities to facilitate caching. Examples of named digital objects include but are not limited to: Web pages described in XML, SGML, and HTML.
The present invention has features which can parse/analyze the object description, identify object fragments and create persistent object fragment identities, and revise the object description by replacing each object fragment with its newly created persistent identity and send the revised object description to the requesting node. Depending upon the properties of a fragment, this can either enable the fragment to be cacheable (which can be at the content/proxy server and the client device in the Web environment), or make the revised object description cacheable at the server and client device. For example, consider the object description of a purchase order which contains a dynamic part to retrieve the current price of a product from the database. This dynamic part may be a small portion of the purchase order, but would prevent the object from being cached. According to one feature of the present invention for recognizing and treating the dynamic part as a separate fragment from the object description, the revised document becomes static and therefore cacheable. Furthermore, fragments can be nested.
A method is also provided to determine which part/segment of a named object to recognize as a fragment identity, based on its properties, which can include its size, processing cost to generate that segment of the object from its description, and other properties such as static vs. dynamic.
The present invention has yet other features to determine which fragments to cache and replace. The cache manager takes into account the fragment size and processing cost to generate the fragment.
The present invention has still other features which allow different versions to be generated for a fragment upon request. The version created can be determined by the property of the requesting devices and the fragment description. Different generators can be maintained for each type of descriptors or markup tags to generate different versions for different types of devices.
An example of a method for identifying object fragments in an object having features of the present invention comprises the steps of: analyzing an object description to identify one or more persistent object fragments associated with the object; creating the one or more persistent object fragments, in response to mid analyzing; and creating a persistent object fragment identity for a persistent object fragment, based on one or more of formal descriptors or an object fragment property. In one embodiment the object description is revised by replacing at least one object fragment with an associated persistent object fragment identity to enable the fragment to be cacheable at one or more of a server and a client; and the revised object description is sent to the client. The client receives the revised object description; and processes and/or caches the revised object description. The client can also receive a version of the one a more object fragments associated with the fragment identity, wherein the version is generated at the server and is based on the capability of the client (e.g., whether it is a handheld device, a set top box, or an Internet appliance.
BRIEF DESCRIPTION OF THE DRAWINGS
These, and further, objects, advantages, and features of the invention will be mere apparent from the following detailed description of a preferred embodiment and the appended drawings wherein:
FIG. 1 is a diagram of an Internet environment having features of the present invention;
FIG. 2 is a more detailed example of a network environment having features of the present invention;
FIG. 3 depicts an example of a digital document using a markup language;
FIG. 4 depicts an example of a modified document;
FIG. 5 depicts the data structure of the fragment description table;
FIG. 6 is an example of the server logic of FIG. 2;
FIG. 7 is an example of the object request handler;
FIG. 8 is an example of the object parser;
FIG. 9 is an example of the next segment locator;
FIG. 10 is as example of the persistent name creator;
FIG. 11 is an example of the fragment request handler;
FIG. 12 is an example of the fragment cache manager; and
FIG. 13 depicts an example of the client logic
DETAILED DESCRIPTION
FIG. 1 depicts an example of an Internet environment adaptable to the present invention. As depicted, a client (60 . . . 63 ) may be connected through a network (25) to access proxy servers (30 . . . 33) or Web content servers (40 . . . 43). The proxy servers and Web servers can provide caching of frequently access Web objects to improve client access time. The client may also have its own cache. Those skilled in the art will realize that a proxy server can be replaced by a hierarchy of proxy servers. A client node (60) can also run a proxy server.
FIG. 2 depicts an example of an overall architecture of a computing node having features of the present invention. In the example of an Internet or intranet environment, the node can be a Web server, a proxy server. As depicted, the computing node can include: a CPU (250); a scratch pad or main memory (245) such as RAM; and persistent storage devices (260) such as direct access storage devices (DASD). The memory (245) stores the server logic 240 (with details depicted in FIG. 6) preferably embodied as computer executable code which may be loaded from DASD (260) into memory (245) for execution by CPU (250). The server logic (240) includes an object request handler (205) (with details depicted in FIG. 7), and a fragment request handler (210) (with details depicted in FIG. 11). It also maintains a fragment cache (270), an object cache (275) and a fragment description table (280) (with detailed depicted in FIG. 5). This information can either reside in persistent storage (260) or in main memory (245).
In a preferred embodiment, an XML-like document will be used as an example of a document described using some formal language, such as a markup language. FIG. 3 shows an example of an XML-like document. The key point here is that the document includes multiple segments (330), where each segment (330) is enclosed between a “start-tag” (310) and an “end-tag” (320). For example, “<cml: molecule>”(similarly, “<m: order>” and “<db: price>”) is a start-tag (310) and its corresponding “end-tag” (320) is “</cml: molecule>” (“</m: order>” and “<db: price>”, respectively). As depicted, the segments may be nested. For example, the segment with the <price> start-tag is included within the segment with the <m: order> start-tag. Thus parsing the document to recognize the segments can be done by matching each “end-tag” with the corresponding “start-tag”, which is the first preceding “start-tag” of the same type at the same nested level. In markup languages such a XML, each segment can have a DTD (document type definition) to describe the semantics of the markup. It is an object of the present invention to select a subset of the segments contained in a document and recognize them as persistent object fragments. Fragment creation eligibility criterion will be introduced next to determine when an object fragment should be created. In the preferred embodiment, two sets of creation eligibility criterion are considered. For each persistent object fragment, a persistent identity or name is assigned and tracked so that if the object fragment appears in multiple objects or multiple times in the same object, it will be recognized as the same fragment.
The first fragment creation eligibility criterion is to recognize and separate out a segment as an object fragment so as to make the remaining document cacheable at the server or client device and/or processable/interpretable at the client device. An example is to recognize a dynamic segment as an object fragment. Consider another example where a segment can not be rendered from the markup language description by a simple client device such as WINDOWS CET™-based Internet appliances. By recognizing the segment as a separate object fragment, the client can process and/or cache the remaining document and let the proxy server interpret the markup language describing the fragment and generate an appropriate version for the client. This limitation on the client devices can be either due to limitation on the processing power or storage capacity of the client device to interpret the markup language and generate the object fragment, the limitation on the bandwidth available to the client device to retrieve the DTD of the fragment or other limitations.
The second criterion is based on the tradeoffs of processing and storage or bandwidth requirements to recognize and separate out a segment as an object fragment so it can be cached separately and reused to avoid going through interpreting the markup language description of the object to generate it again. This will improve response time and reduce server load on fragment re-references. Each fragment—once separated out—may need to be requested separately with additional requests from the client. Thus, preferably, only a segment or group of segments that meet a certain threshold on the processing requirements of interpreting the markup language description to generate the object segment is recognized as a fragment. Another consideration is the additional storage requirement to store the rendered segment. For example, consider two cases. In a first case, the processing time is 100 second of CPU time to generate the segment from the description, and the size of the rendered segment is 10K bytes. In a second case, the processing time is 1 second of CPU time to generate the segment from the description, and the size of the rendered segment is 1000 K bytes. In case 1, the savings on CPU time is substantial while the additional storage cost is minimal. The opposite is true for the second case. In other words, only in the first case is it worthwhile to recognize the segment as a separate fragment for caching. In the preferred embodiment, for an object O, let P(O) be its processing cost to generate a segment from its description and S(O) be the additional storage requirement to store the segment. A value function, F(P(O), S(O)), based on processing costs and storage requirements is used to determine the value of recognizing a fragment. An example of a value function (F) will be processing cost (in seconds) divided by the square root of the additional storage requirement (in 100 Kbytes increments). When the value function exceeds a given threshold (say 5), the segment will be recognized as a fragment.
FIG. 3 depicts an example of a document with 3 segments. As discussed, the first segment (330 1) begins with a start-tag, <cml: molecule>, and ends with an end-tag, </cml: molecule> and the second segment begins with a start-tag, <m: order>, and ends with an end-tag, </m: order>. The second segment (330 2) includes a third segment (330 3) nested within it. The third segment begins with a start-tag, <db: price>, and finishes with an end-tag, <db: price>. Assume the semantics of the three segments as follows. Assume the first segment provides an image of a molecule structure of a chemical compound. Assume also the second segment contains a formula to generate an order table showing the price at different quantities. Assume further, the third segment retrieves the price information from the product database. Hence it is a segment with dynamic information.
FIG. 4 depicts an example of a modified Web document after the persistent fragments have been recognized and extracted. Here it is assumed that generating the molecular structure of the chemical compound in the first segment (330 1) is quite complex, whereas the computation of the order table is straightforward. Hence, only the first (330 1′) and the third segments (330 3′) as recognized as persistent fragments with the identities, “125.1” and “28.3”, respectively. In the preferred embodiment, each of the persistent fragments is replaced with an “include” statement referring to the name of the fragment, e.g. <include HREF=“125.1”>, indicating the reference to the fragment “125.1” and followed by a <include> statement.
FIG. 5 depicts an example of a fragment description table for tracking the object fragment identity and its description. As depicted the table (505) includes a plurality of entries (507), where each table entry (507) points to a fragment description list (510) (only one shown for ease of description). The list (510) includes one or more description elements (520 and 525). Each fragment that maps to a given entry in the fragment description table (510) has a unique description element (520) on the fragment description list (510) of the entry. The description element includes several fields: Nlink (530); Fname (535); and Fdescription (540). The Fname (535) is the persistent name of the fragment. This name is given by the persistent name creator routine (with details depicted in FIG. 10). The Fdescription (540) is the fragment description. The Nlink (530) points to the next description element (525) which maps to the same fragment description table entry (507).
FIG. 6 depicts an example of the server logic (240). In step 605, the server waits for input. Depending upon the type of input, the appropriate routine will be invoked. If at step 610, the input is an object request, the object request handler is invoked, in step 615 (with details described with reference to FIG. 7). Otherwise, in step 620 it is checked if the input is a fragment request. For example, in a Web environment, an object request can be identified on the basis that an object name will have as the server part of its URL, the name of a proxy server. If yes, in step 625 the fragment request handler (with details described with reference to FIG. 11) is invoked. Otherwise, in step 630 a miscellaneous routing is invoked to handle other types of input such as FTP requests which are orthogonal to the current invention and thus will not be described further.
FIG. 7 depicts an example of the object request handler. In step 705, it is first checked whether the requested object is cached in the object cache maintained by this computing node. If the object is cached, in step 710, the cached object is returned to the requesting node. Otherwise, in step 715, the request is forwarded to the control server (or another proxy server). In step 720, the computing node waits for the object requested. In step 725, after receiving the object, the object parser (with details described with reference to FIG. 8) is invoked to analyze the object description and create fragments. In step 730, the object description, which may have been modified by the object parser, is sent back to the requestor. In step 735, the object cache manager is invoked to determine whether the object description (which may have been modified by the object parser) should be cached in the object cache. The object cache manager is similar to a conventional Web cache manager that caches the Web objects. Any standard cache management policy, such as LRU (least recently used), or its variants can take into consideration on tradeoffs between object size, update frequency, and time since last reference (i.e., the reference frequency) can be used. See for example, C. Aggarwal, et al., “On Caching Policies for Web Objects”, IBM Research Report, RC 20619, Mar. 5, 1997, which is hereby incorporated by reference in its entirety, wherein variants of LRU caching algorithms on Web objects are described.
FIG. 8 depicts an example of the object parser depicted in FIG. 7. By way of overview, the object parser maintains two stacks—a “tag_stack” and a “segment_stack”—during its processing to identify persistent fragments. The tag_stack includes the “start-tag”s scanned, but whose matching “end-tag”s have not yet been encountered during scanning of the object description. The segment_stack includes segments recognized that are not qualified as fragments, but have the potential to be combined with segments recognized subsequently to form a fragment. As depicted, in step 805, the two stacks are initialized to null. In step 810, a variable, txt, is set equal to the object description. In step 820, a next segment locator is invoked (with details described with reference to FIG. 9) to identify the next segment, Nsegment, in txt. In step 825, it is checked if Nsegment is null. If so, the processing of txt is completed. Otherwise, in step 830, it will delete segments in the segment_stack that are included in Nsegment, if any. In step 835, it is checked whether Nsegment satisfies the fragment creation eligibility criterion. If so, in step 840 a persistent name creator routine (with details depicted in FIG. 10) is invoked to create a persistent fragment identity for the segment. In step 845, the txt is modified to replace the fragment description with an <include> statement to reference the persistent fragment name followed by an <include> as described in FIG. 4. In step 855, the Nsegment is combined with its adjacent peer segments on the segment_stack, if any, where a peer segment is a segment at the same level (i.e., with the same parent) of the Nsegment in a nested markup language description. In step 860, it is checked if the combined segment satisfies the fragment creation eligibility criterion. If so, if step 865, these adjacent peer segments are removed from the segment_stack. Otherwise, in step 870, the N segment is added to segment_stack.
FIG. 9 depicts a more detailed example of the next segment locator (FIG. 8, step 820). As depicted, in step 910, it is checked if the next token is null, where a token is a consecutive string of characters delimited between blanks (or some other delimiters defined by the markup language). If so, it step 915, the Nsegment is set to null. Otherwise, in step 920, it is checked if the next token is a “start-tag” type token. If so, the token is inserted into the tag_stack with an associated “token position value” set to its starting position in the txt variable. In step 930, it is checked if the next token is an “end-tag” type token. If so, in step 940, the Nsegment is set to the substring in txt starting from the token position value indicated by the top element of the tag_stack to the “end-tag” token. In step 945, the top element in the tag13 stack is removed.
FIG. 10 depicts a more detailed example of the persistent name creator (FIG. 8, step 840). As depicted, in step 1005, the fragment description is obtained from txt. In step 1010, the fragment description is mapped into a number which corresponds to an entry of the fragment description table. Those skilled in the art will appreciate that there are many alternative mapping functions. For example, this can be done by performing an exclusive—or of all the characters in the fragment description and then treating the result as an integer to divide it by the number of entries in the fragment description table. The remainder will serve as the index to the fragment description table. In step 1020, it is checked if the segment description already appeared in the fragment description list of the said entry in the fragment description table. If so, in step 1040, the fragment name of the matching fragment description will be returned. Otherwise, in step 1025, a new persistent name is created for the fragment. There are many ways to create a unique name for the fragment. One way is to maintain a counter for each entry of the fragment description table to track the number of distinct fragment descriptions that have been mapped to this entry. The name given to the new fragment will be the value of its entry to the fragment description table augmented with the current value of the counter associated with the said entry. For example, if a fragment description is mapped to the 26th entry of the fragment description table and there already have 5 distinct fragments previously mapped to this entry, the persistent name for the new fragment will be “26.6”. In step 1030, the fragment name and its description is added to the fragment description list of the corresponding entry in the fragment description table. In step 1035, the persistent name created is returned.
FIG. 11 depicts an example of the fragment request handler (FIG. 6, step 625). As depicted, in step 1105, it is determined which versions of the fragment needs to be generated and returned to the requesting client, if multiple versions are available. A degenerate case is that only one version is available e.g., a proxy server only has code to generate one version of a fragment. In step 1110, it is checked whether the requested version is cached in the fragment cache. If so, in step 1150, the requested version is returned to the requesting node. In step 1160, the fragment cache manager updates the reference statistics. In the preferred embodiment, an LRU cache management policy is used where the requested fragment will be moved to the top of the LRU chain. In step 1120, for the case where the fragment is not in the fragment cache, it obtains the fragment description from the fragment description table. In step 1125, the fragment is generated based on the fragment description and the client requirement. In the preferred embodiment, each type of markup language describing the fragment can have its own DTD to provide its semantic. For each type of DTD, there can be different ways of generating/rendering the fragment based on the characteristics of the requesting devices, such as processing power, storage capacity, and communication bandwidth. This can be described in a GTD (Generator Table Definition) on how to generate a different version for a given DTD to satisfy the requirement of a specific receiving device. The GTD is separate from the DTD. It can be provided by a third party such as the Internet appliance manufacturer or other software manufacturer. In step 1135, the request fragment version is returned to the requester. In step 1140, the fragment cache manager (with details described with reference to FIG. 12) is invoked.
FIG. 12 depicts an example of the fragment cache manager. In the preferred embodiment, the fragment cache manager uses an LRU type replacement policy. As depicted, in step 1205, it is checked whether there is enough free space in the fragment cache to cache the requested fragment (Oc). If so, fragment Oc is cached in the fragment cache. Otherwise in step 1215, it determines the minimum k value such that the bottom k fragments, Obk in the LRU stack of the fragment cache will have a total size larger than that of fragment Oc, Ind step 1220, it is checked based on the value function (f) whether it is more desirable to cache Oc or {Ob1, . . . , Obk}. The total processing cost to generate {Ob1, . . . , Obk} is the sum of the processing cost of each Ob1, 1<i<k, and the additional storage requirement to store {Ob1, . . . , Obk} is the sum of the size of each Ob1, 1<i<k. If Oc is more valuable with a large F function value, in step 1225, {Ob1, . . . , Obk} is deleted to make room to cache Oc. In step 1230, the reference statistics for the fragment version is updated for the fragment cache manager to manage its LRU cache.
To facilitate garbage collection of fragment descriptions that are no longer in use, an object-fragment table can be maintained which tracks the fragment created for each object and an fragment-object table to track all objects containing a common fragment. After an object is updated, on its next reference, the object parser may detect that the object now contains some new fragments and some fragments previously contained in the object are no longer in it. It will then check for each fragment no longer in use by the object whether there is any other object containing it based on the fragment-object table. If so, the fragment description element in FIG. 5 will be deleted from the fragment description table. Finally, the object parser will update the object-fragment table and fragment-object table accordingly. For each fragment deleted from the fragment description table, the fragment cache manager will be invoked to check if any of its fragment version is in the fragment cache and delete it.
FIG. 13 depicts an example of the client logic. In step 1305, the client waits for input (request from a user or a response from the server). Depending upon the type of input, the appropriate routine will be invoked. If in step 1310, the input is an object request from the user, the request is sent to the server in step 1315 (see FIG. 6) where persistent object fragments in the object are identified and the object revised as necessary.
In step 1320, if the input is an object (e.g., a server response from a previous object request), the object is rendered and displayed to the user in step 1330. Recall that since persistent, object fragments have been recognized to make the revised object document cacheable at the server or client device and/or processable/interpretable at the client device. Consider the example where a segment can not be rendered from the markup language description by a simple client device such as WINDOWS CE™-based Internet appliances. According to the present invention, by recognizing the segment as a separate object fragment, the client can process and/or cache the revised document and allow the server to interpret the markup language describing the fragment and generate an appropriate version for the client. Examples of the limitations on the client device include but are not limited to the processing power or storage capacity of the client device to interpret the markup language and generate the object fragment; and/or the bandwidth available to the client device to retrieve the description of the fragment. Recall also that the recognition and revision of an object to remove segments qualifying as object fragments enable the object fragment to be cached separately and reused to avoid going through interpreting the markup language description of the object to generate it again. This will improve response time and reduce server load on fragment re-references. Each fragment—once removed—may need to be requested separately with additional requests from the client. Thus, preferably, only a segment or group of segments that meet a certain threshold on the processing requirements of interpreting the markup language description to generate the object segment were recognized as a fragment by the server.
In step 1335, the client determines whether the object is cacheable. Recall that any dynamic object or object exceeding a certain size will be deemed not cacheable at the client device, which often has limited caching capacity. According to the present invention, the server uses persistent object fragment identifiers to replace persistent object fragments (such as dynamic objects or large segments) in a Web object. The revised object is thus more cacheable at the client device, since the server has removed the dynamic or large objects from the object and reduced the size of the object. For example, recall the example of an object description for a purchase order that includes a dynamic part for retrieving the current price of a product from the database. This dynamic part may be a small portion of the purchase order, but would prevent the object from being cached. According to one feature of the present invention for recognizing and treating the dynamic part as a separate fragment from the object description, the revised document becomes static and therefore cacheable. In step 1340, if the object is cacheable, the object is cached at the local client cache. In step 1325, a miscellaneous routine is invoked to handle other types of input, such as a pager message.
A preferred embodiment of the present invention includes features implemented as software tangibly embodied on a computer program product or program storage device for execution on a processor (not shown) provided with the client (60 . . . 63) and/or server (30 . . . 33). For example, software implemented in a popular object-oriented computer executable code such as JAVA provides portability across different platforms. Those skilled in the art will appreciate that other procedure-oriented and object-oriented (OO) programming environments, including but not limited to C++ and Smalltalk can also be employed.
Those skilled in the art will also appreciate that methods of the present invention may be implemented as software for execution on a computer or other processor-based device. The software may be embodied on a magnetic, electrical, optical, or other persistent program and/or data storage device, including but not limited to: magnetic disks, DASD, bubble memory; tape; optical disks such as CD-ROMs; and other persistent (also called nonvolatile) storage devices such as core, ROM, PROM, flash memory, or battery backed RAM. Those skilled in the art will appreciate that within the spirit and scope of the present invention, one or more of the components instantiated in the memory of the clients (60 . . . 63) or server (30 . . . 33) could be accessed and maintained directly via disk (260), the network 25, another server, or could be distributed across a plurality of servers.
Now that a preferred embodiment of the present invention has been described, with alternatives, various modifications and improvements will occur to those skill in the art. Thus, the detailed description should be understood as an example and not as a limitation. The proper scope of the invention is defined by the appended claims.

Claims (66)

1. A method for identifying object fragments in an object, said method comprising the steps of:
analyzing an object description to identify one or more persistent object fragments associated with the object;
creating the one or more persistent object fragments, in response to said analyzing; and
creating a persistent object fragment identity for a persistent object fragment, based on one or more of: formal descriptors; and an object fragment properly.
2. The method of claim 1, wherein the object description is based on the formal descriptors, said method comprising the further steps of:
maintaining and tracking the persistent object fragment identity and associated formal descriptors; and generating a cacheable object fragment.
3. The method of claim 1, comprising the further steps of revising the object description by replacing at least one object fragment with an associated persistent object fragment identity to enable one or more of: the object fragment; and a revised object description to be cacheable at one a more of: a server; and a client; and
sending a revised object description to the client.
4. The method of claim 3, wherein comprising the further steps of:
the client receiving and caching the revised object description; and
the client receiving a version of the one or more object fragments associated with the fragment identity, wherein the version is generated at the server and is based on the capability of the client.
5. The method of claim 1, further comprising the steps of:
receiving a request for an object fragment;
determining whether the fragment is cached, based on the object fragment identity; and
if the fragment is not cached, dynamically generating the fragment.
6. The method of claim 1, further comprising the step of caching the object fragment based on one or more of: a reference frequency; a cache size; and a processing cost.
7. The method of claim 1, further comprising the step of:
generating one or more different versions of the fragment; wherein a version can be determined by one or more of: a requesting device property and the fragment description.
8. The method of claim 7, further comprising the step of: determining the version of the persistent fragment based on the requesting device property and the fragment property.
9. The method of claim 1, wherein the fragment property includes a processing requirement.
10. The method of claim 1, wherein the fragment property includes one or more of a storage requirement and a bandwidth requirement.
11. The method of claim 1, further comprising the steps of:
identifying an object fragment as a dynamic object fragment; and
transforming the dynamic object to a static object by revising the object description and replacing one or more dynamic object fragments with its object identity.
12. The method of claim 1, wherein the fragment property includes whether the fragment can be generated efficiently by various client devices.
13. The method of claim 1, wherein the formal descriptors are markup tags in the object description and wherein the object is described using a markup language.
14. The method of claim 1, wherein the object is a Web page described using a markup language selected from the group consisting of XML, SGML, or HTML.
15. The method of claim 1, wherein the object fragment can be nested or hierarchical.
16. The method of claim 15, further comprising the steps of:
recognizing a nested object fragment as including a dynamic fragment or a frequently changed fragment; and
making an outer fragment cacheable at one or more of a server and a client.
17. The method of claim 1, further comprising the steps of:
identifying one or more of the object fragments requiring invalidation; and
garbage collecting invalid object fragments.
18. The method of claim 1, wherein the object fragment property comprises the property selected from the group consisting of: a dynamic property; a static property; how frequently the object is going to change; size; or processing cost to generate that fragment from its description.
19. The method of claim 1, further comprising the step of caching the object based on one or more object fragment properties.
20. The method of claim 1, further comprising the steps of:
selecting a subset of the segments contained in the object; and
recognizing the subset as persistent object fragments.
21. The method of claim 1, wherein said step of creating a persistent object fragment further comprises the steps of:
recognizing and separating a segment as an object fragment so it can be cached separately and reused to avoid going through interpreting a markup language description of the object to generate it again; wherein the segment will only be recognized as the object fragment only if the segment or group of segments satisfies a threshold for interpreting the markup language description based on one or more of: a processing requirement; and a storage requirement.
22. The method of claim 1, wherein the persistent object fragment will have a consistent identity regardless of whether it appears in one or more of: multiple objects; and multiple times in the same object.
23. A method for caching objects including object fragments, said method comprising the steps of:
a client receiving from a server an object including a revised object description wherein at least one object fragment has been replaced with an associated persistent object fragment identity based on one or more of: formal descriptors; and an object fragment property, in response to a request for the object; and
the client processing the revised object description.
24. The method of claim 23, further comprising the step of:
the client receiving a version of the one or more object fragments associated with the fragment identity, wherein the version is generated at the server and is based on the capability of the client.
25. The method of claim 24, wherein the version is generated at the server and is based on the capability of the client.
26. The method of claim 23, wherein the persistent object fragment will have a consistent identity regardless of whether it appears is one or more of: multiple objects; and multiple times in the same object.
27. The method of claim 23, wherein the formal descriptors are markup tags in the object description and wherein the object is described using a markup language.
28. The method of claim 23, wherein the object is a Web page described using a markup language selected from the group consisting of XML, SGML, or HTML.
29. The method of claim 23, wherein said processing step includes one or more of caching a revised object and rendering the object.
30. The method of claim 23, further comprising the step of: the client receiving from the server a version of the object fragment interpret and generated at the server, wherein the version generated is based on one or more of: the processing power of the client; the storage capacity of the client; and the bandwidth available to the client to retrieve a description of the fragment.
31. The method of claim 23, wherein the persistent object fragment identifier represents a dynamic object.
32. The method of claim 23, wherein the client is selected from a group consisting of; a handheld device; a palmtop device; a set-top box; a smart phone; or an Internet appliance.
33. A program storage device readable by a machine, tangibly embodying a program of instructions executable by the machine to perform method steps for identifying object fragments in an object, said method steps comprising:
analyzing an object description to identify one or more persistent object fragments associated with the object;
creating the one or more persistent object fragments, in response to said analyzing; and
creating a persistent object fragment identity for a persistent object fragment, based on one or more of: formal descriptors; and an object fragment property.
34. The program storage device of claim 33, wherein the object description is based on the formal descriptors, said method comprising the further steps of maintaining and tracking the persistent object fragment identity and associated formal descriptors; and generating a cacheable object fragment.
35. The program storage device of claim 33, comprising the further steps of:
revising the object description by replacing at least one object fragment with an associated persistent object fragment identity to enable one or more of: the object fragment; and a revised object description to be cacheable at one or more of: a server; and a client; and
sending a revised object description to the client.
36. The program storage device of claim 35, wherein comprising the further steps of:
the client receiving and caching the revised object description; and
the client receiving a version of the one or more object fragments associated with the fragment identity, wherein the version is generated at the server and is based on the capability of the client.
37. The program storage device of claim 33, further comprising the steps of:
receiving a request for an object fragment;
determining whether the fragment is cached, based on the object fragment identity; and
if the fragment is not cached, dynamically generating the fragment.
38. The program storage device of claim 33, further comprising the step of caching the object fragment based on one or more of: a reference frequency; a cache size; and a processing cost.
39. The program storage device of claim 33, further comprising the step of:
generating one or more different versions of the fragment; wherein a version can be determined by one or more of: a requesting device property and the fragment description.
40. The program storage device of claim 39, further comprising the step of: determining the version of the persistent fragment based on the requesting device property and the fragment property.
41. The program storage device of claim 33, wherein the fragment property includes a processing requirement.
42. The program storage device of claim 33, wherein the fragment property includes one or more of a storage requirement and a bandwidth requirement.
43. The program storage device of claim 33, further comprising the steps of:
identifying an object fragment in a dynamic object fragment; and
transforming the dynamic object to a static object by revising the object description and replacing one or more dynamic object fragments with is object identity.
44. The program storage device of claim 33, wherein the fragment property includes whether the fragment can be generated efficiently by various client devices.
45. The program storage device of claim 33, wherein the formal descriptors are markup tags in the object description and wherein the object is described using a markup language.
46. The program storage device of claim 33, wherein the object is a Web page described using a markup language selected from the group consisting of XML, SGML, or HTML.
47. The program storage device of claim 33, wherein the object fragment can be nested or hierarchical.
48. The program storage device of claim 15, further comprising the steps of:
recognizing a nested object fragment as including a dynamic fragment or a frequently changed fragment; and
making an outer fragment cacheable at one or more of a server and a client.
49. The program storage device of claim 33, further comprising the steps of:
identifying one or more of the object fragments requiring invalidation; and
garbage collecting invalid object fragments.
50. The program storage device of claim 33, wherein the object fragment property comprises the property selected from the group consisting of: a dynamic property; a static property; how frequently the object is going to change; size; or processing cost to generate that fragment from its description.
51. The program storage device of claim 33, further comprising the step of caching the object based on one or more object fragment properties.
52. The program storage device of claim 33, further comprising the steps of:
selecting a subset of the segments contained in the object; and
recognizing the subset as persistent object fragments.
53. The program storage device of claim 33, wherein said step of creating a persistent object fragment further comprises the steps of:
recognizing and separating a segment as an object fragment so it can be cached separately and reused to avoid going through interpreting a markup language description of the object to generate it again; wherein the segment will only be recognized as the object fragment only if the segment or group of segments satisfies a threshold for interpreting the markup language description based on one or more of: a processing requirement; and a storage requirement.
54. The program storage device of claim 33, wherein the persistent object fragment will have a consistent identity regardless of whether it appears in one or more of: multiple objects; and multiple times in the same object.
55. A program storage device readable by a machine, tangibly embodying a program of instructions executable by the machine to perform method steps for processing objects including object fragments, said method steps comprising:
a client receiving from a server an object including a revised object description wherein at least one object fragment has been replaced with an associated persistent object fragment identity based on one or one of: formal descriptors; and an object fragment property, in response to a request from the object; and
the client processing the revised object description.
56. The program storage device of claim 55, further comprising the step of:
the client receiving a version of the one or more object fragments associated with the fragment identity, wherein the version is generated at the server and is based on the capability of the client.
57. The program storage device of claim 56, wherein the version is generated at the server and is based on the capability of the client.
58. The program storage device of claim 55, wherein the persistent object fragment will have a consistent identity regardless of whether it appears in one or more of: multiple objects; and multiple times in the same object.
59. The program storage device of claim 55, wherein the formal descriptors are markup tags in the object description and wherein the object is described using a markup language.
60. The program storage device of claim 55, wherein the object is a Web page described using a markup language selected from the group consisting of XML, SGML, or HTML.
61. The program storage device of claim 55, wherein said processing step includes one or more of caching a revised object and rendering the object.
62. The program storage device of claim 55, further comprising the step of: the client receiving from the server a version of the object fragment interpret and generated at the server, wherein the version generated is based on one or more of: the processing power of the client; the storage capacity of the client; and the bandwidth available to the client to retrieve a description of the fragment.
63. The program storage device of claim 55, wherein the persistent object fragment identifier represents a dynamic object.
64. The program storage device of claim 55, wherein the client is selected from a group consisting of: a handheld device; a palmtop device; a set-top box; a smart phone; or an Internet appliance.
65. A method for caching objects including object fragments, and method comprising the steps of:
a client receiving from a server an object including an object description wherein at least one object fragment is referenced by an associated persistent object fragment identity based on one or more of: formal descriptors; and an object fragment property, in response to a request for the object; and
the client processing the object description including the associated persistent object fragment identity.
66. A program storage device readable by a machine, tangibly embodying a program of instructions executable by the machine to perform method steps for processing objects including object fragments, said method steps comprising:
a client receiving from a server an object including an object description wherein at least one object fragment is referenced by an associated persistent object fragment identity based on one or more of: formal descriptors; and the client processing the object description including the associated persistent object fragment identity.
US10/601,267 1998-11-13 2003-06-19 Identifying, processing and caching object fragments in a web environment Expired - Lifetime USRE39184E1 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
US10/601,267 USRE39184E1 (en) 1998-11-13 2003-06-19 Identifying, processing and caching object fragments in a web environment

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US09/192,010 US6249844B1 (en) 1998-11-13 1998-11-13 Identifying, processing and caching object fragments in a web environment
US10/601,267 USRE39184E1 (en) 1998-11-13 2003-06-19 Identifying, processing and caching object fragments in a web environment

Related Parent Applications (1)

Application Number Title Priority Date Filing Date
US09/192,010 Reissue US6249844B1 (en) 1998-11-13 1998-11-13 Identifying, processing and caching object fragments in a web environment

Publications (1)

Publication Number Publication Date
USRE39184E1 true USRE39184E1 (en) 2006-07-11

Family

ID=22707861

Family Applications (2)

Application Number Title Priority Date Filing Date
US09/192,010 Ceased US6249844B1 (en) 1998-11-13 1998-11-13 Identifying, processing and caching object fragments in a web environment
US10/601,267 Expired - Lifetime USRE39184E1 (en) 1998-11-13 2003-06-19 Identifying, processing and caching object fragments in a web environment

Family Applications Before (1)

Application Number Title Priority Date Filing Date
US09/192,010 Ceased US6249844B1 (en) 1998-11-13 1998-11-13 Identifying, processing and caching object fragments in a web environment

Country Status (1)

Country Link
US (2) US6249844B1 (en)

Cited By (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20030236906A1 (en) * 2002-06-24 2003-12-25 Klemets Anders E. Client-side caching of streaming media content
US20040264489A1 (en) * 2003-06-30 2004-12-30 Klemets Anders E. Streaming of variable bit rate multimedia content
US20050044166A1 (en) * 2001-06-28 2005-02-24 Microsoft Corporation Startup methods and apparatuses for use in streaming content
US20050100014A1 (en) * 2000-08-09 2005-05-12 Microsoft Corporation Fast dynamic measurement of bandwidth in a TCP network environment
US20060092822A1 (en) * 2004-04-30 2006-05-04 Microsoft Corporation Session Description Message Extensions
US20060168295A1 (en) * 2003-06-27 2006-07-27 Microsoft Corporation Midstream Determination of Varying Bandwidth Availability
US20070055786A1 (en) * 2005-09-08 2007-03-08 Nokia Corporation Method to determine the completeness of a service guide
US20080288722A1 (en) * 2005-04-19 2008-11-20 Streamezzo Method for Optimization of the Management of a Server Cache Which May be Consulted by Client Terminals with Differing Characteristics
US7650421B2 (en) 2002-12-30 2010-01-19 Microsoft Corporation Adaptable accelerated content streaming
WO2012009191A1 (en) * 2010-07-12 2012-01-19 Alibaba Group Holding Limited Method and apparatus of processing nested fragment caching of a web page
US20130238970A1 (en) * 2012-03-07 2013-09-12 Google Inc. Uniquely Identifying Script Files

Families Citing this family (130)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6417873B1 (en) * 1998-12-11 2002-07-09 International Business Machines Corporation Systems, methods and computer program products for identifying computer file characteristics that can hinder display via hand-held computing devices
US6668354B1 (en) * 1999-01-05 2003-12-23 International Business Machines Corporation Automatic display script and style sheet generation
US6446110B1 (en) * 1999-04-05 2002-09-03 International Business Machines Corporation Method and apparatus for representing host datastream screen image information using markup languages
US6476828B1 (en) * 1999-05-28 2002-11-05 International Business Machines Corporation Systems, methods and computer program products for building and displaying dynamic graphical user interfaces
US7017159B1 (en) * 1999-06-15 2006-03-21 Sun Microsystems, Inc. Smart bookmarks for small footprint device applications
US6405211B1 (en) * 1999-07-08 2002-06-11 Cohesia Corporation Object-oriented representation of technical content and management, filtering, and synthesis of technical content using object-oriented representations
US6615235B1 (en) * 1999-07-22 2003-09-02 International Business Machines Corporation Method and apparatus for cache coordination for multiple address spaces
US6763499B1 (en) * 1999-07-26 2004-07-13 Microsoft Corporation Methods and apparatus for parsing extensible markup language (XML) data streams
US6675353B1 (en) * 1999-07-26 2004-01-06 Microsoft Corporation Methods and systems for generating XML documents
US6996770B1 (en) * 1999-07-26 2006-02-07 Microsoft Corporation Methods and systems for preparing extensible markup language (XML) documents and for responding to XML requests
EP1074925B8 (en) * 1999-08-06 2011-09-14 Ricoh Company, Ltd. Document management system, information processing apparatus, document management method and computer-readable recording medium
US6356933B2 (en) * 1999-09-07 2002-03-12 Citrix Systems, Inc. Methods and apparatus for efficiently transmitting interactive application data between a client and a server using markup language
US6766349B1 (en) 1999-09-24 2004-07-20 Sun Microsystems, Inc. Mechanism for obtaining a thread from, and returning a thread to, a thread pool without attaching and detaching
US6701367B1 (en) 1999-09-24 2004-03-02 Sun Microsystems, Inc. Mechanism for enabling customized session managers to interact with a network server
EP1224568A4 (en) 1999-09-24 2006-11-08 Sun Microsystems Inc Mechanism for enabling session information to be shared across multiple processes
FI112427B (en) * 1999-11-05 2003-11-28 Nokia Corp A method for determining the capabilities of a wireless terminal in a multimedia messaging service, a multimedia messaging service, and a multimedia terminal
US6721780B1 (en) * 1999-11-09 2004-04-13 Fireclick, Inc. Predictive pre-download of network objects
US6591260B1 (en) * 2000-01-28 2003-07-08 Commerce One Operations, Inc. Method of retrieving schemas for interpreting documents in an electronic commerce system
US7096418B1 (en) * 2000-02-02 2006-08-22 Persistence Software, Inc. Dynamic web page cache
US20010014899A1 (en) * 2000-02-04 2001-08-16 Yasuyuki Fujikawa Structural documentation system
US6757708B1 (en) * 2000-03-03 2004-06-29 International Business Machines Corporation Caching dynamic content
US6854018B1 (en) * 2000-03-20 2005-02-08 Nec Corporation System and method for intelligent web content fetch and delivery of any whole and partial undelivered objects in ascending order of object size
US7509397B1 (en) * 2000-04-06 2009-03-24 Yahoo! Inc. Web portholes: using web proxies to capture and enhance display real estate
US6996616B1 (en) * 2000-04-17 2006-02-07 Akamai Technologies, Inc. HTML delivery from edge-of-network servers in a content delivery network (CDN)
US7523158B1 (en) * 2000-05-12 2009-04-21 Oracle International Corporation System and method for partial page updates using a proxy element
US6681298B1 (en) * 2000-07-12 2004-01-20 Powertv, Inc. Hypertext markup language cache system and method
FI112307B (en) * 2000-08-02 2003-11-14 Nokia Corp communication Server
US7047281B1 (en) * 2000-08-08 2006-05-16 Fineground Networks Method and system for accelerating the delivery of content in a networked environment
AU2001281402A1 (en) * 2000-08-08 2002-02-18 Fineground Networks Method and system for parameterized web documents
AU2001290546A1 (en) * 2000-08-22 2002-03-04 Akamai Technologies, Inc. Dynamic content assembly on edge-of-network servers in a content delivery network
US8122236B2 (en) 2001-10-24 2012-02-21 Aol Inc. Method of disseminating advertisements using an embedded media player page
AU2002243448A1 (en) 2000-10-24 2002-06-24 Singingfish.Com, Inc. Method of sizing an embedded media player page
FR2816157A1 (en) * 2000-10-31 2002-05-03 Thomson Multimedia Sa PROCESS FOR PROCESSING DISTRIBUTED VIDEO DATA TO BE VIEWED ON SCREEN AND DEVICE IMPLEMENTING THE METHOD
US7925967B2 (en) * 2000-11-21 2011-04-12 Aol Inc. Metadata quality improvement
US20040030681A1 (en) * 2000-11-21 2004-02-12 Shannon Paul Thurmond System and process for network site fragmented search
US20020105548A1 (en) * 2000-12-12 2002-08-08 Richard Hayton Methods and apparatus for creating a user interface using property paths
US7380250B2 (en) * 2001-03-16 2008-05-27 Microsoft Corporation Method and system for interacting with devices having different capabilities
US7310687B2 (en) * 2001-03-23 2007-12-18 Cisco Technology, Inc. Methods and systems for managing class-based condensation
US7171443B2 (en) * 2001-04-04 2007-01-30 Prodigy Communications, Lp Method, system, and software for transmission of information
US20020147849A1 (en) * 2001-04-05 2002-10-10 Chung-Kei Wong Delta encoding using canonical reference files
WO2002086744A1 (en) * 2001-04-23 2002-10-31 Schwegman, Lundberg, Woessner & Kluth, P.A. Methods, systems and emails to link emails to matters and organizations
US6950837B2 (en) * 2001-06-19 2005-09-27 Intel Corporation Method for using non-temporal streaming to improve garbage collection algorithm
US7185063B1 (en) * 2001-06-22 2007-02-27 Digital River, Inc. Content delivery network using differential caching
US20020198956A1 (en) * 2001-06-25 2002-12-26 International Business Machines Corporation Method and apparatus for managing a cache
DE10132038A1 (en) * 2001-07-03 2003-01-23 Siemens Ag Automation system and process for plant visualization
US7594001B1 (en) * 2001-07-06 2009-09-22 Microsoft Corporation Partial page output caching
US7092997B1 (en) * 2001-08-06 2006-08-15 Digital River, Inc. Template identification with differential caching
US8234412B2 (en) * 2001-09-10 2012-07-31 International Business Machines Corporation Method and system for transmitting compacted text data
US6775743B2 (en) * 2001-09-12 2004-08-10 International Business Machines Corporation Content caching with special handling of multiple identical requests for content
US7155499B2 (en) * 2001-09-28 2006-12-26 Siemens Building Technologies, Inc. System controller for controlling a control network having an open communication protocol via proprietary communication
US20030074459A1 (en) * 2001-09-28 2003-04-17 Michael Soemo Proprietary protocol for a system controller for controlling device controllers on a network having an open communication protocol
US6996600B2 (en) * 2001-09-28 2006-02-07 Siemens Building Technologies Inc. System and method for servicing messages between device controller nodes and via a lon network
US7769823B2 (en) 2001-09-28 2010-08-03 F5 Networks, Inc. Method and system for distributing requests for content
US20030074460A1 (en) * 2001-09-28 2003-04-17 Michael Soemo Proprietary protocol for communicating network variables on a control network
US7072879B2 (en) * 2001-10-22 2006-07-04 Siemens Building Technologies, Inc. Partially embedded database and an embedded database manager for a control system
US7797376B1 (en) * 2001-11-13 2010-09-14 Cisco Technology, Inc. Arrangement for providing content operation identifiers with a specified HTTP object for acceleration of relevant content operations
US7428725B2 (en) 2001-11-20 2008-09-23 Microsoft Corporation Inserting devices specific content
US20040064500A1 (en) * 2001-11-20 2004-04-01 Kolar Jennifer Lynn System and method for unified extraction of media objects
US7117069B2 (en) * 2001-11-28 2006-10-03 Siemens Building Technologies, Inc. Apparatus and method for executing block programs
US7370120B2 (en) * 2001-12-07 2008-05-06 Propel Software Corporation Method and system for reducing network latency in data communication
US20030110272A1 (en) * 2001-12-11 2003-06-12 Du Castel Bertrand System and method for filtering content
US7426534B2 (en) * 2001-12-19 2008-09-16 International Business Machines Corporation Method and system for caching message fragments using an expansion attribute in a fragment link tag
US7412535B2 (en) * 2001-12-19 2008-08-12 International Business Machines Corporation Method and system for caching fragments while avoiding parsing of pages that do not contain fragments
US7509393B2 (en) * 2001-12-19 2009-03-24 International Business Machines Corporation Method and system for caching role-specific fragments
US7730154B2 (en) * 2001-12-19 2010-06-01 International Business Machines Corporation Method and system for fragment linking and fragment caching
US7587515B2 (en) * 2001-12-19 2009-09-08 International Business Machines Corporation Method and system for restrictive caching of user-specific fragments limited to a fragment cache closest to a user
US8484282B2 (en) * 2002-03-21 2013-07-09 International Business Machines Corporation High-speed content transformation engine
US7680875B1 (en) * 2002-04-01 2010-03-16 Novell, Inc. Markers for cached objects
GB0211897D0 (en) * 2002-05-23 2002-07-03 Koninkl Philips Electronics Nv Dynamic markup language
US20030236825A1 (en) * 2002-06-20 2003-12-25 Kulkarni Suhas Sudhakar System, method and computer readable medium for transferring and rendering a web page
US7937704B2 (en) * 2002-06-20 2011-05-03 British Telecommunications Public Limited Company Distributed computer
US20030236813A1 (en) * 2002-06-24 2003-12-25 Abjanic John B. Method and apparatus for off-load processing of a message stream
US7363340B2 (en) * 2002-07-18 2008-04-22 International Business Machines Corporation Method and system for generating auxiliary-server cache identifiers
JP4231261B2 (en) * 2002-09-12 2009-02-25 株式会社エヌ・ティ・ティ・ドコモ Identity determination device
EP1406183A3 (en) * 2002-10-01 2004-04-14 Sap Ag Method and system for refreshing browser pages
US7574653B2 (en) 2002-10-11 2009-08-11 Microsoft Corporation Adaptive image formatting control
US20040199400A1 (en) * 2002-12-17 2004-10-07 Lundberg Steven W. Internet-based patent and trademark application management system
GB0229892D0 (en) * 2002-12-21 2003-01-29 Ibm Method and apparatus for caching documents
GB0230331D0 (en) * 2002-12-31 2003-02-05 British Telecomm Method and apparatus for operating a computer network
US7870100B2 (en) * 2003-06-10 2011-01-11 International Business Machines Corporation Methods and systems for publishing electronic documents with automatic fragment detection
WO2005008480A2 (en) * 2003-07-10 2005-01-27 Computer Associates Think, Inc. System and method for generating a web-enabled graphical user interface plug-in
US7761842B2 (en) * 2003-07-11 2010-07-20 Computer Associates Think, Inc. System and method for generating a graphical user interface (GUI) element
US7089354B2 (en) * 2003-07-30 2006-08-08 International Business Machines Corporation Disk fragmentation test system
US9098475B2 (en) * 2003-08-20 2015-08-04 Xerox Corporation Apparatus and method for generating reusable composite components during dynamic document construction
US7519574B2 (en) * 2003-08-25 2009-04-14 International Business Machines Corporation Associating information related to components in structured documents stored in their native format in a database
US7792866B2 (en) * 2003-08-25 2010-09-07 International Business Machines Corporation Method and system for querying structured documents stored in their native format in a database
US8150818B2 (en) * 2003-08-25 2012-04-03 International Business Machines Corporation Method and system for storing structured documents in their native format in a database
US8250093B2 (en) * 2003-08-25 2012-08-21 International Business Machines Corporation Method and system for utilizing a cache for path-level access control to structured documents stored in a database
US8775468B2 (en) * 2003-08-29 2014-07-08 International Business Machines Corporation Method and system for providing path-level access control for structured documents stored in a database
US7542476B2 (en) * 2003-08-29 2009-06-02 Flash Networks Ltd Method and system for manipulating IP packets in virtual private networks
US7290015B1 (en) * 2003-10-02 2007-10-30 Progress Software Corporation High availability via data services
US7877390B2 (en) * 2003-10-20 2011-01-25 International Business Machines Corporation Systems and methods for providing autonomous persistent storage systems
US20050097128A1 (en) * 2003-10-31 2005-05-05 Ryan Joseph D. Method for scalable, fast normalization of XML documents for insertion of data into a relational database
US20050131962A1 (en) * 2003-12-16 2005-06-16 Deshpande Sachin G. Systems and methods for implementing a cache model
US7826600B2 (en) * 2003-12-22 2010-11-02 International Business Machines Corporation Method and procedure for compiling and caching VoiceXML documents in a voice XML interpreter
US8756487B2 (en) * 2004-01-06 2014-06-17 International Business Machines Corporation System and method for context sensitive content management
US7590704B2 (en) * 2004-01-20 2009-09-15 Microsoft Corporation Systems and methods for processing dynamic content
US8606876B2 (en) * 2004-02-11 2013-12-10 Flash Networks Ltd. Method and system for accelerating data communication that is using multipart
US20050192922A1 (en) * 2004-02-27 2005-09-01 Edlund Stefan B. Client-server computing system capable of validating cached data based on data transformation
US7930479B2 (en) * 2004-04-29 2011-04-19 Sap Ag System and method for caching and retrieving from cache transaction content elements
US7890604B2 (en) 2004-05-07 2011-02-15 Microsoft Corproation Client-side callbacks to server events
US9026578B2 (en) 2004-05-14 2015-05-05 Microsoft Corporation Systems and methods for persisting data between web pages
US7849412B2 (en) * 2004-05-21 2010-12-07 Computer Associates Think, Inc. System and method for generating a web control in a Windows development environment
US20060031751A1 (en) * 2004-05-26 2006-02-09 Shai Ehud Method for creating editable web sites with increased performance & stability
GB0412655D0 (en) * 2004-06-07 2004-07-07 British Telecomm Distributed storage network
US7853676B1 (en) * 2004-06-10 2010-12-14 Cisco Technology, Inc. Protocol for efficient exchange of XML documents with a network device
US7577587B2 (en) * 2004-07-07 2009-08-18 Sap Ag Purchase order and purchase order response interactive forms
US8156429B2 (en) 2004-10-22 2012-04-10 Flash Networks. Ltd Method and system for accelerating downloading of web pages
US20060117257A1 (en) * 2004-11-30 2006-06-01 International Business Machines Corporation System and method for processing JavaScript resource files
WO2006085314A2 (en) * 2005-02-11 2006-08-17 Flash Networks Ltd Method and system for improving user experience while browsing
US7843938B1 (en) 2005-02-25 2010-11-30 Citrix Systems, Inc. QoS optimization with compression
US20070101061A1 (en) * 2005-10-27 2007-05-03 Guruprasad Baskaran Customized content loading mechanism for portions of a web page in real time environments
US7650353B2 (en) * 2005-12-16 2010-01-19 Microsoft Corporation XML specification for electronic data interchange (EDI)
US20070226292A1 (en) * 2006-03-22 2007-09-27 Chetuparambil Madhu K Method and apparatus for preserving updates to execution context when a request is fragmented and executed across process boundaries
US8166198B2 (en) 2006-03-28 2012-04-24 Flahs Networks, Ltd. Method and system for accelerating browsing sessions
US9633356B2 (en) 2006-07-20 2017-04-25 Aol Inc. Targeted advertising for playlists based upon search queries
US8429634B2 (en) * 2006-07-26 2013-04-23 Semiconductor Energy Laboratory Co., Ltd. Semiconductor device, memory circuit, and machine language program generation device, and method for operating semiconductor device and memory circuit
US9602613B2 (en) * 2006-08-03 2017-03-21 Flash Networks, Ltd Method and system for accelerating browsing sessions
US9336323B2 (en) 2006-08-04 2016-05-10 Flash Networks, Inc. Method and system for accelerating surfing the internet
US7962502B2 (en) * 2008-11-18 2011-06-14 Yahoo! Inc. Efficient caching for dynamic webservice queries using cachable fragments
US8700803B2 (en) * 2009-06-03 2014-04-15 Netcordant, Inc. Web page optimization
US9323582B2 (en) * 2009-08-12 2016-04-26 Schlumberger Technology Corporation Node to node collaboration
CN101763437B (en) * 2010-02-10 2013-03-27 华为数字技术(成都)有限公司 Method and device for realizing high-speed buffer storage
CN104346345B (en) * 2013-07-24 2019-03-26 上海中兴软件有限责任公司 The storage method and device of data
US9648125B2 (en) 2013-10-04 2017-05-09 Akamai Technologies, Inc. Systems and methods for caching content with notification-based invalidation
US9813515B2 (en) 2013-10-04 2017-11-07 Akamai Technologies, Inc. Systems and methods for caching content with notification-based invalidation with extension to clients
US9641640B2 (en) 2013-10-04 2017-05-02 Akamai Technologies, Inc. Systems and methods for controlling cacheability and privacy of objects
US20150205884A1 (en) * 2014-01-22 2015-07-23 AI Squared Emphasizing a portion of the visible content elements of a markup language document
US9740793B2 (en) 2014-09-16 2017-08-22 International Business Machines Corporation Exposing fragment identifiers
US11271949B1 (en) * 2019-06-25 2022-03-08 Amazon Technologies, Inc. Application-based scanning

Citations (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5924116A (en) * 1997-04-02 1999-07-13 International Business Machines Corporation Collaborative caching of a requested object by a lower level node as a function of the caching status of the object at a higher level node
US5946697A (en) * 1997-04-22 1999-08-31 Microsoft Corporation Rapid transfer of HTML files
US6012126A (en) * 1996-10-29 2000-01-04 International Business Machines Corporation System and method for caching objects of non-uniform size using multiple LRU stacks partitions into a range of sizes
US6026413A (en) * 1997-08-01 2000-02-15 International Business Machines Corporation Determining how changes to underlying data affect cached objects
US6065058A (en) * 1997-05-09 2000-05-16 International Business Machines Corp. Dynamic push filtering based on information exchanged among nodes in a proxy hierarchy
US6122666A (en) * 1998-02-23 2000-09-19 International Business Machines Corporation Method for collaborative transformation and caching of web objects in a proxy network
US6128627A (en) * 1998-04-15 2000-10-03 Inktomi Corporation Consistent data storage in an object cache
US6138141A (en) * 1996-10-18 2000-10-24 At&T Corp Server to client cache protocol for improved web performance
US6178461B1 (en) * 1998-12-08 2001-01-23 Lucent Technologies Inc. Cache-based compaction technique for internet browsing using similar objects in client cache as reference objects

Patent Citations (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6138141A (en) * 1996-10-18 2000-10-24 At&T Corp Server to client cache protocol for improved web performance
US6012126A (en) * 1996-10-29 2000-01-04 International Business Machines Corporation System and method for caching objects of non-uniform size using multiple LRU stacks partitions into a range of sizes
US5924116A (en) * 1997-04-02 1999-07-13 International Business Machines Corporation Collaborative caching of a requested object by a lower level node as a function of the caching status of the object at a higher level node
US5946697A (en) * 1997-04-22 1999-08-31 Microsoft Corporation Rapid transfer of HTML files
US6065058A (en) * 1997-05-09 2000-05-16 International Business Machines Corp. Dynamic push filtering based on information exchanged among nodes in a proxy hierarchy
US6026413A (en) * 1997-08-01 2000-02-15 International Business Machines Corporation Determining how changes to underlying data affect cached objects
US6122666A (en) * 1998-02-23 2000-09-19 International Business Machines Corporation Method for collaborative transformation and caching of web objects in a proxy network
US6128627A (en) * 1998-04-15 2000-10-03 Inktomi Corporation Consistent data storage in an object cache
US6178461B1 (en) * 1998-12-08 2001-01-23 Lucent Technologies Inc. Cache-based compaction technique for internet browsing using similar objects in client cache as reference objects

Non-Patent Citations (9)

* Cited by examiner, † Cited by third party
Title
"Spyglass Prism Allow Non-PC Devices to Display Content Up to Four Times faster", http://www.spyglass.com/newsflash/releases/091697 prismperf.html, 3 pages, printed Sep. 19, 1997. *
"Spyglass: Making Devices Work With The Web", Products and Services, http://www.spyglass.com/product/wp, 7 pages, printed Sep. 19, 1997. *
Armando Fox et al., "Adapting to Network and Client Variability via On-Demand Dynamic Distillation", University of California at Berkeley, 11 pages, published in Proc. 7<SUP>th </SUP>Intl. conference on Architectural Support for Programming Language and Operating System, (Oct. 1996). *
Benoit Marchal from Pineapplesoft sprl, "An Introduction to SGML", http://www.pineapplesoft.com/reports/sgml/preface.html, 4 pages, (last modified Sep. 25, 1997). *
Charu Aggarwal et al., "Caching on the World Wide Web", IEEE Transactions on Knowledge and Data Engineering, vol. 11, No. 1, pp. 94-107, Jan./Feb. 1999. *
ISO 8879:1986, http://www.iso.ch/cate/d16387.html, Table of Contents, 1 page, (Last updated on May 8, 1999). *
Jadau et al. "Caching of Large Database Objects in Web Server", IEEE Jun. 1007, pp 10-19. *
Michael Leventhal et al., "Designing XML Internet Applications", Prentice Hall PTR, Table of Contents, 18 pages, (1998). *
Spyglass Ships Spyglass Prism 1.0 Dynamic Content Conversion Solution; Revolutionary Product Delives Existing Web Content to Non-PC Devices, http://www.spyglass.com/newsflash/releases/091697 prismships.html, 3 pages, printed Sep. 19, 1997. *

Cited By (28)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20050100014A1 (en) * 2000-08-09 2005-05-12 Microsoft Corporation Fast dynamic measurement of bandwidth in a TCP network environment
US20050108420A1 (en) * 2000-08-09 2005-05-19 Microsoft Corporation Fast dynamic measurement of bandwidth in a TCP network environment
US7349977B2 (en) 2000-08-09 2008-03-25 Microsoft Corporation Fast dynamic measurement of bandwidth in a TCP network environment
US7353286B2 (en) 2000-08-09 2008-04-01 Microsoft Corporation Fast dynamic measurement of bandwidth in a TCP network environment
US20050044166A1 (en) * 2001-06-28 2005-02-24 Microsoft Corporation Startup methods and apparatuses for use in streaming content
US7594025B2 (en) 2001-06-28 2009-09-22 Microsoft Corporation Startup methods and apparatuses for use in streaming content
US7548948B2 (en) * 2002-06-24 2009-06-16 Microsoft Corporation Client-side caching of streaming media content
US7725557B2 (en) 2002-06-24 2010-05-25 Microsoft Corporation Client-side caching of streaming media content
US20060059223A1 (en) * 2002-06-24 2006-03-16 Microsoft Corporation Client-side caching of streaming media content
US20030236906A1 (en) * 2002-06-24 2003-12-25 Klemets Anders E. Client-side caching of streaming media content
US7650421B2 (en) 2002-12-30 2010-01-19 Microsoft Corporation Adaptable accelerated content streaming
US20060168295A1 (en) * 2003-06-27 2006-07-27 Microsoft Corporation Midstream Determination of Varying Bandwidth Availability
US7634373B2 (en) 2003-06-27 2009-12-15 Microsoft Corporation Midstream determination of varying bandwidth availability
US7391717B2 (en) 2003-06-30 2008-06-24 Microsoft Corporation Streaming of variable bit rate multimedia content
US20040264489A1 (en) * 2003-06-30 2004-12-30 Klemets Anders E. Streaming of variable bit rate multimedia content
US20060092822A1 (en) * 2004-04-30 2006-05-04 Microsoft Corporation Session Description Message Extensions
US7783772B2 (en) 2004-04-30 2010-08-24 Microsoft Corporation Session description message extensions
US7809851B2 (en) 2004-04-30 2010-10-05 Microsoft Corporation Session description message extensions
US20080288722A1 (en) * 2005-04-19 2008-11-20 Streamezzo Method for Optimization of the Management of a Server Cache Which May be Consulted by Client Terminals with Differing Characteristics
US8275940B2 (en) * 2005-04-19 2012-09-25 Streamezzo Method and device for optimisation of the management of a server cache which may be consulted by client terminals with differing characteristics
US8316132B2 (en) * 2005-09-08 2012-11-20 Nokia Corporation Method to determine the completeness of a service guide
US20070055786A1 (en) * 2005-09-08 2007-03-08 Nokia Corporation Method to determine the completeness of a service guide
WO2012009191A1 (en) * 2010-07-12 2012-01-19 Alibaba Group Holding Limited Method and apparatus of processing nested fragment caching of a web page
US9195638B2 (en) 2010-07-12 2015-11-24 Alibaba Group Holding Limited Method and apparatus of processing nested fragment caching of a web page
US20150363369A1 (en) * 2010-07-12 2015-12-17 Alibaba Group Holding Limited Method and Apparatus of Processing Nested Fragment Caching of a Web Page
EP2593882A4 (en) * 2010-07-12 2016-12-14 Alibaba Group Holding Ltd Method and apparatus of processing nested fragment caching of a web page
US20130238970A1 (en) * 2012-03-07 2013-09-12 Google Inc. Uniquely Identifying Script Files
US9053199B2 (en) * 2012-03-07 2015-06-09 Google Inc. Uniquely identifying script files by appending a unique identifier to a URL

Also Published As

Publication number Publication date
US6249844B1 (en) 2001-06-19

Similar Documents

Publication Publication Date Title
USRE39184E1 (en) Identifying, processing and caching object fragments in a web environment
US5933832A (en) Retrieval system for frequently updated data distributed on network
US5991713A (en) Efficient method for compressing, storing, searching and transmitting natural language text
US6886130B1 (en) Compiled structure for efficient operation of distributed hypertext
US6230168B1 (en) Method for automatically constructing contexts in a hypertext collection
US6487566B1 (en) Transforming documents using pattern matching and a replacement language
US8332422B2 (en) Using text search engine for parametric search
US6449620B1 (en) Method and apparatus for generating information pages using semi-structured data stored in a structured manner
CA2522686C (en) Progressive relaxation of search criteria
US9703885B2 (en) Systems and methods for managing content variations in content delivery cache
US7334087B2 (en) Context-sensitive caching
US6564251B2 (en) Scalable computing system for presenting customized aggregation of information
US7130872B2 (en) Multi-tiered caching mechanism for the storage and retrieval of content multiple versions
US7124358B2 (en) Method for dynamically generating reference identifiers in structured information
US20020078041A1 (en) System and method of translating a universal query language to SQL
US20020016828A1 (en) Web page rendering architecture
US20150286744A1 (en) Using document templates to assemble a collection of documents
US20020078094A1 (en) Method and apparatus for XML visualization of a relational database and universal resource identifiers to database data and metadata
US7457812B2 (en) System and method for managing structured document
US6823492B1 (en) Method and apparatus for creating an index for a structured document based on a stylesheet
KR100456022B1 (en) An XML-based method of supplying Web-pages and its system for non-PC information terminals
US7793220B1 (en) Scalable derivative services
US6735594B1 (en) Transparent parameter marker support for a relational database over a network
Yoshida MOWS: distributed Web and cache server in Java
US20090164515A1 (en) Method and system for bit streaming for data centric applications

Legal Events

Date Code Title Description
FEPP Fee payment procedure

Free format text: PAYOR NUMBER ASSIGNED (ORIGINAL EVENT CODE: ASPN); ENTITY STATUS OF PATENT OWNER: LARGE ENTITY

FPAY Fee payment

Year of fee payment: 8

FPAY Fee payment

Year of fee payment: 12