WO2009158669A2 - Annotating webpage content - Google Patents

Annotating webpage content Download PDF

Info

Publication number
WO2009158669A2
WO2009158669A2 PCT/US2009/048944 US2009048944W WO2009158669A2 WO 2009158669 A2 WO2009158669 A2 WO 2009158669A2 US 2009048944 W US2009048944 W US 2009048944W WO 2009158669 A2 WO2009158669 A2 WO 2009158669A2
Authority
WO
WIPO (PCT)
Prior art keywords
annotations
webpage
annotation
collection
collections
Prior art date
Application number
PCT/US2009/048944
Other languages
French (fr)
Other versions
WO2009158669A3 (en
Inventor
Sang Van Le
Original Assignee
Google Inc.
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Google Inc. filed Critical Google Inc.
Priority to BRPI0914770A priority Critical patent/BRPI0914770A2/en
Priority to CA2729142A priority patent/CA2729142C/en
Priority to JP2011516734A priority patent/JP5931440B2/en
Priority to AU2009261977A priority patent/AU2009261977B2/en
Priority to CN200980132813.3A priority patent/CN102132273B/en
Priority to EP09771200.4A priority patent/EP2316081B1/en
Publication of WO2009158669A2 publication Critical patent/WO2009158669A2/en
Publication of WO2009158669A3 publication Critical patent/WO2009158669A3/en

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/10Text processing
    • G06F40/166Editing, e.g. inserting or deleting
    • G06F40/169Annotation, e.g. comment data or footnotes
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/955Retrieval from the web using information identifiers, e.g. uniform resource locators [URL]
    • G06F16/9558Details of hyperlinks; Management of linked annotations

Definitions

  • This specification relates to annotating webpage content.
  • Highlighted text can be produced on printed text, in a journal or a textbook. Highlighted text is also produced in electronic documents, using common word processing editors or pdf viewers. Typically, a reader marks text using highlighting means in order to display the text in a way that stands out on a paper or on a computer screen. One may add an annotation to the highlighted text to insert extra information related with the marked text. Also, an annotation can be inserted at a particular point in a document without marking the text at or near the annotation insertion point.
  • a computer-implemented method for annotating webpage content includes accessing a webpage in a browser, the webpage under control of a third party.
  • a collection of annotations stored at a storage location is retrieved, the annotations collection associated with the webpage and an annotations author.
  • the webpage is displayed with the retrieved annotations collection overlaid on the accessed webpage. Implementations can optionally include the following features. If the accessed webpage was modified by the third party, an alert is displayed, followed by removing from the annotations collection an annotation associated with the webpage modification. Also, when hovering a cursor over a highlighted text of the annotations collection, supplementary information associated with the highlighted text is displayed, including an address of a cross-referenced webpage. The user may access the cross-referenced webpage.
  • New annotations can be created and saved within the annotations collection.
  • a new annotation can be created by highlighting a text block of interest or by importing a stored annotation. Verification information associated with the text block, including text offsets and a hash mask is recorded. Then the user can input supplementary information associated with the highlighted text. The supplementary information includes, but is not limited to, notes and addresses of cross-referenced webpages.
  • the user can also edit or update displayed annotations.
  • the user can remove at least one annotation from the annotations collection overlaid on the webpage.
  • the annotations collection is updated to account for the at least one modified annotation. Additional annotations collections, available at the storage location, can be requested, retrieved and displayed alongside the initially retrieved annotations collection overlaid on the webpage. The user is alerted if a permission to access the additional annotations collections is denied.
  • a system for providing annotation information associated with web content includes a computerized device including a browser configured to access a webpage under control of a third party.
  • the system also includes an annotation information repository configured to store collections of annotations associated with the webpage accessed by the browser.
  • a browser plug-in is configured to retrieve a stored annotations collection. The browser displays the retrieved annotations collection overlaid on the accessed webpage.
  • the annotation information repository includes a database configured to store annotations collections.
  • the annotations collections are stored based on a set of identifiers which include a webpage address and an annotation author.
  • the annotation information repository also includes an authentication module configured to verify permissions to the annotations collections.
  • the annotations collection include verification records.
  • the verification records contain permissions and text mapping information.
  • the text mapping information further includes text offsets and a hash mask.
  • the annotations collection also includes supplementary records.
  • the supplementary records contain notes and addresses of cross-referenced webpages.
  • the browser plug-in is configured to update the annotations collection upon creation or removal of an annotation on the accessed webpage. Also the browser plug-in is configured to retrieve multiple annotations collections.
  • a system for annotating webpage content includes a computerized device which contains a browser configured to access a webpage under control of a third party.
  • the system also includes a collection of one or additional annotations made by an annotation author other than the third party associated with the webpage.
  • the annotations collection are stored at a storage location.
  • the system also contains a browser plug-in configured to retrieve a stored annotations collection. The browser displays the retrieved annotations collection overlaid on the accessed webpage.
  • System implementations can optionally include the following features.
  • the browser plug-in is configured to update the annotations collection upon creation or removal of an annotation on the accessed webpage. Also the browser plug-in is configured to give retrieve multiple annotations collections. The browser displays the retrieved multiple annotations collections overlaid on the accessed webpage, where the multiple annotations collections can be created by other annotation authors.
  • users can annotate the content of a webpage under control of a third party, by highlighting the information.
  • the annotated content can be retrieved and displayed in a browser from anywhere and at later time.
  • the appearance of the highlighted text is saved, as if the user has highlighted the information on paper, in an actual book.
  • the subject matter described in this specification can be implemented as a method or as a system or using computer program products, tangibly embodied in information carriers, such as a CD-ROM, a DVD-ROM, a HD-DVD-ROM, a Blue-Ray drive, a semiconductor memory, and a hard disk.
  • Such computer program products may cause a data processing apparatus to conduct one or more operations described in this specification.
  • the subject matter described in this specification can also be implemented as a system including a processor and a memory coupled to the processor.
  • the memory may encode one or more programs that cause the processor to perform one or more of the method acts described in this specification.
  • the subject matter described in this specification can be implemented using various data processing machines.
  • FIG. 1 is a schematic of an exemplary webpage content annotation system.
  • FIG. 2 is a schematic of an exemplary system configured to provide webpage content annotation information.
  • FIG. 3 shows an exemplary method for displaying annotations overlaid on a webpage under control of a third party.
  • FIG. 4 shows an exemplary method for displaying annotations overlaid on a webpage under control of a third party.
  • FIG. 5 shows an exemplary method for creating or removing annotations overlaid on a webpage under control of a third party.
  • FIG. 6 shows an exemplary method for creating or removing annotations overlaid on a webpage under control of a third party.
  • FIG. 7 shows an exemplary method for displaying annotations overlaid on a webpage under control of a third party.
  • FIG. 8 shows an exemplary method for displaying annotations overlaid on a webpage under control of a third party.
  • FIG. 9 is a schematic of an exemplary webpage content annotation system.
  • FIG. 10 shows an exemplary method for displaying annotations overlaid on a webpage under control of a third party.
  • FIG. 11 is a schematic diagram of a computerized electronic device. DETAILED DESCRIPTION
  • This document describes systems and techniques for annotating webpage content, the webpages under control of third parties.
  • a user can access a webpage about planet Jupiter, say at http:// ⁇ ; vi ⁇ ' ' w.nasa.gov/woiidbook ' 'jupiter__worldbook.html , and may highlight on the page the paragraph related to the time it takes Jupiter to spin around once around its axis.
  • the webpage resides on a remote server (at a NASA data center) on the internet and the user cannot modify the code of the webpage as the webpage is under control of a third party. Therefore, when the webpage is accessed either later in time, or from another point of access, the webpage does not feature the highlighted text, as prepared during the previous page visit.
  • a user accesses a webpage 25 under control of a third party, the webpage file located on a remote server 45 on the internet.
  • the webpage content annotation system 10 of FIG. 1 the user views the webpage 25 in a browser 20, including text highlights 27 or other annotations authored by the user during a previous access of the webpage. Therefore, the webpage content annotation system 10 allows the user to annotate the content of a webpage 25 under control of a third party, by highlighting the information.
  • the annotated content may be retrieved and displayed in a browser from anywhere and at later time.
  • the appearance of the highlighted text is saved, as if the user has highlighted the information on paper, in an actual book.
  • browser includes any HTML-enabled application that runs on a computerized electronic device.
  • the webpage 25 is accessed using a computerized electronic device 15 equipped with a browser 20.
  • the computerized electronic device 15 may be a PC, laptop, web-enabled PDA, mobile phone, etc.
  • the functionality of the browser 20 is extended with an annotation plug-in 30.
  • the annotation plug-in 30 is an add-on the GOOGLE toolbar.
  • the annotation plug-in 30 is added to the GMAIL toolbar.
  • the annotation plug-in 30 can work with any browser 20 or any third party web-based products or applications, not just the Google toolbar or the GMAIL toolbar.
  • the annotation plug-in 30 receives from the browser access to the viewing content. Furthermore, the browser displays content retrieved by the annotation plug-in.
  • the functionality of the annotation plug-in 30 may be incorporated in the browser 20.
  • the annotation plug-in 30 requests from an annotation information repository 50 annotated data 27 and other supplementary information associated with the accessed webpage 25.
  • the annotation information repository 50 is a local or a remote target that provides annotated data upon request.
  • a local annotation information repository may be a PC's hard disk or random access memory (RAM), or a PDA's flash memory/RAM.
  • the annotation plug-in 30 reads the annotated data 27 directly from the hard disk or RAM.
  • a remote annotation information repository 50 is located at a Google data center.
  • the annotated data 27 is stored in a database 60 on a data center server.
  • the annotation plug-in 30 makes a network request, such an HTTP/HTTPS request.
  • the browser 20 displays the accessed webpage 25 with the annotated data, if any, overlaid on the webpage 25.
  • the annotation plug-in 30 can manage the annotations in various ways including adding annotations, removing annotations, sorting annotations by dates, by webpage address, by number of annotations per webpage 25, etc.
  • FIG. 2 is a box diagram of an exemplary implementation of the annotation information repository 50.
  • a relational database 60 contains tables 62, each table corresponding to an annotation author.
  • the records in each table 62 contain collections of annotated data 200 grouped by an address of an annotated webpage. Therefore, an annotations collection 200 is a record associated with a user and a webpage address.
  • each table 62 of the relational database 60 corresponds to the address of the annotated webpage, and the records in each table 62 contain collections of annotated data 200 grouped by author.
  • the annotations collection (I, J) 200 may correspond to annotation author "I", and webpage address "J".
  • An annotations collection identifier 205 contains the name of the annotation author and the webpage address.
  • the name of the annotation author may be Igor
  • the address of the webpage containing information on Jupiter is http : //www.nasa. gov/ worldbook/j upiter worldbook.html .
  • the address of the webpage can be stored in the database as a text field.
  • the address of the webpage can be stored by its hash code. Database tables containing records stored in hash code format can by searched very efficiently.
  • a request for access 35 to specific annotations collections 200 is handled by an authentication module 70.
  • a user is given access only to the annotations collections 200 for which he is also identified as an annotation author.
  • the permissions to access an annotations collection 200 are set by the annotation author at the time the annotation collection 200 is made.
  • Each annotations collection (I, J) 200 contains two types of records: verification records 210, and supplementary records 235.
  • the verification records 210 include a permissions record 220, and a text mapping record 230.
  • the permissions record 220 contains authentication information.
  • the permissions record 220 may contain permissions given to other users, for example friends. In one implementation, a friend may be identified by the friend's email address.
  • the annotations author may give the friend access to an annotations collection or a subset of the annotations collection. For example, annotation author Igor can grant a classmate, Kate, access to Igor's annotation collection. Permissions may be only partially reciprocated. For example, Kate may choose not to grant Igor access to Kate's entire annotation collection, but only to a subset of her annotations collection.
  • the text mapping record 230 contains the text offsets and the hash mask of the highlighted text 27, thus the text mapping information is encrypted for storage.
  • the annotations collection 200 does not included the text itself, thus avoiding issues related to intellectual property rights.
  • the hash code of the highlighted text 27 is computed to detect differences between the highlighted text and the text in the currently accessed webpage.
  • the supplementary information may be optionally stored as part of an annotations collection.
  • the supplementary information 235 may include notes 240 about the highlighted text 27.
  • the supplementary information 235 may also include other relevant webpage addresses, cross-correlated 250 with the highlighted text.
  • a supplemental information place holder 260 is stored with the annotations collection 200.
  • FIG. 3 is an exemplary method for accessing a previously annotated webpages.
  • the method includes steps that can be implemented as instructions and executed in part by a processor in the computerized electronic device 15 and in part by a processor in the system located at the annotation information repository 50.
  • the method can begin with step 300 where the user accesses a webpage 25, the webpage under control of a third party.
  • the user may have accessed the webpage previously, and the user may have annotated the webpage on a previous visit.
  • the user may also be accessing the webpage for the first time.
  • the request for access is made from a web-browsing supported client application 20 on a computerized electronic device 15.
  • the request is done by typing in the webpage address, for example http://w ⁇ v'w.nasa.gov/worldbook/j upitcr_worldbook.html .
  • the annotations collection 200 associated with the webpage 25 is retrieved.
  • the annotations collection 200 being retrieved upon accessing the webpage 25 is a default annotations collection.
  • the default annotations collection for the accessed webpage 25 may contain only annotations authored by the user accessing the webpage 25.
  • the annotations contained in the default annotations collections were authored by the user during a previous visit of the currently accessed webpage 25.
  • the annotations collection 200 is being retrieved by an annotation plug-in 30.
  • the annotation plug-in 30 places a request 35 for annotation information to the annotation information repository 50 as shown in FIG. 2, the request 35 based on the identity of the user and the address of the webpage.
  • step 320 the browser 20 displays the retrieved annotations collection 200 overlaid on the webpage.
  • the user views the accessed webpage 25 including the highlighted text 27 and other annotations exactly as the annotations collection 200 overlaid on the webpage 25 appeared during his previous visit.
  • the user may peruse the webpage 25.
  • supplementary information 235 may be displayed 440.
  • the highlighted text 27 containing information on the " ancient astronomers naming Jupiter after the king of the Roman gods” may be cross-referenced 250 to a webpage related to Roman mythology, http://en.wikipedia.org/wiki/Roman_Mythology .
  • the cross-referenced webpage may be displayed in another browser window or another tab of the same browser window.
  • the annotations collection 200 associated with the highlighted text 27 on the " Egyptiant astronomers naming Jupiter after the king of the Roman gods” may include notes 240 on planet naming conventions, etc., authored by the user during previous accesses of the webpage 25.
  • the user may continue to study the webpage 25.
  • the user may create 530 another annotation. For example, using a mouse or a keyboard, the user may highlight the text containing information about the names of Jupiter's moons: "Jupiter's four largest satellites, in order of their distance from Jupiter, are Io, Europa, Ganymede, and Callisto.”
  • the user may also cross-reference 250 to the newly highlighted text the webpage on Roman mythology mentioned above.
  • the newly added annotation is saved 540 as part of the annotation collection 200 associated with the currently accessed webpage 25 and current annotation author.
  • annotations collection (I, J) 200 associated with annotations made by Igor on the webpage about Jupiter, now includes two highlighted text blocks 27, a cross-reference 250 webpage associated with both highlighted text blocks 27, and notes 240 associated with one of the highlighted text blocks 27.
  • the newly saved annotations collection 200 is stored in the database 60 at the annotation information repository 50.
  • an annotation author can make changes to the supplemental information 235 that is associated to an existing highlighted text block 27.
  • the annotations author may add additional notes 240 and cross-references 250.
  • the annotations author may also remove existing supplemental information 235 associated with an existing highlighted text block 27, including notes 240 and cross-references 250.
  • the modifications made in regard to the supplemental information 235 associated with the highlighted text block 27 is saved as part of the annotations collection 200 associated with accessed webpage 25.
  • the user can remove the unwanted annotation and save the annotations collection 200 to reflect the removal of the unwanted annotation.
  • the user may remove an annotation entirely, by de -highlighting a text block 27 using a mouse or keyboard.
  • the supplementary information 235 associated with the highlighted text 27 is also discarded.
  • the highlighted text 27 of an annotation may be preserved, but the supplementary information 235 associated with the highlighted text 27 may be modified (by adding to it, or partially/entirely removing it.)
  • the newly modified annotations collection 200 is saved and stored in the database 60 on a server of the annotation information repository 50.
  • the step 530 of creating a new annotation, and the step 540 of saving the annotation as part of the annotations collection 200 associated with the accessed page 25 can be repeated as many times as necessary.
  • the step 535 of removing an unwanted annotation, and the step 540 of saving the annotations collection 200 associated with the accessed page 25 can also be repeated as many times as necessary.
  • FIG. 6 Another exemplary method of annotating an accessed webpage 25 are described in FIG. 6.
  • the user may decide to mark a portion of the webpage by highlighting 610 a block of text.
  • the offsets of where the text starts and ends within the web document 25 are captured by the annotation plug-in 30, and the hash code of the text is computed 620.
  • the information collected in step 620 is stored as part of the text mapping record 230 included in an annotations collection 200.
  • the annotation plug-in 30 modifies the accessed webpage 25, which is currently loaded in the memory of the computerized electronic device 15. The modification includes adding appropriate HTML tags to display the highlighted text 27.
  • the user may highlight 610 another portion of the text of the accessed webpage 25 and not “loose” the highlight-property of the previously highlighted text.
  • the user may input notes 240 associated with the highlighted text 27.
  • the user may also add the address of cross- referenced 250 webpages associated with the highlighted text 27.
  • the information input in step 630 is stored as part of the supplementary record 235 included in the annotations collection 200.
  • the newly added annotation is saved as part of the annotation collection 200 associated with the currently accessed webpage 25 and current annotation author.
  • an annotation author can highlight a paragraph that already contains a previously highlighted text block 27.
  • the annotation plug-in 30 prompts the user with the choice of overwriting the previously highlighted text block 27.
  • the annotation author also has the choice of retaining the previously entered supplementary information 235 as part of the newly created annotations collection 200. Therefore, if the annotation author chooses so, only one annotation is associated with the highlighted paragraph, instead of more than one annotations.
  • the annotation plug-in 30 may inform 730 the user that other annotations collections 200' associated with the accessed webpage 25 are available at the annotation information repository 50.
  • other annotations collections 200' associated with the accessed page 25 about Jupiter may have been generated by the method described in FIGs 5-6 by a different annotations author, Kate, a member of Igor's study group.
  • Kate collected information on the physical properties of Jupiter, thus the annotations collection 200' created by Kate includes a highlighted text 74 stating that "Jupiter is a giant ball of gas and liquid with little, if any, solid surface.”
  • a request 740 for annotation information is placed by the annotation plug-in 30 to the annotation information repository 50.
  • the annotation plug-in 30 may retrieve the additional annotations collection (K, J) 200' on the webpage 25 about Jupiter, associated with annotation author Kate. The retrieving is possible only if Igor, the viewer of the accessed webpage 25 has the necessary permissions 220 to retrieve the additional annotations collection (K, J) 200'.
  • the browser 20 displays the retrieved additional annotations collection 200' overlaid on the accessed webpage 25.
  • the accessed webpage 25 showcases at least two annotations collections 200 and 200' on the webpage 25 about Jupiter, one annotations collection (I, J) 200 authored by Igor (the current viewer) and another annotations collection (K, J) 200' authored by Kate (a member of Igor's study group).
  • the retrieved additional annotations collection 200' may be saved as annotations collection 200 associated with the accessed webpage 25.
  • the webpage content annotation system 10 provides a convenient environment to share information collaboratively.
  • the retrieving 750 of other available annotations collection may be occurring locally.
  • the available annotations collection 200' may be stored on a flash memory (or memory stick) part of (or attached to) the computerized electronic device 15 used by the user of the webpage content annotation system 10.
  • the step of accessing 320 the webpage 25 causes the annotation plug-in 30 to verify 810 with the annotation information repository 50 if a default annotations collection exists for the accessed webpage 25.
  • Such default annotations collection for the accessed webpage 25 may contain only annotations authored by the user accessing the webpage 25.
  • the annotations contained in the default annotations collections were authored by the user during a previous visit of the currently accessed webpage 25. If the annotations information repository 50 does not store such default annotations collection, the browser 20 displays 820 the accessed webpage 25 as is. If a default annotations collection is stored by the annotations information repository 50, then the annotation plug-in 30 retrieves and displays 320 the default annotations collections.
  • the method continues by informing 730 the viewer of other available annotations collections 200', followed by the user requesting 740 other available annotations collections 200'.
  • the annotation plug-in 30 verifies if Igor, the current viewer of the webpage 25 about Jupiter, has permission to view the available annotations collection (K, J) 200', associated with the webpage 25 about Jupiter and authored by Kate.
  • the permissions information 220 is part of the verification record 210 of an annotations collection 200', as shown in FIG. 2.
  • permission to retrieve an annotations collection 200 is verified via the authentication module 70 of the annotation information repository 50.
  • the permission information may also be part of the supplementary information. For example, some of the supplementary notes can be shared/exported between users, but some other supplementary notes are restricted for the author's use.
  • the annotation plug-in 30 displays 820 an alert message for user "I” on the client computerized electronic device 15. If on the other hand, user "I" has permission to retrieve the available annotations collection (K, J) 200', the annotation plug-in 30 retrieves the available annotations collection 200', in step 750. Then, the browser displays the retrieved additional annotations collection 200' overlaid on the accessed webpage 25.
  • the accessed webpage 25 includes for viewing at least two annotations collections 200 and 200' on the webpage 25 about Jupiter, one annotations collection (I, J) 200 authored by Igor (the current viewer) and another annotations collection (K, J) 200' authored by Kate (a member of Igor's study group).
  • the annotations collections 200 can be shared via permissions.
  • An annotations author for example Kate, may choose to share her annotation collections at the granularity of webpage by using a web-based sharing tool to grant or revoke permissions on accessing the annotation collection.
  • Another user may want to retrieve an annotations collections 200 shared by Kate. For example, while viewing a webpage 25 at address "J", Igor may request the annotation information repository 50 for access to the annotations collection 200 authored by Kate in association with webpage address "J". In one implementation the request is an HTTP request.
  • both Igor and Kate may be, for example, Google toolbar users or GMAIL users. In this example the annotation information repository 50 would be Google.
  • Igor may be a GMAIL user and Kate a Yahoo! Email user. Then Igor requests for Kate's annotations collection 200 from Yahoo!, because in this example Yahoo! would be the annotation information repository 50.
  • annotation information repositories 50, Yahoo! and Google would have a common interface 35 for requesting annotations collections 200.
  • the common interface for requesting 35 annotations collections 200 may be implemented using SOAP.
  • the annotations collections may also be shared via exporting and importing on the local computerized electronic device or on a local network folder.
  • Kate exports and saves her annotations collections to a file.
  • the file is then transferred to Igor via email or some other type of file transfer, for example, by using memory stick, a local network folder, etc.
  • the annotations collection saved on the local electronic device may contain no permissions information record.
  • Igor retrieves the file onto his computerized electronic device, the annotations plug-in added on to Igor's browser can import the annotation collection.
  • the use of the exemplary method described above is illustrated schematically in FIG. 9.
  • one user accesses a webpage 25 about Jupiter, the webpage 25 about Jupiter being controlled by a third part (NASA) 45.
  • the webpage content annotation system 10 used by Igor allows him to display annotations overlaid on the accessed webpage 25.
  • the annotations were authored in part by Igor during a previous visit of the accessed webpage, and in part by a member of his study group, Kate.
  • the annotations are stored by an annotation information repository 50 and retrieved via an annotation plug-in 30.
  • the annotations are stored in a database 60 on the annotation information repository 50 server, and are recorded as annotations collections 200 and 200' associated with the webpage address J, for example and also associated with an annotation author, Igor or Kate, respectively.
  • the annotations collections associated with a webpage address may be accessed by the annotations collection author. For example, Igor's highlights at the webpage address J, are only accessible by Igor. Kate's highlights at that same webpage address J are only accessible by Kate. Therefore, a webpage 25 under control of a third party 45 becomes a virtual book for users who have access to the web, with the added benefit that the virtual book can accommodate personalized highlights and annotations.
  • the step 1010 of retrieving an annotations collection 200 for overlay with the accessed webpage 25 is followed by a verification step 1020.
  • the webpage 25 is inherently controlled by a third party 45, therefore the accessed webpage 25 may have been modified since the time the user had accessed it.
  • the annotations collection 200 includes a text-mapping record 230 to verify possible changes in the webpage content.
  • the annotation plug-in 30 compares each pair of starting and ending text offsets and hash masks of the highlighted text 27 included in the retrieved annotations collections 200, with the corresponding text properties of the accessed webpage 25.
  • the annotation plug-in 30 modifies the accessed webpage 25, which is now loaded in the memory of the computerized electronic device 15, the modification includes adding appropriate HTML tags to display the highlighted text 27, as shown in step 320. [0055] If during the matching step 1030, the annotation plug-in 30 determines that the hash codes do not match, the web-contented annotation plug-in 30 displays 1040 an alert to inform the user of the un-matching, or partially un-matching highlighted text 27. The annotations collection 200, or part of it, is rendered orphan. In one implementation, the annotation plug-in 30 discards 1050 from the annotation information repository database 60 the annotations collection 200 associated with the missing highlighted text.
  • the annotation plug-in 30 updates the annotations collection 200 to account for the missing highlighted text.
  • the annotation plug-in 30 offers the annotation author the choice to retain the supplementary information 235 associated with the "orphan" annotation.
  • the retained supplementary information 235 is saved as part of the annotation collection 200 and may be associated with another highlighted text block 27 or with a newly created annotation.
  • FIG. 11 is a schematic diagram of a computer system 1100 representing the computerized electronic device 15. Also the computer system 1100 can represent a server at the annotation information repository 50.
  • the system 1100 can be used for the operations described in association with any of the computer-implement methods described previously, according to one implementation.
  • the system 1100 is intended to include various forms of digital computers, such as laptops, desktops, workstations, personal digital assistants, servers, blade servers, mainframes, and other appropriate computers.
  • the system 1100 can also include mobile devices, such as personal digital assistants, cellular telephones, smartphones, and other similar computing devices.
  • the system can include portable storage media, such as, Universal Serial Bus (USB) flash drives.
  • USB flash drives may store operating systems and other applications.
  • the USB flash drives can include input/output components, such as a wireless transmitter or USB connector that may be inserted into a USB port of another computing device.
  • the system 1100 includes a processor 1110, a memory 1120, a storage device 1130, and an input/output device 1140. Each of the components 1110, 1120, 1130, and 1140 are interconnected using a system bus 1150.
  • the processor 1110 is capable of processing instructions for execution within the system 1100. In one implementation, the processor 1110 is a single-threaded processor. In another implementation, the processor 1110 is a multithreaded processor.
  • the processor 1110 is capable of processing instructions stored in the memory 1120 or on the storage device 1130 to display graphical information for a user interface on the input/output device 1140.
  • the memory 1120 stores information within the system 1100.
  • the memory 1120 is a computer-readable medium.
  • the memory 1120 is a volatile memory unit.
  • the memory 1120 is a non- volatile memory unit.
  • the storage device 1130 is capable of providing mass storage for the system 1100.
  • the storage device 1130 is a computer-readable medium.
  • the storage device 1130 may be a floppy disk device, a hard disk device, an optical disk device, or a tape device.
  • the input/output device 1140 provides input/output operations for the system 1100.
  • the input/output device 1140 includes a keyboard and/or pointing device.
  • the input/output device 1140 includes a display unit for displaying graphical user interfaces.
  • the features described can be implemented in digital electronic circuitry, or in computer hardware, firmware, software, or in combinations of them.
  • the apparatus can be implemented in a computer program product tangibly embodied in an information carrier, e.g., in a machine-readable storage device or in a propagated signal, for execution by a programmable processor; and method steps can be performed by a programmable processor executing a program of instructions to perform functions of the described implementations by operating on input data and generating output.
  • the described features can be implemented advantageously in one or more computer programs that are executable on a programmable system including at least one programmable processor coupled to receive data and instructions from, and to transmit data and instructions to, a data storage system, at least one input device, and at least one output device.
  • a computer program is a set of instructions that can be used, directly or indirectly, in a computer to perform a certain activity or bring about a certain result.
  • a computer program can be written in any form of programming language, including compiled or interpreted languages, and it can be deployed in any form, including as a stand-alone program or as a module, component, subroutine, or other unit suitable for use in a computing environment.
  • Suitable processors for the execution of a program of instructions include, by way of example, both general and special purpose microprocessors, and the sole processor or one of multiple processors of any kind of computer. Generally, a processor will receive instructions and data from a read-only memory or a random access memory or both.
  • the essential elements of a computer are a processor for executing instructions and one or more memories for storing instructions and data.
  • a computer will also include, or be operatively coupled to communicate with, one or more mass storage devices for storing data files; such devices include magnetic disks, such as internal hard disks and removable disks; magnetooptical disks; and optical disks.
  • Storage devices suitable for tangibly embodying computer program instructions and data include all forms of nonvolatile memory, including by way of example semiconductor memory devices, such as EPROM, EEPROM, and flash memory devices; magnetic disks such as internal hard disks and removable disks; magneto-optical disks; and CD-ROM and DVD- ROM disks.
  • the processor and the memory can be supplemented by, or incorporated in, ASICs (application- specific integrated circuits).
  • the features can be implemented on a computer having a display device such as a CRT (cathode ray tube) or LCD (liquid crystal display) monitor for displaying information to the user and a keyboard and a pointing device such as a mouse or a trackball by which the user can provide input to the computer.
  • a display device such as a CRT (cathode ray tube) or LCD (liquid crystal display) monitor for displaying information to the user and a keyboard and a pointing device such as a mouse or a trackball by which the user can provide input to the computer.
  • the features can be implemented in a computer system that includes a back-end component, such as a data server, or that includes a middleware component, such as an application server or a web server, or that includes a front-end component, such as a client computer having a graphical user interface or an Internet browser, or any combination of them.
  • the components of the system can be connected by any form or medium of digital data communication such as a communication network. Examples of communication networks include a local area network (“LAN”), a wide area network (“WAN”), peer-to-peer networks (having ad-hoc or static members), grid computing infrastructures, and the Internet.
  • LAN local area network
  • WAN wide area network
  • peer-to-peer networks having ad-hoc or static members
  • grid computing infrastructures and the Internet.
  • the computer system can include clients and servers.
  • a client and server are generally remote from each other and typically interact through a network, such as the described one.
  • the relationship of client and server arises by virtue of computer programs running on the respective computers and having a client-server relationship to each other.
  • webpage content annotation system may be integrated with a targeted advertising system.
  • Ads can be shown/displayed to specifically match the annotated/highlighted text of a viewer's annotations collections stored at an annotation information repository. For example, a user who reads about the history of shoes and highlights information on shoes, may indicate affinity to seeing ads about shoes on sale, etc.
  • annotation information system may be used on internets or extranets for collaborative work to improve work/productivity: For example, employees of a company may use the webpage content annotation system and methods for reading wiki pages or online user guides, making observations upon the content of the online user guides and sharing their observations collaboratively.

Abstract

Among other things, a computer-implemented method for annotating webpage content includes accessing a webpage in a browser, the webpage under control of a third party. A collection of annotations stored at a storage location is retrieved, the annotations collection associated with the webpage and an annotations author. The webpage is displayed with the retrieved annotations collection overlaid on the accessed webpage.

Description

ANNOTATING WEBPAGE CONTENT
TECHNICAL FIELD
[0001] This specification relates to annotating webpage content.
BACKGROUND
[0002] Highlighted text can be produced on printed text, in a journal or a textbook. Highlighted text is also produced in electronic documents, using common word processing editors or pdf viewers. Typically, a reader marks text using highlighting means in order to display the text in a way that stands out on a paper or on a computer screen. One may add an annotation to the highlighted text to insert extra information related with the marked text. Also, an annotation can be inserted at a particular point in a document without marking the text at or near the annotation insertion point.
SUMMARY
[0003] This specification describes methods and systems for annotating webpage content, the webpages under control of third parties.
[0004] In one aspect, a computer-implemented method for annotating webpage content includes accessing a webpage in a browser, the webpage under control of a third party. A collection of annotations stored at a storage location is retrieved, the annotations collection associated with the webpage and an annotations author. The webpage is displayed with the retrieved annotations collection overlaid on the accessed webpage. Implementations can optionally include the following features. If the accessed webpage was modified by the third party, an alert is displayed, followed by removing from the annotations collection an annotation associated with the webpage modification. Also, when hovering a cursor over a highlighted text of the annotations collection, supplementary information associated with the highlighted text is displayed, including an address of a cross-referenced webpage. The user may access the cross-referenced webpage.
[0005] Further implementations can optionally include the following features. New annotations can be created and saved within the annotations collection. A new annotation can be created by highlighting a text block of interest or by importing a stored annotation. Verification information associated with the text block, including text offsets and a hash mask is recorded. Then the user can input supplementary information associated with the highlighted text. The supplementary information includes, but is not limited to, notes and addresses of cross-referenced webpages. The user can also edit or update displayed annotations. Furthermore, the user can remove at least one annotation from the annotations collection overlaid on the webpage. The annotations collection is updated to account for the at least one modified annotation. Additional annotations collections, available at the storage location, can be requested, retrieved and displayed alongside the initially retrieved annotations collection overlaid on the webpage. The user is alerted if a permission to access the additional annotations collections is denied.
[0006] In another aspect, a system for providing annotation information associated with web content includes a computerized device including a browser configured to access a webpage under control of a third party. The system also includes an annotation information repository configured to store collections of annotations associated with the webpage accessed by the browser. A browser plug-in is configured to retrieve a stored annotations collection. The browser displays the retrieved annotations collection overlaid on the accessed webpage.
[0007] System implementations can optionally include the following features. The annotation information repository includes a database configured to store annotations collections. The annotations collections are stored based on a set of identifiers which include a webpage address and an annotation author. The annotation information repository also includes an authentication module configured to verify permissions to the annotations collections. The annotations collection include verification records. The verification records contain permissions and text mapping information. The text mapping information further includes text offsets and a hash mask. The annotations collection also includes supplementary records. The supplementary records contain notes and addresses of cross-referenced webpages. The browser plug-in is configured to update the annotations collection upon creation or removal of an annotation on the accessed webpage. Also the browser plug-in is configured to retrieve multiple annotations collections. The browser displays the retrieved multiple annotations collections overlaid on the accessed webpage, where the multiple annotations collections can be created by other annotation authors. [0008] In yet another aspect, a system for annotating webpage content includes a computerized device which contains a browser configured to access a webpage under control of a third party. The system also includes a collection of one or additional annotations made by an annotation author other than the third party associated with the webpage. The annotations collection are stored at a storage location. The system also contains a browser plug-in configured to retrieve a stored annotations collection. The browser displays the retrieved annotations collection overlaid on the accessed webpage.
[0009] System implementations can optionally include the following features. The browser plug-in is configured to update the annotations collection upon creation or removal of an annotation on the accessed webpage. Also the browser plug-in is configured to give retrieve multiple annotations collections. The browser displays the retrieved multiple annotations collections overlaid on the accessed webpage, where the multiple annotations collections can be created by other annotation authors.
[0010] The subject matter described in this specification potentially can provide various advantages. For example, users can annotate the content of a webpage under control of a third party, by highlighting the information. The annotated content can be retrieved and displayed in a browser from anywhere and at later time. The appearance of the highlighted text is saved, as if the user has highlighted the information on paper, in an actual book.
[0011] The subject matter described in this specification can be implemented as a method or as a system or using computer program products, tangibly embodied in information carriers, such as a CD-ROM, a DVD-ROM, a HD-DVD-ROM, a Blue-Ray drive, a semiconductor memory, and a hard disk. Such computer program products may cause a data processing apparatus to conduct one or more operations described in this specification.
[0012] In addition, the subject matter described in this specification can also be implemented as a system including a processor and a memory coupled to the processor. The memory may encode one or more programs that cause the processor to perform one or more of the method acts described in this specification. Further the subject matter described in this specification can be implemented using various data processing machines. [0013] Other features, objects, and advantages of the subject matter of this specification will be apparent from the description and drawings, and from the claims.
DESCRIPTION OF DRAWINGS
[0014] FIG. 1 is a schematic of an exemplary webpage content annotation system.
[0015] FIG. 2 is a schematic of an exemplary system configured to provide webpage content annotation information.
[0016] FIG. 3 shows an exemplary method for displaying annotations overlaid on a webpage under control of a third party.
[0017] FIG. 4 shows an exemplary method for displaying annotations overlaid on a webpage under control of a third party.
[0018] FIG. 5 shows an exemplary method for creating or removing annotations overlaid on a webpage under control of a third party.
[0019] FIG. 6 shows an exemplary method for creating or removing annotations overlaid on a webpage under control of a third party.
[0020] FIG. 7 shows an exemplary method for displaying annotations overlaid on a webpage under control of a third party.
[0021] FIG. 8 shows an exemplary method for displaying annotations overlaid on a webpage under control of a third party.
[0022] FIG. 9 is a schematic of an exemplary webpage content annotation system.
[0023] FIG. 10 shows an exemplary method for displaying annotations overlaid on a webpage under control of a third party.
[0024] FIG. 11 is a schematic diagram of a computerized electronic device. DETAILED DESCRIPTION
[0025] This document describes systems and techniques for annotating webpage content, the webpages under control of third parties. For example, a user can access a webpage about planet Jupiter, say at http://Λ;viΛ' 'w.nasa.gov/woiidbook''jupiter__worldbook.html , and may highlight on the page the paragraph related to the time it takes Jupiter to spin around once around its axis. The webpage resides on a remote server (at a NASA data center) on the internet and the user cannot modify the code of the webpage as the webpage is under control of a third party. Therefore, when the webpage is accessed either later in time, or from another point of access, the webpage does not feature the highlighted text, as prepared during the previous page visit.
[0026] In the exemplary system shown in FIG. 1 , a user accesses a webpage 25 under control of a third party, the webpage file located on a remote server 45 on the internet. By using the webpage content annotation system 10 of FIG. 1, the user views the webpage 25 in a browser 20, including text highlights 27 or other annotations authored by the user during a previous access of the webpage. Therefore, the webpage content annotation system 10 allows the user to annotate the content of a webpage 25 under control of a third party, by highlighting the information. The annotated content may be retrieved and displayed in a browser from anywhere and at later time. The appearance of the highlighted text is saved, as if the user has highlighted the information on paper, in an actual book. In this specification, browser includes any HTML-enabled application that runs on a computerized electronic device.
[0027] The webpage 25 is accessed using a computerized electronic device 15 equipped with a browser 20. The computerized electronic device 15 may be a PC, laptop, web-enabled PDA, mobile phone, etc. The functionality of the browser 20 is extended with an annotation plug-in 30. In one implementation, the annotation plug-in 30 is an add-on the GOOGLE toolbar. In another implementation, the annotation plug-in 30 is added to the GMAIL toolbar. In another aspect, the annotation plug-in 30 can work with any browser 20 or any third party web-based products or applications, not just the Google toolbar or the GMAIL toolbar. The annotation plug-in 30 receives from the browser access to the viewing content. Furthermore, the browser displays content retrieved by the annotation plug-in. In yet another implementation, the functionality of the annotation plug-in 30 may be incorporated in the browser 20. [0028] In an exemplary implementation, the annotation plug-in 30 requests from an annotation information repository 50 annotated data 27 and other supplementary information associated with the accessed webpage 25. The annotation information repository 50 is a local or a remote target that provides annotated data upon request. In one example, a local annotation information repository may be a PC's hard disk or random access memory (RAM), or a PDA's flash memory/RAM. For local targets, the annotation plug-in 30 reads the annotated data 27 directly from the hard disk or RAM. In the example sketched in FIG. 1, a remote annotation information repository 50 is located at a Google data center. The annotated data 27 is stored in a database 60 on a data center server. For remote targets, the annotation plug-in 30 makes a network request, such an HTTP/HTTPS request.
[0029] Once the annotation plug-in 30 has retrieved the available annotated data 27 associated with the accessed webpage, the browser 20 displays the accessed webpage 25 with the annotated data, if any, overlaid on the webpage 25. The annotation plug-in 30 can manage the annotations in various ways including adding annotations, removing annotations, sorting annotations by dates, by webpage address, by number of annotations per webpage 25, etc.
[0030] FIG. 2 is a box diagram of an exemplary implementation of the annotation information repository 50. In this implementation, a relational database 60 contains tables 62, each table corresponding to an annotation author. The records in each table 62 contain collections of annotated data 200 grouped by an address of an annotated webpage. Therefore, an annotations collection 200 is a record associated with a user and a webpage address. In another implementation, each table 62 of the relational database 60 corresponds to the address of the annotated webpage, and the records in each table 62 contain collections of annotated data 200 grouped by author. In the exemplary implementation sketched in FIG. 2, the annotations collection (I, J) 200 may correspond to annotation author "I", and webpage address "J". An annotations collection identifier 205 contains the name of the annotation author and the webpage address. Referring to the example illustrated in FIG. 1, the name of the annotation author may be Igor, and the address of the webpage containing information on Jupiter is http : //www.nasa. gov/ worldbook/j upiter worldbook.html . In this example, the address of the webpage can be stored in the database as a text field. In another aspect, the address of the webpage can be stored by its hash code. Database tables containing records stored in hash code format can by searched very efficiently. [0031] Returning to FIG. 2, a request for access 35 to specific annotations collections 200 is handled by an authentication module 70. In an exemplary implementation, a user is given access only to the annotations collections 200 for which he is also identified as an annotation author. In one implementation, the permissions to access an annotations collection 200 are set by the annotation author at the time the annotation collection 200 is made.
[0032] Each annotations collection (I, J) 200 contains two types of records: verification records 210, and supplementary records 235. The verification records 210 include a permissions record 220, and a text mapping record 230. The permissions record 220 contains authentication information. Furthermore, the permissions record 220 may contain permissions given to other users, for example friends. In one implementation, a friend may be identified by the friend's email address. Moreover, the annotations author may give the friend access to an annotations collection or a subset of the annotations collection. For example, annotation author Igor can grant a classmate, Kate, access to Igor's annotation collection. Permissions may be only partially reciprocated. For example, Kate may choose not to grant Igor access to Kate's entire annotation collection, but only to a subset of her annotations collection.
[0033] The text mapping record 230 contains the text offsets and the hash mask of the highlighted text 27, thus the text mapping information is encrypted for storage. By storing only the offsets of where the highlighted text 27 starts and where it ends within the web document 25, the annotations collection 200 does not included the text itself, thus avoiding issues related to intellectual property rights. Additionally, the hash code of the highlighted text 27 is computed to detect differences between the highlighted text and the text in the currently accessed webpage. The supplementary information may be optionally stored as part of an annotations collection. The supplementary information 235 may include notes 240 about the highlighted text 27. The supplementary information 235 may also include other relevant webpage addresses, cross-correlated 250 with the highlighted text. In case the annotations author does not choose to include supplemental information235, a supplemental information place holder 260 is stored with the annotations collection 200.
[0034] FIG. 3 is an exemplary method for accessing a previously annotated webpages. In one implementation the method includes steps that can be implemented as instructions and executed in part by a processor in the computerized electronic device 15 and in part by a processor in the system located at the annotation information repository 50. The method can begin with step 300 where the user accesses a webpage 25, the webpage under control of a third party. The user may have accessed the webpage previously, and the user may have annotated the webpage on a previous visit. The user may also be accessing the webpage for the first time. The request for access is made from a web-browsing supported client application 20 on a computerized electronic device 15. The request is done by typing in the webpage address, for example http://w\v'w.nasa.gov/worldbook/j upitcr_worldbook.html .
[0035] During the second step of the method, the annotations collection 200 associated with the webpage 25 is retrieved. In one implementation, the annotations collection 200 being retrieved upon accessing the webpage 25 is a default annotations collection. The default annotations collection for the accessed webpage 25 may contain only annotations authored by the user accessing the webpage 25. The annotations contained in the default annotations collections were authored by the user during a previous visit of the currently accessed webpage 25. The annotations collection 200 is being retrieved by an annotation plug-in 30. The annotation plug-in 30 places a request 35 for annotation information to the annotation information repository 50 as shown in FIG. 2, the request 35 based on the identity of the user and the address of the webpage.
[0036] Returning to FIG. 3, in step 320, the browser 20 displays the retrieved annotations collection 200 overlaid on the webpage. In one implementation, if the default annotations collection has been retrieved, the user views the accessed webpage 25 including the highlighted text 27 and other annotations exactly as the annotations collection 200 overlaid on the webpage 25 appeared during his previous visit.
[0037] Referring to FIG. 4, once the browser 20 displays the retrieved annotations collection 200 overlaid on the accessed webpage 25, the user may peruse the webpage 25. As the mouse hovers 430 over the highlighted text 27, supplementary information 235 may be displayed 440. For example, the highlighted text 27 containing information on the "ancient astronomers naming Jupiter after the king of the Roman gods" may be cross-referenced 250 to a webpage related to Roman mythology, http://en.wikipedia.org/wiki/Roman_Mythology . As shown in step 450, by right-clicking on the cross-referenced 250 address, the cross-referenced webpage may be displayed in another browser window or another tab of the same browser window. The annotations collection 200 associated with the highlighted text 27 on the "ancient astronomers naming Jupiter after the king of the Roman gods" may include notes 240 on planet naming conventions, etc., authored by the user during previous accesses of the webpage 25.
[0038] Referring to FIG. 5, once the browser 20 displays 320 the retrieved annotations collection 200 overlaid on the accessed webpage 25, the user may continue to study the webpage 25. When the user finds additional information on the webpage 25, say information relevant for his assignment, the user may create 530 another annotation. For example, using a mouse or a keyboard, the user may highlight the text containing information about the names of Jupiter's moons: "Jupiter's four largest satellites, in order of their distance from Jupiter, are Io, Europa, Ganymede, and Callisto." The user may also cross-reference 250 to the newly highlighted text the webpage on Roman mythology mentioned above. The newly added annotation is saved 540 as part of the annotation collection 200 associated with the currently accessed webpage 25 and current annotation author. For example, the annotations collection (I, J) 200, associated with annotations made by Igor on the webpage about Jupiter, now includes two highlighted text blocks 27, a cross-reference 250 webpage associated with both highlighted text blocks 27, and notes 240 associated with one of the highlighted text blocks 27. The newly saved annotations collection 200 is stored in the database 60 at the annotation information repository 50.
[0039] In another aspect, an annotation author can make changes to the supplemental information 235 that is associated to an existing highlighted text block 27. The annotations author may add additional notes 240 and cross-references 250. The annotations author may also remove existing supplemental information 235 associated with an existing highlighted text block 27, including notes 240 and cross-references 250. The modifications made in regard to the supplemental information 235 associated with the highlighted text block 27 is saved as part of the annotations collection 200 associated with accessed webpage 25.
[0040] Conversely, if the annotations collection 200 contains an annotation that is no longer of interest to the user, the user can remove the unwanted annotation and save the annotations collection 200 to reflect the removal of the unwanted annotation. In one implementation, the user may remove an annotation entirely, by de -highlighting a text block 27 using a mouse or keyboard. The supplementary information 235 associated with the highlighted text 27 is also discarded. In another implementation, the highlighted text 27 of an annotation may be preserved, but the supplementary information 235 associated with the highlighted text 27 may be modified (by adding to it, or partially/entirely removing it.) The newly modified annotations collection 200 is saved and stored in the database 60 on a server of the annotation information repository 50. In the exemplary implementation illustrated schematically in FIG. 5, the step 530 of creating a new annotation, and the step 540 of saving the annotation as part of the annotations collection 200 associated with the accessed page 25 can be repeated as many times as necessary. Moreover, the step 535 of removing an unwanted annotation, and the step 540 of saving the annotations collection 200 associated with the accessed page 25 can also be repeated as many times as necessary.
[0041] Another exemplary method of annotating an accessed webpage 25 are described in FIG. 6. Once a desired webpage 25 is accessed 300, the user may decide to mark a portion of the webpage by highlighting 610 a block of text. The offsets of where the text starts and ends within the web document 25 are captured by the annotation plug-in 30, and the hash code of the text is computed 620. The information collected in step 620 is stored as part of the text mapping record 230 included in an annotations collection 200. Furthermore, after the text highlighting step 610, the annotation plug-in 30 modifies the accessed webpage 25, which is currently loaded in the memory of the computerized electronic device 15. The modification includes adding appropriate HTML tags to display the highlighted text 27. By "fixing" the recently highlighted text 27 in this manner, the user may highlight 610 another portion of the text of the accessed webpage 25 and not "loose" the highlight-property of the previously highlighted text. In step 630 of the exemplary method shown in FIG. 6, the user may input notes 240 associated with the highlighted text 27. The user may also add the address of cross- referenced 250 webpages associated with the highlighted text 27. The information input in step 630 is stored as part of the supplementary record 235 included in the annotations collection 200. In step 540, the newly added annotation is saved as part of the annotation collection 200 associated with the currently accessed webpage 25 and current annotation author.
[0042] In another aspect, an annotation author can highlight a paragraph that already contains a previously highlighted text block 27. The annotation plug-in 30 prompts the user with the choice of overwriting the previously highlighted text block 27. At this time, the annotation author also has the choice of retaining the previously entered supplementary information 235 as part of the newly created annotations collection 200. Therefore, if the annotation author chooses so, only one annotation is associated with the highlighted paragraph, instead of more than one annotations.
[0043] In another implementation illustrated in FIG. 7, once the browser 20 displays 320 the retrieved annotations collection 200 overlaid on the accessed webpage 25, the annotation plug-in 30 may inform 730 the user that other annotations collections 200' associated with the accessed webpage 25 are available at the annotation information repository 50. For example, other annotations collections 200' associated with the accessed page 25 about Jupiter may have been generated by the method described in FIGs 5-6 by a different annotations author, Kate, a member of Igor's study group. Kate collected information on the physical properties of Jupiter, thus the annotations collection 200' created by Kate includes a highlighted text 74 stating that "Jupiter is a giant ball of gas and liquid with little, if any, solid surface." Kate added to the annotations collection 200' the address of a cross-referenced 250 webpage, http ://en. wikipcdia .org/wiki/ Alternative biochemistry , concerning the possibility of non-carbon based life.
[0044] Returning to FIG. 7, a request 740 for annotation information is placed by the annotation plug-in 30 to the annotation information repository 50. The annotation plug-in 30 may retrieve the additional annotations collection (K, J) 200' on the webpage 25 about Jupiter, associated with annotation author Kate. The retrieving is possible only if Igor, the viewer of the accessed webpage 25 has the necessary permissions 220 to retrieve the additional annotations collection (K, J) 200'. In step 760, the browser 20 displays the retrieved additional annotations collection 200' overlaid on the accessed webpage 25. In this example, the accessed webpage 25 showcases at least two annotations collections 200 and 200' on the webpage 25 about Jupiter, one annotations collection (I, J) 200 authored by Igor (the current viewer) and another annotations collection (K, J) 200' authored by Kate (a member of Igor's study group).
[0045] In another implementation, the retrieved additional annotations collection 200' may be saved as annotations collection 200 associated with the accessed webpage 25. Thus, the webpage content annotation system 10 provides a convenient environment to share information collaboratively. [0046] In yet another implementation, the retrieving 750 of other available annotations collection may be occurring locally. In this implementation, the available annotations collection 200' may be stored on a flash memory (or memory stick) part of (or attached to) the computerized electronic device 15 used by the user of the webpage content annotation system 10.
[0047] Referring to the exemplary method illustrated schematically in FIG. 8, the step of accessing 320 the webpage 25 causes the annotation plug-in 30 to verify 810 with the annotation information repository 50 if a default annotations collection exists for the accessed webpage 25. Such default annotations collection for the accessed webpage 25 may contain only annotations authored by the user accessing the webpage 25. The annotations contained in the default annotations collections were authored by the user during a previous visit of the currently accessed webpage 25. If the annotations information repository 50 does not store such default annotations collection, the browser 20 displays 820 the accessed webpage 25 as is. If a default annotations collection is stored by the annotations information repository 50, then the annotation plug-in 30 retrieves and displays 320 the default annotations collections.
[0048] In the exemplary method presented schematically in FIG. 8, the method continues by informing 730 the viewer of other available annotations collections 200', followed by the user requesting 740 other available annotations collections 200'. As part of the conditional step 830, the annotation plug-in 30 verifies if Igor, the current viewer of the webpage 25 about Jupiter, has permission to view the available annotations collection (K, J) 200', associated with the webpage 25 about Jupiter and authored by Kate. The permissions information 220 is part of the verification record 210 of an annotations collection 200', as shown in FIG. 2. Furthermore, permission to retrieve an annotations collection 200 is verified via the authentication module 70 of the annotation information repository 50. In another implementation, the permission information may also be part of the supplementary information. For example, some of the supplementary notes can be shared/exported between users, but some other supplementary notes are restricted for the author's use.
[0049] Returning to the exemplary method of FIG. 8, if, for example, user "I" does not have permission to retrieve the available annotations collection (K, J) 200', the annotation plug-in 30 displays 820 an alert message for user "I" on the client computerized electronic device 15. If on the other hand, user "I" has permission to retrieve the available annotations collection (K, J) 200', the annotation plug-in 30 retrieves the available annotations collection 200', in step 750. Then, the browser displays the retrieved additional annotations collection 200' overlaid on the accessed webpage 25. In this example, the accessed webpage 25 includes for viewing at least two annotations collections 200 and 200' on the webpage 25 about Jupiter, one annotations collection (I, J) 200 authored by Igor (the current viewer) and another annotations collection (K, J) 200' authored by Kate (a member of Igor's study group).
[0050] In one aspect, the annotations collections 200 can be shared via permissions. An annotations author, for example Kate, may choose to share her annotation collections at the granularity of webpage by using a web-based sharing tool to grant or revoke permissions on accessing the annotation collection. Another user may want to retrieve an annotations collections 200 shared by Kate. For example, while viewing a webpage 25 at address "J", Igor may request the annotation information repository 50 for access to the annotations collection 200 authored by Kate in association with webpage address "J". In one implementation the request is an HTTP request. Furthermore, both Igor and Kate may be, for example, Google toolbar users or GMAIL users. In this example the annotation information repository 50 would be Google. In another example, Igor may be a GMAIL user and Kate a Yahoo! Email user. Then Igor requests for Kate's annotations collection 200 from Yahoo!, because in this example Yahoo! would be the annotation information repository 50. Typically both annotation information repositories 50, Yahoo! and Google would have a common interface 35 for requesting annotations collections 200. The common interface for requesting 35 annotations collections 200 may be implemented using SOAP.
[0051] In another exemplary implementation, the annotations collections may also be shared via exporting and importing on the local computerized electronic device or on a local network folder. For example, Kate exports and saves her annotations collections to a file. The file is then transferred to Igor via email or some other type of file transfer, for example, by using memory stick, a local network folder, etc. In this implementation, there may be no permissions record associated with the content of this file. In this case the annotations collection saved on the local electronic device may contain no permissions information record. Once Igor retrieves the file onto his computerized electronic device, the annotations plug-in added on to Igor's browser can import the annotation collection. [0052] The use of the exemplary method described above is illustrated schematically in FIG. 9. For example, one user, Igor, accesses a webpage 25 about Jupiter, the webpage 25 about Jupiter being controlled by a third part (NASA) 45. The webpage content annotation system 10 used by Igor allows him to display annotations overlaid on the accessed webpage 25. The annotations were authored in part by Igor during a previous visit of the accessed webpage, and in part by a member of his study group, Kate. The annotations are stored by an annotation information repository 50 and retrieved via an annotation plug-in 30. The annotations are stored in a database 60 on the annotation information repository 50 server, and are recorded as annotations collections 200 and 200' associated with the webpage address J, for example
Figure imgf000015_0001
and also associated with an annotation author, Igor or Kate, respectively.
[0053] In another implementation of the webpage content annotation system 10, the annotations collections associated with a webpage address may be accessed by the annotations collection author. For example, Igor's highlights at the webpage address J, are only accessible by Igor. Kate's highlights at that same webpage address J are only accessible by Kate. Therefore, a webpage 25 under control of a third party 45 becomes a virtual book for users who have access to the web, with the added benefit that the virtual book can accommodate personalized highlights and annotations.
[0054] In the exemplary method described in FIG. 10, the step 1010 of retrieving an annotations collection 200 for overlay with the accessed webpage 25 is followed by a verification step 1020. The webpage 25 is inherently controlled by a third party 45, therefore the accessed webpage 25 may have been modified since the time the user had accessed it. The annotations collection 200 includes a text-mapping record 230 to verify possible changes in the webpage content. In step 1020, the annotation plug-in 30 compares each pair of starting and ending text offsets and hash masks of the highlighted text 27 included in the retrieved annotations collections 200, with the corresponding text properties of the accessed webpage 25. If there is a match between the text-mapping record 230 of the annotations collection 200 and the currently accessed webpage 25, then the annotation plug-in 30 modifies the accessed webpage 25, which is now loaded in the memory of the computerized electronic device 15, the modification includes adding appropriate HTML tags to display the highlighted text 27, as shown in step 320. [0055] If during the matching step 1030, the annotation plug-in 30 determines that the hash codes do not match, the web-contented annotation plug-in 30 displays 1040 an alert to inform the user of the un-matching, or partially un-matching highlighted text 27. The annotations collection 200, or part of it, is rendered orphan. In one implementation, the annotation plug-in 30 discards 1050 from the annotation information repository database 60 the annotations collection 200 associated with the missing highlighted text. In another implementation, the annotation plug-in 30 updates the annotations collection 200 to account for the missing highlighted text. Specifically, the annotation plug-in 30 offers the annotation author the choice to retain the supplementary information 235 associated with the "orphan" annotation. The retained supplementary information 235 is saved as part of the annotation collection 200 and may be associated with another highlighted text block 27 or with a newly created annotation.
[0056] FIG. 11 is a schematic diagram of a computer system 1100 representing the computerized electronic device 15. Also the computer system 1100 can represent a server at the annotation information repository 50. The system 1100 can be used for the operations described in association with any of the computer-implement methods described previously, according to one implementation. The system 1100 is intended to include various forms of digital computers, such as laptops, desktops, workstations, personal digital assistants, servers, blade servers, mainframes, and other appropriate computers. The system 1100 can also include mobile devices, such as personal digital assistants, cellular telephones, smartphones, and other similar computing devices. Additionally the system can include portable storage media, such as, Universal Serial Bus (USB) flash drives. For example, the USB flash drives may store operating systems and other applications. The USB flash drives can include input/output components, such as a wireless transmitter or USB connector that may be inserted into a USB port of another computing device.
[0057] The system 1100 includes a processor 1110, a memory 1120, a storage device 1130, and an input/output device 1140. Each of the components 1110, 1120, 1130, and 1140 are interconnected using a system bus 1150. The processor 1110 is capable of processing instructions for execution within the system 1100. In one implementation, the processor 1110 is a single-threaded processor. In another implementation, the processor 1110 is a multithreaded processor. The processor 1110 is capable of processing instructions stored in the memory 1120 or on the storage device 1130 to display graphical information for a user interface on the input/output device 1140.
[0058] The memory 1120 stores information within the system 1100. In one implementation, the memory 1120 is a computer-readable medium. In one implementation, the memory 1120 is a volatile memory unit. In another implementation, the memory 1120 is a non- volatile memory unit.
[0059] The storage device 1130 is capable of providing mass storage for the system 1100. In one implementation, the storage device 1130 is a computer-readable medium. In various different implementations, the storage device 1130 may be a floppy disk device, a hard disk device, an optical disk device, or a tape device.
[0060] The input/output device 1140 provides input/output operations for the system 1100. In one implementation, the input/output device 1140 includes a keyboard and/or pointing device. In another implementation, the input/output device 1140 includes a display unit for displaying graphical user interfaces.
[0061] The features described can be implemented in digital electronic circuitry, or in computer hardware, firmware, software, or in combinations of them. The apparatus can be implemented in a computer program product tangibly embodied in an information carrier, e.g., in a machine-readable storage device or in a propagated signal, for execution by a programmable processor; and method steps can be performed by a programmable processor executing a program of instructions to perform functions of the described implementations by operating on input data and generating output. The described features can be implemented advantageously in one or more computer programs that are executable on a programmable system including at least one programmable processor coupled to receive data and instructions from, and to transmit data and instructions to, a data storage system, at least one input device, and at least one output device. A computer program is a set of instructions that can be used, directly or indirectly, in a computer to perform a certain activity or bring about a certain result. A computer program can be written in any form of programming language, including compiled or interpreted languages, and it can be deployed in any form, including as a stand-alone program or as a module, component, subroutine, or other unit suitable for use in a computing environment. [0062] Suitable processors for the execution of a program of instructions include, by way of example, both general and special purpose microprocessors, and the sole processor or one of multiple processors of any kind of computer. Generally, a processor will receive instructions and data from a read-only memory or a random access memory or both. The essential elements of a computer are a processor for executing instructions and one or more memories for storing instructions and data. Generally, a computer will also include, or be operatively coupled to communicate with, one or more mass storage devices for storing data files; such devices include magnetic disks, such as internal hard disks and removable disks; magnetooptical disks; and optical disks. Storage devices suitable for tangibly embodying computer program instructions and data include all forms of nonvolatile memory, including by way of example semiconductor memory devices, such as EPROM, EEPROM, and flash memory devices; magnetic disks such as internal hard disks and removable disks; magneto-optical disks; and CD-ROM and DVD- ROM disks. The processor and the memory can be supplemented by, or incorporated in, ASICs (application- specific integrated circuits).
[0063] To provide for interaction with a user, the features can be implemented on a computer having a display device such as a CRT (cathode ray tube) or LCD (liquid crystal display) monitor for displaying information to the user and a keyboard and a pointing device such as a mouse or a trackball by which the user can provide input to the computer.
[0064] The features can be implemented in a computer system that includes a back-end component, such as a data server, or that includes a middleware component, such as an application server or a web server, or that includes a front-end component, such as a client computer having a graphical user interface or an Internet browser, or any combination of them. The components of the system can be connected by any form or medium of digital data communication such as a communication network. Examples of communication networks include a local area network ("LAN"), a wide area network ("WAN"), peer-to-peer networks (having ad-hoc or static members), grid computing infrastructures, and the Internet.
[0065] The computer system can include clients and servers. A client and server are generally remote from each other and typically interact through a network, such as the described one. The relationship of client and server arises by virtue of computer programs running on the respective computers and having a client-server relationship to each other.
[0066] A number of implementations of a webpage content annotation system have been described. Nevertheless, it will be understood that various modifications may be made without departing from the spirit and scope of the subject matter of this specification. For example, the webpage content annotation system may be integrated with a targeted advertising system. Ads can be shown/displayed to specifically match the annotated/highlighted text of a viewer's annotations collections stored at an annotation information repository. For example, a user who reads about the history of shoes and highlights information on shoes, may indicate affinity to seeing ads about shoes on sale, etc.
[0067] In another implementation, the annotation information system may be used on internets or extranets for collaborative work to improve work/productivity: For example, employees of a company may use the webpage content annotation system and methods for reading wiki pages or online user guides, making observations upon the content of the online user guides and sharing their observations collaboratively.
[0068] In the academic world, students may be able to carry computerized electronic devices instead of backpacks loaded with books. By using such web-enabled reading devices students can read online textbooks, thus having access to their own textbook to highlight and annotate as they wish. For example, instead of printed US history textbooks, there may be a web site with the US history textbook. The students can read their "own copies" of the textbook, and they can annotate as they please. Furthermore, if they choose to, the students may only see their own annotations.
[0069] Accordingly, other embodiments are within the scope of the following claims.

Claims

WHAT IS CLAIMED IS:
1. A computer-implemented method for annotating webpage content, the method comprising: accessing a webpage in a browser, the webpage under control of a third party; retrieving a collection of annotations stored at a storage location, the annotations collection associated with the webpage and an annotations author; and displaying the webpage with the retrieved annotations collection overlaid on the accessed webpage.
2. The method of claim 1 , wherein the retrieving further comprises: if the accessed webpage was modified by the third party, displaying an alert; and removing from the annotations collection an annotation associated with the webpage modification.
3. The method of claim 1 further comprising: hovering a cursor over a highlighted text of the annotations collection; displaying supplementary information associated with the highlighted text, including an address of a cross-referenced webpage; and accessing the cross-referenced webpage.
4. The method of claim 1 further comprising: creating at least one annotation on the webpage; and saving the at least one annotation within the annotations collection.
5. The method of claim 4, wherein the creating of at least one annotations further comprising: highlighting a text block of interest; recording verification information associated with the text block, including text offsets and a hash mask; and inputting supplementary information associated with the highlighted text, the supplementary information including notes and addresses of cross-referenced webpages.
6. The method of claim 1 further comprising: modifying at least one annotation from the annotations collection overlaid on the webpage, wherein the modifying includes removing, editing and updating; and updating the annotations collection to account for the at least one modified annotation.
7. The method of claim 1 further comprising: informing a viewer of additional annotations collections being available at the storage location; requesting the additional annotations collections; retrieving the additional annotations collections; and displaying the additional annotations collections alongside the initially retrieved annotations collection overlaid on the webpage.
8. The method of claim 7 wherein the retrieving the additional annotations collections further comprises: displaying an alert if a permission to access the additional annotations collections is denied, the permission being included in the annotations collection.
9. A system for providing annotation information associated with web content, the system comprising: a computerized device including a browser configured to access a webpage under control of a third party; an annotation information repository configured to store collections of annotations associated with the webpage accessed by the browser; and a browser plug-in configured to display a stored annotations collection overlaid on the accessed webpage.
10. The system of claim 9 wherein the annotation information repository comprises: a database configured to store annotations collections, the storing based on a set of identifiers including a webpage address and an annotation author; and an authentication module configured to verify permissions to the annotations collections.
11. The system of claim 9 wherein the annotations collection comprising: verification records including permissions and text mapping information; and supplementary records including notes and addresses of cross-referenced webpages.
12. The system of claim 11 wherein the text mapping information includes text offsets and a hash mask.
13. The system of claims 9 wherein the browser plug-in is configured to update the annotations collection upon creation or removal of an annotation on the accessed webpage.
14. The system of claims 9 wherein the browser plug-in is configured to display multiple annotations collections overlaid on the accessed webpage, the multiple annotations collections created by more than one annotations author.
15. A system for annotating webpage content, the system comprising: a computerized device including a browser configured to access a webpage under control of a third party; a collection of one or additional annotations made by an annotation author other than the third party associated with the webpage, the annotations collection stored at a storage location; and a browser plug-in configured to display the webpage with the annotations collection overlaid on the webpage.
16. The system of claim 15 wherein the annotations collection comprising: verification records including permissions and text mapping information; and supplementary records including notes and addresses of cross-referenced webpages.
17. The system of claim 15 wherein the text mapping information includes text offsets and a hash mask.
18. The system of claims 15 wherein the browser plug-in is configured to update the annotations collection upon creation or removal of an annotation on the accessed webpage.
19. The system of claims 15 wherein the browser plug-in is configured to display multiple annotations collections overlaid on the accessed webpage, the multiple annotations collections created by more than one annotations author.
PCT/US2009/048944 2008-06-27 2009-06-26 Annotating webpage content WO2009158669A2 (en)

Priority Applications (6)

Application Number Priority Date Filing Date Title
BRPI0914770A BRPI0914770A2 (en) 2008-06-27 2009-06-26 webpage content annotation
CA2729142A CA2729142C (en) 2008-06-27 2009-06-26 Annotating webpage content
JP2011516734A JP5931440B2 (en) 2008-06-27 2009-06-26 Web page content annotation
AU2009261977A AU2009261977B2 (en) 2008-06-27 2009-06-26 Annotating webpage content
CN200980132813.3A CN102132273B (en) 2008-06-27 2009-06-26 Annotating webpage content
EP09771200.4A EP2316081B1 (en) 2008-06-27 2009-06-26 Annotating webpage content

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US12/163,778 2008-06-27
US12/163,778 US8190990B2 (en) 2008-06-27 2008-06-27 Annotating webpage content

Publications (2)

Publication Number Publication Date
WO2009158669A2 true WO2009158669A2 (en) 2009-12-30
WO2009158669A3 WO2009158669A3 (en) 2010-04-08

Family

ID=41445366

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/US2009/048944 WO2009158669A2 (en) 2008-06-27 2009-06-26 Annotating webpage content

Country Status (9)

Country Link
US (1) US8190990B2 (en)
EP (1) EP2316081B1 (en)
JP (2) JP5931440B2 (en)
KR (1) KR101614178B1 (en)
CN (1) CN102132273B (en)
AU (1) AU2009261977B2 (en)
BR (1) BRPI0914770A2 (en)
CA (1) CA2729142C (en)
WO (1) WO2009158669A2 (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2014056599A1 (en) * 2012-10-12 2014-04-17 Redpeppix. Gmbh & Co. Kg Tagging system and method for providing a communication platform in a network
US11386184B2 (en) 2013-09-06 2022-07-12 Mars Flag Corporation Information processing apparatus, search server, web server, and non-transitory computer readable storage medium

Families Citing this family (55)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US11429685B2 (en) 2006-06-22 2022-08-30 Rohit Chandra Sharing only a part of a web page—the part selected by a user
US11301532B2 (en) 2006-06-22 2022-04-12 Rohit Chandra Searching for user selected portions of content
US11288686B2 (en) * 2006-06-22 2022-03-29 Rohit Chandra Identifying micro users interests: at a finer level of granularity
US11853374B2 (en) 2006-06-22 2023-12-26 Rohit Chandra Directly, automatically embedding a content portion
US20090265607A1 (en) * 2008-04-17 2009-10-22 Razoss Ltd. Method, system and computer readable product for management, personalization and sharing of web content
US20100325557A1 (en) * 2009-06-17 2010-12-23 Agostino Sibillo Annotation of aggregated content, systems and methods
US8799754B2 (en) * 2009-12-07 2014-08-05 At&T Intellectual Property I, L.P. Verification of data stream computations using third-party-supplied annotations
US9032289B1 (en) * 2010-03-26 2015-05-12 Google Inc. Providing suggestions to users to write comments
BR122013025248A2 (en) * 2010-08-04 2019-08-06 Copia Interactive, Llc METHOD OF ADDING NOTES TO DIGITAL CONTENT AND LEGAL READING BY NON-TRANSITIONAL COMPUTER CONTAINING SOFTWARE FOR IMPLEMENTATION
AU2013204046B2 (en) * 2010-08-04 2016-07-21 Copia Interactive, Llc System for and method of collaborative annotation of digital content
US9251267B2 (en) * 2010-09-30 2016-02-02 Google Inc. Generating behavior information for a link
US8732181B2 (en) 2010-11-04 2014-05-20 Litera Technology Llc Systems and methods for the comparison of annotations within files
US20120116871A1 (en) * 2010-11-05 2012-05-10 Google Inc. Social overlays on ads
AU2011352131A1 (en) 2010-12-28 2013-07-11 Google Inc. Targeting based on social updates
US9977800B2 (en) * 2011-03-14 2018-05-22 Newsplug, Inc. Systems and methods for enabling a user to operate on displayed web content via a web browser plug-in
CN102682025B (en) * 2011-03-16 2014-03-19 中兴通讯股份有限公司 Device and method for achieving adding and displaying of web image comments
US20130031455A1 (en) * 2011-07-28 2013-01-31 Peter Griffiths System for Linking to Documents with Associated Annotations
US8909708B2 (en) * 2011-09-08 2014-12-09 Google Inc. System and method for confirming authorship of documents
US9483454B2 (en) 2011-10-07 2016-11-01 D2L Corporation Systems and methods for context specific annotation of electronic files
US20130097522A1 (en) * 2011-10-15 2013-04-18 Derek A. Devries Method and system of generating composite web page elements with an annotating proxy server
US8977952B2 (en) * 2011-11-07 2015-03-10 Thomson Reuters Global Resources Electronic book version and annotation maintenance
CN103136206B (en) * 2011-11-23 2016-07-06 阿里巴巴集团控股有限公司 Information displaying method and equipment
KR101984823B1 (en) * 2012-04-26 2019-05-31 삼성전자주식회사 Method and Device for annotating a web page
US8522130B1 (en) * 2012-07-12 2013-08-27 Chegg, Inc. Creating notes in a multilayered HTML document
US9160605B1 (en) 2012-09-20 2015-10-13 Amazon Technologies, Inc. Distributing data to groups of user devices
KR101527039B1 (en) * 2012-11-01 2015-06-08 엘지전자 주식회사 Mobile terminal and controlling method thereof
US10033537B2 (en) * 2012-11-30 2018-07-24 Stephen B. Heppe Promoting learned discourse in online media with consideration of sources and provenance
US20140215050A1 (en) * 2013-01-29 2014-07-31 Array Networks, Inc. Method and system for web analytics using a proxy
JP5920247B2 (en) * 2013-03-01 2016-05-18 富士ゼロックス株式会社 Image processing system, registration device, image processing device, and program
CN104516919B (en) * 2013-09-30 2018-01-30 北大方正集团有限公司 One kind quotes annotation process method and system
US9854026B1 (en) 2014-06-30 2017-12-26 Emc Corporation Service to invoke companion applications
CN104657451B (en) * 2015-02-05 2019-01-15 百度在线网络技术(北京)有限公司 The processing method and processing device of the page
US10642929B2 (en) * 2015-04-30 2020-05-05 Rakuten, Inc. Information display device, information display method and information display program
US9971753B2 (en) * 2015-06-04 2018-05-15 University Of Central Florida Research Foundation, Inc. Computer system providing collaborative learning features and related methods
US11308264B2 (en) * 2015-07-07 2022-04-19 International Business Machines Corporation Managing document annotations in a publish/subscribe system
US10447626B2 (en) 2015-07-07 2019-10-15 International Business Machines Corporation Control of messages in publish/subscribe system
US9916296B2 (en) 2015-09-24 2018-03-13 International Business Machines Corporation Expanding entity and relationship patterns to a collection of document annotators using run traces
CN106598557A (en) * 2015-10-15 2017-04-26 中兴通讯股份有限公司 Information processing method and apparatus
KR20180072661A (en) * 2015-11-17 2018-06-29 점보 테크놀로지 컴퍼니 리미티드 Web page annotation and community sharing system
CN108369404B (en) 2015-12-10 2019-05-17 西门子股份公司 The distributed embedded data and Knowledge Management System of integrated PLC historical record
US10546029B2 (en) * 2016-01-13 2020-01-28 Derek A. Devries Method and system of recursive search process of selectable web-page elements of composite web page elements with an annotating proxy server
US10102194B2 (en) * 2016-12-14 2018-10-16 Microsoft Technology Licensing, Llc Shared knowledge about contents
JP6639438B2 (en) * 2017-04-20 2020-02-05 キヤノン株式会社 Information processing apparatus, information processing method, and program
KR101865906B1 (en) * 2017-06-07 2018-06-08 (주)오투팜 Method and program for providing updated information
US10417310B2 (en) 2017-06-09 2019-09-17 Microsoft Technology Licensing, Llc Content inker
CN107357781B (en) * 2017-06-29 2020-12-29 上海财兔信息科技有限公司 System and method for identifying relevance between webpage title and text
US10360257B2 (en) * 2017-08-08 2019-07-23 TuSimple System and method for image annotation
US10613844B2 (en) * 2017-11-10 2020-04-07 International Business Machines Corporation Using comments of a program to provide optimizations
KR102046922B1 (en) 2018-02-19 2019-11-20 동서대학교 산학협력단 System for Changing Webpage adding Highlight and Memo at Contents of Webpage and Method thereof
US11080358B2 (en) * 2019-05-03 2021-08-03 Microsoft Technology Licensing, Llc Collaboration and sharing of curated web data from an integrated browser experience
CN111027281B (en) * 2019-11-20 2023-06-06 北京字节跳动网络技术有限公司 Word segmentation method, device, equipment and storage medium
CN111541949B (en) * 2020-04-30 2021-09-14 上海哔哩哔哩科技有限公司 Interaction method and system for barrage colored eggs
CN111597467A (en) * 2020-05-08 2020-08-28 北京字节跳动网络技术有限公司 Display method and device and electronic equipment
US11270062B2 (en) 2020-07-20 2022-03-08 Labelbox, Inc. System and method for automated content annotation workflow
US11361152B2 (en) * 2020-07-20 2022-06-14 Labelbox, Inc. System and method for automated content labeling

Family Cites Families (35)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH0359759A (en) * 1989-07-28 1991-03-14 Casio Comput Co Ltd Document data processor
US6877137B1 (en) * 1998-04-09 2005-04-05 Rose Blush Software Llc System, method and computer program product for mediating notes and note sub-notes linked or otherwise associated with stored or networked web pages
JP3865946B2 (en) * 1998-08-06 2007-01-10 富士通株式会社 CHARACTER MESSAGE COMMUNICATION SYSTEM, CHARACTER MESSAGE COMMUNICATION DEVICE, CHARACTER MESSAGE COMMUNICATION SERVER, COMPUTER-READABLE RECORDING MEDIUM CONTAINING CHARACTER MESSAGE COMMUNICATION PROGRAM, COMPUTER-READABLE RECORDING MEDIUM RECORDING CHARACTER MESSAGE COMMUNICATION MANAGEMENT PROGRAM Message communication management method
JP2000105731A (en) * 1998-09-29 2000-04-11 Fuji Xerox Co Ltd Joint work supporting device
US6297819B1 (en) * 1998-11-16 2001-10-02 Essential Surfing Gear, Inc. Parallel web sites
US6687877B1 (en) * 1999-02-17 2004-02-03 Siemens Corp. Research Inc. Web-based call center system with web document annotation
US6516340B2 (en) * 1999-07-08 2003-02-04 Central Coast Patent Agency, Inc. Method and apparatus for creating and executing internet based lectures using public domain web page
JP2002015005A (en) * 2000-06-29 2002-01-18 Fujitsu Ltd Method and system for providing information
US6721921B1 (en) * 2000-07-26 2004-04-13 Itm Associates Method and system for annotating documents using an independent annotation repository
US20020152064A1 (en) * 2001-04-12 2002-10-17 International Business Machines Corporation Method, apparatus, and program for annotating documents to expand terms in a talking browser
TW556103B (en) * 2001-05-04 2003-10-01 Ibm Web page annotation systems
JP3771831B2 (en) * 2001-11-01 2006-04-26 インターナショナル・ビジネス・マシーンズ・コーポレーション Computer system and program for sharing annotation information added to digital content
JP3858686B2 (en) * 2001-12-19 2006-12-20 株式会社日立製作所 Learning support program, learning support method, and learning support system
US7305557B2 (en) * 2003-06-20 2007-12-04 International Business Machines Corporation Management and recovery of data object annotations using digital fingerprinting
JP2005063118A (en) * 2003-08-11 2005-03-10 Hitachi Koukiyou Syst Eng Kk Method for providing service of convenience adding function of web page
US8195761B2 (en) * 2004-01-13 2012-06-05 International Business Machines Corporation Prefetching web resources based on proxy triggers
US7254593B2 (en) * 2004-01-16 2007-08-07 International Business Machines Corporation System and method for tracking annotations of data sources
US7181681B2 (en) * 2004-01-28 2007-02-20 Microsoft Corporation Realtime web page scrapping and visual representation of associated clickthrough and impression data architecture
JP2005267021A (en) * 2004-03-17 2005-09-29 Ricoh Co Ltd Document preparation method and device, program, storage medium and data structure of document
JP2005275821A (en) * 2004-03-25 2005-10-06 Canon Inc Document management system and document management method
JP4547990B2 (en) * 2004-05-25 2010-09-22 富士ゼロックス株式会社 Information processing apparatus and information processing program
US7299407B2 (en) * 2004-08-24 2007-11-20 International Business Machines Corporation Marking and annotating electronic documents
WO2006026578A2 (en) * 2004-08-27 2006-03-09 Peng Tao Online annotation management system and method
JP4891571B2 (en) * 2005-06-14 2012-03-07 中部電力株式会社 Related information sharing apparatus and related information sharing method
US7945653B2 (en) * 2006-10-11 2011-05-17 Facebook, Inc. Tagging digital media
US20070226058A1 (en) * 2006-03-21 2007-09-27 Myware, Inc. Time based electronic advertisement
JP4876734B2 (en) * 2006-06-22 2012-02-15 富士ゼロックス株式会社 Document use management system and method, document management server and program thereof
JP4616800B2 (en) * 2006-06-26 2011-01-19 日本電信電話株式会社 Information display device, information display method, program implementing the method, and medium storing the program
US20080065649A1 (en) * 2006-09-08 2008-03-13 Barry Smiler Method of associating independently-provided content with webpages
US20100278453A1 (en) * 2006-09-15 2010-11-04 King Martin T Capture and display of annotations in paper and electronic documents
JP5233220B2 (en) * 2006-10-11 2013-07-10 株式会社リコー Page additional information sharing management method
JP4520450B2 (en) * 2006-12-05 2010-08-04 富士通株式会社 Annotation management program, annotation management apparatus, annotation management method, and annotation display program
US7715448B2 (en) * 2007-06-06 2010-05-11 Red Aril, Inc. Network device for embedding data in a data packet sequence
US20080306815A1 (en) * 2007-06-06 2008-12-11 Nebuad, Inc. Method and system for inserting targeted data in available spaces of a webpage
US8219902B2 (en) * 2007-10-26 2012-07-10 T-Mobile Usa, Inc. System and method for remote update of display pages

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
See references of EP2316081A4 *

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2014056599A1 (en) * 2012-10-12 2014-04-17 Redpeppix. Gmbh & Co. Kg Tagging system and method for providing a communication platform in a network
US11386184B2 (en) 2013-09-06 2022-07-12 Mars Flag Corporation Information processing apparatus, search server, web server, and non-transitory computer readable storage medium

Also Published As

Publication number Publication date
CA2729142A1 (en) 2009-12-30
JP2015195046A (en) 2015-11-05
WO2009158669A3 (en) 2010-04-08
US8190990B2 (en) 2012-05-29
CA2729142C (en) 2018-01-16
EP2316081B1 (en) 2014-06-11
EP2316081A4 (en) 2012-12-19
JP6040293B2 (en) 2016-12-07
KR20110027795A (en) 2011-03-16
JP2011526394A (en) 2011-10-06
BRPI0914770A2 (en) 2016-07-26
US20090327855A1 (en) 2009-12-31
CN102132273A (en) 2011-07-20
AU2009261977B2 (en) 2014-01-23
KR101614178B1 (en) 2016-04-29
JP5931440B2 (en) 2016-06-08
AU2009261977A1 (en) 2009-12-30
EP2316081A2 (en) 2011-05-04
CN102132273B (en) 2015-04-08

Similar Documents

Publication Publication Date Title
EP2316081B1 (en) Annotating webpage content
US11308032B2 (en) Suggesting content items to be accessed by a user
US9954964B2 (en) Content suggestion for posting on communication network
JP4782683B2 (en) Personalized searchable library with emphasis capability and access to electronic images of text based on user ownership of corresponding physical text
CN109074383B (en) Document search with visualization within the context of a document
US20100083102A1 (en) Online Content Editing of Dynamic Websites
WO2014172218A1 (en) Method, apparatus and computer readable medium for collaborative authoring with clipping functionality
US8001154B2 (en) Library description of the user interface for federated search results
Saxena et al. Referencing made easy: reference management softwares
Steele Bibliographic citation management software as a tool for building knowledge
US20110087764A1 (en) Engine for generating and managing communications concerning on-line collaboration among a plurality of users in consideration with a computerized application
KR101660640B1 (en) Lab Meeting System and the Method for Sharing Experiment Note
WO2013149110A1 (en) User configurable electronic textbook
JP2007535281A (en) Virtual private network system
Wink Cloud computing
Agrawal et al. Reference management software programs
US9361285B2 (en) Method and apparatus for storing notes while maintaining document context
US9471615B2 (en) Enhancing content mediated engagement
Gorton et al. ETANA-GIS: GIS for archaeological digital libraries
Sandefur Young investigator perspectives. Blogging for electronic record keeping and collaborative research
Westphal Announcing Exciting New Developments at ACMQ
Palmirani et al. A multi-layer digital library for mediaeval legal manuscripts
Ba et al. Linear Panel Data Analysis in Nonprofit Studies: Embracing the Potential of Longitudinal Data
PIE-J Working Group NISO RP-16: PIE-J: The presentation & identification of e-journals
Paolo The web 2.0 revolution in library and information services

Legal Events

Date Code Title Description
WWE Wipo information: entry into national phase

Ref document number: 200980132813.3

Country of ref document: CN

121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 09771200

Country of ref document: EP

Kind code of ref document: A2

ENP Entry into the national phase

Ref document number: 2729142

Country of ref document: CA

WWE Wipo information: entry into national phase

Ref document number: 2009261977

Country of ref document: AU

ENP Entry into the national phase

Ref document number: 2011516734

Country of ref document: JP

Kind code of ref document: A

NENP Non-entry into the national phase

Ref country code: DE

ENP Entry into the national phase

Ref document number: 20117001653

Country of ref document: KR

Kind code of ref document: A

WWE Wipo information: entry into national phase

Ref document number: 2009771200

Country of ref document: EP

ENP Entry into the national phase

Ref document number: 2009261977

Country of ref document: AU

Date of ref document: 20090626

Kind code of ref document: A

ENP Entry into the national phase

Ref document number: PI0914770

Country of ref document: BR

Kind code of ref document: A2

Effective date: 20101227