US20110047152A1 - Method and apparatus for facilitating use of hypertext links on the World Wide Web - Google Patents

Method and apparatus for facilitating use of hypertext links on the World Wide Web Download PDF

Info

Publication number
US20110047152A1
US20110047152A1 US12/925,809 US92580910A US2011047152A1 US 20110047152 A1 US20110047152 A1 US 20110047152A1 US 92580910 A US92580910 A US 92580910A US 2011047152 A1 US2011047152 A1 US 2011047152A1
Authority
US
United States
Prior art keywords
link
phrase
context
level
links
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Abandoned
Application number
US12/925,809
Inventor
Julius O. Smith, III
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Intellectual Ventures Assets 186 LLC
Gula Consulting LLC
Original Assignee
Individual
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Individual filed Critical Individual
Priority to US12/925,809 priority Critical patent/US20110047152A1/en
Publication of US20110047152A1 publication Critical patent/US20110047152A1/en
Assigned to PRESCOTT NETWORKS, LLC reassignment PRESCOTT NETWORKS, LLC ASSIGNMENT OF ASSIGNORS INTEREST (SEE DOCUMENT FOR DETAILS). Assignors: SMITH, JULIUS O., III
Assigned to INTELLECTUAL VENTURES ASSETS 186 LLC reassignment INTELLECTUAL VENTURES ASSETS 186 LLC ASSIGNMENT OF ASSIGNORS INTEREST (SEE DOCUMENT FOR DETAILS). Assignors: GULA CONSULTING LIMITED LIABILITY COMPANY
Assigned to INTELLECTUAL VENTURES ASSETS 191 LLC, INTELLECTUAL VENTURES ASSETS 186 LLC reassignment INTELLECTUAL VENTURES ASSETS 191 LLC SECURITY INTEREST (SEE DOCUMENT FOR DETAILS). Assignors: MIND FUSION, LLC
Abandoned legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/951Indexing; Web crawling techniques
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/70Information retrieval; Database structures therefor; File system structures therefor of video data
    • G06F16/74Browsing; Visualisation therefor
    • G06F16/748Hypervideo
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/955Retrieval from the web using information identifiers, e.g. uniform resource locators [URL]
    • G06F16/9558Details of hyperlinks; Management of linked annotations
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/958Organisation or management of web site content, e.g. publishing, maintaining pages or automatic linking
    • G06F16/986Document structures and storage, e.g. HTML extensions
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y10TECHNICAL SUBJECTS COVERED BY FORMER USPC
    • Y10STECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y10S707/00Data processing: database and file management or data structures
    • Y10S707/953Organization of data
    • Y10S707/956Hierarchical
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y10TECHNICAL SUBJECTS COVERED BY FORMER USPC
    • Y10STECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y10S707/00Data processing: database and file management or data structures
    • Y10S707/99931Database or file accessing
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y10TECHNICAL SUBJECTS COVERED BY FORMER USPC
    • Y10STECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y10S707/00Data processing: database and file management or data structures
    • Y10S707/99931Database or file accessing
    • Y10S707/99933Query processing, i.e. searching
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y10TECHNICAL SUBJECTS COVERED BY FORMER USPC
    • Y10STECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y10S707/00Data processing: database and file management or data structures
    • Y10S707/99941Database schema or data structure
    • Y10S707/99942Manipulating data structure, e.g. compression, compaction, compilation
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y10TECHNICAL SUBJECTS COVERED BY FORMER USPC
    • Y10STECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y10S707/00Data processing: database and file management or data structures
    • Y10S707/99941Database schema or data structure
    • Y10S707/99943Generating database or data structure, e.g. via user interface
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y10TECHNICAL SUBJECTS COVERED BY FORMER USPC
    • Y10STECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y10S707/00Data processing: database and file management or data structures
    • Y10S707/99941Database schema or data structure
    • Y10S707/99944Object-oriented database structure
    • Y10S707/99945Object-oriented database structure processing

Definitions

  • the present invention relates to facilitating access to information over a computer network such as the Internet. More particularly, the present invention relates to technology for partially automating the linking of documents on the World Wide Web by authors of Web content. Such techniques are particularly useful for more easily creating richly interconnected information on the Web.
  • the World Wide Web provides an enormous distributed database of information interconnected physically by the Internet.
  • One of the main difficulties for users of the Web is finding needed information out of the tremendous quantity of information that is available.
  • Various mechanisms have been developed to address this problem.
  • index website is typically a server computer connected to the World Wide Web which maintains an index of Web content that can be searched in various ways by users (clients) connected to the server over the Internet. Indexes are often updated automatically by means of “spiders” which systematically explore the Web looking for new or updated content. Most search engines also provide means for users to install information to be indexed, so that such information may be indexed immediately without waiting for a spider to find it.
  • An example of a premier search engine is the “Alta Vista” website, accessible on the Web at the Universal Resource Locator (URL) address ⁇ http://www.altavista.com>>>.
  • search results typically contain too much undesired information as well as the desired information. This occurs because the information content of the Web is vast, and because it is difficult for users to construct search parameters in such a way as to pass most desired content while rejecting most undesired content. As a result, users typically must spend a lot of time sifting through search-engine results and/or refining their searches with additional restrictions in the search parameters. Additionally, the information stored in the index is not organized in a form suitable for browsing in a logical order.
  • Another mechanism developed to facilitate access to information on the World Wide Web is the directory website which presents a hierarchical directory of information that can be browsed by the user.
  • Premier sites of this nature include Yahoo ( ⁇ http://www.yahoo.com>>>), Netscape ( ⁇ http://www.netscape.com>>>), and Excite ( ⁇ http://www.excite.com>>>).
  • a visitor to such a site is first presented with a top-level list of topics. Choosing a topic by clicking on a topic's hypertext link with the mouse produces a list of subtopics, and so on, until a final level is reached at which useful information is displayed about the topic, or else a remote website pertaining to that topic is visited.
  • Directory companies such as Yahoo typically have teams of editors who explore the Web looking for content suitable for reference at their site, and these workers perform a function analogous to the automatic “spiders” used by automated index websites.
  • directory websites normally support searching within the directory site, thus producing search results of generally higher quality and less “clutter” than typically encountered on an index site.
  • directory websites typically allow submission of content for reference, subject to editorial consideration.
  • directory websites improve over index websites by providing editorial selection, logical organization, and browsing capability, all of which are absent in typical index websites.
  • a first difficulty, however, with directory websites is that they cannot reasonably keep up with the vastness of the information on the World Wide Web by means of manual editorial selection. As a result, directory websites tend to offer far less information relative to index websites.
  • a second difficulty with directory servers is that their content is proprietary and controlled by a team of editors at one company. This editorial control, while ensuring consistently high quality on the site, makes it difficult and sometimes even infeasible for an information provider to obtain a desired listing in the hierarchical directory.
  • One directory site that addresses this difficulty is the Open Directory project ( ⁇ http://dmoz.org/>>>); The Open Directory allows any user on the Internet to become an “editor” for a particular topic at the site.
  • a third difficulty related to the first is that typical directory sites are extremely broad in scope, contributing to the absence of specialized information that is not of interest to a wide general audience.
  • a difficulty with both index and directory websites is that information is presented without regard to the user's level of education. It is therefore often possible for a high-school senior working on a book report, for example, to encounter information understandable only by a graduate student in a specialized field. There is similarly normally no means for selecting information according to its type or source or other potentially desirable criteria.
  • a user rating system (or “scoring system”) to which any user may contribute.
  • An example of this mechanism is seen in the online book-store website ⁇ http://www.amazon.com/>>>.
  • Amazon allows any user to contribute a “book review” and an overall rating on a five-star scale. The average rating is displayed for each book, and books which match the user's search criteria are displayed sorted according to decreasing score (and possibly other criteria such as the number sold).
  • An interesting feature of the Amazon rating system is that it is democratic, allowing the vast quantity of World Wide Web users to jointly develop a ranking of the information sources (in this case books).
  • a difficulty with rating systems is that they are generally used only at the site where the ratings are collected, and no mechanism is provided for making use of the ratings elsewhere, such as in other documents on the Web linking to the same information.
  • HTML HyperText Markup Language
  • XML XML
  • HTML provides for the specification of hypertext links in Web-page text displayed by the browser.
  • a hypertext link consists of text to be displayed by the browser and a link target which is usually not displayed.
  • HTML code a hypertext link consists of text to be displayed by the browser and a link target which is usually not displayed.
  • a difficulty with hypertext links is that they must be laboriously added by Web content providers. Typical HTML editors merely provide a data-entry form in which the URL for the link target can be typed.
  • a second shortcoming of HTML and Web browsers is that there is no standard mechanism for specifying link properties such as educational level, type of resource, information source, or the like, which could be supported by Web browsers to give the user finer control of link display based on link properties. After the links are typed in, they must be maintained as their URLs change, and as new and better link-targets become available. There is therefore a need for automated assistance with entering, maintaining, and improving hypertext links in documents intended for a hypertext document environment such as the Web.
  • hypertext links also called “hyperlinks,” “links,” or “definitions”
  • the present invention is designed to provide a link installation service which automatically installs hyperlinks within information submitted to the service by hypertext authors.
  • submissions may be in HTML format, plain ASCII format, LaTeX source format, or a variety of additional formats to be added in the future.
  • the output returned to the user may be in either HTML or LaTeX source format (which may be compiled into HTML format). Criteria can optionally be specified which govern the installation of hyperlinks.
  • the invention further provides selectable databases of hyperlinks, organized by category (or “context”), which can be optionally selected for automatic link installation. It is further provided that content developers may add their own links to the existing link databases, and they may additionally create new link databases and specify their relation to the existing link databases. Contributing users are preferably required to have a known, verified email address. A user with a verified email address is called a “known user”.
  • the invention further provides means for browsing the link databases in a logically organized, hierarchical tree structure, wherein higher-level nodes correspond to more general contexts, and lower-level nodes correspond to more specialized contexts.
  • the link databases can additionally be searched for keyword matches within component fields. Users may provide ratings and/or reviews for individual links in the link databases.
  • the hyperlink databases of the present invention support various optional “properties” associated with each hyperlink.
  • One such property, useful in the development of educational content, is a level designation which indicates the educational level required for best understanding of the link-target information. Additional optional properties include the language of the content (such as English), a viewer suitability rating such as exists for movies (PG-13, R, etc.), and properties defined by the user.
  • Link properties can be specified by users to control the automatic installation of links, and/or to control what is displayed while browsing the link databases.
  • Another feature of the present invention is the ability for users to rate (or score) the quality of any link in the database and/or to submit a written review of any link.
  • the quality ratings may be averaged together and used to determine the relative ordering of the links when there are multiple link targets for the same word or phrase (“competing definitions”).
  • “competing definitions” In the typical case of HTML format, features of the JavaScript scripting language may be used to provide convenient access to multiple link targets, ranked according to score.
  • the latest ranked list of competing definitions may be maintained on a central server on the Web, with the installed link pointing there, instead of containing only a snapshot at the time of link installation, which may rapidly go out of date.
  • the currently highest rated link may be installed in the user's Web document for each recognized topic.
  • FIG. 1 shows an example initial Web page seen by a visitor using a Web browser to access the online version of the service.
  • FIG. 2 shows a Web page giving an overview of the capabilities of the online service.
  • FIG. 3 shows an example Web top-level page seen while browsing the hyperlink databases.
  • FIG. 4 shows an example lower-level page seen while browsing the hyperlink databases, in which the context has been narrowed considerably.
  • FIG. 5 shows an example browsing view at the level of a key phrase in which all displayed links are interpreted as “definitions” for the key phrase.
  • FIG. 6 shows an example form for adding a new link (definition) to the link database for the current key phrase.
  • FIG. 7 shows a Web page for submitting text for link installation.
  • FIG. 8 depicts the tree structure of the hierarchical link database.
  • FIG. 1 illustrates how a World Wide Web “home page” might appear on a website embodying the principles of the present invention.
  • the user has several choices of where to “navigate” next:
  • the first choice 101 is a hypertext link entitled “Learn about the W3K,” where in this example, “W3K” is an acronym standing for the “World Wide Web of Knowledge.” If this choice is selected by clicking the mouse on the underlined text, the visitor “navigates” to the Web page shown in FIG. 2 which provides an overview of the online service provided by the W3K. In particular, it is explained how submitting plain text 110 to the W3K server will result in hyperlinked text 111 being returned to the user. A summary 115 of high-level functions is also provided in FIG. 2 .
  • the second choice in FIG. 1 is a hypertext link 102 entitled “Browse the W3K.” If this choice is selected by clicking the mouse on the underlined text, the visitor “navigates” to the Web page shown in FIG. 3 supporting browsing of the hyperlink databases, as described further below.
  • the third choice in FIG. 1 is a hypertext link 103 in which the text displayed by the Web browser is “Install W3K links in a Web document.” If this choice is selected, the visitor is taken to the Web page of FIG. 7 where the user can submit text for link installation in a variety of formats. The text is returned to the user by the server with hypertext links installed according to the user's specifications. Link databases to be searched can be collected into a list during the browsing operation.
  • the link databases are organized hierarchically according to category, somewhat like the Dewey decimal system for library organization. Each category (or “directory”) is interpreted as a context analogous to a field of study. Each context may itself contain any number of contexts (“subcontexts,” or “subdirectories”), and it may additionally contain a database of information pertaining to that context (which may be a implemented in a file in that directory).
  • Every context may be identified by the directory path that reaches it from the top level.
  • the set of all contexts form a “tree structure” analogous to the hierarchical file systems used by all major computer operating systems at the present time.
  • a link database preferably comprises a list of (key, URL) pairs.
  • a key (or “key phrase” or sometimes “word”) identifies a topic or concept, and the URL points to information about that topic on the Internet.
  • the key is the “word being looked up”, and the URL points to its “definition”.
  • the (key, URL) pairs in the link database are interpreted within the particular context associated with the directory containing that dictionary.
  • the key corresponds to the noun phrase identifying a technical topic for which an article exists in the encyclopedia
  • the URL may correspond to the page number on which the article begins
  • the context may correspond to the technical field for which the encyclopedia was written.
  • dictionaries are interpreted in a particular context, alternate definitions are not allowed.
  • a context is preferably sufficiently narrow such that all terms (words or key phrases) in that context have a unique meaning.
  • Ordinary “flat” dictionaries must accommodate alternate definitions for a single word, while “hierarchical dictionaries” need not.
  • a single URL can provide only one “definition”. However, a single URL can be used to “define” any number of key phrases, which are then regarded as synonyms. Often the title of the addressed HTML page on the Web is the “key phrase” that is “defined” by the URL. When there are several (key, URL) pairs having the same URL, the different keys are treated as alternate phrasings for the same concept or topic, and are said to form a synonym group.
  • the following example dictionary entries provide an example of a synonym group:
  • Order is important in the dictionary because “the first match wins” during automatic link installation. For example, with the above ordering, the key phrase “Taylor Series Expansion” will match before checking for “Taylor Expansion” or “Taylor Series”. Ordering equivalent key phrases from longest to shortest ensures that the longest possible match will occur in documents submitted for link installation.
  • Synonyms can be listed in a link's properties, or they can simply be entered as additional link entries pointing to the same link target (URL), since links take up relatively little space.
  • a “key phrase” may be understood as a bottom-level subdirectory of the context tree.
  • a key-phrase directory holds a dictionary (link database) containing at least one link. This database may be implemented as a file residing in a directory having a name derived from the key phrase. Preferably, however, all key phrases in a particular context (together with their links), plus perhaps additional contexts, are implemented in a single larger database file in the parent context directory. For simplicity, however, a key phrase will nevertheless be considered logically to be a bottom-level directory (leaf node) in the hierarchical context directory, irrespective of implementation details associated with the use of a hierarchical file system.
  • All of the links in the key-phrase directory are interpreted as competing sources of information on the one topic identified by the key phrase.
  • the tangible difference between a key-phrase directory and a context directory is that the key-phrase directory has no subcontexts, only links.
  • a bottom-level directory in the context tree hierarchy corresponds to a single concept or topic, and all of the (key, URL) pairs in its dictionary pertain to that one topic.
  • the number of distinct URLs present is the number of competing sources of information.
  • the context-tree implementation must provide a means for ordering sub-directories, at least when those sub-directories correspond to key phrases.
  • an ordered link database file may reside in the context directory containing the key phrase, and include all other key phrases in that context as well (including synonyms); the key phrase(s) corresponding to each link may be stored as link properties.
  • synonyms are not handled as separate key-phrase directories. Instead, a single representative is selected (usually the most descriptive or canonical), and all other equivalent phrasings (synonyms) are listed in a separate synonym file in the key-phrase directory. (Order is carefully preserved.) During browsing, synonyms are displayed at the bottom of the key-phrase page.
  • Context synonyms are presently implemented using symbolic links in a UNIX file system implementation of the context tree.
  • the context hierarchy below illustrates two different paths to the subcontext (directory) “Sound_Synthesis”, where the notation “->” indicates a symbolic link, as is typically done when listing files in a UNIX file system:
  • the “true parent” of the node sound_synthesis is Computer_Music
  • the parent Signal_Processing is a “linked parent”.
  • Symbolic links provide a means for reaching multidisciplinary fields by browsing the constituent fields in a top-down way.
  • a symbolic link may be deleted and replaced with a copy of some or all of the directory which was formerly linked (possibly utilizing symbolic links at a lower level).
  • closely related contexts may start out as identical, but later may evolve into separate collections, as the maintainers see fit.
  • the dictionary corresponding to a particular context is defined as the union of all key-phrase dictionaries in that context.
  • the dictionaries belonging to all key-phrase subcategories in the particular context are logically concatenated together into one large dictionary, with additional dictionary entries added for all synonyms.
  • This dictionary is referred to as the “context dictionary”. The order in which the key phrases are concatenated is prescribed.
  • a typical aggregate dictionary consists of the context dictionary for the “current context” (established, e.g., by browsing), followed by the context-dictionaries of all subcontexts (usually not in any particular order, unless explicitly listed by the user), then followed by the context dictionary of the “true parent” context, followed by the context dictionary of the true parent's true parent, and so on, until the context dictionary of the top-level directory is appended (which contains extremely generic terms).
  • Linked parent dictionaries may also be added in where desired.
  • Link database browsing support on the server may be implemented in a variety of ways.
  • scripts which implement directory websites, such as the links-2.0 scripts from Gossamer Threads, Inc., and such scripts can be adapted to implement the hierarchical dictionary of the present invention.
  • FIGS. 3 and 5 illustrate the appearance of such a browsing system.
  • HTML SELECT pop-up menus which are dynamically generated from the current directory contents.
  • static HTML pages are preferable over dynamic HTML generated by the server, when feasible.
  • the database directory structure may additionally be displayed in a fixed HTML frame on the left, as is currently done on many websites. For example, the way directory trees are displayed on the left in Microsoft Windows Explorer is a good model.
  • FIG. 3 illustrates a Web page display at the start of browsing.
  • the top-level context is displayed.
  • only four top-level subcontexts are offered (Computing 131 , Education 132 , Legal 133 , and Music 134 ).
  • Each of these words is a hypertext link, which can be clicked with the mouse to navigate to the associated sub-context. For example, clicking on Music 134 , then on “Computer Music” (which is available in the Music context), then on “Signal Processing”, then finally on “Sound Synthesis” produces the page shown in FIG. 4 .
  • Hypertext links 130 Near the top of each page during browsing is a set of hypertext links 130 separated by a vertical bar ‘
  • the “W3K Home” link in the standard menu 130 takes the browser back to the initial W3K home page illustrated in FIG. 1 , as does clicking on the W3K logo.
  • the “Browse from Top” link in the standard menu 130 navigates to the top-level browsing page shown in FIG. 3 .
  • the “Select Hierarchy” link navigates to a page where a different context hierarchy can be selected for browsing.
  • There is only one primary public context hierarchy the one reached from the second choice 102 in FIG. 1 ).
  • individual users and groups of users can set up context hierarchies for their own purposes, without having to worry about fitting into the ever-expanding primary public context hierarchy.
  • language localization methods are not available, as preferred, to provide alternate language selection for each Web page in the public hierarchy
  • alternate hierarchies can be used to support alternate languages.
  • Alternate hierarchies can be designated by their creators as public (anyone can add to it), restricted (anyone can read it, but only the owner(s) can write it), or private (only the owner(s) can read or write it).
  • the owners include the creator and members of any groups listed by the creator as being co-owners.
  • the “Add Subcontext” link navigates to the form provided for creating a new subcontext within the current context. Since FIG. 3 is at the top-level context, this operation is only allowed in a public hierarchy for a select group of “trusted” users.
  • the “Add Key-Phrase” link leads to the form for adding a new key phrase in the current context. At least one definition link is required when adding a new key phrase.
  • this operation is restricted to trusted users since any key phrases appearing at the top of the hierarchy are “generic terms” having definitions which are independent of context. Truly global key phrases such as domain names and trademarks are appropriate at the top level.
  • the “Add Definition” link is for adding a new definition for a key phrase. This entails supplying a URL which points to information about the URL and some other information, as will be later described. Thus, the number of distinct URLs in the set of URLs associated with a particular key phrase can be increased from 1 (its usual initial value) to any number by adding more definitions.
  • the “Add Synonym” link in the standard menu 130 allows the addition of a key phrase to a list of “synonyms” for an existing key phrase. A synonym can also be constructed for a context. Synonyms will be described further below.
  • “Submit Dictionary File” provides convenient submission of a large number of links (key phrases and definitions) as well as the ability to specify a context path for each one, as will be described. It is additionally possible to display specified contexts and contexts selected for link installation in the form of a dictionary file. For example, a user can perform a search in order to collect all links contributed by that user, display the results as a dictionary file, save the dictionary file on his or her local computer, perform any desired editing operations, and submit the edited dictionary file back to the server to update his or her links on the server.
  • Modify Additions allows the user to edit (modify or delete) any information he or she submitted to the W3K site.
  • link properties delete a link, delete a context or key-phrase directory wholly owned by the user or user's group, and so on.
  • a user belonging to one or more groups may edit any information submitted by anyone in any of those groups.
  • a set of records to be edited can be created by means of the search facility.
  • a record may hold the information associated with a link, key-phrase directory, or context directory.
  • “Select Context” selects the “dictionary” associated with the current context for inclusion in subsequent “link installation”.
  • the context dictionary normally includes each key phrase in the current context together with at least one definition for each key phrase. It may also include similar information from parent contexts and subcontexts, as will be discussed.
  • the aggregate dictionary used in link installation is like a kind of “shopping cart” that can be filled with component dictionaries found while browsing around the context hierarchy; in this analogy, “items to be purchased” correspond to the dictionaries to be used in link installation.
  • Selection configuration information lower in the hierarchy is not modified when excluding a context, so that re-selecting the context allows the contained selection configuration to become active once again.
  • link-installation FIG. 7
  • Browser “cookies” are very useful for storing the context search preferences for the user across sessions; since many tend to work in one or a few fields, it is often the case that the contexts used for link installation do not change very often.
  • Browser cookies are simply information stored on the user's computer (the client computer) by the server; cookie files are supported by the major Web browsers such as Netscape Navigator and Microsoft Internet Explorer. If cookie files are not available for any reason (they can be disabled by the user), preference information can be stored on the server indexed by the user's email address, which is unique among users.
  • Edit All Selected is similar to “Modify Additions” except that instead of determining the list of database elements to be edited by using a search (or direct specification), it is initialized from the set of selected links owned by the user and/or groups to which the user belongs.
  • Topic Rated creates a list of highest ranked links in the database. These are generally excellent home pages, tutorials and the like on various topics.
  • Random Link takes the user to a randomly chosen definition link.
  • the “context path” 140 ( FIG. 4 ) is displayed just below the standard menu 130 , with each path element separated by a colon ‘:’.
  • the context path is displayed as “Top Music: Computer Music: Signal Processing: Sound Synthesis.” Clicking on the “Lagrange Interpolation” key-phrase 144 in this context takes the browser to the definition page for Lagrange Interpolation shown in FIG. 5 .
  • search form Below the horizontal line in FIG. 3 is a search form. Typing text into the field and clicking on the “Search!” button results in a dynamically generated web page listing all links (in all contexts) matching the search criteria. More refined searches can be carried out by first selecting the “More search options” link. Since links have quite a few properties (to be discussed), searches can be honed rather finely without relying entirely on typical means for selecting a subset of all names and phrases within contexts, key-phrases, and definitions.
  • FIG. 4 displays the contents of the context-path
  • key phrases are browser categories with no subcategories, only links
  • contexts are browser categories containing subcategories (either subcontexts or key phrases).
  • the links under a key-phrase are treated as “competing definitions” for that key phrase.
  • FIG. 8 illustrates the relationships among contexts, key phrases, and definitions.
  • the top level context 180 is the root node of the tree structure defined by the hierarchical link database. There can be any number of subcontexts or key phrases under the top level context 180 . In the example of FIG. 8 , there are two subcontexts, “Intermediate Context 1 ” 181 and “Intermediate Context 2 ” 182 . Since these are context directories, they each may contain any number of subcontexts and/or key phrases. In the present example, there are two key phrases 183 and 184 in the first subcontext 181 and one synonym group 185 (two equivalent key phrases) in the second context 182 . A key phrase must have at least one definition (link) associated with it.
  • “Key Phrase 1 ” 183 contains three competing definitions 186
  • “Key Phrase 2 ” 184 contains four competing definitions 187
  • the synonym group 185 consisting of “Key Phrase 3 ” and “Key Phrase 4 ” contains two definitions 188 to choose from for that synonym group. Since order is important, we may choose a consistent ordering convention for tree diagrams in which the ordering of all subnodes of a node is defined as left to right in a diagram as in FIG. 8 .
  • the “Sound Synthesis” 141 context includes one synonym 143 . This is a context synonym identified by the path
  • FIG. 5 shows a display of two “competing definitions” for the phrase “Lagrange Interpolation” 151 .
  • Either of the two links 152 or 153 may be installed in a document containing the phrase “Lagrange Interpolation”. They are both named “Lagrange_Interpolation” because that happens to be the title of both Web documents. However, the links point to two different targets on the Web written by two different authors.
  • the “popular” superscipt for a link 152 or 153 means it has been receiving relatively frequent visits (or “hits”) via the W3K site.
  • the number of hits displayed in this case is 0.
  • each link 152 or 153 is the date 155 the link was submitted, the number of hits 156 (number of times anyone has clicked on the link at this site), a rating 157 for each link (which is 0 since the links were just added), and the number of votes included in each rating (also 0 at the moment). Available elsewhere on the website also is the number of times a link has been installed in Web documents.
  • the rating display is a hyperlink which a user can select in order to contribute a rating or a review of the link.
  • the “Add a Definition” link 158 navigates to the form shown in FIG. 6 for adding another definition link for Lagrange Interpolation.
  • the current key phrase “Lagrange Interpolation” is filled into the “Topic” field 161 , and the context path leading to the key phrase is filled into the “Context” field 160 . This makes it convenient to enter a new source of information (definition) on a topic (key phrase) while browsing.
  • the “current context” field of the form becomes instead a pop-up HTML “select” list containing all of the contexts presently in the database, making it convenient to quickly select any context in which a new subcontext or key-phrase is to be added.
  • the only required fields on the add-definition form ( FIG. 6 ) are the URL 162 and user's email address 169 . All others are optional.
  • the URL is the new definition, and it is tested by the server to make sure it is responding. If the Site Title field 163 was left blank, the title of the Web page addressed by the URL, which is automatically retrieved by the server (using the Perl LWP module), is filled in automatically as the link title.
  • the contributor's email address is required because all submissions to the server in the preferred embodiment are associated with the contributor's email address.
  • there are alternative means for identifying users known in the art such as a more conventional registration procedure in which the user chooses a login name and password.
  • the preferred embodiment ensures that the email address given really reaches the user. If the user is new, an authorization process, described in ⁇ 5.1.9, is initiated which tests the user's email address.
  • the link contributor is invited to write a short description 164 of the website, specify the minimum 165 and maximum 166 educational level covered at the site (usually done by the author of the site), and specify the type of resource 167 (home page, conference paper, book chapter, or the like). The user may also type in his or her name 168 .
  • the context path leading to a link is also one of its properties, as is its URL, title, description, and so on.
  • a link can have more properties than these, some of which will be described below.
  • the “Specify Additional Properties” link 61 takes the user to a larger form where the additional properties can be specified.
  • the submit button 60 can be pressed to send the form to the link database server (a computer at w3k.org in this case).
  • the server tests the URL by retrieving the first page, checks that the user's email address is known and that the user's IP address and cookie information match information previously stored on the server (otherwise authorization is carried out), checks for duplication of the key phrase and URL in the given context, possibly checks the URL target for “inappropriate content”, assigns an automatic educational level if none was provided (unless automatic level assignment is already scheduled at regular intervals), and adds the new definition to the link database for the current key phrase (and context path, if the database file holds links for multiple key-phrase contexts). If the addition was successful, the user is navigated to a dynamically generated Web page summarizing the information added to the database. If there were any problems, an error page is generated listing the reason(s) for failure to accept the page.
  • dictionary file submission which can be regarded as a script-based replacement of the above browser-based interface.
  • An example of such a dictionary file is given in a later section.
  • the form for submitting such a file may be reached via the “Submit Dictionary File” link in the standard menu, or as an option under the “Add to or Edit the W3K” option on the server home page.
  • known users may optionally create a new top-level context tree which is private to that user or to one or more groups identified by the user. This mode of usage is advantageous for private usage without incurring collisions with links in the main “global” context tree shown in FIG. 3 . It is further the only way a known user can submit large quantities of contexts, key phrases, and links by means of a dictionary file submission, since that operation is not permitted in the global public context hierarchy. Further details will be described.
  • the user submits his or her email address in a simple Web-page form, and the server (1) emails a randomly generated ASCII string to that email address, and (2) navigates the user to a Web page containing a form for receiving that random string from the user.
  • the form also instructs the user to receive the email and to paste the random string into the second authorization form and submit it. This process verifies that the email address in fact reaches the user.
  • the email address and IP address of the user are then saved on the server. Additionally, the same information is written on the user's computer using a browser cookie. If the cookie goes away for any reason, or if the user later comes in from a different IP address for which authorization has never occurred (e.g., due receiving a new dynamically assigned IP address from an ISP, or using for the first time a different home computer connected directly to the Internet), authorization is triggered once again when any editing operation is requested. Users coming in over dynamically assigned IP address generally have to be authorized for each session until all such IP addresses have been seen and logged on the server along with the user's email address.
  • the user may use the “Back” button in his or her Web browser to find the page which triggered the authorization process, and resubmit the form successfully.
  • Both a minimum 165 and maximum level 166 can be set.
  • the min and max can be set to the same value (such as 10), or the max can be left unset (which defaults to level 100 , meaning no maximum).
  • the min and max can be set appropriately to cover the estimated range.
  • the minimum level still sets the official “level” used in automatic level assignment for other documents, but the maximum level, if specified, may affect link installation when a specific level range is specified for that.
  • An educational level is implemented a floating-point number, so that a level of 10.5 can be specified, e.g., in the form 165 or 166 .
  • Resource type 167 Another important link property, also on the main form, is resource type 167 .
  • Resource types include dictionary definition, encyclopedia article, unpublished article, conference paper, talk overheads, refereed journal article, book chapter, book, tutorial, lecture notes, course readers, and the like. Sometimes authors may wish to screen out non-refereed sources such as conference papers or unpublished works.
  • refereed publications and books will typically be hosted on the website of a publisher, requiring some form of payment for access, such as a site subscription or, preferably, a per-page “micropayment” such as the well known Millicent system provides.
  • Additional optional properties may be specified on a second form by selecting the “Specify Additional Properties” link 61 .
  • Additional properties include source type (individual, educational institution, company, non-profit organization, etc.), geographical location, language (English is assumed by default), “viewer suitability” analogous to ‘PG-13’, ‘R’, etc., for movies, a list of groups to be granted editing access, and so on.
  • Link properties added automatically by the server when installing a link in a database include a unique integer ID, the email address and IP address of the link contributor, the date of submission, an initial rating of zero, an initial zero number of “hits”, an initial zero number of “installs” in documents, and the like.
  • Link properties make it convenient to specify “virtual link database directories” which include only the links satisfying certain criteria specifiable in terms of link properties. For example, a user may ask to see only tutorials and books in a certain educational level range. Alternatively, an author may specify seeing only links belonging to that author's email address, or group. Thus, properties enable selective browsing (or listing) as well as more selective link installation. Such selective browsing may be specified using the Search feature on the site home page 105 , standard menu 130 , or at the bottom of any Web page seen while browsing.
  • Link properties may also be usefully included in installed links (within HTML “comments” or in specially defined XML tags) when indirect links are being installed (that is, when the installed link points to a centralized link server which forwards the user's browser to the ultimate destination).
  • Installed link properties may be interpreted by the link server to provide additional control over link behavior. For example, a teacher using Internet documents for a 9 th grade class could configure the link server to suppress all links having an educational level greater than 10. That way, when educational level properties are available for all links, as the present invention provides, documents may be populated with hyperlinks which can be configured not to refer a student to information at a more advanced level than the teacher desires. The teacher may further suppress any links with a viewer suitability rating below a certain value.
  • installed link properties enable dynamically configurable link behavior based on link property values.
  • link properties can be associated with “classes” in “cascading style sheets” (an add-on to HTML) in order to display links to dictionary definitions in one manner, encyclopedia articles in another manner, and home pages in another, etc.
  • a subcontext When a subcontext is created, it can be marked as “restricted” to the owner (creating user) or to groups specified by the owner. Restricting a directory prevents anyone but the owner or specified groups from modifying the subdirectory.
  • the restricted directory can optionally be made “invisible” to users other than those having modification rights, in which case the restricted directory is said to be “private”.
  • An unrestricted directory is said to be “public”.
  • a restricted directory can be deleted or renamed or otherwise reorganized no matter what it contains. Typical uses of restricted directories include
  • the name of a restricted directory has the name of its first group (or owner, if no access groups are defined) automatically appended as a suffix to the name chosen by the owner in order to prevent conflicts with public directories and other restricted directories on the same topic.
  • any number of users may have restricted subdirectories on the same topic. For example, in the subdirectory “/Music/Computer_Music/Synthesis/” there could be
  • a known user may even create a new top-level hierarchy which may be designated public, restricted, or private.
  • User- or group-owned hierarchies of this nature which lie outside the primary public hierarchy may be placed in a special standard menu item entitled “Alternate Universes”, e.g., to indicate that they are not a part of the primary public context hierarchy.
  • each competing definition 152 or 153 is the hyperlink “Rate It” which navigates to a form where that definition (link) can be rated on a scale from 1 to 10, and/or a written review about that link can be submitted. If the user is not known, an attempt to submit a rating or review routes the user to the authorization page, and after a successful authorization, the rating or review is accepted by the server.
  • a primary function of the invention is to facilitate the installation of hyperlinks in documents intended for the World Wide Web. This section provides a detailed description of link installation in the preferred embodiment.
  • a hypertext link is installed directly to the top-ranked source of information on the topic identified by the matching key phrase in the user's submitted text. This is the first choice presented in the “Link Type” radio-button-group 177 of the default link installation form ( FIG. 7 ).
  • the link installation server preferably tests all links in its databases periodically and eliminates them if they are unavailable for a prolonged period of time such as more than a week. When all links containing a bad URL are automatically removed from the databases, all owners of the links are notified automatically by email and invited to submit an updated version of the link(s).
  • the second approach is to install an indirect link which links via a centralized server (such as a website providing the link installation service).
  • a centralized server such as a website providing the link installation service.
  • This choice is provided by the second radio button in the “Link Type” portion 177 of the default link installation form.
  • Such an intermediate website acts as a so-called “proxy server” for the link.
  • Indirect links may always point to the most up-to-date, top-ranked source of information on any given topic.
  • An example URL syntax for this mode of operation is
  • Context paths can similarly be assigned integer IDs in order to shorten indirect URLs.
  • a third approach is to insert a link to the “key-phrase page” itself at the centralized server (the page on the server listing all “competing definitions” for that key phrase). This is the third and final choice in the “Link Type” radio group 177 .
  • an end user following such an installed link will see all competing definitions, in ranked order, instead of only one.
  • the end user can then request that the definitions be reorganized according to various criteria such as educational level, document size, type of resource (article, book, etc.), type of source (.edu, .org, .com, etc., individuals, etc.), and so on, by making requests of the server interactively, or by means of preferences registered with the server.
  • a refinement of the third approach is to build or generate a more helpful “key-phrase home page” on the link server.
  • This page could provide, for example, a brief definition, followed by an organized presentation of all available sources of information, organized by type and ranked according to quality in each case.
  • the casual user may be satisfied with a mere dictionary-style definition, while the serious scholar can more readily pursue a wider variety of sources beyond merely the top-ranked source.
  • Providing interactive reorganization of the definition page according to end user preferences is preferable in this case as well.
  • a fourth approach is to use JavaScript features to install a snapshot of the key-phrase home page at the time of link installation.
  • a JavaScript pop-up menu may hold a list of all competing links for the linked topic.
  • the link database server preferably provides periodic link testing, average ratings computation, link reordering, automatic educational level assignment, and so on. It is also straightforward for the server to format the key-phrase home page dynamically according to user preferences based on link properties and other criteria. For full generality, it is desirable to customize and differentiate key-phrase home pages on the basis of language, educational level, and other properties. (They are already segregated according to context by the context hierarchy in which they reside.) To address the potential concentratedity of this task, a mechanism for allowing known and trusted users to submit key-phrase home pages for installation on the server can be provided. For this purpose, the server can provide a template document containing variables that are filled in by the server, in a manner often found in website construction tools.
  • any word or phrase can be selected in text displayed by the user's browser and “looked up” at a server website containing the link databases.
  • a similar mechanism is currently available in Microsoft Internet Explorer 5: The right-click menu contains an entry “See more with Lycos!” which, when selected, causes the selected phrase (or word last clicked with the mouse) to be looked up in the search engine at the Lycos website ( ⁇ http://www.lycos.com>>>).
  • the database server may act in place of the Lycos website
  • the user may be taken to the page of “competing definitions” (all links) for that topic.
  • the topic is available in multiple contexts, a list of all distinct contexts can be first displayed, so that the user can select which one he or she had in mind, and then be taken to the definition page in the selected context.
  • the term is not in the link database but coincides with a context directory name, that directory can be displayed by the browser.
  • the unrecognized phrase may be forwarded to an ordinary online dictionary (for single words), encyclopedia, or Internet search engine.
  • the link-free look-up mode described in the previous paragraphs can be supported in any number of applications, not just Web browsers.
  • the word processor Microsoft Word already supports looking up an ordinary dictionary definition of a word by selecting the word and choosing the “Define” item in the right-click pop-up menu. Another item in that menu could be “Look it up at the W3K”, for example.
  • a link-free look-up service of this nature could be provided in any application which displays text and supports text selection by the end user. The service can be provided either over an Internet connection as described above, or, in the absence of an Internet connection (or supplementary to it), using the single-computer embodiment of the present invention described in ⁇ 5.2.
  • end users of the link-free lookup service may optionally register with the database server in order to specify preferences such as whether a key-phrase lookup (sans link) should navigate to the key-phrase home page or more directly to the currently highest ranked definition for that key phrase.
  • the user may also inform the server of his or her educational level, desired viewer suitability range, and the like.
  • the database server may accept a URL containing a “virtual form submission” of a link-free lookup form.
  • a lookup request for the phrase “Hubble constant” could be sent to the database server by “navigating” to the URL
  • the CGI script linkfreelookup.cgi runs and may immediately issue a “navigation” output to the highest ranked link matching “Hubble constant”, if any.
  • the URL may also include a user name. If user preferences exist, the script may alternatively navigate to a key-phrase page of competing definitions for the Hubble constant, and so on. Additionally, any number link properties may be specified in the URL as well.
  • FIG. 7 shows the default web page for submitting documents to have hypertext links installed by the server.
  • the user pastes text to be “linkified” directly into the “Text or URL” textfield 170 .
  • a URL 77 has been specified, indicating that an entire website is being submitted for link installation, as will be described further below.
  • Three input submission formats may be specified by the “Input” radio-button group 171 : HTML, Plain ASCII, and LaTeX source.
  • “Help” link 174 which navigates the user to documentation on the relevant considerations for each choice.
  • submission of HTML format is selected in the input-format radio group 171 .
  • the output is also normally received in HTML format; this facilitates fast construction of Web pages from simple ASCII text files. It also can be used to quickly obtain a browsable Web directory from a list of keywords generated by other means. Since some HTML editors support “drag and drop” link installation from another document, an automatically generated list of HTML links can be very useful even for manual link entry in an HTML editor.
  • links are installed in the form of an invocation of the macro ⁇ htmladdnormallink ⁇ text ⁇ ⁇ target ⁇ , which is defined in the widely used html.sty LaTeX style file.
  • HTML parsing can be accomplished using the HTML Perl package (see, for example, page 716 of the Pen Cookbook by T. Christiansen and N. Torkington, O'Reilly, 1998).
  • LaTeX directives are avoided in the text matching algorithm within LaTeX source. (Perl for LaTeX parsing is available in the latex2html Perl script, freely available at ⁇ http://ctan.tug.org/ctan/>>>.)
  • Linking is preferably suppressed when the recognized phrase coincides with the name of the current section or document, i.e., a phrase that results in a link to the current page.
  • a second pair of radio buttons 176 exists for specifying that links be installed for either all emphasized words or phrases, or only emphasized words or phrases. Emphasized occurrences may appears as “ ⁇ emph ⁇ . . . ⁇ ” in LaTeX and as ⁇ I> . . . ⁇ /I> or ⁇ B> . . . ⁇ /B> in HTML.
  • the two radio-button-pairs 175 and 176 can be considered to specify “two bits” which select among the following cases:
  • any URLs found as plain text in the source are by default converted to links that display their own URLs as anchor text.
  • Many email programs and word processors presently perform this transformation on URLs detected as plain text in received email.
  • the “Link Type” radio button group 177 selects among three of the basic installed link types discussed in the first subsection of this section.
  • the “Link Color” select pop-up list 178 provides for link color selection as discussed above. In addition to the standard color names, there is a “take default” selection which does not specify the link color, thereby leaving it to the HTML cascading style sheet or user's browser to choose link color.
  • the “Contexts” radio group 179 provides some high-level choices of context selection for link installation.
  • the first choice, “All W3K contexts” corresponds to combining all context dictionaries in the entire context hierarchy. As the context hierarchy grows, this can become a computationally expensive option, even when the aggregate dictionary is maintained as an existing file at all times.
  • a “current context” exists (as result of browsing or user preferences), it and its extensions are preferably listed first in the aggregate dictionary, as will be clarified further below.
  • the second radio button in the “Contexts” radio group 179 selects only the “current context” (/Music/Computer_Music).
  • the current context is normally established by browsing or by standing user preferences. (When “Install Links” is selected in the standard menu 130 while browsing, the last context displayed in the browser becomes the default current context.)
  • a browser cookie is preferably used to remember the most recent “current context” for each user across sessions.
  • Installing links from only the current context is not as narrow as it may seem at first since normally the context dictionaries for /Music and ‘/’ (the top-level generic dictionary) are included, as well as all subcontexts of Computer_Music.
  • the two “Extensions” checkboxes 70 provide all-or-nothing control over appending parent and subcontext dictionaries to the current-context dictionary.
  • subcontext /Music/Computer_Music/Sound_Synthesis is a synonym for /Engineering/Signal_Processing/Sound_Synthesis, say, and if “multiple inheritance” is enabled at all subcontext hierarchy levels (an advanced link installation option), then the context dictionary for all of Signal_Processing and /Engineering would be folded in, at a lower precedence level, of course, since they are listed after all subcontexts of /Music.
  • the aggregate dictionary list built for link installation by the server can be rather large even when only the current context is selected for link installation.
  • the third and final option in the “Contexts” radio group 179 is to provide an explicit list of context dictionaries.
  • a list of context dictionaries can be accumulated via browsing in the manner described above, or a list can be submitted dictionary-file format. Additional “virtual context dictionaries” may be defined by means of the Search function, with the search results forming a link subset which can be assigned a name and treated as a dictionary. It is preferable to offer convenient hierarchical browsing of the selected portion of context hierarchy represented by the dictionary list. Any search result may also be displayed as a dictionary file. Dictionary files are discussed more detail below.
  • Dictionary lists may be stored on the server in a directory devoted to each user or in a file with user's email address forming part of the filename, as shown in the example of FIG. 7 . They may also be stored on the user's computer via browser cookies.
  • the “Min Level” 71 and “Max Level” 72 pop-up lists allow specification of a range of educational levels for link installation.
  • the server When the user presses the “Submit” button 173 (or the submit button of a long-form submission form), the server receives the filled-out form specifying how links are to be installed, processes the submitted text in a CGI Perl script or other server-side software to install the links, and generates output consisting of the user's submitted text with all the new links embedded.
  • an aggregate dictionary file is prepared on the server based on the user's link-installation specifications and the current contents of the server's link database. This dictionary file is then “applied” to the user's submitted documents in order to replace key phrases by hypertext links.
  • a Perl script illustrating link installation for HTML files is included in Appendix A.
  • the preferable details of the methods used depend on the relative sizes of the files involved. For example, if stringent conditions are specified on link properties for installation, and if a large file is submitted from the user, it may be the case that the aggregate link dictionary is much smaller than the combined size of the files submitted for link installation. In this case, it may be fastest to search the submitted file for each link in the aggregate dictionary.
  • the number of eligible links is large (e.g., “All W3K contexts” was selected in the Contexts section 179 of the link installation form), and if the submission itself is small, it may be preferable to search the aggregate dictionary file for each possible key phrase in the input file using well known “incremental search” techniques.
  • the output HTML may be returned to the user in the form of a “dynamic Web page.” That is, the user's browser immediately “navigates” to the automatically generated HTML page as if it were already somewhere on the Web. At that point, the user can select “Save As” in the Web browser in order to save the HTML in a local file, or “View Source” can be selected in the browser to enable copy/pasting of the generated HTML into a text editor for further editing.
  • the user specifies a URL pointing to the submitted document in place of the text of the submission itself.
  • This mode of submission is more convenient for linking entire websites.
  • the server processes the submitted file and all files reachable from the first via hyperlinks, provided that the reachable files reside somewhere on the same website (as defined by its URL). In other words, links are followed provided the first portion of the URL matches that of the submitted URL in its entirety.
  • the processed document is not returned as dynamic HTML, but rather as a hyperlink to a single binary output file on the server containing all the processed files.
  • This output file may be created by combining all processed files into one using the freely available tar program, and further compressed using the freely available gzip program.
  • the tar and gzip programs are available from the GNU Free Software Foundation ( ⁇ http://www.gnu.org/>>>).
  • the output file can then be “downloaded” to the client computer by clicking on the hyperlink pointing to the output file in the dynamically generated HTML.
  • the user then unpacks the file on his or her local computer using, e.g., gunzip and tar, or the shareware program winzip.
  • the user may specify the URL of a single composite file in “tarred and compressed” format, i.e., created using gnutar and gzip in the same way that the server's output is prepared in the case of multi-file submissions.
  • the filename extension is used by the link installation server to distinguish between pointers to websites (.html or no extension, indicating a directory) and compressed tar files (.tgz or .tar.gz). If the text appearing in the “Text or URL” textfield of the link-installation submission form starts with “http:”, “ftp:”, or “gopher:”, a URL is assumed.
  • Link installation usually occurs within a “current context” or a list of contexts.
  • the current context may be set according to the location of the browser when “Install Links” was selected by the user.
  • one or more contexts may be set explicitly in a dictionary list provided by the user when filling in the link installation form of FIG. 7 .
  • the current context dictionary is searched first for matches in the user-supplied text, and matches are transformed into links.
  • the process is “idempotent” since matches will not occur within the link syntax itself (such as in HTML anchor specifications or LaTeX macro arguments).
  • dictionary entries are ordered from longest to shortest phrasings, as discussed above.
  • the dictionary for the current context is optionally augmented by the union of all lower-level dictionaries within that context.
  • Current-level definitions take precedence over lower-level definitions in any key-phrase collisions. Collisions among lower level dictionaries are not explicitly arbitrated (since that could be accomplished by listing them explicitly), so that the first occurrence of a lower-level definition will take precedence (when not defined at the main level). This follows simply from the convention that “the first match wins”.
  • the purpose of adding in all lower level directories is to provide a reasonably complete dictionary at a high-level node without having to duplicate definitions from lower-level contexts. In principle, such duplication could be avoided by moving all lower-level definitions to the highest possible context.
  • the term “idempotent” is a math term used in many technical fields, and it is not an English-language term (according to the Funk & Wagnalls Standard Desk Dictionary). Therefore, “idempotent” can be defined without conflict in the top-level dictionary for the English language. In practice, however, it works out better to define terms in their “most natural” subcontext, and let their definitions “float up” as far as they can go without collision. Positioning a term within its “most appropriate” context makes the hierarchical dictionary better organized and instructive when browsing.
  • the parent node is normally next in the aggregate dictionary. It is searched for further matches, so that more general terms in the higher context not “overridden” by the lower contexts will be linked to their definitions. This process continues until the top-level context node is reached in the aggregate dictionary.
  • a list of context dictionaries may be specified explicitly in a variety of ways. This is analogous to specifying multiple libraries when linking a computer program. The order of specification is important since the first match is taken. This feature may be used by specifying ancillary fields after the main field of the author. For example, a physics professor might include certain math contexts after the appropriate context(s) within the field of physics.
  • a known user can contribute his or her own link database, select only it for search during link installation, and thereby obtain full control over the links which may be installed.
  • a personal link database can be very useful to the author of a book typeset in LaTeX, for example. Since LaTeX supports the generation of an index file, and since the freely available latex2html Perl script will convert a book index into an HTML page, such an index can be easily and automatically be converted (e.g., in the Emacs text editor) to a dictionary file format acceptable by the server. The entire book can then be processed by the server to install links pointing somewhere into the book for every occurrence of an indexed word in the book. Other links can of course also be included.
  • LaTeX index files Another application of LaTeX index files is to merge the indexes of related books in order to generate a link database for a particular “field,” spanning a specific set of resources.
  • a link database (or dictionary list) may be submitted in a documented ASCII format supported by the server. Since all properties are optional, the submitted file can be as simple as a list of key phrases and their corresponding URLs.
  • dictionary file which can be used to initialize a context hierarchy for the examples seen in the FIGS. 3-5 :
  • Link properties could be updated or added in this manner, for example. Ratings and reviews are not affected since they may not be submitted in a dictionary file.) Rejected submissions are listed in a message from the server delivered in a dynamic web page, as is typical. Similar action is taken for other kinds of messages to the user as needed.
  • the specified context directory does not exist, it is created, and the email address of the creating user is logged as its owner.
  • the server automatically installs an encoding of the owner's email address in each link entry by means of an additional link property. Other properties, such as initial ratings, date-of-submission, etc., are installed by the server. Only the owner or group-member or server webmaster may modify an existing link or directory.
  • Similar submission protocols can perform editing operations which would otherwise be laborious over the browser-based user interface described above, such as deleting a database subdirectory and all its contents (provided, of course, that everything to be deleted is owned by the person or group making the request).
  • editing operations which would otherwise be laborious over the browser-based user interface described above, such as deleting a database subdirectory and all its contents (provided, of course, that everything to be deleted is owned by the person or group making the request).
  • the directives can perform editing operations which would otherwise be laborious over the browser-based user interface described above, such as deleting a database subdirectory and all its contents (provided, of course, that everything to be deleted is owned by the person or group making the request).
  • dictionary file format When specifying a list of context dictionaries for link installation, it is convenient to be able to use dictionary file format. When used in this way, all PATH directives in the file are extracted to form a list of contexts. If any links are specified for a particular context PATH, then only those links will be eligible for installation. Additional directives are provided which correspond to the options available for context dictionary specification, such as include parents, include subcontexts, and allow multiple inheritance. For convenience, these aggregate-dictionary-building directives are ignored when submitting a dictionary file as a means of submitting links.
  • IP address is stored as well as the verified email address for security reasons.
  • a user with “root privileges” on a personal machine can generate any number of return email addresses, while the number of IP addresses available to an individual is usually very limited. For example, if unusually many email addresses are found to belong the same IP address, a warning can be automatically emailed to the webmaster who can look into the matter further, such as by inspecting all contributions from that IP address. If an IP address turns out to belong to a malicious “hacker”, it is straightforward using standard UNIX tools to eliminate all database entries and directories associated with that IP address, barring it from further contributions, and so on.
  • IP address When the IP address is dynamic, as is often the case when a commercial Internet Service Provider (ISP) hosts the user's account, it is less likely that many different email addresses will belong to the same person, and the ISP can be contacted for assistance. Note that it is very easy to arbitrarily set the “From:” field in any email message; therefore, the “Received” fields in received email may be analyzed by the server to get closer to the true originating location. In Netscape Navigator, for example, select “View/Headers/All” to see such fields in received email.
  • ISP Internet Service Provider
  • Each link database may be implemented on the server as a plain ASCII file in a directory structure that corresponds to the hierarchical organization of the link databases.
  • the complete hierarchy can also be implemented in a single file which contains path information for each link entry.
  • the initial prototype of the present invention used a single link database file based on the links-2.0 software scripts from Gossamer Threads, Inc. ( ⁇ http://www.gossamer-threads.com/scripts/links/>>>).
  • the context path information is included in what is called a “link category”.
  • categories having no sub-categories are considered to be “key phrases”, and actual links within a key-phrase (bottom-level category) are treated as “competing definitions”.
  • a single links database file is not considered the best mode of carrying out the present invention. Instead, a hierarchical file system implementation is preferred in which the directory path corresponds to the context, and the database file in a context directory contains only links for that context (along with perhaps a limited number of subcontexts).
  • an evolutionary path can be followed starting out with a single database file, followed by splitting into separate database files for top-level contexts, followed by further splits as the files grow too large, etc.
  • the links-2.0 system advises a limit of 10,000 links for its one-file link database system managed by Perl CGI scripts.)
  • the first path component stored in the link database may be removed since it becomes implied by the directory in which the database file resides.
  • a database directory may contain both files and directories. Subdirectories are interpreted as subtopics, and the hypertext links for the current directory (when it is a key-phrase directory) may reside within a single ASCII file named “links.txt”, for example, preferably located in a context directory containing the key phrase.
  • the links.txt file contains a list of hypertext links for the current context in a plain ASCII format described below.
  • the name of a user's configuration file may include the user's email address, if known, and otherwise an arbitrarily assigned session ID for “unknown” users. All active sessions preferably time out after a period of inactivity, as is commonly implemented by websites featuring session management.
  • ratings.txt There may be a system file ratings.txt, parallel to links.txt in each directory, containing all contributed ratings for the links in links.txt.
  • Information stored in ratings.txt for each link includes the email address of each contributor, and the contributed rating. When a new rating is contributed, an entry is appended to ratings.txt. If there is already a rating from that email address, it is replaced with the new one. A new average rating is computed, and the updated average rating and contributor count are entered into links.txt as properties for the affected link.
  • reviews.txt also parallel to links.txt, resides in each directory and contains all contributed “link reviews”. Information stored for each link includes the email address of each contributor, and the contributed review. When a new review is contributed, it is appended to reviews.txt, replacing any previous review from that email address.
  • Links may be stored on the server in the following simple ASCII text-file format:
  • This format uses explicit property names which are convenient when specifying sparse subsets of all possible properties (and also more clear for describing the invention).
  • An alternative is the use of a fixed-format record in which the property names are implied by their field position within the record.
  • the ID is a unique integer assigned to the database record.
  • the ID therefore uniquely identifies the record and can be used to identify it in various contexts, such as in the URL for indirect links.
  • a link to a Web page about the “W3K” website could appear in the link database (in one long line which is broken for clarity below) as
  • a user database holding information such as a list of IP addresses authorized for that email address, whether the user wants to receive the W3K newsletter, the list of groups to which the user belongs (being a “trusted user” means belonging to the “trusted” group), and information logging any inappropriate use of the service such as submitting offensive links.
  • Hits Number of times link accessed by browsing Installs Number of documents link has been installed in Rating Quality rating as a number from 1 to 10 RateCount Number of users contributing ratings isNew 1 if Date is sufficiently recent isPopular 1 if Hits is large relative to other links
  • OwnerEmail Email address of link contributor ReceiveMail 1 if link contributor wants our newsletter Groups List of owning groups separated by ‘
  • the properties can be used to limit the range of links installed by a link installation. For example, a certain educational level range can be specified, or links only of a certain type may be specified. Restriction to links contributed by the owner or owning group is also easily specified.
  • KEYs will match occurrences of any case by default. When a link is installed in a user's document, the user's original case is preserved in the anchor text. KEYs may be entered in singular form since the string matching algorithm will match will ignore a trailing ‘s’.
  • a KEY is either a simple word or a phrase consisting of words separated by underbars, e.g., Funk_&_Wagnalls_Knowledge_Center. A word may not contain certain “meta-characters” such as “
  • context names must be “legal” UNIX file names after whitespace has been converted to underbars ‘_’, since the preferred embodiment uses a UNIX directory tree corresponding to at least part the context hierarchy. Restriction to legal filenames is easily relaxed by encoding the directory names in hexadecimal, as an example, or using the special character encodings of HTML.
  • the string matching algorithm used in link installation “folds” the input case to “lower” and replaces underbars and hyphen with spaces in string comparisons.
  • KEYs in text submitted for link installation can have any case and can include underbars, hyphens, or spaces separating words in the keyword phrases, yielding the same matching results in all such cases.
  • the link name is functionally equivalent “funk wagnalls knowledge center” for purposes of string matching. To include special characters where necessary, names may be quoted, as in
  • the present invention can be adapted equally well to single-computer operation, requiring no network connection.
  • the user can install a link database application from a CD-ROM, for example, in the fashion typical of many software products for personal computers. All functions formerly described as being provided by a Web browser and the remote link-installation and database server can be provided by the installed application.
  • a Web version if available, can serve to provide a supplementary collection of links.
  • Link rankings can of course change at any time, and this is normal. However, since locally installed ranking information may be a rating override by the user (rather than the result of a previous upgrade), it is not necessarily correct to overwrite the locally installed rating properties. Similarly, other properties may have been added by the user to fine tune link installation results. During installation of the incremental update, the user may be given a choice of whether or not to accept conflicting information from the incremental update on a link by link, or property by property basis.
  • the default action may of course be to avoid overwriting any user-developed information, and the default upgrade can proceed in this mode.
  • all conflicting links can be installed in a third parallel directory tree for later inspection by the user.
  • Another means for avoiding conflicts is to rename any pre-existing directories containing user modifications (by adding a private suffix to its directory name, say) before carrying out an update.
  • the educational level of a definition is a number indicating how advanced the material is. Authors generally wish to minimize the educational level as much as possible consistent with the intended audience, the material being presented, and the desired length of the document.
  • link Every definition (link) is assigned an educational level.
  • a normalized educational level may be provided manually by the link contributor as a number between 0 and 100, with the number being loosely interpreted as “years of education likely required” for someone specializing in the subject.
  • a level is automatically computed which interpolates the manually assigned levels that do exist.
  • the automatically assigned level of a definition is computed by first computing an integer “raw level” for the definition based purely on an analysis of definition interdependencies, followed by the computation and assignment of a “normalized level” which maps each raw level to the pre-existing manually assigned level when it exists.
  • the raw educational level is defined as 1 plus the maximum raw educational level of all referenced definitions, where a definition referencing no other definitions is assigned a raw level of 1.
  • a referenced definition here means any server-resident link occurring inside the definition's document. All links outside the server are assigned a level of zero (to avoid having to process external documents, all their sublinks, etc., in a potentially huge recursion over the internet).
  • a definition document which refers only to outside material, which would receive an automatically assigned raw level of 1, it is the submitting author's responsibility to reasonably assign its educational level manually when submitting it to the server. Forward references in a document can be marked as such to prevent them from artificially elevating the automatically computed level.
  • a “definition document” means the page pointed to by the definition's URL together with all information reachable from the definition page on the same website by following links.
  • the definition document corresponding to definition URL http://www.w3k.org/ includes the contents of http://www.w3k.org/index.html together with any HTML (or other format) content reachable by following links within index.html, such as http://www.w3k.org/about.html, which are at the same site, that is, expressible using URLs starting with http://www.w3k.org/.
  • documents may mention any specific recommended prerequisites at the outset, providing links whenever possible, so as to better orient the reader.
  • list of keywords appearing below an article's abstract, after being automatically linked by the present invention, can provide a good first set of background links.
  • an explicit list of prerequisite topics automatically linked to top-ranked tutorials by the present invention, can provide a more systematic and preferable approach to prerequisites.
  • Training levels are normalized to make them more intuitive to use. Normalization is a process which recomputes automatically assigned educational levels so as to make them fit more naturally among the manually assigned levels.
  • Raw numerical levels are integers which are bounded below by 0 and unbounded above.
  • the raw level of any document is at least 1 greater than the maximum level among all documents it references.
  • levels of advanced writings can tend toward very large integers.
  • the raw level number is difficult to interpret. Normalization can be easily accomplished using a piecewise-linear mapping from the raw-level scale to some “standard level” scale.
  • the standard level scale used at the server may be loosely based on the number of years of education “likely required” to understand the document for someone taking a direct educational route to the material in the given context. It is analogous to a “par score” in golf, which defines the average number of “strokes” required to complete the course, for a good player. Such a numbering scheme obviously becomes more and more debatable at levels above 14 or so (middle undergraduate level) where curricula are not uniformly standardized. In fact, it is common in many technical fields to constantly work toward moving level 17 materials (beginning graduate level) down to level 14 or even below, meaning the “ideal” educational level is itself a moving target.
  • a definition can be schematically represented by a list of numbers (the raw levels of all referenced definitions) to which a higher number must be assigned (the definition's raw level).
  • the definition's raw level For example, two definitions D1 and D2 might appear schematically in this way as follows:
  • definition node 80 is assigned a raw educational level of 20 which is consistent with the fact that its document references definitions 2, 5, 44, and 45 which have been assigned raw levels 13, 14, 18, and 19, respectively.
  • each node of the graph corresponds to a definition
  • a reference to another definition can be visualized as an arrow (also called an “edge” of the graph) pointing from the referencing node to the referenced node.
  • the arrows can alternatively be taken to all point in the other direction, from the referenced node to the referencing node.
  • a directed graph can be topologically sorted if and only if it is acyclic, meaning there are no cycles in the graph.
  • Standard topological sorting algorithms just work when there are no cycles, and when there are cycles, they are detected and reported as errors, leaving it to the user to “break the cycle” in some way.
  • Cycles can be caused inadvertently in documents by use of forward references.
  • a level 12 document about the calculus it would be quite natural to mention some calculus applications such as solving problems in elementary mechanics (which is level 13 at a typical university, requiring calculus as a prerequisite).
  • some calculus applications such as solving problems in elementary mechanics (which is level 13 at a typical university, requiring calculus as a prerequisite).
  • a mechanism is provided for authors to mark forward references in documents, so that forward-reference links are ignored in the computation of educational level.
  • the owner of the modified link record is preferably notified via email.
  • the combination of solutions 0 and 1 is employed, but more sophisticated solutions are anticipated to become preferable as the link database grows.
  • level assignment helps to disambiguate between relatively advanced and elementary contexts.
  • the proof of “Euler's theorem” contained the use of the phrase “positive real number.”
  • the term “positive real” existed in the dictionary for the same context, but at a much more advanced level. (It was entered as short for “positive real function” which is a mathematical property of functions in the complex plane characterizing “passive” driving-point impedances in the context /Engineering/Electrical/Classical_Circuit_Theory). Since links to higher educational levels are normally suppressed when installing links, the misleading link would not occur.
  • phase vocoder a tool for “stretching” sound in time, among other uses
  • normalized level 16 a college senior with a good undergraduate background in signal processing
  • phase vocoder topic may be set at level 18 or 19 so that the Music Ph.D. student has time to pick up necessary signal processing prerequisites in the electrical engineering department.
  • a convention for handling multidisciplinary documents can be that the level of the document is set according to the portion of the document which properly belongs in the context to which it is assigned.
  • a document requiring expertise in three fields say, could then receive three different levels, one in each context, where the level in each case depends only on the portion of the document pertaining to that field.
  • the paper can be placed in the context hierarchy in at least two places, under /Legal/Patents, and under /Science/Biology/Genetic_Engineering (which, by the way, could be synonymous with /Engineering/Genetic_Engineering).
  • Under /Legal/Patents its level is set according to the level of legal background required.
  • Under /Science/Biology its level is set according to the level of knowledge required to understand the technical aspects of the article.
  • link properties can be added by the user, the above example can be extended to include a secondary level specification. For example, when filing the paper under /Legal/Patents, it can be given two additional user-defined properties
  • An alternative mode of treatment for multidisciplinary or cross-disciplinary documents is to assign a manual level to the document which represents the number of years of education likely required for understanding assuming one first obtains a typical background in the current context. That way, the level numbers of links in a particular field are most meaningful for people in that field. Since people in the field are most likely to be using the links for that field, it makes sense for the numbers to be best adapted to their needs.
  • the system of the present invention allows for many choices of normalized level scale, and the scale can be changed at any time by resetting the manually set levels.
  • a link contributor may, for example, perform a property-based link database search which collects together all editable links at a particular educational level, and changes all the levels to a new value in one edit operation on the selected set.
  • the normalized educational level numbers may go down over time as shorter and shorter “critical paths” to particular advanced topics are identified. Curricula can in fact be based precisely on such identified paths, or even automatically generated from them. This is a foreseeable use of the present invention. Specifically, a student can choose a set of skills and topics he or she wishes to master, and plunge in immediately on the most advanced level, clicking on links whenever a word or phrase is not understood. After enough clicks, a level is reached where the material is digestible, and the student can eventually work back up to the desired level. This is in contrast to the usual program of beginning with elementary preliminaries which can seem unmotivated to the student. Needed educational prerequisites can be “paged in” as needed on a demand-driven basis (to use an analogy with “demand paging” in computer science). Significantly, unnecessary educational prerequisites can be bypassed entirely.
  • Such a mechanized approach to self-paced educational certification has a few obvious disadvantages relative to more traditional educational methods, especially given the limited “virtual reality” capabilities of present day computers.
  • it can provide a very efficient way to “tool up” on a technical topic, particularly for the time-limited employee, the spare-time student, or the bright young learner in a developing country seeking salable job skills in the global information economy.
  • each concept is given its own “home page” on the Web which approaches the subject in a top-down way, addressing the widest possible audience, and directing all interested readers to the most appropriate further information for them.
  • All-encompassing home-page style definitions are likely to emerge as the “best definition” in a user-ranked list. Being designated the “best definition” for a topic is significant in that, in some modes of operation, only the “best” definition is available to the end user in installed links.
  • Example “topic home pages” include one on the Kalman filter at
  • top-down, general-interest, home pages on a given topic is that its minimum educational level may be set low, allowing the link to be installed in a greater variety of materials, while the expert can skip over the introductory tutorials and proceed directly to the mode advanced material of interest. For this reason, the preferred embodiment supports specifying a range of educational levels for a single document.
  • a well designed topic home page might span levels from high-school all the way out to the research frontier.
  • An example website covering such a large range is the Physics 2000 website at
  • a useful means for generating link targets is to write a LaTeX document which explains one “concept” per section.
  • the title of the section may be the word or phrase being defined.
  • the first occurrence of the concept's name may be emphasized using the LaTeX command ⁇ emph ⁇ ⁇ .
  • the utility latex2html can then be used in a normal fashion on such a “concepts file” to create a separate HTML file for each section, and the hypertext index generation capability of latex2html can be used to generate the associated link database (dictionary file).
  • a dictionary can be generated directly from the HTML using the widely available Perl modules LWP and HTML (see, for example, the Perl Cookbook by T. Christiansen and N. Torkington, O'Reilly, 1998).
  • a Perl script for this purpose is listed in Appendix B.
  • the dictionary file so generated may be submitted to the link database server to conveniently contribute links to all the newly created concept home pages on the Web. (The script is written to retrieve all files under a URL by following links, thereby providing a starting point for a utility which creates a dictionary file from any website on the Web.)
  • the LaTeX command ⁇ section ⁇ ⁇ can be replaced by an invocation of the following more convenient macro:
  • This macro is called with the section name as usual, but additional properties may be specified in the second argument, and these are passed to the output in an HTML comment which may be interpreted by the script which creates a dictionary file from the HTML files (a starting point is given in Appendix B).
  • Context properties found in the HTML comments may be translated to PATH directives in the dictionary file.
  • An alternative use of properties embedded in link target files is that, when using indirect links, link properties do not have to be included in the link database at the link server.
  • a ⁇ emph ⁇ derivative ⁇ is a financial instrument derived from an underlying security such as ... ⁇
  • the use of the context property can be extremely important for obtaining the desired results. If the context property is not specified for a link, it may default to the path of the database directory in which the link resides.
  • the context value All can be specified in a link entry by trusted users to enforce only one meaning of a term across all Contexts. Such treatment could be appropriate for trademarks, for example. Since the top level of the link database directory tree is Context independent, link entry at that level can be restricted to trusted users.
  • properties were used to specify the context and educational level of a link target. Properties can similarly be specified in documents submitted for link installation in order to guide link installation. Thus, the document receiving links can call for links of a certain type rather than having to specify all aspects of link installation in a separate script or over the browser interface at the server.
  • a linked reference can be marked as a “forward reference”, for example, via
  • the Perl script in Appendix C can be used to convert the “Favorites” folder containing Internet bookmarks for Microsoft Internet Explorer 5 to the dictionary file format described above.
  • the inexpensive utility program LinkSync http://www.bluesquirrel.com
  • LinkSync http://www.bluesquirrel.com
  • Many “web surfers” have valuable hierarchically organized “link collections” in their browsers which might usefully be converted to dictionary files that can be submitted to the link installation server. Hierarchy is preserved by creating context paths as needed.

Abstract

A database server contains pointers to useful information, such as on the World Wide Web. Users of the server may have hypertext links added automatically into documents they submit. Users may additionally contribute to the link database, thereby extending it, and may add additional qualifying information pertaining to the links.

Description

    1 CROSS-REFERENCE TO RELATED APPLICATIONS
  • This application is a continuation of U.S. application Ser. No. 10/681,621 filed Oct. 7, 2003, which is a divisional of U.S. application Ser. No. 09/412,248 filed Oct. 5, 1999, which claims priority from U.S. Provisional Patent Application No. 60/103,089 filed Oct. 5, 1998, which is incorporated herein by reference.
  • 2 BACKGROUND OF THE INVENTION
  • 2.1 Field of the Invention
  • The present invention relates to facilitating access to information over a computer network such as the Internet. More particularly, the present invention relates to technology for partially automating the linking of documents on the World Wide Web by authors of Web content. Such techniques are particularly useful for more easily creating richly interconnected information on the Web.
  • 2.2 Description of Related Art
  • The World Wide Web provides an enormous distributed database of information interconnected physically by the Internet. One of the main difficulties for users of the Web is finding needed information out of the tremendous quantity of information that is available. Various mechanisms have been developed to address this problem.
  • One mechanism for facilitating access to information on the Web is the index website. An index website is typically a server computer connected to the World Wide Web which maintains an index of Web content that can be searched in various ways by users (clients) connected to the server over the Internet. Indexes are often updated automatically by means of “spiders” which systematically explore the Web looking for new or updated content. Most search engines also provide means for users to install information to be indexed, so that such information may be indexed immediately without waiting for a spider to find it. An example of a premier search engine is the “Alta Vista” website, accessible on the Web at the Universal Resource Locator (URL) address <<<http://www.altavista.com>>>.
  • A difficulty with search engines is that search results typically contain too much undesired information as well as the desired information. This occurs because the information content of the Web is vast, and because it is difficult for users to construct search parameters in such a way as to pass most desired content while rejecting most undesired content. As a result, users typically must spend a lot of time sifting through search-engine results and/or refining their searches with additional restrictions in the search parameters. Additionally, the information stored in the index is not organized in a form suitable for browsing in a logical order.
  • Another mechanism developed to facilitate access to information on the World Wide Web is the directory website which presents a hierarchical directory of information that can be browsed by the user. Premier sites of this nature include Yahoo (<<<http://www.yahoo.com>>>), Netscape (<<<http://www.netscape.com>>>), and Excite (<<<http://www.excite.com>>>). A visitor to such a site is first presented with a top-level list of topics. Choosing a topic by clicking on a topic's hypertext link with the mouse produces a list of subtopics, and so on, until a final level is reached at which useful information is displayed about the topic, or else a remote website pertaining to that topic is visited. Directory companies such as Yahoo typically have teams of editors who explore the Web looking for content suitable for reference at their site, and these workers perform a function analogous to the automatic “spiders” used by automated index websites. Like the search engines, directory websites normally support searching within the directory site, thus producing search results of generally higher quality and less “clutter” than typically encountered on an index site. Also like index websites, directory websites typically allow submission of content for reference, subject to editorial consideration. Thus, directory websites improve over index websites by providing editorial selection, logical organization, and browsing capability, all of which are absent in typical index websites.
  • A first difficulty, however, with directory websites is that they cannot reasonably keep up with the vastness of the information on the World Wide Web by means of manual editorial selection. As a result, directory websites tend to offer far less information relative to index websites. A second difficulty with directory servers is that their content is proprietary and controlled by a team of editors at one company. This editorial control, while ensuring consistently high quality on the site, makes it difficult and sometimes even infeasible for an information provider to obtain a desired listing in the hierarchical directory. One directory site that addresses this difficulty is the Open Directory project (<<<http://dmoz.org/>>>); The Open Directory allows any user on the Internet to become an “editor” for a particular topic at the site. A third difficulty related to the first is that typical directory sites are extremely broad in scope, contributing to the absence of specialized information that is not of interest to a wide general audience.
  • A difficulty with both index and directory websites is that information is presented without regard to the user's level of education. It is therefore often possible for a high-school senior working on a book report, for example, to encounter information understandable only by a graduate student in a specialized field. There is similarly normally no means for selecting information according to its type or source or other potentially desirable criteria.
  • To assist users in selecting sources of information, some websites provide a user rating system (or “scoring system”) to which any user may contribute. An example of this mechanism is seen in the online book-store website <<<http://www.amazon.com/>>>. Amazon allows any user to contribute a “book review” and an overall rating on a five-star scale. The average rating is displayed for each book, and books which match the user's search criteria are displayed sorted according to decreasing score (and possibly other criteria such as the number sold). An interesting feature of the Amazon rating system is that it is democratic, allowing the vast quantity of World Wide Web users to jointly develop a ranking of the information sources (in this case books). Such a scheme addresses the difficulty of sorting through enormous quantities of information by harnessing a potentially enormous base of users as contributing editors, in effect. A difficulty with rating systems is that they are generally used only at the site where the ratings are collected, and no mechanism is provided for making use of the ratings elsewhere, such as in other documents on the Web linking to the same information.
  • An important mechanism integral to the function of the World Wide Web is the HyperText Markup Language (HTML) which is a text format supported by Web browser programs (such as Netscape Navigator or Microsoft Internet Explorer). A more recent variant called XML is now gaining support, and its function is similar to that of HTML for present purposes. HTML provides for the specification of hypertext links in Web-page text displayed by the browser. At a minimum, a hypertext link consists of text to be displayed by the browser and a link target which is usually not displayed. For example, the HTML code
      • <a href=“<<<http://www.w3k.org>>>”>W3K website</a>
        contains the text (also known as the anchor) “W3K website”, while the link target is <<<http://www.w3k.org>>> which is a URL pointing to the W3K website. Thus, the link target is normally addressed by a URL pointing to information on the Web about the displayed word or phrase. (The complete HTML format specification may be found online at the URL <<<http://www.w3.org/>>>.) To the browser user, the anchor text of a hypertext link as above appears in a Web-page display as an underlined word or phrase, e.g.,
      • Visit the W3K website for more information regarding automatic link installation.
        and usually in a different color than normal, unlinked text. By clicking on the hypertext link with the mouse, the user directs the browser program to “follow the link” by “navigating” to the URL associated with the link. The link-target URL may point to another Web page anywhere on the World Wide Web, or it may simply point to another location within the same electronic document. Hypertext links in HTML documents make it much easier for the user to explore the World Wide Web by visiting Web pages and clicking on the links found therein. Web browsers further make it easy to return to the page containing the link by using the “back” button, or the “history” list of visited pages maintained by the browser.
  • A difficulty with hypertext links is that they must be laboriously added by Web content providers. Typical HTML editors merely provide a data-entry form in which the URL for the link target can be typed. A second shortcoming of HTML and Web browsers is that there is no standard mechanism for specifying link properties such as educational level, type of resource, information source, or the like, which could be supported by Web browsers to give the user finer control of link display based on link properties. After the links are typed in, they must be maintained as their URLs change, and as new and better link-targets become available. There is therefore a need for automated assistance with entering, maintaining, and improving hypertext links in documents intended for a hypertext document environment such as the Web.
  • SUMMARY OF THE INVENTION
  • It is a primary object of the present invention to facilitate the addition of hypertext links (also called “hyperlinks,” “links,” or “definitions”) to documents intended for access on the Internet via the World Wide Web. Accordingly, the present invention is designed to provide a link installation service which automatically installs hyperlinks within information submitted to the service by hypertext authors. Submissions may be in HTML format, plain ASCII format, LaTeX source format, or a variety of additional formats to be added in the future. The output returned to the user may be in either HTML or LaTeX source format (which may be compiled into HTML format). Criteria can optionally be specified which govern the installation of hyperlinks.
  • The invention further provides selectable databases of hyperlinks, organized by category (or “context”), which can be optionally selected for automatic link installation. It is further provided that content developers may add their own links to the existing link databases, and they may additionally create new link databases and specify their relation to the existing link databases. Contributing users are preferably required to have a known, verified email address. A user with a verified email address is called a “known user”. The invention further provides means for browsing the link databases in a logically organized, hierarchical tree structure, wherein higher-level nodes correspond to more general contexts, and lower-level nodes correspond to more specialized contexts. The link databases can additionally be searched for keyword matches within component fields. Users may provide ratings and/or reviews for individual links in the link databases.
  • The hyperlink databases of the present invention support various optional “properties” associated with each hyperlink. One such property, useful in the development of educational content, is a level designation which indicates the educational level required for best understanding of the link-target information. Additional optional properties include the language of the content (such as English), a viewer suitability rating such as exists for movies (PG-13, R, etc.), and properties defined by the user. Link properties can be specified by users to control the automatic installation of links, and/or to control what is displayed while browsing the link databases.
  • Educational levels not specified on submission are estimated based on the level of links found within the link target document. As a result, every link in the link database is assigned either an educational level, either manually or automatically. Determining levels automatically detects any “cycles” in the link database. (A “cycle” occurs when document A links either directly or indirectly to document B, and document B links either directly or indirectly to document A.) Cycle detection can help content providers eliminate inadvertent “forward references.” Means are provided for marking forward-reference links in submitted documents so that educational level will not be affected. Cycle-free systems of links can be more effectively used as a basis for online course materials.
  • Another feature of the present invention is the ability for users to rate (or score) the quality of any link in the database and/or to submit a written review of any link. The quality ratings may be averaged together and used to determine the relative ordering of the links when there are multiple link targets for the same word or phrase (“competing definitions”). In the typical case of HTML format, features of the JavaScript scripting language may be used to provide convenient access to multiple link targets, ranked according to score. Alternatively, the latest ranked list of competing definitions may be maintained on a central server on the Web, with the installed link pointing there, instead of containing only a snapshot at the time of link installation, which may rapidly go out of date. Alternatively, the currently highest rated link may be installed in the user's Web document for each recognized topic.
  • 4 BRIEF DESCRIPTION OF THE DRAWING FIGURES
  • FIG. 1 shows an example initial Web page seen by a visitor using a Web browser to access the online version of the service.
  • FIG. 2 shows a Web page giving an overview of the capabilities of the online service.
  • FIG. 3 shows an example Web top-level page seen while browsing the hyperlink databases.
  • FIG. 4 shows an example lower-level page seen while browsing the hyperlink databases, in which the context has been narrowed considerably.
  • FIG. 5 shows an example browsing view at the level of a key phrase in which all displayed links are interpreted as “definitions” for the key phrase.
  • FIG. 6 shows an example form for adding a new link (definition) to the link database for the current key phrase.
  • FIG. 7 shows a Web page for submitting text for link installation.
  • FIG. 8 depicts the tree structure of the hierarchical link database.
  • 5 DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS
  • The following is a description of the best presently contemplated modes of carrying out the invention. The descriptions are not to be taken in a limiting sense but are made for the purpose of illustrating the general principles of the invention. It is particularly noted that the invention may be implemented in a variety of different file formats, database technologies, search and replace methods, computer processors and system architectures, host operating systems, network protocols, user-interface frameworks, and the like.
  • 5.1 Client-Server Architecture on the World Wide Web
  • FIG. 1 illustrates how a World Wide Web “home page” might appear on a website embodying the principles of the present invention. The user has several choices of where to “navigate” next: The first choice 101 is a hypertext link entitled “Learn about the W3K,” where in this example, “W3K” is an acronym standing for the “World Wide Web of Knowledge.” If this choice is selected by clicking the mouse on the underlined text, the visitor “navigates” to the Web page shown in FIG. 2 which provides an overview of the online service provided by the W3K. In particular, it is explained how submitting plain text 110 to the W3K server will result in hyperlinked text 111 being returned to the user. A summary 115 of high-level functions is also provided in FIG. 2.
  • The second choice in FIG. 1 is a hypertext link 102 entitled “Browse the W3K.” If this choice is selected by clicking the mouse on the underlined text, the visitor “navigates” to the Web page shown in FIG. 3 supporting browsing of the hyperlink databases, as described further below. The third choice in FIG. 1 is a hypertext link 103 in which the text displayed by the Web browser is “Install W3K links in a Web document.” If this choice is selected, the visitor is taken to the Web page of FIG. 7 where the user can submit text for link installation in a variety of formats. The text is returned to the user by the server with hypertext links installed according to the user's specifications. Link databases to be searched can be collected into a list during the browsing operation.
      • The fourth choice, “Add to or Edit the W3K,” is a link 104 to a Web page for editing the link databases. Editing operations include submitting new links, creating new link categories, and changing previously submitted links or link properties. These editing functions are also available while browsing the databases.
      • The fifth and final choice, “Search the W3K Dictionaries,” is a link 105 to a Web page for specifying search criteria in terms of link properties. The search collects together all links in the link databases matching the search criteria, and displays them organized by properties according to user specifications. The search feature is useful for collecting various link subsets together for various purposes including link installation, editing link properties, and other functions involving groups of links. As an alternative to a list display format, a sparse context hierarchy can be generated, containing only the database information matching the search criteria; the sparse hierarchy can then be conveniently browsed by the user.
  • These functions are described in further detail below.
  • 5.1.1 Overview and Terminology
  • This section introduces the main terms which will be used hereafter.
  • Hierarchical Contexts
  • The link databases are organized hierarchically according to category, somewhat like the Dewey decimal system for library organization. Each category (or “directory”) is interpreted as a context analogous to a field of study. Each context may itself contain any number of contexts (“subcontexts,” or “subdirectories”), and it may additionally contain a database of information pertaining to that context (which may be a implemented in a file in that directory).
  • The particular sequence of directories obtained by visiting one subdirectory after another is called a path. Every context may be identified by the directory path that reaches it from the top level. Thus, the set of all contexts form a “tree structure” analogous to the hierarchical file systems used by all major computer operating systems at the present time.
  • Dictionaries
  • A link database (or “dictionary”) preferably comprises a list of (key, URL) pairs. A key (or “key phrase” or sometimes “word”) identifies a topic or concept, and the URL points to information about that topic on the Internet. In a loose analogy with an ordinary dictionary, the key is the “word being looked up”, and the URL points to its “definition”. However, unlike an ordinary dictionary, the (key, URL) pairs in the link database are interpreted within the particular context associated with the directory containing that dictionary. In a somewhat better analogy with a technical encyclopedia in a particular field, the key corresponds to the noun phrase identifying a technical topic for which an article exists in the encyclopedia, the URL may correspond to the page number on which the article begins, and the context may correspond to the technical field for which the encyclopedia was written.
  • Because dictionaries are interpreted in a particular context, alternate definitions are not allowed. In other words, a context is preferably sufficiently narrow such that all terms (words or key phrases) in that context have a unique meaning. Ordinary “flat” dictionaries must accommodate alternate definitions for a single word, while “hierarchical dictionaries” need not. Thus, if a term is found to have a second meaning in a particular context, it is time to create one or more subcontexts in which that term is disambiguated.
  • Synonyms
  • A single URL can provide only one “definition”. However, a single URL can be used to “define” any number of key phrases, which are then regarded as synonyms. Often the title of the addressed HTML page on the Web is the “key phrase” that is “defined” by the URL. When there are several (key, URL) pairs having the same URL, the different keys are treated as alternate phrasings for the same concept or topic, and are said to form a synonym group. The following example dictionary entries provide an example of a synonym group:
      • KEY=Taylor Series Expansion
        URL=<<<http://www.mathworld.org/analysis/TaylorSeries.html>>>
      • KEY=Taylor Expansion
        URL=<<<http://www.mathworld.org/analysis/TaylorSeries.html>>>
      • KEY=Taylor Series
        URL=<<<http://www.mathworld.org/analysis/TaylorSeries.html>>>
  • Order is important in the dictionary because “the first match wins” during automatic link installation. For example, with the above ordering, the key phrase “Taylor Series Expansion” will match before checking for “Taylor Expansion” or “Taylor Series”. Ordering equivalent key phrases from longest to shortest ensures that the longest possible match will occur in documents submitted for link installation.
  • Synonyms can be listed in a link's properties, or they can simply be entered as additional link entries pointing to the same link target (URL), since links take up relatively little space.
  • “Key Phrase” Directories
  • A “key phrase” may be understood as a bottom-level subdirectory of the context tree. A key-phrase directory holds a dictionary (link database) containing at least one link. This database may be implemented as a file residing in a directory having a name derived from the key phrase. Preferably, however, all key phrases in a particular context (together with their links), plus perhaps additional contexts, are implemented in a single larger database file in the parent context directory. For simplicity, however, a key phrase will nevertheless be considered logically to be a bottom-level directory (leaf node) in the hierarchical context directory, irrespective of implementation details associated with the use of a hierarchical file system.
  • All of the links in the key-phrase directory are interpreted as competing sources of information on the one topic identified by the key phrase. The tangible difference between a key-phrase directory and a context directory is that the key-phrase directory has no subcontexts, only links. Thus, a bottom-level directory in the context tree hierarchy (a “leaf node” of the context tree) corresponds to a single concept or topic, and all of the (key, URL) pairs in its dictionary pertain to that one topic. The number of distinct URLs present is the number of competing sources of information.
  • Perhaps the simplest means for handling synonyms is to add a key-phrase directory for each alternate phrasing of each topic. Because order is important when traversing a synonym group, the context-tree implementation must provide a means for ordering sub-directories, at least when those sub-directories correspond to key phrases. Alternatively, an ordered link database file may reside in the context directory containing the key phrase, and include all other key phrases in that context as well (including synonyms); the key phrase(s) corresponding to each link may be stored as link properties.
  • In the preferred embodiment, synonyms are not handled as separate key-phrase directories. Instead, a single representative is selected (usually the most descriptive or canonical), and all other equivalent phrasings (synonyms) are listed in a separate synonym file in the key-phrase directory. (Order is carefully preserved.) During browsing, synonyms are displayed at the bottom of the key-phrase page.
  • Context Synonyms
  • The preferred embodiment also supports context synonyms, as opposed to key-phrase synonyms just described. Context synonyms are presently implemented using symbolic links in a UNIX file system implementation of the context tree. As an example, the context hierarchy below illustrates two different paths to the subcontext (directory) “Sound_Synthesis”, where the notation “->” indicates a symbolic link, as is typically done when listing files in a UNIX file system:
  • Humanities
     Music
      Computer_Music
       Sound_Synthesis
    Engineering
     Electrical
      Signal_Processing
       Sound_Synthesis -> /Humanities/Music/Computer_Music/
       Sound_Synthesis
  • In this example, the “true parent” of the node sound_synthesis is Computer_Music, while the parent Signal_Processing is a “linked parent”. There can be any number of linked parents, but only one true parent.
  • Symbolic links provide a means for reaching multidisciplinary fields by browsing the constituent fields in a top-down way. At any time, a symbolic link may be deleted and replaced with a copy of some or all of the directory which was formerly linked (possibly utilizing symbolic links at a lower level). In this way, closely related contexts may start out as identical, but later may evolve into separate collections, as the maintainers see fit.
  • Context Dictionaries
  • The dictionary corresponding to a particular context is defined as the union of all key-phrase dictionaries in that context. In other words, the dictionaries belonging to all key-phrase subcategories in the particular context are logically concatenated together into one large dictionary, with additional dictionary entries added for all synonyms. This dictionary is referred to as the “context dictionary”. The order in which the key phrases are concatenated is prescribed.
  • When installing links in submitted documents, several context dictionaries are normally combined together to form a larger “aggregate dictionary” which is what is used for link installation. A typical aggregate dictionary consists of the context dictionary for the “current context” (established, e.g., by browsing), followed by the context-dictionaries of all subcontexts (usually not in any particular order, unless explicitly listed by the user), then followed by the context dictionary of the “true parent” context, followed by the context dictionary of the true parent's true parent, and so on, until the context dictionary of the top-level directory is appended (which contains extremely generic terms). Linked parent dictionaries may also be added in where desired. Since order is respected during link installation, definitions provided in the “current context” will receive first precedence, followed by definitions occurring in subcontexts (which are considered within the current context), followed by the more generic definitions of parent contexts. Since “the first match wins” in link installation, generic terms defined in parent contexts are “overridden” by more specialized definitions of the same terms of art in the current context. For example, the word “resolution” might be defined at the top level as the first definition appearing in an ordinary dictionary of the English language, while in the context of “ . . . /Signal_Processing/Spectrum_Analysis” it would be given its more arcane definition regarding the resolving power of a short-time Fourier transform.
  • 5.1.2 Browsing the Link Databases
  • There several benefits to providing browsing of the link databases:
      • it provides a unique educational resource which organizes valuable information on the Web in a manner especially well suited for educational purposes;
      • it provides a convenient means for learning what links are available for installation in documents;
      • it provides a convenient means for collecting context dictionaries for subsequent use in automatic link installation. While browsing, links and/or entire context subtrees can be marked for inclusion or exclusion in subsequent automatic link installations;
      • it provides a convenient means for navigating to contexts in which subcontexts and/or key-phrases can be added and/or edited by the user, or to key-phrase directories in which links can be added and/or edited and/or rated; and
      • by displaying links selectively according various link properties, browsing provides a means for viewing useful link subsets, such as all links entered by the user or user's group.
  • Link database browsing support on the server may be implemented in a variety of ways. As an example, there are commercially available scripts which implement directory websites, such as the links-2.0 scripts from Gossamer Threads, Inc., and such scripts can be adapted to implement the hierarchical dictionary of the present invention. FIGS. 3 and 5 illustrate the appearance of such a browsing system. Alternatively, one may use HTML SELECT pop-up menus, which are dynamically generated from the current directory contents. However, for performance reasons, static HTML pages are preferable over dynamic HTML generated by the server, when feasible. To provide more context and ease of navigation, the database directory structure may additionally be displayed in a fixed HTML frame on the left, as is currently done on many websites. For example, the way directory trees are displayed on the left in Microsoft Windows Explorer is a good model.
  • FIG. 3 illustrates a Web page display at the start of browsing. The top-level context is displayed. In this simplified example, only four top-level subcontexts are offered (Computing 131, Education 132, Legal 133, and Music 134). Each of these words is a hypertext link, which can be clicked with the mouse to navigate to the associated sub-context. For example, clicking on Music 134, then on “Computer Music” (which is available in the Music context), then on “Signal Processing”, then finally on “Sound Synthesis” produces the page shown in FIG. 4.
  • 5.1.3 The Standard Browsing Menu
  • Near the top of each page during browsing is a set of hypertext links 130 separated by a vertical bar ‘|’. This is the “standard menu” appearing at the top of every page while browsing the W3K website and at other times as well. Each of these links allows the user to carry out some available function.
  • The “W3K Home” link in the standard menu 130 takes the browser back to the initial W3K home page illustrated in FIG. 1, as does clicking on the W3K logo.
  • The “Browse from Top” link in the standard menu 130 navigates to the top-level browsing page shown in FIG. 3.
  • The “Select Hierarchy” link navigates to a page where a different context hierarchy can be selected for browsing. There is only one primary public context hierarchy (the one reached from the second choice 102 in FIG. 1). However, individual users and groups of users can set up context hierarchies for their own purposes, without having to worry about fitting into the ever-expanding primary public context hierarchy. If well known “language localization” methods are not available, as preferred, to provide alternate language selection for each Web page in the public hierarchy, alternate hierarchies can be used to support alternate languages. Alternate hierarchies can be designated by their creators as public (anyone can add to it), restricted (anyone can read it, but only the owner(s) can write it), or private (only the owner(s) can read or write it). The owners include the creator and members of any groups listed by the creator as being co-owners.
  • The “Install Links” link in the standard menu 130 navigates to the form provided for submitting documents for link installation, which will be described further below.
  • The “Add Subcontext” link navigates to the form provided for creating a new subcontext within the current context. Since FIG. 3 is at the top-level context, this operation is only allowed in a public hierarchy for a select group of “trusted” users.
  • The “Add Key-Phrase” link leads to the form for adding a new key phrase in the current context. At least one definition link is required when adding a new key phrase. At the top level of the public hierarchy, this operation is restricted to trusted users since any key phrases appearing at the top of the hierarchy are “generic terms” having definitions which are independent of context. Truly global key phrases such as domain names and trademarks are appropriate at the top level.
  • The “Add Definition” link is for adding a new definition for a key phrase. This entails supplying a URL which points to information about the URL and some other information, as will be later described. Thus, the number of distinct URLs in the set of URLs associated with a particular key phrase can be increased from 1 (its usual initial value) to any number by adding more definitions. The “Add Synonym” link in the standard menu 130 allows the addition of a key phrase to a list of “synonyms” for an existing key phrase. A synonym can also be constructed for a context. Synonyms will be described further below.
  • “Submit Dictionary File” provides convenient submission of a large number of links (key phrases and definitions) as well as the ability to specify a context path for each one, as will be described. It is additionally possible to display specified contexts and contexts selected for link installation in the form of a dictionary file. For example, a user can perform a search in order to collect all links contributed by that user, display the results as a dictionary file, save the dictionary file on his or her local computer, perform any desired editing operations, and submit the edited dictionary file back to the server to update his or her links on the server.
  • “Modify Additions” allows the user to edit (modify or delete) any information he or she submitted to the W3K site. In particular, it is possible to modify link properties, delete a link, delete a context or key-phrase directory wholly owned by the user or user's group, and so on. A user belonging to one or more groups may edit any information submitted by anyone in any of those groups. A set of records to be edited can be created by means of the search facility. A record may hold the information associated with a link, key-phrase directory, or context directory.
  • “Select Context” selects the “dictionary” associated with the current context for inclusion in subsequent “link installation”. The context dictionary normally includes each key phrase in the current context together with at least one definition for each key phrase. It may also include similar information from parent contexts and subcontexts, as will be discussed. Thus, the aggregate dictionary used in link installation is like a kind of “shopping cart” that can be filled with component dictionaries found while browsing around the context hierarchy; in this analogy, “items to be purchased” correspond to the dictionaries to be used in link installation.
  • It is also possible to assemble various context directories into an aggregate dictionary for link installation without browsing by simply providing a context dictionary list, or by selecting contexts from a number of SELECT menus in HTML listing all available contexts. After the current context is selected, the “Select Context” link changes to “Deselect Context”, so that clicking on it takes the current context out of the aggregate link installation dictionary.
  • Selection configuration information lower in the hierarchy is not modified when excluding a context, so that re-selecting the context allows the contained selection configuration to become active once again. During link-installation (FIG. 7), it is possible to override all such selection information by simply specifying an explicit list 75 of context dictionaries, or selecting “All W3K contexts” in the form entry for contexts 179.
  • Browser “cookies” are very useful for storing the context search preferences for the user across sessions; since many tend to work in one or a few fields, it is often the case that the contexts used for link installation do not change very often. Browser cookies are simply information stored on the user's computer (the client computer) by the server; cookie files are supported by the major Web browsers such as Netscape Navigator and Microsoft Internet Explorer. If cookie files are not available for any reason (they can be disabled by the user), preference information can be stored on the server indexed by the user's email address, which is unique among users.
  • “Browse All Selected” places hierarchy browsing in a special mode in which only the currently Selected contexts and links are visible. This can also be reversed so that only deselected contexts are visible. (Sometimes it is helpful to go back and forth.) This feature can help the user more quickly review what link databases (“key-phrases” and “definitions”) have been selected for link installation.
  • “Edit All Selected” is similar to “Modify Additions” except that instead of determining the list of database elements to be edited by using a search (or direct specification), it is initialized from the set of selected links owned by the user and/or groups to which the user belongs.
  • “What's New” creates a list of all contexts, key phrases, or definitions which have been added recently to the system.
  • “What's Cool” creates a list of all contexts, key phrases, or definitions which have been receiving relatively high traffic (“hits”) recently.
  • “Top Rated” creates a list of highest ranked links in the database. These are generally excellent home pages, tutorials and the like on various topics.
  • “Email Updates” allows the user to subscribe to the W3K newsletter.
  • “Random Link” takes the user to a randomly chosen definition link.
  • “Search” supports general search for information within the current context and beyond.
  • 5.1.4 Context Path Display
  • While browsing, the “context path” 140 (FIG. 4) is displayed just below the standard menu 130, with each path element separated by a colon ‘:’. In FIG. 4, for example, the context path is displayed as “Top Music: Computer Music: Signal Processing: Sound Synthesis.” Clicking on the “Lagrange Interpolation” key-phrase 144 in this context takes the browser to the definition page for Lagrange Interpolation shown in FIG. 5.
  • 5.1.5 Search Form
  • Below the horizontal line in FIG. 3 is a search form. Typing text into the field and clicking on the “Search!” button results in a dynamically generated web page listing all links (in all contexts) matching the search criteria. More refined searches can be carried out by first selecting the “More search options” link. Since links have quite a few properties (to be discussed), searches can be honed rather finely without relying entirely on typical means for selecting a subset of all names and phrases within contexts, key-phrases, and definitions.
  • 5.1.6 Topics Under a Context
  • FIG. 4 displays the contents of the context-path
      • /Music/Computer_Music/Signal_Processing/Sound_Synthesis.
  • We see that the “Sound_Synthesis” context contains two subcontexts “Acoustic Instruments” 148 and “Vintage Methods” 149.
  • In addition to subcontexts, there is a list labeled “Words and phrases defined in context Sound Synthesis” 141. (For greater convenience when browsing contexts, browsing can be configured to show only a single link to the key-phrase list on a separate page.) The phrases listed include “Commuted Synthesis” 142, “Physical Modeling” 145, and “Lagrange Interpolation” 144.
  • Technically, as far as the browsing function is concerned, “words and phrases” (key phrases) are similar to “subcontexts”. However, key phrases are browser categories with no subcategories, only links, while contexts are browser categories containing subcategories (either subcontexts or key phrases). The links under a key-phrase are treated as “competing definitions” for that key phrase.
  • FIG. 8 illustrates the relationships among contexts, key phrases, and definitions. The top level context 180 is the root node of the tree structure defined by the hierarchical link database. There can be any number of subcontexts or key phrases under the top level context 180. In the example of FIG. 8, there are two subcontexts, “Intermediate Context 1181 and “Intermediate Context 2182. Since these are context directories, they each may contain any number of subcontexts and/or key phrases. In the present example, there are two key phrases 183 and 184 in the first subcontext 181 and one synonym group 185 (two equivalent key phrases) in the second context 182. A key phrase must have at least one definition (link) associated with it. In the present example, “Key Phrase 1183 contains three competing definitions 186, “Key Phrase 2184 contains four competing definitions 187, while the synonym group 185 consisting of “Key Phrase 3” and “Key Phrase 4” contains two definitions 188 to choose from for that synonym group. Since order is important, we may choose a consistent ordering convention for tree diagrams in which the ordering of all subnodes of a node is defined as left to right in a diagram as in FIG. 8.
  • Note also in FIG. 4 that the “Sound Synthesis” 141 context includes one synonym 143. This is a context synonym identified by the path
      • Engineering: Signal Processing: Sound Synthesis
        which can be thought of as a different context path to the same place. A context synonym can be thought of as a “symbolic link,” in the sense of a UNIX file system, from one “context directory” to another. It is often appropriate for multidisciplinary fields, such as the field of sound synthesis, which belong as a subcontext of more than one high-level context. In link installation, context synonyms can provide what is analogous in computer science as “multiple inheritance”, i.e., the dictionaries of multiple parents (“Music” and “Engineering” in this example) can optionally be included automatically in the formation of the aggregate dictionary for link installation, while only the one main context (“Sound Synthesis” in this example) has to be selected for link installation.
  • To illustrate a “context synonym in FIG. 8, we could add a third subcontext box under the “Top Level Context” box 180 entitled “Intermediate Context 3” which could have a different kind of border to indicate that it is a symbolic link to some other context. We could then draw an arrow from the “Intermediate Context 3” box to its equivalent, such as either “Intermediate Context 1181 or “Intermediate Context 2182.
  • FIG. 5 shows a display of two “competing definitions” for the phrase “Lagrange Interpolation” 151. Either of the two links 152 or 153 may be installed in a document containing the phrase “Lagrange Interpolation”. They are both named “Lagrange_Interpolation” because that happens to be the title of both Web documents. However, the links point to two different targets on the Web written by two different authors.
  • The “new” superscript after a link 152 or 153 means it was added relatively recently. In this example, both links for “Lagrange Interpolation” were added on the same day.
  • The “popular” superscipt for a link 152 or 153 means it has been receiving relatively frequent visits (or “hits”) via the W3K site. The number of hits displayed in this case is 0.
  • Also displayed in FIG. 5 for each link 152 or 153 is the date 155 the link was submitted, the number of hits 156 (number of times anyone has clicked on the link at this site), a rating 157 for each link (which is 0 since the links were just added), and the number of votes included in each rating (also 0 at the moment). Available elsewhere on the website also is the number of times a link has been installed in Web documents. Finally, there are three links 154 for rating each link (assigning a quality score from 1 to 10 and optionally submitting a more detailed written review), reading the reviews written by others, and viewing all of the link's properties in tabular form. After the rating display is a hyperlink which a user can select in order to contribute a rating or a review of the link.
  • This completes a first-pass overview of the main pages and selections seen by the user while browsing the link databases. Functions available while browsing will be described further in the following sections.
  • 5.1.7 Adding or Modifying Definitions or Categories
  • In FIG. 5, the “Add a Definition” link 158 navigates to the form shown in FIG. 6 for adding another definition link for Lagrange Interpolation. The current key phrase “Lagrange Interpolation” is filled into the “Topic” field 161, and the context path leading to the key phrase is filled into the “Context” field 160. This makes it convenient to enter a new source of information (definition) on a topic (key phrase) while browsing.
  • When “Add a Category” or “Add a Key Phrase” is selected from the top-level context (or “Add . . . ” is selected on the main website home page), the “current context” field of the form becomes instead a pop-up HTML “select” list containing all of the contexts presently in the database, making it convenient to quickly select any context in which a new subcontext or key-phrase is to be added.
  • The only required fields on the add-definition form (FIG. 6) are the URL 162 and user's email address 169. All others are optional.
  • The URL is the new definition, and it is tested by the server to make sure it is responding. If the Site Title field 163 was left blank, the title of the Web page addressed by the URL, which is automatically retrieved by the server (using the Perl LWP module), is filled in automatically as the link title.
  • The contributor's email address is required because all submissions to the server in the preferred embodiment are associated with the contributor's email address. However, there are alternative means for identifying users known in the art, such as a more conventional registration procedure in which the user chooses a login name and password. The preferred embodiment ensures that the email address given really reaches the user. If the user is new, an authorization process, described in §5.1.9, is initiated which tests the user's email address.
  • While not required, the link contributor is invited to write a short description 164 of the website, specify the minimum 165 and maximum 166 educational level covered at the site (usually done by the author of the site), and specify the type of resource 167 (home page, conference paper, book chapter, or the like). The user may also type in his or her name 168.
  • Fields such as educational level 165 that are potentially confusing tend to have a “Help” link 62 next to them. For example, the educational level help 62 explains that the numerical value is in units (loosely) of “years of education likely required to understand the material”. A minimum level with no maximum level corresponds to setting one level rather than a range of levels. When no educational level at all is provided with the definition, the link server will attempt to compute it automatically based on the level of the links it contains, as will be described. In a script-based submission, finer control is possible using additional level-related properties.
  • Things like “educational level” and “resource type” are examples of link properties. The context path leading to a link is also one of its properties, as is its URL, title, description, and so on. A link can have more properties than these, some of which will be described below. The “Specify Additional Properties” link 61 takes the user to a larger form where the additional properties can be specified.
  • When the user is satisfied with the filled in definition-submission form, the submit button 60 can be pressed to send the form to the link database server (a computer at w3k.org in this case). At that point, the server tests the URL by retrieving the first page, checks that the user's email address is known and that the user's IP address and cookie information match information previously stored on the server (otherwise authorization is carried out), checks for duplication of the key phrase and URL in the given context, possibly checks the URL target for “inappropriate content”, assigns an automatic educational level if none was provided (unless automatic level assignment is already scheduled at regular intervals), and adds the new definition to the link database for the current key phrase (and context path, if the database file holds links for multiple key-phrase contexts). If the addition was successful, the user is navigated to a dynamically generated Web page summarizing the information added to the database. If there were any problems, an error page is generated listing the reason(s) for failure to accept the page.
  • A far quicker means of entering definitions is by means of dictionary file submission which can be regarded as a script-based replacement of the above browser-based interface. An example of such a dictionary file is given in a later section. The form for submitting such a file may be reached via the “Submit Dictionary File” link in the standard menu, or as an option under the “Add to or Edit the W3K” option on the server home page.
  • 5.1.8 Private Context Trees
  • As mentioned when describing the standard menu 130, known users may optionally create a new top-level context tree which is private to that user or to one or more groups identified by the user. This mode of usage is advantageous for private usage without incurring collisions with links in the main “global” context tree shown in FIG. 3. It is further the only way a known user can submit large quantities of contexts, key phrases, and links by means of a dictionary file submission, since that operation is not permitted in the global public context hierarchy. Further details will be described.
  • 5.1.9 User Authorization
  • Whenever a user requests an operation on the server requiring information to be stored on the server (any “editing operation”), the user must be “known.” Being known means the email address of the user has been given by the user to the server, and the email address has been verified by the server to work (reach the user). When an editing operation of any kind is requested (including the simplest form of link submission, or even a link rating from 1 to 10), if the user is not yet known, an “authorization process” is carried out as a preliminary step in the desired editing operation.
  • In the authorization process, the user submits his or her email address in a simple Web-page form, and the server (1) emails a randomly generated ASCII string to that email address, and (2) navigates the user to a Web page containing a form for receiving that random string from the user. The form also instructs the user to receive the email and to paste the random string into the second authorization form and submit it. This process verifies that the email address in fact reaches the user.
  • The email address and IP address of the user are then saved on the server. Additionally, the same information is written on the user's computer using a browser cookie. If the cookie goes away for any reason, or if the user later comes in from a different IP address for which authorization has never occurred (e.g., due receiving a new dynamically assigned IP address from an ISP, or using for the first time a different home computer connected directly to the Internet), authorization is triggered once again when any editing operation is requested. Users coming in over dynamically assigned IP address generally have to be authorized for each session until all such IP addresses have been seen and logged on the server along with the user's email address.
  • After a successful authorization, the user may use the “Back” button in his or her Web browser to find the page which triggered the authorization process, and resubmit the form successfully.
  • 5.1.10 Link Properties
  • Many other properties can be specified for a link besides the URL 162 and email address 169. One of the most important properties, brought out in the main form, is educational level 165. Both a minimum 165 and maximum level 166 can be set. When the link-target document is written at a single well-defined educational level, such as “10th grade”, the min and max can be set to the same value (such as 10), or the max can be left unset (which defaults to level 100, meaning no maximum). When the document spans a wide range of educational levels, such as a well designed “topic home page” might do, the min and max can be set appropriately to cover the estimated range. The minimum level still sets the official “level” used in automatic level assignment for other documents, but the maximum level, if specified, may affect link installation when a specific level range is specified for that. An educational level is implemented a floating-point number, so that a level of 10.5 can be specified, e.g., in the form 165 or 166.
  • Another important link property, also on the main form, is resource type 167. Resource types include dictionary definition, encyclopedia article, unpublished article, conference paper, talk overheads, refereed journal article, book chapter, book, tutorial, lecture notes, course readers, and the like. Sometimes authors may wish to screen out non-refereed sources such as conference papers or unpublished works. Of course, refereed publications and books will typically be hosted on the website of a publisher, requiring some form of payment for access, such as a site subscription or, preferably, a per-page “micropayment” such as the well known Millicent system provides.
  • Additional optional properties may be specified on a second form by selecting the “Specify Additional Properties” link 61. Additional properties include source type (individual, educational institution, company, non-profit organization, etc.), geographical location, language (English is assumed by default), “viewer suitability” analogous to ‘PG-13’, ‘R’, etc., for movies, a list of groups to be granted editing access, and so on.
  • Link properties added automatically by the server when installing a link in a database include a unique integer ID, the email address and IP address of the link contributor, the date of submission, an initial rating of zero, an initial zero number of “hits”, an initial zero number of “installs” in documents, and the like.
  • Link properties make it convenient to specify “virtual link database directories” which include only the links satisfying certain criteria specifiable in terms of link properties. For example, a user may ask to see only tutorials and books in a certain educational level range. Alternatively, an author may specify seeing only links belonging to that author's email address, or group. Thus, properties enable selective browsing (or listing) as well as more selective link installation. Such selective browsing may be specified using the Search feature on the site home page 105, standard menu 130, or at the bottom of any Web page seen while browsing.
  • Link properties may also be usefully included in installed links (within HTML “comments” or in specially defined XML tags) when indirect links are being installed (that is, when the installed link points to a centralized link server which forwards the user's browser to the ultimate destination). Installed link properties may be interpreted by the link server to provide additional control over link behavior. For example, a teacher using Internet documents for a 9th grade class could configure the link server to suppress all links having an educational level greater than 10. That way, when educational level properties are available for all links, as the present invention provides, documents may be populated with hyperlinks which can be configured not to refer a student to information at a more advanced level than the teacher desires. The teacher may further suppress any links with a viewer suitability rating below a certain value. In summary, installed link properties enable dynamically configurable link behavior based on link property values.
  • In another use of installed link properties (which requires either browser support and/or local editing of the HTML containing the installed links), link properties can be associated with “classes” in “cascading style sheets” (an add-on to HTML) in order to display links to dictionary definitions in one manner, encyclopedia articles in another manner, and home pages in another, etc.
  • 5.1.11 Restricted Directories
  • When a subcontext is created, it can be marked as “restricted” to the owner (creating user) or to groups specified by the owner. Restricting a directory prevents anyone but the owner or specified groups from modifying the subdirectory. The restricted directory can optionally be made “invisible” to users other than those having modification rights, in which case the restricted directory is said to be “private”. An unrestricted directory is said to be “public”. A restricted directory can be deleted or renamed or otherwise reorganized no matter what it contains. Typical uses of restricted directories include
      • Retaining the ability delete the entire directory and rebuild it with a dictionary file submission.
      • Supporting a private dictionary corresponding to a particular project, such as a book, in which it is desired to have complete control over all links used in link installation.
  • The name of a restricted directory has the name of its first group (or owner, if no access groups are defined) automatically appended as a suffix to the name chosen by the owner in order to prevent conflicts with public directories and other restricted directories on the same topic. With this convention, any number of users may have restricted subdirectories on the same topic. For example, in the subdirectory “/Music/Computer_Music/Synthesis/” there could be
  • Commuted_Synthesis_by_mak@vipunen.hut.fi/
    Commuted_Synthesis_by_jos@ccrma.stanford.edu/
  • In this way, any number of experts may provide their own “packages” of links on the same topic.
  • A known user may even create a new top-level hierarchy which may be designated public, restricted, or private. User- or group-owned hierarchies of this nature which lie outside the primary public hierarchy may be placed in a special standard menu item entitled “Alternate Universes”, e.g., to indicate that they are not a part of the primary public context hierarchy.
  • 5.1.12 Link Ratings and Reviews
  • When browsing reaches a key-phrase directory, as shown in FIG. 5, following each competing definition 152 or 153 is the hyperlink “Rate It” which navigates to a form where that definition (link) can be rated on a scale from 1 to 10, and/or a written review about that link can be submitted. If the user is not known, an attempt to submit a rating or review routes the user to the authorization page, and after a successful authorization, the rating or review is accepted by the server.
  • All ratings and reviews are stored on the server along with the email address (and IP address) of the contributor. Only one rating and review are allowed per item per email address, but the user owning the rating or review can modify either at any time. Certain “trusted” users, such as website editors or expert consultants enlisted to help with ratings and reviews, may be given higher weighting in the ratings, and the reviews may be organized by editors according to their quality. Otherwise, the rating system is straightforward and similar in functionality to the five-star rating and review system used at <<<http://www.amazon.com>>> for books.
  • 5.1.13 Link Installation
  • A primary function of the invention is to facilitate the installation of hyperlinks in documents intended for the World Wide Web. This section provides a detailed description of link installation in the preferred embodiment.
  • Installed-Link Types
  • There are at least four alternative ways to install a link in a document.
  • In the first mode, a hypertext link is installed directly to the top-ranked source of information on the topic identified by the matching key phrase in the user's submitted text. This is the first choice presented in the “Link Type” radio-button-group 177 of the default link installation form (FIG. 7). A disadvantage of this approach is that links often become “stale” due to changing ISPs, changing filenames, etc., requiring the links to be re-installed from time to time. (The link installation server preferably tests all links in its databases periodically and eliminates them if they are unavailable for a prolonged period of time such as more than a week. When all links containing a bad URL are automatically removed from the databases, all owners of the links are notified automatically by email and invited to submit an updated version of the link(s).)
  • The second approach is to install an indirect link which links via a centralized server (such as a website providing the link installation service). This choice is provided by the second radio button in the “Link Type” portion 177 of the default link installation form. Such an intermediate website acts as a so-called “proxy server” for the link. Indirect links may always point to the most up-to-date, top-ranked source of information on any given topic. An example URL syntax for this mode of operation is
      • <<<http://www.w3k.org/jump.cgi?ID=35>>>
        where it is assumed that each link has a unique integer identifier on the proxy server, and jump.cgi is a CGI script which is passed the identifier as if it were a form submission in which the form contained a field named “ID” with the value 35. To avoid having to assign unique identifiers across all contexts, the context path can be included in the URL, e.g.,
      • <<<http://www.w3k.org/jump.cgi?ID=3&PATH=Engineering+Signal_Processing>>>
  • Context paths can similarly be assigned integer IDs in order to shorten indirect URLs.
  • A third approach is to insert a link to the “key-phrase page” itself at the centralized server (the page on the server listing all “competing definitions” for that key phrase). This is the third and final choice in the “Link Type” radio group 177. In this case, an end user following such an installed link will see all competing definitions, in ranked order, instead of only one. The end user can then request that the definitions be reorganized according to various criteria such as educational level, document size, type of resource (article, book, etc.), type of source (.edu, .org, .com, etc., individuals, etc.), and so on, by making requests of the server interactively, or by means of preferences registered with the server.
  • A refinement of the third approach is to build or generate a more helpful “key-phrase home page” on the link server. This page could provide, for example, a brief definition, followed by an organized presentation of all available sources of information, organized by type and ranked according to quality in each case. In this format, the casual user may be satisfied with a mere dictionary-style definition, while the serious scholar can more readily pursue a wider variety of sources beyond merely the top-ranked source. Providing interactive reorganization of the definition page according to end user preferences is preferable in this case as well.
  • A fourth approach is to use JavaScript features to install a snapshot of the key-phrase home page at the time of link installation. In this approach, a JavaScript pop-up menu may hold a list of all competing links for the linked topic.
  • Example Key-Phrase Home Page Format
  • Below is an example of how a very simple “key-phrase home page” might be laid out:
  • TABLE 1
    Key Phrase: Dictionary-style definition
    Link to highest-rated online encyclopedia-style article
    Link to highest-rated online tutorial, if available
    Link to highest-rated textbook covering this topic, if any
    Link to educational resources (online courses, degree programs,
    etc.)
    Highest rated related links (“See also” type information)
    Rank-ordered list of encyclopedia-style links
    Rank-ordered list of online tutorials
    Rank-ordered list of other online information
    Rank-ordered list of contributed links of unknown type
    . . .
    Last unrated contributed link of unknown type
  • The link database server preferably provides periodic link testing, average ratings computation, link reordering, automatic educational level assignment, and so on. It is also straightforward for the server to format the key-phrase home page dynamically according to user preferences based on link properties and other criteria. For full generality, it is desirable to customize and differentiate key-phrase home pages on the basis of language, educational level, and other properties. (They are already segregated according to context by the context hierarchy in which they reside.) To address the potential enormity of this task, a mechanism for allowing known and trusted users to submit key-phrase home pages for installation on the server can be provided. For this purpose, the server can provide a template document containing variables that are filled in by the server, in a manner often found in website construction tools.
  • Link Color
  • While copious linking makes a set of documents very convenient to navigate among, the high density of links can be distracting to the eye. For this reason, the link installation submission form provides a checkbox for requesting that the hypertext links be set to the same color as the surrounding text. This leaves only an underline to indicate each link. Presumably, future versions of HTML and browsers will allow finer control over the display modes of links, and it may in some cases be possible to offer turning off all visual indications that a link is a link. This is because when links are installed at very high density, such as this invention makes possible, the reader can assume that essentially all nontrivial words are linked. Links become the rule rather than the exception for all “uncommon” words in a document.
  • Avoiding Installed Links Altogether
  • In an alternate mode of usage, any word or phrase can be selected in text displayed by the user's browser and “looked up” at a server website containing the link databases. A similar mechanism is currently available in Microsoft Internet Explorer 5: The right-click menu contains an entry “See more with Lycos!” which, when selected, causes the selected phrase (or word last clicked with the mouse) to be looked up in the search engine at the Lycos website (<<<http://www.lycos.com>>>).
  • In the case of the present invention, in which the database server may act in place of the Lycos website, if the word or phrase is found in the link database, the user may be taken to the page of “competing definitions” (all links) for that topic. If the topic is available in multiple contexts, a list of all distinct contexts can be first displayed, so that the user can select which one he or she had in mind, and then be taken to the definition page in the selected context. If the term is not in the link database but coincides with a context directory name, that directory can be displayed by the browser. As a last alternative, the unrecognized phrase may be forwarded to an ordinary online dictionary (for single words), encyclopedia, or Internet search engine. The link-free look-up mode described in the previous paragraphs can be supported in any number of applications, not just Web browsers. For example, the word processor Microsoft Word already supports looking up an ordinary dictionary definition of a word by selecting the word and choosing the “Define” item in the right-click pop-up menu. Another item in that menu could be “Look it up at the W3K”, for example. A link-free look-up service of this nature could be provided in any application which displays text and supports text selection by the end user. The service can be provided either over an Internet connection as described above, or, in the absence of an Internet connection (or supplementary to it), using the single-computer embodiment of the present invention described in §5.2.
  • In the preferred embodiment, end users of the link-free lookup service may optionally register with the database server in order to specify preferences such as whether a key-phrase lookup (sans link) should navigate to the key-phrase home page or more directly to the currently highest ranked definition for that key phrase. The user may also inform the server of his or her educational level, desired viewer suitability range, and the like.
  • To support link-free lookup mode, the database server may accept a URL containing a “virtual form submission” of a link-free lookup form. As a simple example, a lookup request for the phrase “Hubble constant” could be sent to the database server by “navigating” to the URL
      • <<<http://www.w3k.org/linkfreelookup.cgi?TEXT=Hubble+constant>>>
  • The CGI script linkfreelookup.cgi runs and may immediately issue a “navigation” output to the highest ranked link matching “Hubble constant”, if any. The URL may also include a user name. If user preferences exist, the script may alternatively navigate to a key-phrase page of competing definitions for the Hubble constant, and so on. Additionally, any number link properties may be specified in the URL as well.
  • Link Installation Form Operation
  • FIG. 7 shows the default web page for submitting documents to have hypertext links installed by the server. The user pastes text to be “linkified” directly into the “Text or URL” textfield 170. In this example, a URL 77 has been specified, indicating that an entire website is being submitted for link installation, as will be described further below.
  • Three input submission formats may be specified by the “Input” radio-button group 171: HTML, Plain ASCII, and LaTeX source. In addition, there is a “Help” link 174 which navigates the user to documentation on the relevant considerations for each choice.
  • In the example of FIG. 7, submission of HTML format is selected in the input-format radio group 171. In the case of “plain ASCII” submission, the output is also normally received in HTML format; this facilitates fast construction of Web pages from simple ASCII text files. It also can be used to quickly obtain a browsable Web directory from a list of keywords generated by other means. Since some HTML editors support “drag and drop” link installation from another document, an automatically generated list of HTML links can be very useful even for manual link entry in an HTML editor.
  • In the case of LaTeX source format, links are installed in the form of an invocation of the macro \ htmladdnormallink{text} {target}, which is defined in the widely used html.sty LaTeX style file.
  • When the input format is HTML, it is parsed to prevent accidental replacement of HTML tag data with links. In particular, it is important not to install links within the anchor text of existing links. HTML parsing can be accomplished using the HTML Perl package (see, for example, page 716 of the Pen Cookbook by T. Christiansen and N. Torkington, O'Reilly, 1998).
  • In a similar manner, LaTeX directives are avoided in the text matching algorithm within LaTeX source. (Perl for LaTeX parsing is available in the latex2html Perl script, freely available at <<<http://ctan.tug.org/ctan/>>>.)
  • Linking is preferably suppressed when the recognized phrase coincides with the name of the current section or document, i.e., a phrase that results in a link to the current page.
  • When “Link only the first occurrence . . . ” is selected in the first half 175 “Occurrences” section of the link-installation submission form (FIG. 7), only the first occurrence of the phrase is linked each page (HTML) or section (LaTeX). Otherwise all occurrences are linked.
  • A second pair of radio buttons 176 exists for specifying that links be installed for either all emphasized words or phrases, or only emphasized words or phrases. Emphasized occurrences may appears as “\emph{ . . . }” in LaTeX and as <I> . . . </I> or <B> . . . </B> in HTML.
  • The two radio-button- pairs 175 and 176 can be considered to specify “two bits” which select among the following cases:
  • TABLE 2
    00 link all occurrence of a key phrase, whether emphasized
    or not;
    01 link all occurrences of a key phrase, but only when
    emphasized;
    10 link the first occurrence of a key phrase in each page
    (whether emphasized or not), and all emphasized
    occurrences; and
    11 link only the first emphasized occurrence of a key
    phrase.
  • As a further special case, any URLs found as plain text in the source are by default converted to links that display their own URLs as anchor text. Many email programs and word processors presently perform this transformation on URLs detected as plain text in received email.
  • The “Link Type” radio button group 177 selects among three of the basic installed link types discussed in the first subsection of this section.
  • The “Link Color” select pop-up list 178 provides for link color selection as discussed above. In addition to the standard color names, there is a “take default” selection which does not specify the link color, thereby leaving it to the HTML cascading style sheet or user's browser to choose link color.
  • The “Contexts” radio group 179 provides some high-level choices of context selection for link installation. The first choice, “All W3K contexts” corresponds to combining all context dictionaries in the entire context hierarchy. As the context hierarchy grows, this can become a computationally expensive option, even when the aggregate dictionary is maintained as an existing file at all times. When a “current context” exists (as result of browsing or user preferences), it and its extensions are preferably listed first in the aggregate dictionary, as will be clarified further below.
  • The second radio button in the “Contexts” radio group 179 selects only the “current context” (/Music/Computer_Music). The current context is normally established by browsing or by standing user preferences. (When “Install Links” is selected in the standard menu 130 while browsing, the last context displayed in the browser becomes the default current context.) A browser cookie is preferably used to remember the most recent “current context” for each user across sessions.
  • Installing links from only the current context is not as narrow as it may seem at first since normally the context dictionaries for /Music and ‘/’ (the top-level generic dictionary) are included, as well as all subcontexts of Computer_Music. The two “Extensions” checkboxes 70 provide all-or-nothing control over appending parent and subcontext dictionaries to the current-context dictionary. Additionally, if the subcontext /Music/Computer_Music/Sound_Synthesis is a synonym for /Engineering/Signal_Processing/Sound_Synthesis, say, and if “multiple inheritance” is enabled at all subcontext hierarchy levels (an advanced link installation option), then the context dictionary for all of Signal_Processing and /Engineering would be folded in, at a lower precedence level, of course, since they are listed after all subcontexts of /Music. In summary, the aggregate dictionary list built for link installation by the server can be rather large even when only the current context is selected for link installation.
  • The third and final option in the “Contexts” radio group 179 is to provide an explicit list of context dictionaries. A list of context dictionaries can be accumulated via browsing in the manner described above, or a list can be submitted dictionary-file format. Additional “virtual context dictionaries” may be defined by means of the Search function, with the search results forming a link subset which can be assigned a name and treated as a dictionary. It is preferable to offer convenient hierarchical browsing of the selected portion of context hierarchy represented by the dictionary list. Any search result may also be displayed as a dictionary file. Dictionary files are discussed more detail below.
  • Dictionary lists may be stored on the server in a directory devoted to each user or in a file with user's email address forming part of the filename, as shown in the example of FIG. 7. They may also be stored on the user's computer via browser cookies.
  • The “Min Level” 71 and “Max Level” 72 pop-up lists allow specification of a range of educational levels for link installation.
  • While any number of properties may be associated with links, the top-level default submission form of FIG. 7 for link installation invites link selection according to only a few properties such as context 179 (determined by dictionary selections) and educational level 71,72. Installation specifications based on additional properties may be obtained by following the “Specify Additional Properties” link 174 and filling out a larger form allowing specification according to more criteria, using well known principles of database subset selection according record properties.
  • When the user presses the “Submit” button 173 (or the submit button of a long-form submission form), the server receives the filled-out form specifying how links are to be installed, processes the submitted text in a CGI Perl script or other server-side software to install the links, and generates output consisting of the user's submitted text with all the new links embedded.
  • Link Installation on the Server
  • Actual link installation from an aggregate dictionary by the server, while one of the more complex and resource-demanding operations, is based on well known database technology and methods in computer science for string search and replacement. The Perl language is well suited for this task.
  • In the preferred embodiment, an aggregate dictionary file is prepared on the server based on the user's link-installation specifications and the current contents of the server's link database. This dictionary file is then “applied” to the user's submitted documents in order to replace key phrases by hypertext links. A Perl script illustrating link installation for HTML files is included in Appendix A.
  • The preferable details of the methods used depend on the relative sizes of the files involved. For example, if stringent conditions are specified on link properties for installation, and if a large file is submitted from the user, it may be the case that the aggregate link dictionary is much smaller than the combined size of the files submitted for link installation. In this case, it may be fastest to search the submitted file for each link in the aggregate dictionary.
  • If, on the other hand, the number of eligible links is large (e.g., “All W3K contexts” was selected in the Contexts section 179 of the link installation form), and if the submission itself is small, it may be preferable to search the aggregate dictionary file for each possible key phrase in the input file using well known “incremental search” techniques.
  • In either case, if the user has specified that only emphasized words or phrases are to be linked, then all phrase boundaries are known, and this can be used to greatly reduce the computational burden of the string-matching task.
  • Single-Page Submission
  • For single-page text submitted using the HTML form of FIG. 7, the output HTML may be returned to the user in the form of a “dynamic Web page.” That is, the user's browser immediately “navigates” to the automatically generated HTML page as if it were already somewhere on the Web. At that point, the user can select “Save As” in the Web browser in order to save the HTML in a local file, or “View Source” can be selected in the browser to enable copy/pasting of the generated HTML into a text editor for further editing.
  • Submission of an Entire Website
  • In an alternative mode of submission, shown in FIG. 7, the user specifies a URL pointing to the submitted document in place of the text of the submission itself. This mode of submission is more convenient for linking entire websites. In a typical configuration, the server processes the submitted file and all files reachable from the first via hyperlinks, provided that the reachable files reside somewhere on the same website (as defined by its URL). In other words, links are followed provided the first portion of the URL matches that of the submitted URL in its entirety.
  • In the case of URL submission, the processed document is not returned as dynamic HTML, but rather as a hyperlink to a single binary output file on the server containing all the processed files. This output file may be created by combining all processed files into one using the freely available tar program, and further compressed using the freely available gzip program. The tar and gzip programs are available from the GNU Free Software Foundation (<<<http://www.gnu.org/>>>). The output file can then be “downloaded” to the client computer by clicking on the hyperlink pointing to the output file in the dynamically generated HTML. The user then unpacks the file on his or her local computer using, e.g., gunzip and tar, or the shareware program winzip. As a third alternative, preferred for large submissions or over slow internet connections, the user may specify the URL of a single composite file in “tarred and compressed” format, i.e., created using gnutar and gzip in the same way that the server's output is prepared in the case of multi-file submissions.
  • The filename extension is used by the link installation server to distinguish between pointers to websites (.html or no extension, indicating a directory) and compressed tar files (.tgz or .tar.gz). If the text appearing in the “Text or URL” textfield of the link-installation submission form starts with “http:”, “ftp:”, or “gopher:”, a URL is assumed.
  • The following sections will describe further details of the operations indicated above.
  • 5.1.14 Dictionary Search Order
  • Link installation usually occurs within a “current context” or a list of contexts. In the example described above, the current context may be set according to the location of the browser when “Install Links” was selected by the user. Alternatively, one or more contexts may be set explicitly in a dictionary list provided by the user when filling in the link installation form of FIG. 7.
  • In the simplest mode, the current context dictionary is searched first for matches in the user-supplied text, and matches are transformed into links. The process is “idempotent” since matches will not occur within the link syntax itself (such as in HTML anchor specifications or LaTeX macro arguments). As a result, dictionary entries are ordered from longest to shortest phrasings, as discussed above.
  • As described above, the dictionary for the current context is optionally augmented by the union of all lower-level dictionaries within that context. Current-level definitions take precedence over lower-level definitions in any key-phrase collisions. Collisions among lower level dictionaries are not explicitly arbitrated (since that could be accomplished by listing them explicitly), so that the first occurrence of a lower-level definition will take precedence (when not defined at the main level). This follows simply from the convention that “the first match wins”.
  • The purpose of adding in all lower level directories is to provide a reasonably complete dictionary at a high-level node without having to duplicate definitions from lower-level contexts. In principle, such duplication could be avoided by moving all lower-level definitions to the highest possible context. As a simple example, the term “idempotent” is a math term used in many technical fields, and it is not an English-language term (according to the Funk & Wagnalls Standard Desk Dictionary). Therefore, “idempotent” can be defined without conflict in the top-level dictionary for the English language. In practice, however, it works out better to define terms in their “most natural” subcontext, and let their definitions “float up” as far as they can go without collision. Positioning a term within its “most appropriate” context makes the hierarchical dictionary better organized and instructive when browsing.
  • When an undesired definition is encountered, it can be “fixed” (the first time) by defining the term in the current context, since that will take precedence over all subcontexts and parent contexts. A conflict cannot occur in the current context (in principle) because a context is by definition a name space in which every term has a unique definition. Another solution is to list a specific ordering of lower-level dictionaries so that the first match is the desired one.
  • After the current-level dictionary is “applied”, including all subcontexts, the parent node is normally next in the aggregate dictionary. It is searched for further matches, so that more general terms in the higher context not “overridden” by the lower contexts will be linked to their definitions. This process continues until the top-level context node is reached in the aggregate dictionary.
  • Note that it is not necessary to create an explicit aggregate dictionary. It is equivalent to instead apply context dictionaries sequentially in the proper order.
  • As mentioned above, a list of context dictionaries may be specified explicitly in a variety of ways. This is analogous to specifying multiple libraries when linking a computer program. The order of specification is important since the first match is taken. This feature may be used by specifying ancillary fields after the main field of the author. For example, a physics professor might include certain math contexts after the appropriate context(s) within the field of physics.
  • 5.1.15 Maximizing Match Length in Key Phrase String Matching
  • As discussed above, there may be several forms of a key phrase (“synonyms”) corresponding to the same URL. It is normally preferable to match the longest form present in the text so as to avoid multiple generic matches such as
      • Taylor Series Expansion
        when there exists a longer match
      • Taylor Series Expansion
        having a completely different meaning. Maximal-length matching is implemented in the preferred embodiment by maintaining the key phrases in order of longest to shortest and then traversing the dictionary in the prescribed order.
    5.1.16 Contributing Links
      • Link submission support on the server enables all users to assist in the expansion of the “knowledge tree” represented by the link database dictionaries, and enables individual users to augment the link installation system to meet their special needs.
  • For example, a known user can contribute his or her own link database, select only it for search during link installation, and thereby obtain full control over the links which may be installed.
  • A personal link database can be very useful to the author of a book typeset in LaTeX, for example. Since LaTeX supports the generation of an index file, and since the freely available latex2html Perl script will convert a book index into an HTML page, such an index can be easily and automatically be converted (e.g., in the Emacs text editor) to a dictionary file format acceptable by the server. The entire book can then be processed by the server to install links pointing somewhere into the book for every occurrence of an indexed word in the book. Other links can of course also be included.
  • Another application of LaTeX index files is to merge the indexes of related books in order to generate a link database for a particular “field,” spanning a specific set of resources.
  • Only known users can submit links and/or create subcontexts or key phrases. All submissions are “owned” by the submitting email address or groups defined by the submitting user. (Email addresses are verified by the authorization process described earlier.) Only the owner, group member, or server webmaster may make changes in submissions (except for their ratings and reviews, of course, which any known user can affect).
  • Since any number of users may be trying to submit link databases simultaneously, one of many known schemes for “file locking” is needed for the database files and directories during a submission. To avoid periods of database unavailability, submitted public databases can be first prepared in a temporary directory and extensively checked for correctness by the server, including owner checking, name-collision checking, URL validations, format checks, and so on. During this process, the eventual destination directory is preferably write-locked. Since final installation may be carried out by rapidly renaming the two directories, downtime for read access is minimized. Implementing link databases as many files distributed throughout a context directory tree makes database updates simpler, since updates in one context need not affect activities going on in other contexts.
  • 5.1.17 Dictionary File Format
  • A link database (or dictionary list) may be submitted in a documented ASCII format supported by the server. Since all properties are optional, the submitted file can be as simple as a list of key phrases and their corresponding URLs. Below is a “dictionary file” which can be used to initialize a context hierarchy for the examples seen in the FIGS. 3-5:
  • GROUPS = CM_DSP
    PATH = /Education/Technology
    KEY = W3K
    URL = <<<http://www.w3k.org>>>
    # -------------------------------------------
    KEY = / Legal / GNU General Public License
    URL = <<<http://www.fsf.org/copyleft/gpl.html>>>
    # -------------------------------------------
    PATH = /Music/Computer_Music/Signal_Processing/People
    KEY = Julius O. Smith III | Julius O. Smith | Julius Smith
    URL = <<<http://www-ccrma.stanford.edu/~jos/>>>
    KEY = JOS
    URL = <<<http://www-ccrma.stanford.edu/~jos/>>>
    # -------------------------------------------
    PATH = /Music/Computer_Music/Centers/CCRMA
    KEY = CCRMA Courses
    URL = <<<http://www-ccrma.stanford.edu/CCRMA/Overview/courses.html>>>
    KEY = CCRMA Research
    URL = <<<http://www-ccrma.stanford.edu/CCRMA/Overview/research.html>>>
    KEY = CCRMA Overview
    URL = <<<http://www-ccrma.stanford.edu/CCRMA/Overview/Overview.html>>>
    KEY = CCRMA
    URL = <<<http://www-ccrma.stanford.edu/>>>
    # -------------------------------------------
    PATH = /Music/Computer_Music/Sound_Synthesis
    SYNM = /Engineering/Signal_Processing/Sound_Synthesis
    KEY = Lagrange Interpolation
    URL = <<<http://www-ccrma.stanford.edu/~jos/Lagrange_Interpolation.html>>>
    LEVEL = 12
    KEY = Lagrange Interpolation
    URL = <<<http://www.acoustics.hut.fi/~vpv/publications/vesa_phd.html>>>
    KEY = Digital Waveguide Synthesis
    URL = <<<http://www-ccrma.stanford.edu/~jos/wg.html>>>
    KEY = Commuted Synthesis
    URL = <<<http://www-ccrma.stanford.edu/~jos/book2000/ \
      CommutedSynth.html>>>
    KEY = Virtual Analog Synthesis
    URL = <<<http://www-ccrma.stanford.edu/~jos/VirtualAnalog/ \
      VirtualAnalog.html>>>
    KEY = Physical Modeling Synthesis
    URL = <<<http://www-ccrma.stanford.edu/~jos/pmupd/PMSynthesis.html>>>
    # -------------------------------------------
    PATH = Music/Computer_Music/Signal_Processing/ \
      Sound_Synthesis/Vintage_Methods
    KEY = Additive Synthesis
    URL = <<<http://www-ccrma.stanford.edu/~jos/SMS_PVC/ \
      AdditiveSynth.html>>>
    KEY = Sampling Synthesis
    URL = <<<http://www-ccrma.stanford.edu/~jos/samplingsynth.html>>>
    KEY = Cross-Synthesis
    URL = <<<http://www-ccrma.stanford.edu/~jos/crosssynth.html>>>
    # -------------------------------------------
    PATH= Music/Computer_Music/Signal_Processing/Sound_Synthesis/ \
       Acoustic_Instruments
    KEY = Bowed String Synthesis
    URL = <<<http://www-ccrma.stanford.edu/~jos/book2000/ \
      Bowed_Strings.html>>>
    KEY = Brass Synthesis
    URL = <<<http://www-ccrma.stanford.edu/~jos/pmupd/Brasses.html>>>

    Several features of the ASCII dictionary format may be noted:
      • The GROUP directive lists the names of all groups which share ownership the submitted links. In this example, only one group, CM_DSP, is specified. Group specification is optional.
      • The PATH directive sets the default context for subsequent entries.
      • Anything after ‘#’ is interpreted as a “comment” and ignored.
      • An entry can override the default path by including its own “absolute path” specification, as illustrated by the entry for the “GNU General Public License”.
      • Path components are separated by ‘/’ as is conventional in UNIX file systems. Spaces before and after a ‘/’ are removed by the interpreter, and spaces within KEY s are converted to ‘_’. (Any number of adjacent “whitespace characters” are converted to a single ‘_’.)
      • The SYNM directive declares a synonym for the current default context. In this example, /Music/Computer_Music/Sound_Synthesis is declared to be synonymous with /Engineering/Signal_Processing/Sound_Synthesis.
      • KEY synonyms may be declared in a single entry by separating them with vertical bars ‘|’.
      • KEY synonyms may also be created by specifying the same URL in two different entries (as in the JOS entry).
      • Order is important: The phrases “CCRMA Overview” and “CCRMA Research” will be transformed into links before the word “CCRMA”, as a result of the ordering shown.
      • The only example of “competing definitions” in this dictionary is the case of “Lagrange Interpolation”.
      • The first entry for “Lagrange Interpolation” is accompanied by an education level range specification using the LEVEL directive. It is set to 12 indicating that a high-school senior (at least one on the “math track”) should be able to fully understand the main thrust of it. Alternatively, a minimum and maximum educational level could have been specified using the MIN_LEVEL and MAX_LEVEL directives. The arbitrarily set maximum value of 100 means “no maximum”. Level ranges are more appropriate for “home pages” and the like which link to a variety of documents at a variety of educational levels.
  • Only trusted users can submit links and contexts wholesale in this manner to the link database server. However, any known user can submit such a set of links to a restricted or private directory. Otherwise, known users are allowed to submit one link at a time using the “Add a Resource” submission form described earlier.
  • If there are any pre-existing links in the same context directory with the same name and URL as a newly submitted link, the pre-existing link is retained unless the new submission is by the same owner. (Link properties could be updated or added in this manner, for example. Ratings and reviews are not affected since they may not be submitted in a dictionary file.) Rejected submissions are listed in a message from the server delivered in a dynamic web page, as is typical. Similar action is taken for other kinds of messages to the user as needed.
  • If the specified context directory does not exist, it is created, and the email address of the creating user is logged as its owner. The server automatically installs an encoding of the owner's email address in each link entry by means of an additional link property. Other properties, such as initial ratings, date-of-submission, etc., are installed by the server. Only the owner or group-member or server webmaster may modify an existing link or directory.
  • Similar submission protocols can perform editing operations which would otherwise be laborious over the browser-based user interface described above, such as deleting a database subdirectory and all its contents (provided, of course, that everything to be deleted is owned by the person or group making the request). For example, the directives
      • DELETE_LINK /Physics/Quantum_Mechanics/Planck's_Constant
      • DELETE_PATH /Physics/Quantum_Mechanics/Schroedinger's_Wave_Equation/
        can be used in place of online interactive editing of the server link database. In general, there is preferably a script-style equivalent for all operations performable interactively via a graphical user interface such as Web browsers provide. In addition to performing the operations more quickly and conveniently, script-style alternative interfaces are very important for the visually impaired. Scripting also provides a means of conveniently resubmitting all links contributed by the user, thereby making it convenient for users to maintain “back-ups” of their submissions in a form that can be easily restored on the link-installation server. Browsing and Search features can be used to obtain a dictionary-file display of all links owned by the user.
  • There may be a limit placed on the number of database links and subdirectories that can be submitted by any one user (email address) or group. This is to guard against accidents, malicious “hacking,” and to facilitate editorial tracking of contributed content. A certain amount of automatic checking for inappropriate content is possible, based on searching link targets for inappropriate words. Users can apply for “trusted” status by sending email to the server webmaster or other authorized agent. Trusted users may be given a higher contribution limit and perhaps also a higher weighting in link ratings. A group of users can be formed in which each member is trusted within that group.
  • 5.1.18 Use of Dictionary File Format to Specify Context Lists and Dictionary Lists
  • When specifying a list of context dictionaries for link installation, it is convenient to be able to use dictionary file format. When used in this way, all PATH directives in the file are extracted to form a list of contexts. If any links are specified for a particular context PATH, then only those links will be eligible for installation. Additional directives are provided which correspond to the options available for context dictionary specification, such as include parents, include subcontexts, and allow multiple inheritance. For convenience, these aggregate-dictionary-building directives are ignored when submitting a dictionary file as a means of submitting links.
  • Using the previous example dictionary file now to specify an aggregate dictionary for link installation gives results equivalent to the following dictionary file:
  •   GROUPS = CM_DSP # Only operative if selecting based on group
      PATH = /Education/Technology
      PATH = /Music/Computer_Music/Signal_Processing/People
      PATH = /Music/Computer_Music/Centers/CCRMA
      PATH = /Music/Computer_Music/Sound_Synthesis
      SYNM = /Engineering/Signal_Processing/Sound_Synthesis
      PATH = Music/Computer_Music/Signal_Processing/
      Sound_Synthesis/Vintage_Methods
      PATH=
      Music/Computer_Music/Signal_Processing/Sound_Synthesis/
      Acoustic_Instruments
    Adding some typical directives and eliminating some redundant
    specification leads to
      GROUPS = CM_DSP # Only operative if selecting based on group
      PATH = /Music/Computer_Music
      MULTIPLE_INHERITANCE_DEPTH = 2
      OWNERS_ONLY
      MIN_LEVEL = 12
      MAX_LEVEL = 100
      SUITABILITY = PG-13 # Movie and V-chip names understood
      SOURCE = ALL
      TYPE = Refereed
      PATH = /Education/Technology

    Several features of this aggregate-dictionary specification may be noted:
      • MULTIPLE_INHERITANCE_DEPTH=1 means that the context dictionaries of linked parents are appended to the aggregate dictionary for context synonyms occurring 1 level below the current context or less. This is just sufficient to pick up the “engineering parents” of context Sound_Synthesis without also including linked parents of lower levels.
      • The OWNERS_ONLY directive restricts the aggregate dictionary to links owned by members of group CM_DSP.
      • The minimum and maximum educational level restrict link installation to links rated at 12th grade or higher.
      • Source “ALL” means any source. Other choices include EDUCATIONAL (.edu), COMMERCIAL (.com), and so on. As usual, multiple sources can be separated by vertical bar ‘|’.
      • The TYPE is resource type. “Refereed” is a symbol for all refereed source types (journal article, book, etc.) If no type was specified by the contributor, it is UNKNOWN.
      • Order is important: The listed contexts will be appended in the order given, with the first one listed being considered the “current context”.
      • Dictionary combining directives as shown in this example are “sticky”, meaning that they apply also to subsequently listed context paths unless they are explicitly reset, or set to “NIL” indicating no value (to obtain the system default behavior).
    5.1.19 Security Considerations
  • The IP address is stored as well as the verified email address for security reasons. A user with “root privileges” on a personal machine can generate any number of return email addresses, while the number of IP addresses available to an individual is usually very limited. For example, if unusually many email addresses are found to belong the same IP address, a warning can be automatically emailed to the webmaster who can look into the matter further, such as by inspecting all contributions from that IP address. If an IP address turns out to belong to a malicious “hacker”, it is straightforward using standard UNIX tools to eliminate all database entries and directories associated with that IP address, barring it from further contributions, and so on. When the IP address is dynamic, as is often the case when a commercial Internet Service Provider (ISP) hosts the user's account, it is less likely that many different email addresses will belong to the same person, and the ISP can be contacted for assistance. Note that it is very easy to arbitrarily set the “From:” field in any email message; therefore, the “Received” fields in received email may be analyzed by the server to get closer to the true originating location. In Netscape Navigator, for example, select “View/Headers/All” to see such fields in received email.
  • 5.1.20 Link Database Implementation
  • Each link database may be implemented on the server as a plain ASCII file in a directory structure that corresponds to the hierarchical organization of the link databases.
  • The complete hierarchy can also be implemented in a single file which contains path information for each link entry. The initial prototype of the present invention used a single link database file based on the links-2.0 software scripts from Gossamer Threads, Inc. (<<<http://www.gossamer-threads.com/scripts/links/>>>). In this implementation, the context path information is included in what is called a “link category”. In adapting the links-2.0 scripts, categories having no sub-categories are considered to be “key phrases”, and actual links within a key-phrase (bottom-level category) are treated as “competing definitions”.
  • For a variety of reasons, use of a single links database file is not considered the best mode of carrying out the present invention. Instead, a hierarchical file system implementation is preferred in which the directory path corresponds to the context, and the database file in a context directory contains only links for that context (along with perhaps a limited number of subcontexts).
  • Alternatively, an evolutionary path can be followed starting out with a single database file, followed by splitting into separate database files for top-level contexts, followed by further splits as the files grow too large, etc. (The links-2.0 system advises a limit of 10,000 links for its one-file link database system managed by Perl CGI scripts.) On each split, the first path component stored in the link database may be removed since it becomes implied by the directory in which the database file resides.
  • A database directory may contain both files and directories. Subdirectories are interpreted as subtopics, and the hypertext links for the current directory (when it is a key-phrase directory) may reside within a single ASCII file named “links.txt”, for example, preferably located in a context directory containing the key phrase. The links.txt file contains a list of hypertext links for the current context in a plain ASCII format described below.
  • There may be a temporary “system file” for each active user which lists current selections and other state information pertaining to that user. Multiple selection configurations may be stored on client computers by means of the “cookie” mechanism supported by the major Web browsers. The name of a user's configuration file may include the user's email address, if known, and otherwise an arbitrarily assigned session ID for “unknown” users. All active sessions preferably time out after a period of inactivity, as is commonly implemented by websites featuring session management.
  • There may be a system file ratings.txt, parallel to links.txt in each directory, containing all contributed ratings for the links in links.txt. Information stored in ratings.txt for each link includes the email address of each contributor, and the contributed rating. When a new rating is contributed, an entry is appended to ratings.txt. If there is already a rating from that email address, it is replaced with the new one. A new average rating is computed, and the updated average rating and contributor count are entered into links.txt as properties for the affected link.
  • Another system file, reviews.txt, also parallel to links.txt, resides in each directory and contains all contributed “link reviews”. Information stored for each link includes the email address of each contributor, and the contributed review. When a new review is contributed, it is appended to reviews.txt, replacing any previous review from that email address.
  • 5.1.21 Link Database Details
  • Links may be stored on the server in the following simple ASCII text-file format:
  • ID | KEY | URL | PropertyName:Value | PropertyName:Value | ...
    ID | KEY | URL | PropertyName:Value | PropertyName:Value | ...
    ...
  • This format uses explicit property names which are convenient when specifying sparse subsets of all possible properties (and also more clear for describing the invention). An alternative is the use of a fixed-format record in which the property names are implied by their field position within the record.
  • The ID is a unique integer assigned to the database record. The ID therefore uniquely identifies the record and can be used to identify it in various contexts, such as in the URL for indirect links.
  • For example, a link to a Web page about the “W3K” website could appear in the link database (in one long line which is broken for clarity below) as
  • 23 | W3K | <<<http://www.w3k.org>>> | Date:2-Sep-99 |
        Context : /Education/Technology/W3K |
        Level:All | Rating:5 | RatingCount:7 |
        Hits: 20 | Installs: 4 |
        Owner : Julius Smith |
        Group : CM_DSP |
    OwnerEmail : jos@w3k.org
  • In addition to link databases, there is preferably a user database holding information such as a list of IP addresses authorized for that email address, whether the user wants to receive the W3K newsletter, the list of groups to which the user belongs (being a “trusted user” means belonging to the “trusted” group), and information logging any inappropriate use of the service such as submitting offensive links. (See the system for dealing with “trolls” at <<<http://www.slashdot.org for an example system>>>.)
  • 5.1.22 Example Link Properties
  • Example PropertyNames and their meanings are as follows:
  • TABLE 3
    Property Meaning
    Level Educational level of the link, if not a range (1-100, All)
    MinLevel Lower bound of educational level range, if applicable
    MaxLevel Upper bound of educational level range, if applicable
    FullTitle Contents of URL's HTML <TITLE> tag in quoted string
    Description Description of link by submitting user
    Date Date link was submitted by user
    Type Type of information (Encyclopedia, Tutorial, Book,
    Course, . . . )
    Language English, French, German, Spanish, . . .
    Suitability Similar to rating system used in the “V chip” for
    television
    Context Context path (when handling many contexts per database
    file)
    Synonyms List of equivalent phrases separated by ‘|’. Order is
    important.
    Hits Number of times link accessed by browsing
    Installs Number of documents link has been installed in
    Rating Quality rating as a number from 1 to 10
    RateCount Number of users contributing ratings
    isNew
    1 if Date is sufficiently recent
    isPopular
    1 if Hits is large relative to other links
    OwnerEmail Email address of link contributor
    ReceiveMail
    1 if link contributor wants our newsletter
    Groups List of owning groups separated by ‘|’
    User1 Property defined by user
    User2 Property defined by user
    . . . . . .
  • The properties can be used to limit the range of links installed by a link installation. For example, a certain educational level range can be specified, or links only of a certain type may be specified. Restriction to links contributed by the owner or owning group is also easily specified.
  • KEYs will match occurrences of any case by default. When a link is installed in a user's document, the user's original case is preserved in the anchor text. KEYs may be entered in singular form since the string matching algorithm will match will ignore a trailing ‘s’. A KEY is either a simple word or a phrase consisting of words separated by underbars, e.g., Funk_&_Wagnalls_Knowledge_Center. A word may not contain certain “meta-characters” such as “|” or “#” which have system meanings, and all such meta-characters are stripped out by a regular expression (in Perl) on input. Similarly, context names must be “legal” UNIX file names after whitespace has been converted to underbars ‘_’, since the preferred embodiment uses a UNIX directory tree corresponding to at least part the context hierarchy. Restriction to legal filenames is easily relaxed by encoding the directory names in hexadecimal, as an example, or using the special character encodings of HTML. The string matching algorithm used in link installation “folds” the input case to “lower” and replaces underbars and hyphen with spaces in string comparisons. As a result, KEYs in text submitted for link installation can have any case and can include underbars, hyphens, or spaces separating words in the keyword phrases, yielding the same matching results in all such cases. In the above example, the link name is functionally equivalent “funk wagnalls knowledge center” for purposes of string matching. To include special characters where necessary, names may be quoted, as in
      • ‘Funk & Wagnalls Knowledge Center’
  • In the case of quoted names, string-matching is exact. Other details regarding string matching for link installation may be seen in the example of Appendix A.
  • 5.2 Single-Computer Implementation
  • The present invention can be adapted equally well to single-computer operation, requiring no network connection. In this case, the user can install a link database application from a CD-ROM, for example, in the fashion typical of many software products for personal computers. All functions formerly described as being provided by a Web browser and the remote link-installation and database server can be provided by the installed application. A Web version, if available, can serve to provide a supplementary collection of links.
  • There are several advantages to this mode of operation:
      • Since all data and software are local, response time can be greatly improved relative to use over the Internet.
      • A link database application may take advantage of native graphical user interface (GUI) facilities on the personal computer, which are typically more advanced than the platform-independent HTML and Web-browser facilities.
      • Since link database extensions may occur on the local hard disk instead of on a remote website, security requirements are alleviated, and user privacy is enhanced, especially for “private” database directories.
      • The link databases are not constantly changing, particularly the ratings, thereby automatically giving repeatable results on repeated link installations.
      • The link databases can be customized by manually setting alternative link orderings, and eliminating unwanted alternative links.
  • The following implementation differences apply to the single-computer embodiment:
      • Instead of one master link database directory, there may be two parallel link database directory trees having a common directory structure. The first may be “read only” so that it can be distributed and used on a CD-ROM, for example, while the second is “writable” and contains any user-developed databases, as well as the temporary “system files” generated during use of the system. The writable directory tree will normally reside on a local hard disk.
      • In operation, the writable directory is searched first so that it takes precedence over the read-only directory,
      • Logically, the links.txt files in the writable and read-only directory images are treated as one file, with the read-only version being appended to the writable version.
      • Links on the CD-ROM may be “deleted” by adding a corresponding entry for them in the writable directory tree consisting of exactly the same keyword or phrase, the same URL, and the single property “DELETED”. Read-only directories cannot be deleted or renamed, but they can be excluded from link searches in the normal way (which applies also to the corresponding directory in the writable tree, if any, since they are logically the same directory).
      • Link database updates may be obtained over the Internet and installed locally to keep the single-computer software up to date. To facilitate this process, it is convenient to maintain on the server listings of database directories and contents for each software release. During an update, the server can traverse the link database directory, compare against the listing applicable to the user's current release, and generate an incremental update to bring the user up to the latest state. The incremental update is installed in the writable database directory on the user's local computer, automatically shadowing any older corresponding information on the CD-ROM. Updates may be obtained at any time to obtain the latest links. Information can be stored locally on the user's machine to enable each update to be incremental relative to the previous update as opposed to the latest official release.
      • URLs submitted in the “Text or URL” textfield of the link-installation submission form may also include “file:” type URLs.
  • It may occur that the user has locally extended the link database in a way that conflicts with the server's extensions since the time of the user's release or last update. The directory path, keyword or phrase, and URL all have to be identical to create a link conflict, and so actual conflicts can only occur in link properties. Link rankings can of course change at any time, and this is normal. However, since locally installed ranking information may be a rating override by the user (rather than the result of a previous upgrade), it is not necessarily correct to overwrite the locally installed rating properties. Similarly, other properties may have been added by the user to fine tune link installation results. During installation of the incremental update, the user may be given a choice of whether or not to accept conflicting information from the incremental update on a link by link, or property by property basis. The default action may of course be to avoid overwriting any user-developed information, and the default upgrade can proceed in this mode. In the default mode, all conflicting links can be installed in a third parallel directory tree for later inspection by the user. Another means for avoiding conflicts is to rename any pre-existing directories containing user modifications (by adding a private suffix to its directory name, say) before carrying out an update.
  • 5.3 Educational Levels
  • The educational level of a definition is a number indicating how advanced the material is. Authors generally wish to minimize the educational level as much as possible consistent with the intended audience, the material being presented, and the desired length of the document.
  • Every definition (link) is assigned an educational level. A normalized educational level may be provided manually by the link contributor as a number between 0 and 100, with the number being loosely interpreted as “years of education likely required” for someone specializing in the subject. When no manual assignment is made by the link contributor, a level is automatically computed which interpolates the manually assigned levels that do exist.
  • 5.3.1 Automatic Assignment of Educational Levels
  • The automatically assigned level of a definition is computed by first computing an integer “raw level” for the definition based purely on an analysis of definition interdependencies, followed by the computation and assignment of a “normalized level” which maps each raw level to the pre-existing manually assigned level when it exists.
  • The raw educational level is defined as 1 plus the maximum raw educational level of all referenced definitions, where a definition referencing no other definitions is assigned a raw level of 1. A referenced definition here means any server-resident link occurring inside the definition's document. All links outside the server are assigned a level of zero (to avoid having to process external documents, all their sublinks, etc., in a potentially huge recursion over the internet). In a definition document which refers only to outside material, which would receive an automatically assigned raw level of 1, it is the submitting author's responsibility to reasonably assign its educational level manually when submitting it to the server. Forward references in a document can be marked as such to prevent them from artificially elevating the automatically computed level.
  • In the preceding paragraph, a “definition document” means the page pointed to by the definition's URL together with all information reachable from the definition page on the same website by following links. Thus, as an example, the definition document corresponding to definition URL http://www.w3k.org/ includes the contents of http://www.w3k.org/index.html together with any HTML (or other format) content reachable by following links within index.html, such as http://www.w3k.org/about.html, which are at the same site, that is, expressible using URLs starting with http://www.w3k.org/.
  • In addition to setting an appropriate educational level when possible, documents may mention any specific recommended prerequisites at the outset, providing links whenever possible, so as to better orient the reader. The often-used “list of keywords” appearing below an article's abstract, after being automatically linked by the present invention, can provide a good first set of background links. However, an explicit list of prerequisite topics, automatically linked to top-ranked tutorials by the present invention, can provide a more systematic and preferable approach to prerequisites.
  • 5.3.2 Normalizing Numerical Educational Levels
  • Educational levels are normalized to make them more intuitive to use. Normalization is a process which recomputes automatically assigned educational levels so as to make them fit more naturally among the manually assigned levels.
  • Raw numerical levels are integers which are bounded below by 0 and unbounded above. The raw level of any document is at least 1 greater than the maximum level among all documents it references. As automatic level assignment proceeds through a large collection of documents, levels of advanced writings can tend toward very large integers. As a result, the raw level number is difficult to interpret. Normalization can be easily accomplished using a piecewise-linear mapping from the raw-level scale to some “standard level” scale.
  • The standard level scale used at the server may be loosely based on the number of years of education “likely required” to understand the document for someone taking a direct educational route to the material in the given context. It is analogous to a “par score” in golf, which defines the average number of “strokes” required to complete the course, for a good player. Such a numbering scheme obviously becomes more and more debatable at levels above 14 or so (middle undergraduate level) where curricula are not uniformly standardized. In fact, it is common in many technical fields to constantly work toward moving level 17 materials (beginning graduate level) down to level 14 or even below, meaning the “ideal” educational level is itself a moving target.
  • A nominal listing of normalized educational levels is given in the following table:
  • TABLE 4
    Numerical Named
    Range Range
    0-1 Preschool to 1st grade
    1-2 1st to 2nd grade
    2-3 2nd to 3rd grade
    3-4 3rd grade
    4-5 4th grade
    . . . . . .
    12-13 12th grade
    13-14 College Freshman
    16-17 College Senior
    17-18 1st year graduate student
    20-21 4th year graduate student
    . . . . . .
    100 Arbitrary maximum level
  • To find the nonlinear mapping necessary for level normalization, certain documents need to be assigned a “known standard level”, such as “8th grade.” An obvious choice for such documents would be those occurring in actual standardized courseware. A few such documents spread out between zero and the maximum level are sufficient, but preferably there is at least one per year, to make the mapping reasonably accurate. With this information, a piecewise-linear mapping from raw level to standard level can be constructed which takes the automatically computed levels of the known-standard-level documents to their known standard levels. Thus, the piecewise linear mapping will have “break-points” at the level of each standard-level document.
  • For purposes of level normalization, a definition can be schematically represented by a list of numbers (the raw levels of all referenced definitions) to which a higher number must be assigned (the definition's raw level). For example, two definitions D1 and D2 might appear schematically in this way as follows:
      • D1(20): 13, 14, 18, 19
      • D2(19): 17, 14, 18
  • Here, the number in parentheses (20, for Definition 1) is the definition's level, while the numbers following the colon ‘:’ give the levels of all referenced definitions, not including forward references.
  • To keep track of the specific definitions providing each referenced level, let's extend the above notation to include definition number:
      • D80(20): D2(13), D5(14), D44(18), D45(19)
      • D81(19): D9(17), D5(14), D44(18)
  • Thus, in this example, definition node 80 is assigned a raw educational level of 20 which is consistent with the fact that its document references definitions 2, 5, 44, and 45 which have been assigned raw levels 13, 14, 18, and 19, respectively.
  • Forming A Directed Graph Expressing Educational Level Relations
  • From the above data structure, it is now straightforward to form a directed graph in which each node of the graph corresponds to a definition, and a reference to another definition can be visualized as an arrow (also called an “edge” of the graph) pointing from the referencing node to the referenced node. (The arrows can alternatively be taken to all point in the other direction, from the referenced node to the referencing node.)
  • The problem of assigning consistent educational levels can now be seen to be equivalent to the general problem of ordering the nodes of a directed graph so that every path through the graph traverses nodes in a monotonic numerical order. This type of ordering is known as a topological sort, and there are standard algorithms in computer science, such as Dijkstra's algorithm, for this purpose.
  • Resolving Cycles
  • It is well known that a directed graph can be topologically sorted if and only if it is acyclic, meaning there are no cycles in the graph. Standard topological sorting algorithms just work when there are no cycles, and when there are cycles, they are detected and reported as errors, leaving it to the user to “break the cycle” in some way.
  • Cycles can be caused inadvertently in documents by use of forward references. For example, in a level 12 document about the calculus, it would be quite natural to mention some calculus applications such as solving problems in elementary mechanics (which is level 13 at a typical university, requiring calculus as a prerequisite). To address this problem, a mechanism is provided for authors to mark forward references in documents, so that forward-reference links are ignored in the computation of educational level.
  • Algorithm for Educational Level Assignment
  • An algorithm for educational level assignment can now be described as follows:
      • 1. A directed graph is constructed by traversing all definition nodes and building a tree in which each node contains a list of pointers to other nodes. Each node corresponds to a definition, and each pointer points to a referenced definition. Let the definition nodes be arbitrarily numbered from 1 to N. This will be called the “lexical ordering” of the nodes, and it need not change.
      • 2. Using a topological sort algorithm on the directed graph, each node is assigned a second integer corresponding to its “raw educational level. Nodes having no references (no “incoming arrows”) are assigned a raw level of 1. A node having one or more references is assigned a raw level at least 1 greater than the raw level of all referenced nodes. (Manual assignments used for normalization are ignored during the topological sort.)
      • 3. Use the raw level assignments together with the manually assigned normalized levels to construct a piecewise-linear mapping from raw to normalized educational level for all nodes.
  • A Perl program implementing the above steps is given in Appendix D.
  • Non-Monotonic Manual Level Assignments
  • There is nothing in the system to prevent manually assigned educational levels from being “out of order” relative to the raw assigned levels. For example, suppose document A has a raw level of 200, while document B, which references document A either directly or indirectly, has a raw level of 250. Suppose further that document A has a manually assigned level of 40 while document B has a manually assigned level of 30. Since the manual ordering is the opposite of the reference-based ordering, a conflict occurs, and the piecewise linear map which takes raw to normalized levels will have a segment with negative slope, which doesn't make sense.
  • There are several possible ways to address this issue:
    • 0. The manual assignments can be forced to be monotonic by adjusting them so that the minimum piecewise-linear mapping slope is zero (or some small positive number). In the above example, the normalized level of B could simply be thrown out on the simple grounds that since B references A, it is by definition at a higher level, so that its manual level cannot be allowed to be set lower than that of A. It could alternatively be set to the same level as A. A could also be compared with other documents at comparable raw levels to see if its manually set level is unusually high, and, if so, it can be rejected as an “outlier”. The automatic normalization system does not need many manually set levels, so it is preferable practice to throw out any that seem suspect for any reason.
    • 1. Manual level assignments can be allowed only for values up to, say, 18, forcing all higher levels to be extrapolated via automatic assignment. Extrapolation may be determined by normalizing the highest raw level to 100, and other values would function as well. Alternatively, the slope of the mapping leading up to normalized level 18 can be estimated, and higher raw levels can be normalized to preserve this slope.
    • 2. A slightly more sophisticated version of the previous solution is to “fit a monotonic curve” through the manual level assignments. To see how this works, plot N(i) versus R(i) in the Cartesian plane for all i, where i ranges over all definition links in the system, R(i) is the raw level assigned to definition i, and N(i) is the normalized level manually assigned to definition i, if any. If N(i) is not assigned, do not plot the point. Since there can be any number of documents with the same raw level R(i), there may be many values of N(i) for a given value of R(i) as i ranges over all links in the database. For each R, a range of normalized values may be covered by N. Such a plot of N versus R can be called a “scatter plot.” For simplicity, suppose both R and N are normalized to range between 0 and 1, so that the plot of N versus R in the Cartesian plane starts at (0,0) and terminates at (1,1). Ideally, this distribution of points in the plane will resemble a blurry line at a 45-degree angle (or perhaps a concave or convex curve instead of a straight line, etc.). Well known curve fitting methods, such as linear regression, cubic splines, Bezier curves, and least-squares polynomial fitting can be used to fit a monotonic curve through the “scatter-plot” of N versus R.
    • 3. A more refined version of the previous solution is to replace the 2D scatter plot by a 3D histogram. As in the 2D case, the horizontal axis is R and the vertical axis is N in the plane; however, the third dimension for each plane coordinate (N,R) is now the “number of definitions having manual level N and raw level R”. The scatter plot resembling a blurry monotonic planar curve in the previous solution now becomes a raised histogram following the same curve. However, the third dimension allows seeing where “most” of the manual ratings lie, and a curve-fitting procedure can try to “follow the ridge” in the histogram. There are numerous curve fitting and “hill climbing” methods known those skilled in the art that would be straightforward to apply here.
    • 4. Certain manual level assignments can be given priority over others, such as those produced by an educational standards body, educational institution, or professional society. Deferring to these assignments in the event of a conflict may resolve it in some cases.
    • 5. An expert in the field can study the level assignments and make a recommendation for reassignment of the manual normalized levels as seems right.
  • In any case, when a manually set normalized level is thrown out or reassigned, the owner of the modified link record is preferably notified via email. In the current preferred embodiment, the combination of solutions 0 and 1 is employed, but more sophisticated solutions are anticipated to become preferable as the link database grows.
  • It is informative for users to see a histogram of the number of documents having a certain normalized level versus normalized level. Such a histogram can indicate to users how significantly the number of links installed will change as the minimum and/or maximum educational level are adjusted.
  • Educational Levels as Context Limiters
  • Note that level assignment helps to disambiguate between relatively advanced and elementary contexts. In one actual example encountered by the author, the proof of “Euler's theorem” contained the use of the phrase “positive real number.” The term “positive real” existed in the dictionary for the same context, but at a much more advanced level. (It was entered as short for “positive real function” which is a mathematical property of functions in the complex plane characterizing “passive” driving-point impedances in the context /Engineering/Electrical/Classical_Circuit_Theory). Since links to higher educational levels are normally suppressed when installing links, the misleading link would not occur. Other solutions to this problem include (1) requiring the whole phrase “positive real functions” for a match with the more advanced topic, (2) splitting contexts so as to separate elementary complex variables and more advanced network theory, and (3) supplying a trivial home page for “positive real number” which then would shadow “positive real” in this case.
  • Context Dependency of Educational Levels
  • Since a link to a particular document may be repeated in any number of contexts, its manually assigned educational level may be a function of the field associated with that context (such as a high-level parent context). For example, in an engineering curriculum, study of the “phase vocoder” (a tool for “stretching” sound in time, among other uses) can easily occur at normalized level 16 (a college senior with a good undergraduate background in signal processing). On the other hand, within a computer music graduate curriculum, the “phase vocoder” topic may be set at level 18 or 19 so that the Music Ph.D. student has time to pick up necessary signal processing prerequisites in the electrical engineering department. (In the golf analogy, 16 is “par”, while non-engineering majors (“less experienced players”) may receive a 2 or 3 stroke “handicap”.) When levels are fine-tuned in this way, the relevant “context synonyms” previously described may need to be broken. For automatic level assignments to follow context-sensitive normalized levels, it is necessary for documents to either specify the context of the link (as described below), or provide the context of the document containing the link, such as is easily provided in HTML “metadata”. In summary, a single document may have a different normalized educational level assigned to it in different contexts.
  • A convention for handling multidisciplinary documents can be that the level of the document is set according to the portion of the document which properly belongs in the context to which it is assigned. A document requiring expertise in three fields, say, could then receive three different levels, one in each context, where the level in each case depends only on the portion of the document pertaining to that field. As an example, consider a paper on the legal issues of patenting genetically engineered life forms; the paper can be placed in the context hierarchy in at least two places, under /Legal/Patents, and under /Science/Biology/Genetic_Engineering (which, by the way, could be synonymous with /Engineering/Genetic_Engineering). Under /Legal/Patents, its level is set according to the level of legal background required. Under /Science/Biology, its level is set according to the level of knowledge required to understand the technical aspects of the article.
  • Since link properties can be added by the user, the above example can be extended to include a secondary level specification. For example, when filing the paper under /Legal/Patents, it can be given two additional user-defined properties
      • SecondaryContext: /Science/Biology
      • SecondaryLevel: 14
        and the filing of a link to the paper under /Science/Biology/Genetic_Engineering can have the two extra properties
      • SecondaryContext: /Legal/Patents
      • SecondaryLevel: 16
        for example.
  • An alternative mode of treatment for multidisciplinary or cross-disciplinary documents is to assign a manual level to the document which represents the number of years of education likely required for understanding assuming one first obtains a typical background in the current context. That way, the level numbers of links in a particular field are most meaningful for people in that field. Since people in the field are most likely to be using the links for that field, it makes sense for the numbers to be best adapted to their needs.
  • The system of the present invention allows for many choices of normalized level scale, and the scale can be changed at any time by resetting the manually set levels. A link contributor may, for example, perform a property-based link database search which collects together all editable links at a particular educational level, and changes all the levels to a new value in one edit operation on the selected set.
  • Example of a Very High Educational Level
  • A specific example of a particularly high educational level is “superstring theory” which is a current “hot topic” in theoretical physics. Understanding a recent conference paper on this topic might require several prior recent papers to be read and understood, and these in turn might require the equivalent of a basic textbook on superstring theory, which would require a solid grounding in quantum field theory, which requires a basic quantum mechanics background, which requires a few years of physics and math at the college level, and so on. It is debatable how long it should take a “good student” to get through such a path of study in order fully grasp the end concepts, but this question can be decided by educators and professional societies. Since normalized educational levels are ultimately determined by manual assignment, the automatic assignment system will conform to their decisions.
  • Curricula “Critical Path” Identification and “Education On Demand”
  • As educational curricula become “finer grained”, the normalized educational level numbers may go down over time as shorter and shorter “critical paths” to particular advanced topics are identified. Curricula can in fact be based precisely on such identified paths, or even automatically generated from them. This is a foreseeable use of the present invention. Specifically, a student can choose a set of skills and topics he or she wishes to master, and plunge in immediately on the most advanced level, clicking on links whenever a word or phrase is not understood. After enough clicks, a level is reached where the material is digestible, and the student can eventually work back up to the desired level. This is in contrast to the usual program of beginning with elementary preliminaries which can seem unmotivated to the student. Needed educational prerequisites can be “paged in” as needed on a demand-driven basis (to use an analogy with “demand paging” in computer science). Significantly, unnecessary educational prerequisites can be bypassed entirely.
  • Demand-driven education can be more motivating to students prefer to work with an end goal in sight. Since everything studied is motivated by an attempt to understand a more advanced document which accomplishes a tangible desired goal (such as “stretching” sound without changing its pitch in the case of the phase vocoder), it can always be clear to the student, for example, “what is this for,” and “how are we going to use this.” It further provides a “natural selection mechanism” for educational materials, omitting required study of anything not needed to fathom the specifically chosen advanced materials on the desired track.
  • Application to Automatic Teaching and Certification
  • In a computer-assisted curriculum along these lines, the student may have to answer exam questions at the end of each document which serve to verify and reinforce the student's understanding. In the end, a course of study could result in a collection of automatically generated certificates for the topics covered, and job requirements could be stated in terms of certification levels on specific topics. There are obvious issues associated with proctoring exams in such a system, but there exist traditional solutions as well as more novel technological solutions using, say, a simple camera attached to the computer with a viewing angle large enough to include the keyboard.
  • Such a mechanized approach to self-paced educational certification has a few obvious disadvantages relative to more traditional educational methods, especially given the limited “virtual reality” capabilities of present day computers. However, it can provide a very efficient way to “tool up” on a technical topic, particularly for the time-limited employee, the spare-time student, or the bright young learner in a developing country seeking salable job skills in the global information economy.
  • 5.4 Preferred Link Targets
  • Preferably, each concept is given its own “home page” on the Web which approaches the subject in a top-down way, addressing the widest possible audience, and directing all interested readers to the most appropriate further information for them. All-encompassing home-page style definitions are likely to emerge as the “best definition” in a user-ranked list. Being designated the “best definition” for a topic is significant in that, in some modes of operation, only the “best” definition is available to the end user in installed links.
  • Example “topic home pages” include one on the Kalman filter at
  • http://www.cs.unc.edu/˜welch/kalmanLinks.html
    and the “Digital Audio Resampling Home Page” located at
      • http://www-ccrma.stanford.edu/˜jos/resample/
  • Another advantage of top-down, general-interest, home pages on a given topic is that its minimum educational level may be set low, allowing the link to be installed in a greater variety of materials, while the expert can skip over the introductory tutorials and proceed directly to the mode advanced material of interest. For this reason, the preferred embodiment supports specifying a range of educational levels for a single document. A well designed topic home page might span levels from high-school all the way out to the research frontier. An example website covering such a large range (although not devoted to a single topic) is the Physics 2000 website at
  • http://www.Colorado.EDU/physics/2000/
  • 5.4.1 Generating Link Targets in a Uniform Format
  • A useful means for generating link targets is to write a LaTeX document which explains one “concept” per section. The title of the section may be the word or phrase being defined. The first occurrence of the concept's name may be emphasized using the LaTeX command \emph{ }. The utility latex2html can then be used in a normal fashion on such a “concepts file” to create a separate HTML file for each section, and the hypertext index generation capability of latex2html can be used to generate the associated link database (dictionary file). Alternatively, a dictionary can be generated directly from the HTML using the widely available Perl modules LWP and HTML (see, for example, the Perl Cookbook by T. Christiansen and N. Torkington, O'Reilly, 1998). A Perl script for this purpose is listed in Appendix B. The dictionary file so generated may be submitted to the link database server to conveniently contribute links to all the newly created concept home pages on the Web. (The script is written to retrieve all files under a URL by following links, thereby providing a starting point for a utility which creates a dictionary file from any website on the Web.)
  • The LaTeX command \section{ } can be replaced by an invocation of the following more convenient macro:
  • \newcommand{\ConceptSection}[2]{
     \section{#1}
     \index{#1}
     \begin{htmlonly}
     \rawhtml{<!-- W3K_PROPERTIES: ‘#1’ #2 -->}
     \end{htmlonly}
    }
  • This macro is called with the section name as usual, but additional properties may be specified in the second argument, and these are passed to the output in an HTML comment which may be interpreted by the script which creates a dictionary file from the HTML files (a starting point is given in Appendix B). (A more elegant solution, feasible when XML is widely supported by Web browsers, is to define an XML tag for each link property.) Context properties found in the HTML comments may be translated to PATH directives in the dictionary file. An alternative use of properties embedded in link target files is that, when using indirect links, link properties do not have to be included in the link database at the link server.
  • As an example, a new section could begin as follows:
  • \ConceptSection{Derivative}{Level:12
     Context:/Mathematics
     Synonyms:Differentiation}{
       The  \emph{derivative} with respect to $x$ of the continuous
       function $f(x)$ is defined by the \emph{limit} ...
    }

    and a related example might be
  • \ConceptSection{Derivative}{Level:11 Context:/Financial/Investing}{
     A  \emph{derivative} is a financial instrument derived from an
     underlying security such as ...
    }
  • Note how the use of the context property can be extremely important for obtaining the desired results. If the context property is not specified for a link, it may default to the path of the database directory in which the link resides. The context value All can be specified in a link entry by trusted users to enforce only one meaning of a term across all Contexts. Such treatment could be appropriate for trademarks, for example. Since the top level of the link database directory tree is Context independent, link entry at that level can be restricted to trusted users.
  • 5.5 Guiding Link References
  • In the previous example, properties were used to specify the context and educational level of a link target. Properties can similarly be specified in documents submitted for link installation in order to guide link installation. Thus, the document receiving links can call for links of a certain type rather than having to specify all aspects of link installation in a separate script or over the browser interface at the server.
  • As an example, in a document using the term “derivative,” the context may be specified as follows:
  • The  \kref{derivative}{Context:/Mathematics} with respect to $x$ of the
    continuous function $f(x)$ ...

    where the kref macro might be defined along the following lines:
  • \newcommand{\kref}[2]{
     \emph{#1}
     \begin{htmlonly}
     \rawhtml{<!-- W3K_PROPERTIES: ‘#1’ #2 -->}
     \end{htmlonly}
    }
  • A linked reference can be marked as a “forward reference”, for example, via
  • The  \kref{derivative}{Context:/Mathematics ForwardReference} with
    respect to $x$ of the continuous function $f(x)$ ...

    in order to avoid having it affect the educational level of the current document. While these examples pertain to LaTeX submission format, preparation of plain HTML submissions is specified implicitly as well, since LaTeX is compiled into HTML in this application.
  • 5.6 Converting Browser Bookmarks to Dictionaries
  • The Perl script in Appendix C can be used to convert the “Favorites” folder containing Internet bookmarks for Microsoft Internet Explorer 5 to the dictionary file format described above. The inexpensive utility program LinkSync (http://www.bluesquirrel.com) can be used to convert Netscape Navigator bookmarks to MSIE format (and vice versa). Many “web surfers” have valuable hierarchically organized “link collections” in their browsers which might usefully be converted to dictionary files that can be submitted to the link installation server. Hierarchy is preserved by creating context paths as needed.
  • REFERENCES CITED 6.1 U.S. Patent Documents
    • U.S. Pat. No. 4,982,344; Jan. 1, 1991; Daniel S. Jordan; Accelerating link creation
    6.2 Other Publications
    • L. Carr, “The link fifty years on: A personal view of hypertext linking,” SIGWEB Newsletter, vol. 8, pp. 41-43, February 1991.
    • L. Carr, D. De Roure, W. Hall, and G. Hill, “Implementing an open link service for the world-wide web,” World Wide Web, vol. 1, no. 2, pp. 61-71, 1998.
    • S. Hitchcock, “Open journal project: final report to elib,” tech. rep., Multimedia Research Group, University of Southampton, November 1998.
    • H. Wells, “World brain: The idea of a permanent world encyclopedia,” in Encyclopédia Française, 1937.

Claims (12)

1. A computer-implemented method for automatically linking phrases in text, the method comprising:
a) storing linkable phrases in a database;
b) receiving submitted text, wherein the submitted text contains at least one submitted phrase that matches a first linkable phrase in the database, and wherein the submitted phrase contains a shorter key that matches a second linkable phrase in the database;
c) searching the submitted text for an occurrence of a linkable phrase;
d) giving precedence to the submitted phrase over the shorter key it contains;
e) returning information corresponding to the found submitted phrase as output.
2. The method of claim 1 wherein storing linkable phrases in a database further includes the step of associating a linkable phrase with a context.
3. The method of claim 2 wherein searching occurs within a context.
4. The method of claim 1 wherein the precedence is established based on an ordering of the stored linkable phrases in the database.
5. The method of claim 1 wherein the returned information includes a pointer to a network resource associated with the found submitted phrase.
6. The method of claim 5 wherein the returned information includes a ranked list of pointers to network resources associated with the found submitted phrase.
7. A system for automatically linking phrases in text, comprising:
a) a computer processor;
b) a storage medium containing stored linkable phrases;
c) text input means for receiving submitted text, wherein the submitted text contains at least one submitted phrase that matches a first linkable phrase, and wherein the submitted phrase contains a shorter key that matches a second linkable phrase;
d) matching means for searching the submitted text and finding an occurrence of the submitted phrase in preference over the shorter key it contains, thereby determining a found submitted phrase;
e) output means for returning information corresponding to the found submitted phrase.
8. The system of claim 7 wherein the storage medium further contains information associating a linkable phrase with a context.
9. The system of claim 8 wherein the matching means further comprises means for finding an occurrence of the submitted phrase in a particular context.
10. The system of claim 7 wherein the stored linkable phrases are ordered so that the submitted phrase is ordered before the shorter key it contains, and wherein the matching means gives preference to the first match.
11. The system of claim 7 wherein the output means comprises means for returning a pointer to a network resource associated with the found submitted phrase.
12. The method of claim 11 wherein the output means comprises means for returning an ordered list of pointers to network resources.
US12/925,809 1998-10-05 2010-10-29 Method and apparatus for facilitating use of hypertext links on the World Wide Web Abandoned US20110047152A1 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
US12/925,809 US20110047152A1 (en) 1998-10-05 2010-10-29 Method and apparatus for facilitating use of hypertext links on the World Wide Web

Applications Claiming Priority (5)

Application Number Priority Date Filing Date Title
US10308998P 1998-10-05 1998-10-05
US09/412,248 US6772139B1 (en) 1998-10-05 1999-10-05 Method and apparatus for facilitating use of hypertext links on the world wide web
US10/681,621 US7233950B2 (en) 1998-10-05 2003-10-07 Method and apparatus for facilitating use of hypertext links on the world wide web
US11/707,788 US7877407B2 (en) 1998-10-05 2007-02-15 Method and apparatus for facilitating use of hypertext links on the world wide web
US12/925,809 US20110047152A1 (en) 1998-10-05 2010-10-29 Method and apparatus for facilitating use of hypertext links on the World Wide Web

Related Parent Applications (1)

Application Number Title Priority Date Filing Date
US11/707,788 Continuation US7877407B2 (en) 1998-10-05 2007-02-15 Method and apparatus for facilitating use of hypertext links on the world wide web

Publications (1)

Publication Number Publication Date
US20110047152A1 true US20110047152A1 (en) 2011-02-24

Family

ID=32044839

Family Applications (4)

Application Number Title Priority Date Filing Date
US09/412,248 Expired - Lifetime US6772139B1 (en) 1998-10-05 1999-10-05 Method and apparatus for facilitating use of hypertext links on the world wide web
US10/681,621 Expired - Lifetime US7233950B2 (en) 1998-10-05 2003-10-07 Method and apparatus for facilitating use of hypertext links on the world wide web
US11/707,788 Expired - Fee Related US7877407B2 (en) 1998-10-05 2007-02-15 Method and apparatus for facilitating use of hypertext links on the world wide web
US12/925,809 Abandoned US20110047152A1 (en) 1998-10-05 2010-10-29 Method and apparatus for facilitating use of hypertext links on the World Wide Web

Family Applications Before (3)

Application Number Title Priority Date Filing Date
US09/412,248 Expired - Lifetime US6772139B1 (en) 1998-10-05 1999-10-05 Method and apparatus for facilitating use of hypertext links on the world wide web
US10/681,621 Expired - Lifetime US7233950B2 (en) 1998-10-05 2003-10-07 Method and apparatus for facilitating use of hypertext links on the world wide web
US11/707,788 Expired - Fee Related US7877407B2 (en) 1998-10-05 2007-02-15 Method and apparatus for facilitating use of hypertext links on the world wide web

Country Status (1)

Country Link
US (4) US6772139B1 (en)

Cited By (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20080037879A1 (en) * 2006-07-25 2008-02-14 Paxson Dana W Method and apparatus for electronic literary macrame component referencing
US20090235150A1 (en) * 2008-03-17 2009-09-17 Digitalsmiths Corporation Systems and methods for dynamically creating hyperlinks associated with relevant multimedia content
US20100211565A1 (en) * 2008-10-20 2010-08-19 Facility Italia S.P.A. Method for searching for multimedia content items on the internet
US20110035651A1 (en) * 2006-02-24 2011-02-10 Paxson Dana W Apparatus and method for creating literary macrames
US20110179344A1 (en) * 2007-02-26 2011-07-21 Paxson Dana W Knowledge transfer tool: an apparatus and method for knowledge transfer
US20110213805A1 (en) * 2004-03-15 2011-09-01 Yahoo! Inc. Search systems and methods with integration of user annotations
US20120137209A1 (en) * 2010-11-26 2012-05-31 International Business Machines Corporation Visualizing total order relation of nodes in a structured document
US8521517B2 (en) * 2010-12-13 2013-08-27 Google Inc. Providing definitions that are sensitive to the context of a text
US8689134B2 (en) 2006-02-24 2014-04-01 Dana W. Paxson Apparatus and method for display navigation
US8782013B1 (en) * 2002-10-08 2014-07-15 Symantec Operating Corporation System and method for archiving data
US9047259B1 (en) * 2010-07-21 2015-06-02 Google Inc. System and method for renaming hyperlinks
US20150161120A1 (en) * 2012-06-05 2015-06-11 Google Inc. Identifying landing pages for images

Families Citing this family (318)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8898190B2 (en) * 1997-01-29 2014-11-25 Philip R. Krause Method and apparatus for enhancing electronic reading by identifying relationships between sections of electronic text
US6154757A (en) * 1997-01-29 2000-11-28 Krause; Philip R. Electronic text reading environment enhancement method and apparatus
US8874611B2 (en) * 1997-01-29 2014-10-28 Philip R Krause Method and apparatus for enhancing electronic reading by identifying relationships between sections of electronic text
AUPO489297A0 (en) * 1997-01-31 1997-02-27 Aunty Abha's Electronic Publishing Pty Ltd A system for electronic publishing
US7293228B1 (en) 1997-01-31 2007-11-06 Timebase Pty Limited Maltweb multi-axis viewing interface and higher level scoping
US6772139B1 (en) * 1998-10-05 2004-08-03 Smith, Iii Julius O. Method and apparatus for facilitating use of hypertext links on the world wide web
US6145000A (en) 1998-10-06 2000-11-07 Ameritech Corporation System and method for creating and navigating a linear hypermedia resource program
US7051360B1 (en) * 1998-11-30 2006-05-23 United Video Properties, Inc. Interactive television program guide with selectable languages
US7334184B1 (en) 1999-03-10 2008-02-19 American Express Travel Related Services Company, Inc. Method for online information sharing for completing electronic forms
US7752251B1 (en) * 2000-04-14 2010-07-06 Brian Mark Shuster Method, apparatus and system for hosting information exchange groups on a wide area network
US9262384B2 (en) 1999-05-21 2016-02-16 E-Numerate Solutions, Inc. Markup language system, method, and computer program product
US9262383B2 (en) 1999-05-21 2016-02-16 E-Numerate Solutions, Inc. System, method, and computer program product for processing a markup document
US9268748B2 (en) 1999-05-21 2016-02-23 E-Numerate Solutions, Inc. System, method, and computer program product for outputting markup language documents
US8438487B1 (en) * 1999-05-24 2013-05-07 Catherine Lin-Hendel Method and system for one-click navigation and browsing of electronic media and their category structure as well as tracking the navigation and browsing thereof
US7350139B1 (en) * 2000-06-16 2008-03-25 American Express Travel Related Services Company, Inc. System and method for utilizing a drag and drop technique to complete electronic forms
US7496632B2 (en) * 1999-06-28 2009-02-24 Lockheed Martin Corporation Method and system for distributing a public information release authorization (PIRA) form over an intranet
US7165069B1 (en) * 1999-06-28 2007-01-16 Alexa Internet Analysis of search activities of users to identify related network sites
US6421724B1 (en) * 1999-08-30 2002-07-16 Opinionlab, Inc. Web site response measurement tool
US7031952B1 (en) 1999-10-08 2006-04-18 Knowledge Filter, Inc. Knowledge filter
US20070219995A1 (en) * 1999-10-08 2007-09-20 Knowledge Filter Knowledge filter
US7478035B1 (en) * 1999-11-02 2009-01-13 Eclarity, Inc. Verbal classification system for the efficient sending and receiving of information
US7337389B1 (en) 1999-12-07 2008-02-26 Microsoft Corporation System and method for annotating an electronic document independently of its content
US20040268253A1 (en) * 1999-12-07 2004-12-30 Microsoft Corporation Method and apparatus for installing and using reference materials in conjunction with reading electronic content
US7458014B1 (en) * 1999-12-07 2008-11-25 Microsoft Corporation Computer user interface architecture wherein both content and user interface are composed of documents with links
US6992687B1 (en) * 1999-12-07 2006-01-31 Microsoft Corporation Bookmarking and placemarking a displayed document in a computer system
US7028267B1 (en) * 1999-12-07 2006-04-11 Microsoft Corporation Method and apparatus for capturing and rendering text annotations for non-modifiable electronic content
US9424240B2 (en) 1999-12-07 2016-08-23 Microsoft Technology Licensing, Llc Annotations for electronic content
US8321496B2 (en) * 1999-12-13 2012-11-27 Half.Com, Inc. User evaluation of content on distributed communication network
US7107526B1 (en) * 2000-01-19 2006-09-12 Xerox Corporation Transparent injection of specific content into web pages viewed while browsing
US6389467B1 (en) 2000-01-24 2002-05-14 Friskit, Inc. Streaming media search and continuous playback system of media resources located by multiple network addresses
US7228305B1 (en) * 2000-01-24 2007-06-05 Friskit, Inc. Rating system for streaming media playback system
JP3307625B2 (en) * 2000-02-25 2002-07-24 株式会社ガーラ Electronic bulletin board system and mail server
US20010049707A1 (en) 2000-02-29 2001-12-06 Tran Bao Q. Systems and methods for generating intellectual property
US7428505B1 (en) 2000-02-29 2008-09-23 Ebay, Inc. Method and system for harvesting feedback and comments regarding multiple items from users of a network-based transaction facility
US6721726B1 (en) * 2000-03-08 2004-04-13 Accenture Llp Knowledge management tool
AU2001243443A1 (en) 2000-03-09 2001-09-17 The Web Access, Inc. Method and apparatus for performing a research task by interchangeably utilizinga multitude of search methodologies
US7340464B2 (en) * 2000-03-10 2008-03-04 General Electric Company Method for automated web site maintenance via searching
US7979881B1 (en) * 2000-03-30 2011-07-12 Microsoft Corporation System and method for identifying audio/visual programs to be recorded
WO2001076205A1 (en) * 2000-03-31 2001-10-11 Coppercom, Inc. Telecommunications system and methods
FR2807537B1 (en) * 2000-04-06 2003-10-17 France Telecom HYPERMEDIA RESOURCE SEARCH ENGINE AND INDEXING METHOD THEREOF
US7020679B2 (en) * 2000-05-12 2006-03-28 Taoofsearch, Inc. Two-level internet search service system
JP4037999B2 (en) * 2000-05-15 2008-01-23 インターナショナル・ビジネス・マシーンズ・コーポレーション Website, robot type search engine response system, robot type search engine registration method, storage medium, and program transmission device
US7269160B1 (en) 2000-05-26 2007-09-11 Buffalo International, Inc. Voice over internet call center integration
US7412409B2 (en) * 2000-06-15 2008-08-12 American Express Travel Related Services Company, Inc. Online ordering medium and method
US20080162298A1 (en) * 2000-06-15 2008-07-03 American Express Travel Related Services Company, Inc. Online ordering system and method
US7305355B2 (en) 2000-06-12 2007-12-04 American Express Travel Related Services Company, Inc. Universal shopping cart and order injection system
AU2001266938A1 (en) * 2000-06-15 2001-12-24 Infospace, Inc. Unified product purchasing system and method
US6948135B1 (en) 2000-06-21 2005-09-20 Microsoft Corporation Method and systems of providing information to computer users
US7000230B1 (en) 2000-06-21 2006-02-14 Microsoft Corporation Network-based software extensions
US7346848B1 (en) 2000-06-21 2008-03-18 Microsoft Corporation Single window navigation methods and systems
US7155667B1 (en) 2000-06-21 2006-12-26 Microsoft Corporation User interface for integrated spreadsheets and word processing tables
US7191394B1 (en) 2000-06-21 2007-03-13 Microsoft Corporation Authoring arbitrary XML documents using DHTML and XSLT
US6883168B1 (en) 2000-06-21 2005-04-19 Microsoft Corporation Methods, systems, architectures and data structures for delivering software via a network
US7624356B1 (en) 2000-06-21 2009-11-24 Microsoft Corporation Task-sensitive methods and systems for displaying command sets
US7117435B1 (en) 2000-06-21 2006-10-03 Microsoft Corporation Spreadsheet fields in text
US7234108B1 (en) 2000-06-29 2007-06-19 Microsoft Corporation Ink thickness rendering for electronic annotations
US7058641B1 (en) * 2000-08-08 2006-06-06 Franz Gregory J Information distribution system and method
US8397163B1 (en) * 2000-08-14 2013-03-12 Deep Sran Device, method, and system for providing an electronic reading environment
US20020049792A1 (en) * 2000-09-01 2002-04-25 David Wilcox Conceptual content delivery system, method and computer program product
US7426513B2 (en) * 2000-10-12 2008-09-16 Sap Portals Israel Ltd. Client-based objectifying of text pages
US7003551B2 (en) 2000-11-30 2006-02-21 Bellsouth Intellectual Property Corp. Method and apparatus for minimizing storage of common attachment files in an e-mail communications server
US9600842B2 (en) 2001-01-24 2017-03-21 E-Numerate Solutions, Inc. RDX enhancement of system and method for implementing reusable data markup language (RDL)
US20020099552A1 (en) * 2001-01-25 2002-07-25 Darryl Rubin Annotating electronic information with audio clips
US7185275B2 (en) * 2001-02-27 2007-02-27 International Business Machines Corporation Method and computer system for enabling a user to construct a web-site navigation
JP3379090B2 (en) * 2001-03-02 2003-02-17 インターナショナル・ビジネス・マシーンズ・コーポレーション Machine translation system, machine translation method, and machine translation program
US7158971B1 (en) 2001-03-07 2007-01-02 Thomas Layne Bascom Method for searching document objects on a network
US7389241B1 (en) * 2001-03-07 2008-06-17 Thomas Layne Bascom Method for users of a network to provide other users with access to link relationships between documents
US7386792B1 (en) 2001-03-07 2008-06-10 Thomas Layne Bascom System and method for collecting, storing, managing and providing categorized information related to a document object
US8103737B2 (en) * 2001-03-07 2012-01-24 International Business Machines Corporation System and method for previewing hyperlinks with ‘flashback’ images
US20020156809A1 (en) * 2001-03-07 2002-10-24 O'brien Thomas A. Apparatus and method for locating and presenting electronic content
US20020143557A1 (en) * 2001-03-14 2002-10-03 Gregory Vassmer Computer program product for assisting a user to select among information units of a plurality of structured information units concerning bearings and seals
US6912582B2 (en) * 2001-03-30 2005-06-28 Microsoft Corporation Service routing and web integration in a distributed multi-site user authentication system
US7500017B2 (en) * 2001-04-19 2009-03-03 Microsoft Corporation Method and system for providing an XML binary format
US20030078934A1 (en) * 2001-04-23 2003-04-24 David Cappellucci Dynamic content delivery system
US7117247B2 (en) * 2001-04-24 2006-10-03 Ricoh Company, Ltd. System, computer program product and method for storing information in an application service provider via e-mails
US20020169872A1 (en) * 2001-05-14 2002-11-14 Hiroshi Nomiyama Method for arranging information, information processing apparatus, storage media and program tranmission apparatus
US7272594B1 (en) * 2001-05-31 2007-09-18 Autonomy Corporation Ltd. Method and apparatus to link to a related document
US7289980B2 (en) * 2001-06-12 2007-10-30 International Business Machines Corporation Method and system for managing preferred web site links
US20030041305A1 (en) * 2001-07-18 2003-02-27 Christoph Schnelle Resilient data links
CA2353682A1 (en) * 2001-07-23 2003-01-23 Ibm Canada Limited-Ibm Canada Limitee Link management of document structures
US20030055914A1 (en) * 2001-08-01 2003-03-20 Izuru Tanaka Communication apparatus and method, recording medium, program, and content providing apparatus
US6963874B2 (en) * 2002-01-09 2005-11-08 Digital River, Inc. Web-site performance analysis system and method utilizing web-site traversal counters and histograms
JP4294912B2 (en) * 2001-08-13 2009-07-15 ブラザー工業株式会社 Terminal information notification system, terminal information notification method, and network terminal device
US7363310B2 (en) * 2001-09-04 2008-04-22 Timebase Pty Limited Mapping of data from XML to SQL
US20080148193A1 (en) * 2001-09-13 2008-06-19 John Moetteli System and method of efficient web browsing
US20030084095A1 (en) * 2001-10-26 2003-05-01 Hayden Douglas Todd Method to preserve web page links using registration and notification
US7281206B2 (en) 2001-11-16 2007-10-09 Timebase Pty Limited Maintenance of a markup language document in a database
US7343417B2 (en) * 2001-11-30 2008-03-11 Knowledge Networks, Inc. System and method for rating media information
US20030128231A1 (en) * 2002-01-09 2003-07-10 Stephane Kasriel Dynamic path analysis
US20030131097A1 (en) * 2002-01-09 2003-07-10 Stephane Kasriel Interactive path analysis
US20030131106A1 (en) * 2002-01-09 2003-07-10 Stephane Kasriel Web-page performance toolbar
US7631035B2 (en) * 2002-01-09 2009-12-08 Digital River, Inc. Path-analysis toolbar
US20030145096A1 (en) * 2002-01-29 2003-07-31 International Business Machines Corporation Method and device for delivering information through a distributed information system
US7284195B2 (en) * 2002-01-31 2007-10-16 International Business Machines Corporation Structure and method for linking within a website
US20030160830A1 (en) * 2002-02-22 2003-08-28 Degross Lee M. Pop-up edictionary
US7546338B2 (en) * 2002-02-25 2009-06-09 Ascentive Llc Method and system for screening remote site connections and filtering data based on a community trust assessment
EP1338982A1 (en) * 2002-02-26 2003-08-27 Hewlett-Packard Company Process for enhancing the access to resources
US7372952B1 (en) 2002-03-07 2008-05-13 Wai Wu Telephony control system with intelligent call routing
US7873900B2 (en) * 2002-03-22 2011-01-18 Inet Spch Property Hldg., Limited Liability Company Ordering internet voice content according to content density and semantic matching
US7712020B2 (en) * 2002-03-22 2010-05-04 Khan Emdadur R Transmitting secondary portions of a webpage as a voice response signal in response to a lack of response by a user
US7716199B2 (en) * 2005-08-10 2010-05-11 Google Inc. Aggregating context data for programmable search engines
US7743045B2 (en) * 2005-08-10 2010-06-22 Google Inc. Detecting spam related and biased contexts for programmable search engines
US20070038614A1 (en) * 2005-08-10 2007-02-15 Guha Ramanathan V Generating and presenting advertisements based on context data for programmable search engines
US7693830B2 (en) * 2005-08-10 2010-04-06 Google Inc. Programmable search engine
US20030202009A1 (en) * 2002-04-24 2003-10-30 Stephane Kasriel Integration toolbar
US7493560B1 (en) * 2002-05-20 2009-02-17 Oracle International Corporation Definition links in online documentation
US7505971B2 (en) * 2002-06-28 2009-03-17 Hewlett-Packard Development Company, L.P. Shared drive that provides shared access to editable files in a database
US20040015542A1 (en) * 2002-07-22 2004-01-22 Anonsen Steven P. Hypermedia management system
US7769750B2 (en) * 2002-07-22 2010-08-03 Microsoft Corporation Metadata based hypermedia management system
US20040019688A1 (en) * 2002-07-29 2004-01-29 Opinionlab Providing substantially real-time access to collected information concerning user interaction with a web page of a website
US7370285B1 (en) * 2002-07-31 2008-05-06 Opinionlab, Inc. Receiving and reporting page-specific user feedback concerning one or more particular web pages of a website
US7478121B1 (en) 2002-07-31 2009-01-13 Opinionlab, Inc. Receiving and reporting page-specific user feedback concerning one or more particular web pages of a website
US8335779B2 (en) * 2002-08-16 2012-12-18 Gamroe Applications, Llc Method and apparatus for gathering, categorizing and parameterizing data
US8090717B1 (en) * 2002-09-20 2012-01-03 Google Inc. Methods and apparatus for ranking documents
US7194469B1 (en) * 2002-09-24 2007-03-20 Adobe Systems Incorporated Managing links in a collection of documents
US7200589B1 (en) * 2002-10-03 2007-04-03 Hewlett-Packard Development Company, L.P. Format-independent advertising of data center resource capabilities
US7072893B1 (en) * 2002-11-27 2006-07-04 Adobe Systems Incorporated Repairing links in locked files
US20040158617A1 (en) * 2003-02-06 2004-08-12 Trip Advisor, Inc. Methods and systems for processing a link
US7415672B1 (en) 2003-03-24 2008-08-19 Microsoft Corporation System and method for designing electronic forms
US7370066B1 (en) 2003-03-24 2008-05-06 Microsoft Corporation System and method for offline editing of data files
US7275216B2 (en) * 2003-03-24 2007-09-25 Microsoft Corporation System and method for designing electronic forms and hierarchical schemas
US7913159B2 (en) 2003-03-28 2011-03-22 Microsoft Corporation System and method for real-time validation of structured data files
US7296017B2 (en) 2003-03-28 2007-11-13 Microsoft Corporation Validation of XML data files
US7827487B1 (en) * 2003-06-16 2010-11-02 Opinionlab, Inc. Soliciting user feedback regarding one or more web pages of a website without obscuring visual content
US7451392B1 (en) 2003-06-30 2008-11-11 Microsoft Corporation Rendering an HTML electronic form by applying XSLT to XML using a solution
US8122014B2 (en) * 2003-07-02 2012-02-21 Vibrant Media, Inc. Layered augmentation for web content
US7599938B1 (en) 2003-07-11 2009-10-06 Harrison Jr Shelton E Social news gathering, prioritizing, tagging, searching, and syndication method
US20050044192A1 (en) * 2003-07-28 2005-02-24 Applin John R. Web site management system with link management functionality
US7406660B1 (en) 2003-08-01 2008-07-29 Microsoft Corporation Mapping between structured data and a visual surface
US7334187B1 (en) 2003-08-06 2008-02-19 Microsoft Corporation Electronic form aggregation
US7454393B2 (en) * 2003-08-06 2008-11-18 Microsoft Corporation Cost-benefit approach to automatically composing answers to questions by extracting information from large unstructured corpora
JP4400147B2 (en) * 2003-08-29 2010-01-20 富士ゼロックス株式会社 Document group structure data creation apparatus and method
US20050086634A1 (en) * 2003-10-16 2005-04-21 International Business Machines Corporation Web page development environment that displays frequency of use information
AU2003287497A1 (en) * 2003-11-04 2004-06-06 Thomson Licensing Cache server at hotspots for downloading services
JP4720079B2 (en) * 2003-11-19 2011-07-13 富士ゼロックス株式会社 Manufacturing method of light control device
US8150824B2 (en) * 2003-12-31 2012-04-03 Google Inc. Systems and methods for direct navigation to specific portion of target document
US7418456B2 (en) * 2004-01-16 2008-08-26 International Business Machines Corporation Method for defining a metadata schema to facilitate passing data between an extensible markup language document and a hierarchical database
US7290012B2 (en) * 2004-01-16 2007-10-30 International Business Machines Corporation Apparatus, system, and method for passing data between an extensible markup language document and a hierarchical database
US7548910B1 (en) * 2004-01-30 2009-06-16 The Regents Of The University Of California System and method for retrieving scenario-specific documents
US8819072B1 (en) 2004-02-02 2014-08-26 Microsoft Corporation Promoting data from structured data files
US7636941B2 (en) * 2004-03-10 2009-12-22 Microsoft Corporation Cross-domain authentication
US8577893B1 (en) * 2004-03-15 2013-11-05 Google Inc. Ranking based on reference contexts
TWI270266B (en) * 2004-03-16 2007-01-01 Ec Server Com Inc Information management method of web farm
US20050210008A1 (en) * 2004-03-18 2005-09-22 Bao Tran Systems and methods for analyzing documents over a network
US7496837B1 (en) 2004-04-29 2009-02-24 Microsoft Corporation Structural editing with schema awareness
US20060048126A1 (en) * 2004-05-11 2006-03-02 Kui-Chang Sun Method and a system thereof for generating a homepage content file capable of dynamically linking with a plurality of on-demand knowledge bases
US7296021B2 (en) * 2004-05-21 2007-11-13 International Business Machines Corporation Method, system, and article to specify compound query, displaying visual indication includes a series of graphical bars specify weight relevance, ordered segments of unique colors where each segment length indicative of the extent of match of each object with one of search parameters
US7281018B1 (en) 2004-05-26 2007-10-09 Microsoft Corporation Form template data source change
US7774620B1 (en) 2004-05-27 2010-08-10 Microsoft Corporation Executing applications at appropriate trust levels
WO2005119435A2 (en) * 2004-06-02 2005-12-15 Open Text Corporation Systems and methods for dynamic menus
US9021529B2 (en) 2004-07-15 2015-04-28 Microsoft Technology Licensing, Llc Content recordation techniques
US7540051B2 (en) * 2004-08-20 2009-06-02 Spatial Systems, Inc. Mapping web sites based on significance of contact and category
US8244726B1 (en) * 2004-08-31 2012-08-14 Bruce Matesso Computer-aided extraction of semantics from keywords to confirm match of buyer offers to seller bids
US20060053092A1 (en) * 2004-09-01 2006-03-09 Chris Foo Method and system to perform dynamic search over a network
WO2006041425A2 (en) * 2004-09-15 2006-04-20 S.M.A.R.T. Link Medical, Inc. Social network analysis
US8412706B2 (en) * 2004-09-15 2013-04-02 Within3, Inc. Social network analysis
WO2006036216A2 (en) * 2004-09-15 2006-04-06 S.M.A.R.T. Link Medical, Inc. Collections of linked databases
US20070260599A1 (en) * 2004-09-15 2007-11-08 Mcguire Heather A Social network analysis
US8880521B2 (en) * 2004-09-15 2014-11-04 3Degrees Llc Collections of linked databases
US8635217B2 (en) 2004-09-15 2014-01-21 Michael J. Markus Collections of linked databases
US8577886B2 (en) * 2004-09-15 2013-11-05 Within3, Inc. Collections of linked databases
US7692636B2 (en) 2004-09-30 2010-04-06 Microsoft Corporation Systems and methods for handwriting to a screen
US20060074843A1 (en) * 2004-09-30 2006-04-06 Pereira Luis C World wide web directory for providing live links
KR100621179B1 (en) * 2004-10-26 2006-09-19 삼성전자주식회사 Display apparatus and method for processing text information thereof
US8487879B2 (en) 2004-10-29 2013-07-16 Microsoft Corporation Systems and methods for interacting with a computer through handwriting to a screen
US7606794B2 (en) * 2004-11-11 2009-10-20 Yahoo! Inc. Active Abstracts
US20060101012A1 (en) * 2004-11-11 2006-05-11 Chad Carson Search system presenting active abstracts including linked terms
US7712022B2 (en) 2004-11-15 2010-05-04 Microsoft Corporation Mutually exclusive options in electronic forms
US7721190B2 (en) 2004-11-16 2010-05-18 Microsoft Corporation Methods and systems for server side form processing
US7904801B2 (en) 2004-12-15 2011-03-08 Microsoft Corporation Recursive sections in electronic forms
US7937651B2 (en) 2005-01-14 2011-05-03 Microsoft Corporation Structural editing operations for network forms
US7958257B2 (en) * 2005-01-19 2011-06-07 International Business Machines Corporation Message filtering and demultiplexing system
EP1691305A1 (en) * 2005-02-10 2006-08-16 Sap Ag Method for visualizing and browsing entries of a database, method for intuitively searching a database, computer system and computer program product
US7725834B2 (en) 2005-03-04 2010-05-25 Microsoft Corporation Designer-created aspect for an electronic form template
US8417825B2 (en) * 2005-03-09 2013-04-09 Apple Inc. Communications handles and proxy agents
US7912838B2 (en) * 2005-04-07 2011-03-22 International Business Machines Corporation System and method for performing just-in-time priming of a database cache
JP4653805B2 (en) * 2005-04-12 2011-03-16 富士通株式会社 Semantic search program
US8010515B2 (en) 2005-04-15 2011-08-30 Microsoft Corporation Query to an electronic form
US7725820B2 (en) * 2005-05-16 2010-05-25 Planview, Inc. Method of generating a display for a directed graph and a system for use with the method
US7797628B2 (en) * 2005-05-16 2010-09-14 Planview, Inc. Method of using a directed graph and a system for use with the method
US8200975B2 (en) 2005-06-29 2012-06-12 Microsoft Corporation Digital signatures for network forms
US8453044B2 (en) * 2005-06-29 2013-05-28 Within3, Inc. Collections of linked databases
US8190681B2 (en) * 2005-07-27 2012-05-29 Within3, Inc. Collections of linked databases and systems and methods for communicating about updates thereto
US7774335B1 (en) * 2005-08-23 2010-08-10 Amazon Technologies, Inc. Method and system for determining interest levels of online content navigation paths
US8027976B1 (en) * 2005-08-23 2011-09-27 Oracle International Corporation Enterprise content search through searchable links
US7966395B1 (en) 2005-08-23 2011-06-21 Amazon Technologies, Inc. System and method for indicating interest of online content
US8874477B2 (en) 2005-10-04 2014-10-28 Steven Mark Hoffberg Multifactorial optimization system and method
US8943035B2 (en) * 2005-11-14 2015-01-27 Patrick J. Ferrel Distributing web applications across a pre-existing web
US10395326B2 (en) * 2005-11-15 2019-08-27 3Degrees Llc Collections of linked databases
US8001459B2 (en) 2005-12-05 2011-08-16 Microsoft Corporation Enabling electronic documents for limited-capability computing devices
US7814102B2 (en) * 2005-12-07 2010-10-12 Lexisnexis, A Division Of Reed Elsevier Inc. Method and system for linking documents with multiple topics to related documents
US8307275B2 (en) * 2005-12-08 2012-11-06 International Business Machines Corporation Document-based information and uniform resource locator (URL) management
US20070135945A1 (en) * 2005-12-14 2007-06-14 Microsoft Corporation Identifying property relationships
US7805430B2 (en) * 2005-12-22 2010-09-28 Sap Ag Evaluation of name prefix and suffix during a search
JP2007179145A (en) * 2005-12-27 2007-07-12 Brother Ind Ltd Address information search system and address information search program
US7499943B2 (en) * 2006-01-09 2009-03-03 International Business Machines Corporation Mapping for mapping source and target objects
CN101375268B (en) 2006-01-27 2012-06-06 Unz.Org有限责任公司 Computer system and method for presenting digitized content on a network
US7779343B2 (en) 2006-01-30 2010-08-17 Microsoft Corporation Opening network-enabled electronic documents
US8341226B2 (en) * 2006-03-15 2012-12-25 Intel Corporation Techniques to control electronic mail delivery
US7533112B2 (en) * 2006-03-27 2009-05-12 Microsoft Corporation Context hierarchies for address searching
KR100837749B1 (en) * 2006-04-18 2008-06-13 엔에이치엔(주) Method for investing article offered in on-line system with weight and system for executing the method
GB0608552D0 (en) * 2006-04-29 2006-06-07 Enabled London Ltd Information system
KR100808991B1 (en) * 2006-05-04 2008-03-07 삼성전자주식회사 Apparatus and Method For Searching Words With Electronic Dictionary In The Mobile Station
EP1860578A1 (en) * 2006-05-22 2007-11-28 Caterpillar Inc. System for analyzing patents
US20070276812A1 (en) * 2006-05-23 2007-11-29 Joshua Rosen Search Result Ranking Based on Usage of Search Listing Collections
US20070277097A1 (en) * 2006-05-25 2007-11-29 Erik Frederick Hennum Apparatus, system, and method for context-aware authoring transform
US8898264B1 (en) 2006-06-22 2014-11-25 Emc Corporation Linking business objects and documents
US8909748B1 (en) 2006-06-22 2014-12-09 Emc Corporation Configurable views of context-relevant content
US20080028286A1 (en) * 2006-07-27 2008-01-31 Chick Walter F Generation of hyperlinks to collaborative knowledge bases from terms in text
US8775237B2 (en) 2006-08-02 2014-07-08 Opinionlab, Inc. System and method for measuring and reporting user reactions to advertisements on a web page
US8676868B2 (en) * 2006-08-04 2014-03-18 Chacha Search, Inc Macro programming for resources
EP1895459A1 (en) * 2006-08-31 2008-03-05 Opinionlab, Inc. Computer-implemented system and method for measuring and reporting business intelligence based on comments collected from web page users using software associated with accessed web pages
US8635099B1 (en) 2006-09-26 2014-01-21 Gfk Custom Research, Llc Method and system for providing surveys
US7899700B2 (en) * 2006-09-29 2011-03-01 Knowledge Networks, Inc. Method and system for providing multi-dimensional feedback
US20080127042A1 (en) * 2006-10-17 2008-05-29 Microsoft Corporation Customizable list of application specific terms
US20080109712A1 (en) * 2006-11-06 2008-05-08 Mcbrearty Gerald F Method, system, and program product supporting automatic substitution of a textual string for a url within a document
US7840561B2 (en) * 2006-11-14 2010-11-23 Microsoft Corporation Retrieving hierarchical data
JP4915219B2 (en) * 2006-11-24 2012-04-11 富士通株式会社 Hypertext conversion program, method and apparatus
US7650571B2 (en) * 2006-12-27 2010-01-19 Sap Portals Israel Ltd. Smart links and dynamic favorites
US20090106201A1 (en) * 2006-12-27 2009-04-23 White Robert A System and method for related information search and presentation from user interface content
US8504922B2 (en) * 2006-12-29 2013-08-06 Microsoft Corporation Enhanced user navigation to previously visited areas in a media environment
US8127219B1 (en) 2007-01-22 2012-02-28 Adobe Systems Incorporated Printing and rendering hyperlink destinations
US7917507B2 (en) * 2007-02-12 2011-03-29 Microsoft Corporation Web data usage platform
US8429185B2 (en) * 2007-02-12 2013-04-23 Microsoft Corporation Using structured data for online research
US8549409B2 (en) * 2007-02-22 2013-10-01 Microsoft Corporation Customizable web application navigation
GB2443580B (en) * 2007-05-18 2008-07-30 Cvon Innovations Ltd Access system and method
US20090049405A1 (en) * 2007-06-01 2009-02-19 Kendall Gregory Lockhart System and method for implementing session-based navigation
US7747600B2 (en) * 2007-06-13 2010-06-29 Microsoft Corporation Multi-level search
US8706704B2 (en) * 2007-07-11 2014-04-22 International Business Machines Corporation Method and system for creating semantic relationships using hyperlinks
US20090024621A1 (en) * 2007-07-16 2009-01-22 Yahoo! Inc. Method to set up online book collections and facilitate social interactions on books
KR100944724B1 (en) * 2007-08-21 2010-03-03 엔에이치엔비즈니스플랫폼 주식회사 User authentication system using IP address and method thereof
US8234627B2 (en) * 2007-09-21 2012-07-31 Knowledge Networks, Inc. System and method for expediting information display
US20090094224A1 (en) 2007-10-05 2009-04-09 Google Inc. Collaborative search results
US7925694B2 (en) 2007-10-19 2011-04-12 Citrix Systems, Inc. Systems and methods for managing cookies via HTTP content layer
US7809697B1 (en) * 2007-11-09 2010-10-05 Google Inc. Compressing hyperlinks in a hyperlink-based document
US7853558B2 (en) * 2007-11-09 2010-12-14 Vibrant Media, Inc. Intelligent augmentation of media content
US8375073B1 (en) * 2007-11-12 2013-02-12 Google Inc. Identification and ranking of news stories of interest
US8561026B2 (en) * 2007-11-27 2013-10-15 International Business Machines Corporation Method, apparatus and computer program for facilitating the improvement of a user interface
US9223884B2 (en) * 2007-11-29 2015-12-29 Sap Se Resource identifier personalization
US8146110B2 (en) 2007-12-10 2012-03-27 Microsoft Corporation Service platform for in-context results
US8150842B2 (en) * 2007-12-12 2012-04-03 Google Inc. Reputation of an author of online content
US8150850B2 (en) * 2008-01-07 2012-04-03 Akiban Technologies, Inc. Multiple dimensioned database architecture
US8090877B2 (en) 2008-01-26 2012-01-03 Citrix Systems, Inc. Systems and methods for fine grain policy driven cookie proxying
US20090210493A1 (en) * 2008-02-15 2009-08-20 Microsoft Corporation Communicating and Displaying Hyperlinks in a Computing Community
US20090228804A1 (en) * 2008-03-05 2009-09-10 Microsoft Corporation Service Preview And Access From an Application Page
US8412702B2 (en) * 2008-03-12 2013-04-02 Yahoo! Inc. System, method, and/or apparatus for reordering search results
US7865455B2 (en) * 2008-03-13 2011-01-04 Opinionlab, Inc. System and method for providing intelligent support
US20090234837A1 (en) * 2008-03-14 2009-09-17 Yahoo! Inc. Search query
US8868482B2 (en) * 2008-03-20 2014-10-21 Oracle International Corporation Inferring schemas from XML document collections
US8209331B1 (en) * 2008-04-02 2012-06-26 Google Inc. Context sensitive ranking
US8051068B2 (en) * 2008-04-21 2011-11-01 Yahoo! Inc. Trail-based exploration of a repository of documents
US9135328B2 (en) * 2008-04-30 2015-09-15 Yahoo! Inc. Ranking documents through contextual shortcuts
US9081765B2 (en) 2008-08-12 2015-07-14 Abbyy Infopoisk Llc Displaying examples from texts in dictionaries
US8812304B2 (en) 2008-08-12 2014-08-19 Abbyy Infopoisk Llc Method and system for downloading additional search results into electronic dictionaries
US8135581B2 (en) * 2008-08-12 2012-03-13 Abbyy Software Ltd Method and system for downloading additional search results into electronic dictionaries
US8730241B2 (en) * 2008-09-08 2014-05-20 Fair Isaac Corporation Techniques for drawing curved edges in graphs
US8280836B2 (en) * 2008-09-08 2012-10-02 Fair Isaac Corporation Converting unordered graphs to oblivious read once ordered graph representation
US7843237B2 (en) * 2008-11-17 2010-11-30 Infineon Technologies Austria Ag Circuit arrangement for actuating a transistor
US20100199185A1 (en) * 2009-02-04 2010-08-05 Microsoft Corporation Common navigation mechanism for desktop and browser-based applications
US8650182B2 (en) * 2009-02-24 2014-02-11 Oracle International Corporation Mechanism for efficiently searching XML document collections
IT1394000B1 (en) * 2009-04-20 2012-05-17 Edintorni Media S R L ELECTRONIC METHOD FOR THE CREATION OF LINKS BETWEEN INTERNET PAGES.
US20100287129A1 (en) * 2009-05-07 2010-11-11 Yahoo!, Inc., a Delaware corporation System, method, or apparatus relating to categorizing or selecting potential search results
US8312048B2 (en) * 2009-05-19 2012-11-13 Microsoft Corporation Database application navigation
JP2011003182A (en) * 2009-05-19 2011-01-06 Studio Ousia Inc Keyword display method and system thereof
US8255787B2 (en) * 2009-06-29 2012-08-28 International Business Machines Corporation Automated configuration of location-specific page anchors
US8244755B2 (en) 2009-06-29 2012-08-14 International Business Machines Corporation Search engine optimization using page anchors
US9514435B2 (en) * 2009-08-17 2016-12-06 Accenture Global Services Limited System for targeting specific users to discussion threads
US9494931B2 (en) * 2009-09-23 2016-11-15 Fisher-Rosemount Systems, Inc. Dynamic hyperlinks for process control systems
US8332232B2 (en) * 2009-11-05 2012-12-11 Opinionlab, Inc. System and method for mobile interaction
US8392449B2 (en) * 2009-12-08 2013-03-05 Google Inc. Resource search operations
US20110145138A1 (en) * 2009-12-15 2011-06-16 Martin Fitzgerald Bradley Browser extension that processes text to facilitate commerce on social media
US9665648B2 (en) * 2010-03-29 2017-05-30 Nokia Technologies Oy Method and apparatus for a user interest topology based on seeded user interest modeling
SG182695A1 (en) * 2010-04-15 2012-08-30 Bijutsu Shuppan Networks Co Ltd Electronic book capable of making content of magazine into community
US8281232B2 (en) * 2010-04-22 2012-10-02 Rockmelt, Inc. Integrated adaptive URL-shortening functionality
US8572496B2 (en) * 2010-04-27 2013-10-29 Go Daddy Operating Company, LLC Embedding variable fields in individual email messages sent via a web-based graphical user interface
US9582575B2 (en) 2010-07-09 2017-02-28 Lexisnexis, A Division Of Reed Elsevier Inc. Systems and methods for linking items to a matter
US8386915B2 (en) 2010-07-26 2013-02-26 Rockmelt, Inc. Integrated link statistics within an application
WO2012024311A1 (en) 2010-08-16 2012-02-23 Perfect Sense Digital Llc Placement of hyperlinks in documents
US8751513B2 (en) 2010-08-31 2014-06-10 Apple Inc. Indexing and tag generation of content for optimal delivery of invitational content
US8819168B2 (en) * 2010-12-14 2014-08-26 Microsoft Corporation Link expansion service
US8868406B2 (en) * 2010-12-27 2014-10-21 Avaya Inc. System and method for classifying communications that have low lexical content and/or high contextual content into groups using topics
JP5760564B2 (en) * 2011-03-22 2015-08-12 カシオ計算機株式会社 Information display device and program
EP2691846A4 (en) * 2011-03-30 2015-03-11 Nokia Corp Using metadata
US9129225B2 (en) 2011-05-25 2015-09-08 Nokia Technologies Oy Method and apparatus for providing rule-based recommendations
US9177199B2 (en) * 2011-08-03 2015-11-03 Eastman Kodak Company Semantic magazine pages
US9063964B2 (en) * 2012-01-04 2015-06-23 Trustgo Mobile, Inc. Detecting application harmful behavior and grading application risks for mobile devices
US8856640B1 (en) 2012-01-20 2014-10-07 Google Inc. Method and apparatus for applying revision specific electronic signatures to an electronically stored document
US10789412B2 (en) 2012-02-20 2020-09-29 Wix.Com Ltd. System and method for extended dynamic layout
IL225079B (en) * 2012-03-06 2018-12-31 Wix Com Ltd A method and system for master -page based integrated editing and dynamic layout activation
US9201916B2 (en) * 2012-06-13 2015-12-01 Infosys Limited Method, system, and computer-readable medium for providing a scalable bio-informatics sequence search on cloud
US10261938B1 (en) 2012-08-31 2019-04-16 Amazon Technologies, Inc. Content preloading using predictive models
CN102902780B (en) * 2012-09-28 2016-04-20 五八有限公司 A kind of uniform resource position mark URL dynamic matching method and device
CN102890718B (en) * 2012-10-10 2015-04-08 重庆大学 Electronic nose data mining method based on supervised explicit manifold learning algorithm
US11308037B2 (en) 2012-10-30 2022-04-19 Google Llc Automatic collaboration
US9495341B1 (en) * 2012-12-18 2016-11-15 Google Inc. Fact correction and completion during document drafting
US9384285B1 (en) 2012-12-18 2016-07-05 Google Inc. Methods for identifying related documents
US9514113B1 (en) 2013-07-29 2016-12-06 Google Inc. Methods for automatic footnote generation
US9842113B1 (en) 2013-08-27 2017-12-12 Google Inc. Context-based file selection
US10628411B2 (en) * 2013-11-20 2020-04-21 International Business Machines Corporation Repairing a link based on an issue
US9529791B1 (en) 2013-12-12 2016-12-27 Google Inc. Template and content aware document and template editing
US9565147B2 (en) 2014-06-30 2017-02-07 Go Daddy Operating Company, LLC System and methods for multiple email services having a common domain
US9703763B1 (en) 2014-08-14 2017-07-11 Google Inc. Automatic document citations by utilizing copied content for candidate sources
US10362368B1 (en) * 2015-03-06 2019-07-23 Amazon Technologies, Inc. Inferring entity information in media content
US10162868B1 (en) * 2015-03-13 2018-12-25 Amazon Technologies, Inc. Data mining system for assessing pairwise item similarity
US9753701B2 (en) * 2015-12-10 2017-09-05 Sap Se Generating logic with scripting language in software as a service enterprise resource planning
US9762521B2 (en) * 2016-01-15 2017-09-12 International Business Machines Corporation Semantic analysis and delivery of alternative content
US10262006B2 (en) 2016-04-29 2019-04-16 Microsoft Technology Licensing, Llc Contextually triggered entry point
US10832000B2 (en) * 2016-11-14 2020-11-10 International Business Machines Corporation Identification of textual similarity with references
CN107122494B (en) * 2017-05-22 2020-06-26 南京大学 Topic model construction method based on community discovery
US10534798B2 (en) * 2017-06-13 2020-01-14 Oracle International Corporation Computer system and method to update data aggregation configurations and control data aggregation
US10642908B2 (en) * 2017-11-15 2020-05-05 Sap Se Internet of things search and discovery dynamic alteration of results
US10726072B2 (en) 2017-11-15 2020-07-28 Sap Se Internet of things search and discovery graph engine construction
US11250038B2 (en) * 2018-01-21 2022-02-15 Microsoft Technology Licensing, Llc. Question and answer pair generation using machine learning
US11003468B2 (en) * 2018-11-07 2021-05-11 Citrix Systems, Inc. Preloading of application on a user device based on content received by the user device
US11243906B2 (en) 2018-12-14 2022-02-08 OSAAP America, LLC Drag and drop format conversion between applications
US11200294B2 (en) * 2019-03-20 2021-12-14 Hisense Visual Technology Co., Ltd. Page updating method and display device
US11321412B1 (en) 2020-11-04 2022-05-03 Capital One Services, Llc Customized navigation flow
US11550844B2 (en) * 2020-12-07 2023-01-10 Td Ameritrade Ip Company, Inc. Transformation of database entries for improved association with related content items

Citations (22)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4982344A (en) * 1988-05-18 1991-01-01 Xerox Corporation Accelerating link creation
US5204947A (en) * 1990-10-31 1993-04-20 International Business Machines Corporation Application independent (open) hypermedia enablement services
US5297249A (en) * 1990-10-31 1994-03-22 International Business Machines Corporation Hypermedia link marker abstract and search services
US5760771A (en) * 1996-07-17 1998-06-02 At & T Corp System and method for providing structured tours of hypertext files
US5794257A (en) * 1995-07-14 1998-08-11 Siemens Corporate Research, Inc. Automatic hyperlinking on multimedia by compiling link specifications
US5806079A (en) * 1993-11-19 1998-09-08 Smartpatents, Inc. System, method, and computer program product for using intelligent notes to organize, link, and manipulate disparate data objects
US5822539A (en) * 1995-12-08 1998-10-13 Sun Microsystems, Inc. System for adding requested document cross references to a document by annotation proxy configured to merge and a directory generator and annotation server
US5963956A (en) * 1997-02-27 1999-10-05 Telcontar System and method of optimizing database queries in two or more dimensions
US5974407A (en) * 1997-09-29 1999-10-26 Sacks; Jerome E. Method and apparatus for implementing a hierarchical database management system (HDBMS) using a relational database management system (RDBMS) as the implementing apparatus
US6092074A (en) * 1998-02-10 2000-07-18 Connect Innovations, Inc. Dynamic insertion and updating of hypertext links for internet servers
US6154750A (en) * 1998-04-01 2000-11-28 Cyberpulse Llc Method and system for navigation and data entry in heirarchically-organized database views
US6282511B1 (en) * 1996-12-04 2001-08-28 At&T Voiced interface with hyperlinked information
US6286005B1 (en) * 1998-03-11 2001-09-04 Cannon Holdings, L.L.C. Method and apparatus for analyzing data and advertising optimization
US6285999B1 (en) * 1997-01-10 2001-09-04 The Board Of Trustees Of The Leland Stanford Junior University Method for node ranking in a linked database
US6539387B1 (en) * 1995-10-23 2003-03-25 Avraham Oren Structured focused hypertext data structure
US6714926B1 (en) * 1999-02-02 2004-03-30 Amazon.Com, Inc. Use of browser cookies to store structured data
US6714936B1 (en) * 1999-05-25 2004-03-30 Nevin, Iii Rocky Harry W. Method and apparatus for displaying data stored in linked nodes
US6751736B1 (en) * 2000-03-14 2004-06-15 International Business Machines Corporation Method and apparatus for E-commerce by using optional fields for virtual bar codes
US6772139B1 (en) * 1998-10-05 2004-08-03 Smith, Iii Julius O. Method and apparatus for facilitating use of hypertext links on the world wide web
US6907449B2 (en) * 1998-09-22 2005-06-14 Qwest Communications International, Inc. Conferencing system for simultaneous broadcast of audio and transmission of documents via push technology
US7193997B2 (en) * 2001-03-19 2007-03-20 International Business Machines Corporation Packet classification
US7391865B2 (en) * 1999-09-20 2008-06-24 Security First Corporation Secure data parser method and system

Patent Citations (24)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4982344A (en) * 1988-05-18 1991-01-01 Xerox Corporation Accelerating link creation
US5204947A (en) * 1990-10-31 1993-04-20 International Business Machines Corporation Application independent (open) hypermedia enablement services
US5297249A (en) * 1990-10-31 1994-03-22 International Business Machines Corporation Hypermedia link marker abstract and search services
US5806079A (en) * 1993-11-19 1998-09-08 Smartpatents, Inc. System, method, and computer program product for using intelligent notes to organize, link, and manipulate disparate data objects
US5794257A (en) * 1995-07-14 1998-08-11 Siemens Corporate Research, Inc. Automatic hyperlinking on multimedia by compiling link specifications
US6539387B1 (en) * 1995-10-23 2003-03-25 Avraham Oren Structured focused hypertext data structure
US5822539A (en) * 1995-12-08 1998-10-13 Sun Microsystems, Inc. System for adding requested document cross references to a document by annotation proxy configured to merge and a directory generator and annotation server
US5760771A (en) * 1996-07-17 1998-06-02 At & T Corp System and method for providing structured tours of hypertext files
US6282511B1 (en) * 1996-12-04 2001-08-28 At&T Voiced interface with hyperlinked information
US6285999B1 (en) * 1997-01-10 2001-09-04 The Board Of Trustees Of The Leland Stanford Junior University Method for node ranking in a linked database
US5963956A (en) * 1997-02-27 1999-10-05 Telcontar System and method of optimizing database queries in two or more dimensions
US5974407A (en) * 1997-09-29 1999-10-26 Sacks; Jerome E. Method and apparatus for implementing a hierarchical database management system (HDBMS) using a relational database management system (RDBMS) as the implementing apparatus
US6092074A (en) * 1998-02-10 2000-07-18 Connect Innovations, Inc. Dynamic insertion and updating of hypertext links for internet servers
US6286005B1 (en) * 1998-03-11 2001-09-04 Cannon Holdings, L.L.C. Method and apparatus for analyzing data and advertising optimization
US6154750A (en) * 1998-04-01 2000-11-28 Cyberpulse Llc Method and system for navigation and data entry in heirarchically-organized database views
US6907449B2 (en) * 1998-09-22 2005-06-14 Qwest Communications International, Inc. Conferencing system for simultaneous broadcast of audio and transmission of documents via push technology
US6772139B1 (en) * 1998-10-05 2004-08-03 Smith, Iii Julius O. Method and apparatus for facilitating use of hypertext links on the world wide web
US7233950B2 (en) * 1998-10-05 2007-06-19 Smith Iii Julius O Method and apparatus for facilitating use of hypertext links on the world wide web
US7877407B2 (en) * 1998-10-05 2011-01-25 Smith Iii Julius O Method and apparatus for facilitating use of hypertext links on the world wide web
US6714926B1 (en) * 1999-02-02 2004-03-30 Amazon.Com, Inc. Use of browser cookies to store structured data
US6714936B1 (en) * 1999-05-25 2004-03-30 Nevin, Iii Rocky Harry W. Method and apparatus for displaying data stored in linked nodes
US7391865B2 (en) * 1999-09-20 2008-06-24 Security First Corporation Secure data parser method and system
US6751736B1 (en) * 2000-03-14 2004-06-15 International Business Machines Corporation Method and apparatus for E-commerce by using optional fields for virtual bar codes
US7193997B2 (en) * 2001-03-19 2007-03-20 International Business Machines Corporation Packet classification

Cited By (19)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8782013B1 (en) * 2002-10-08 2014-07-15 Symantec Operating Corporation System and method for archiving data
US20110213805A1 (en) * 2004-03-15 2011-09-01 Yahoo! Inc. Search systems and methods with integration of user annotations
US9489463B2 (en) * 2004-03-15 2016-11-08 Excalibur Ip, Llc Search systems and methods with integration of user annotations
US20110035651A1 (en) * 2006-02-24 2011-02-10 Paxson Dana W Apparatus and method for creating literary macrames
US8689134B2 (en) 2006-02-24 2014-04-01 Dana W. Paxson Apparatus and method for display navigation
US8091017B2 (en) * 2006-07-25 2012-01-03 Paxson Dana W Method and apparatus for electronic literary macramé component referencing
US20080037879A1 (en) * 2006-07-25 2008-02-14 Paxson Dana W Method and apparatus for electronic literary macrame component referencing
US20110179344A1 (en) * 2007-02-26 2011-07-21 Paxson Dana W Knowledge transfer tool: an apparatus and method for knowledge transfer
US9690786B2 (en) * 2008-03-17 2017-06-27 Tivo Solutions Inc. Systems and methods for dynamically creating hyperlinks associated with relevant multimedia content
US20090235150A1 (en) * 2008-03-17 2009-09-17 Digitalsmiths Corporation Systems and methods for dynamically creating hyperlinks associated with relevant multimedia content
US9519713B2 (en) * 2008-10-20 2016-12-13 Facilitylive S.R.L. Method for searching for multimedia content items on the internet
US20100211565A1 (en) * 2008-10-20 2010-08-19 Facility Italia S.P.A. Method for searching for multimedia content items on the internet
US9047259B1 (en) * 2010-07-21 2015-06-02 Google Inc. System and method for renaming hyperlinks
US9043695B2 (en) * 2010-11-26 2015-05-26 International Business Machines Corporation Visualizing total order relation of nodes in a structured document
US20120137209A1 (en) * 2010-11-26 2012-05-31 International Business Machines Corporation Visualizing total order relation of nodes in a structured document
US8645364B2 (en) 2010-12-13 2014-02-04 Google Inc. Providing definitions that are sensitive to the context of a text
US8521517B2 (en) * 2010-12-13 2013-08-27 Google Inc. Providing definitions that are sensitive to the context of a text
US20150161120A1 (en) * 2012-06-05 2015-06-11 Google Inc. Identifying landing pages for images
US9158857B2 (en) * 2012-06-05 2015-10-13 Google Inc. Identifying landing pages for images

Also Published As

Publication number Publication date
US20070156761A1 (en) 2007-07-05
US7233950B2 (en) 2007-06-19
US6772139B1 (en) 2004-08-03
US7877407B2 (en) 2011-01-25
US20040068527A1 (en) 2004-04-08

Similar Documents

Publication Publication Date Title
US7233950B2 (en) Method and apparatus for facilitating use of hypertext links on the world wide web
US7770123B1 (en) Method for dynamically generating a “table of contents” view of a HTML-based information system
US7117199B2 (en) Spatially coding and displaying information
AU2003258037B2 (en) Desktop client interaction with a geographic text search system
US20030050927A1 (en) System and method for location, understanding and assimilation of digital documents through abstract indicia
EP1211616A2 (en) Data storage and retrieval system
US20080034285A1 (en) Information architecture for the interactive environment
WO2001029709A1 (en) System and method for location, understanding and assimilation of digital documents through abstract indicia
Brinkley et al. Information retrieval from the Internet: an evaluation of the tools
Zellweger A kowledge-based model to database retrieval
Eller An associative repository for the administration of course material
Toney et al. New Web-based interfaces to old databases
Florez Finding and Evaluating Adult ESL Resources on the World Wide Web. ERIC Q & A.
Neuman et al. Providing and Accessing Information via the Internet: The Georgetown Catalogue of Project in Electronic Text
Sarwar Automated Libraries: What we expect from Digital Libraries
Simutis New Approaches to Sharing Mathematics and Science Databases
Kaushik Design, Implementation and Evaluation of WebManthan: A Personalized, Graphical Web Browser
Rakhshan Structural Web search and query mining

Legal Events

Date Code Title Description
AS Assignment

Owner name: PRESCOTT NETWORKS, LLC, DELAWARE

Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNOR:SMITH, JULIUS O., III;REEL/FRAME:025890/0001

Effective date: 20101004

STCB Information on status: application discontinuation

Free format text: ABANDONED -- AFTER EXAMINER'S ANSWER OR BOARD OF APPEALS DECISION

AS Assignment

Owner name: INTELLECTUAL VENTURES ASSETS 186 LLC, DELAWARE

Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNOR:GULA CONSULTING LIMITED LIABILITY COMPANY;REEL/FRAME:062756/0052

Effective date: 20221222

AS Assignment

Owner name: INTELLECTUAL VENTURES ASSETS 186 LLC, DELAWARE

Free format text: SECURITY INTEREST;ASSIGNOR:MIND FUSION, LLC;REEL/FRAME:063295/0001

Effective date: 20230214

Owner name: INTELLECTUAL VENTURES ASSETS 191 LLC, DELAWARE

Free format text: SECURITY INTEREST;ASSIGNOR:MIND FUSION, LLC;REEL/FRAME:063295/0001

Effective date: 20230214