US20120278160A1 - Real-time adaptive probabilistic selection of messages - Google Patents

Real-time adaptive probabilistic selection of messages Download PDF

Info

Publication number
US20120278160A1
US20120278160A1 US13/544,861 US201213544861A US2012278160A1 US 20120278160 A1 US20120278160 A1 US 20120278160A1 US 201213544861 A US201213544861 A US 201213544861A US 2012278160 A1 US2012278160 A1 US 2012278160A1
Authority
US
United States
Prior art keywords
advertising message
message
advertising
measure
messages
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Abandoned
Application number
US13/544,861
Inventor
Ion T. Ieong
Christopher R. Barkley
Matthew O. Anderson
Richard Hansen Wolniewicz
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Individual
Original Assignee
Individual
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Individual filed Critical Individual
Priority to US13/544,861 priority Critical patent/US20120278160A1/en
Publication of US20120278160A1 publication Critical patent/US20120278160A1/en
Abandoned legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q30/00Commerce
    • G06Q30/02Marketing; Price estimation or determination; Fundraising
    • G06Q30/0241Advertisements
    • G06Q30/0242Determining effectiveness of advertisements
    • G06Q30/0244Optimization

Definitions

  • the described technology is directed to the field of providing electronic content, and, more particularly, to the field of optimized message selection.
  • Web publishers often include advertising messages in the web pages that they serve to users. Such advertising messages may either be for the benefit of an advertiser other than the publisher, or may promote products or services sold by the publisher.
  • an online merchant may include on its web pages advertisements promoting products or services sold by the online merchant.
  • each page served by a publisher may have a number of “slots” (also referred to herein as distinguished locations) that each may contain an advertising message. It is frequently true that, for each slot, several different advertising messages are eligible to be included in that slot, such that the publisher may select any of these eligible advertising messages to include in the display. It is common for a publisher to select from among the advertising messages eligible for a particular slot the advertising message expected to have the greatest value to the publisher, such as the advertising message expected to produce the greatest measure of revenue and/or profit for the publisher.
  • each eligible advertising message is expected to produce by dividing the total revenue and/or profit that the advertising message has produced when included in a slot during a foregoing period of time by the number of times the advertising message has been included in the display during that period to obtain a “mean value” or “expected value” for the advertising message in the slot.
  • This approach often provides a useful basis for selecting an eligible advertising message to include in a slot where each eligible message has been included in the display (1) a statistically significant number of times (2) under relevant conditions that are similar to those present at the time the slot is being filled. Where this situation does not exist, however, one or more of the advertising messages may be misvalued based upon a scarcity of representative experience with the advertising message, causing the publisher to select an advertising message other than the optimal advertising message and adversely affect its overall results.
  • FIG. 1 is a high-level data flow diagram showing data flow within a typical arrangement of components used to provide the facility.
  • FIG. 2 is a conceptual diagram showing a typical served page.
  • FIG. 3 is a block diagram showing some of the components typically incorporated in at least some of the computer systems and other devices on which the facility executes.
  • FIG. 4 is a graph showing sample normal distribution curves used by the facility to select scores for each of two candidate messages for a particular slot.
  • FIG. 5 is a flow diagram showing a routine performed by the facility in order to select messages to include in displays contained in a requested page in some embodiments.
  • FIG. 6 is a flow diagram showing a routine performed by the facility in order to score a message for slot in some embodiments.
  • FIG. 7 is a flow diagram showing a routine performed by the facility to track the value produced by the exposure of particular messages in particular slots in some embodiments.
  • FIG. 8 is a flow diagram showing a routine performed by the facility in order to maintain the bonus associated with a guarantee in some embodiments.
  • FIG. 9 is a flow diagram showing a routine performed by the facility in order to combine statistics collected during the current period into the overall statistics maintained and used by the facility in some embodiments.
  • Applicants have recognized that the modal exploitation/experimentation approach to selecting advertising messages described above has the disadvantage that the fixed percentage of opportunities that are allocated to experimentation versus exploitation is, at any particular moment, very likely to differ from the optimal percentage. Accordingly, they have developed an approach to selecting messages that does not rely on allocating a fixed percentage of opportunities to each of an exploitation mode and a separate experimentation mode.
  • a software facility (“the facility”) that seeks to optimize the selection of advertising messages or other content using probabilistic techniques. For each message that is eligible to be included in a particular slot in a particular served instance of a page, the facility generates an inclusion score by randomly selecting a value from a probability distribution, such as a normal distribution, corresponding to the combination of the message and the slot.
  • a probability distribution such as a normal distribution
  • the normal distribution is based on the message's history of being presented in the slot: the normal distribution is centered on the mean value for the message in the slot, and its breadth, that is, the number of standard deviations represented by the normal distribution, is generally inversely related to the number of exposures that the message has had in the slot during a foregoing time period, such that a first message that has had less exposure in the slot than a second message typically has a wider normal distribution than the second message.
  • the probabilistic selection of an inclusion score employed by the facility weights each possible value in accordance with the height of the normal distribution at that value, such that the likelihood of selecting a value near the center of a normal distribution is greater than selecting a value further from the center of that normal distribution.
  • the facility ranks the messages based upon their inclusion scores. The facility then proceeds to select eligible messages in decreasing order of their inclusion scores. This approach is well-adapted to resolving a selection between a set of candidates whose membership may vary frequently based upon which advertising messages are eligible to be included in a particular slot in a particular served instance of a page. Indeed, the approach used by the facility can be successful at selecting among sets of candidate messages whose membership is different in every iteration.
  • the facility generates inclusion scores for a slot in a manner that seeks to satisfy guarantees established for one or more messages. Such guarantees each specify a minimum percentage of a slot's exposures that should include a particular message. For example, a guarantee may specify that a particular message be included in a particular slot for 20% of the slot's total exposures.
  • the facility seeks to satisfy guarantees by establishing, for each guaranteed message, a guarantee bonus that is added to inclusion scores determined for the guaranteed message in the slot that is the subject of the guarantee.
  • the facility uses these augmented inclusion scores in its comparison of the inclusion scores of candidate messages.
  • the facility dynamically determines a guarantee bonus for each guarantee that is large enough to satisfy the guarantee, but not much larger. In particular, the facility may adjust a guarantee bonus level earlier determined for the message based upon the rate at which a message has been exposed in the slot while subject to that guarantee bonus level.
  • the facility automatically attributes to new messages having no or limited exposure history in a slot seed values that cause the facility to select these new messages at a moderate rate and establish an actual history, which is later used as a basis for selecting these messages.
  • the facility determines a seed value for a new message in a way that is sensitive to details of messages with which the new message will compete for the current slot of the current page instance.
  • the facility decays the histories it maintains in order to deemphasize past message performance that may no longer be representative of current performance.
  • the facility applies a variable decay rate obtained by simulating the operation of the facility at different decay rates, and choosing the decay rate whose simulation produces the highest overall value.
  • the facility frequently produces a high overall value, while satisfying exposure guarantees, and adjusting to new message-slot combinations and trends in the value of a message in a slot.
  • FIG. 1 is a high-level data flow diagram showing data flow within a typical arrangement of components used to provide the facility.
  • a number of web client computer systems 110 that are under user control generate and send page view requests 131 to a logical web server 100 via a network such as the Internet 120 .
  • These requests typically include page view requests for web pages that may each contain one or more slots for including messages.
  • these requests may either all be routed to a single web server computer system, or may be load-balanced among a number of web server computer systems.
  • the web server typically replies to each with a served page 132 .
  • FIG. 2 is a conceptual diagram showing a typical served page.
  • the sample page 200 contains certain content that is native to all instances of the page, such as native content 210 describing a softball item that is available for purchase.
  • the page further contains two slots 220 and 230 , each of which can contain a message selected by the facility. Three sample messages 221 - 223 are shown that are candidate messages for slot 220 .
  • the facility identifies these candidate messages from a larger pool of messages scheduled into slot 220 based on one or more preliminary eligibility tests, which can be based on such factors as the time of day, day of week, user identity, user characteristics, user behavior, etc.
  • Slot 230 may have a different set of candidate messages, depending on how messages have been scheduled into these slots.
  • the facility increments a count of the number of times that each message is included in display 220 during the current tracking period.
  • the user may select the message to obtain additional information about the subject of the message.
  • the user may go on to perform various actions related to the subject of the message, such as purchasing an item, entering a contest, or recommending an item to another user.
  • the operator of the facility may ascribe a value to any of these actions resulting from a message selection.
  • the value ascribed to an action resulting from message selection may be the level of revenue or profit that the publisher expects to derive from the action, such as the level of revenue or profit that the publisher expects to derive from an item purchase.
  • the ascribed value may correspond to an amount of money that the advertiser has agreed to pay the publisher when a message is selected, or when an action is performed.
  • the ascribed value may correspond to a level of desirability of the action to the publisher or another party that is determined without reference to any pecuniary value of the action. For example, a publisher who regards a user's provision of their postal address to be twice as useful as the user's provision of their email address may ascribe a value of $2.00 to the provision of a postal address and a value of $1.00 to the provision of an email address.
  • the facility augments a balance of a total value the message has produced when displayed in display 220 during the current tracking period.
  • the count of the number of times the message has been included in slot 220 and the balance of the total value that message has produced when displayed in slot 220 forms the basis used by the facility to score this message for future inclusion in slot 220 .
  • FIG. 3 is a block diagram showing some of the components typically incorporated in at least some of the computer systems and other devices on which the facility executes.
  • These computer systems and devices 300 may include one or more central processing units (“CPUs”) 301 for executing computer programs; a computer memory 302 for storing programs and data while they are being used; a persistent storage device 303 , such as a hard drive for persistently storing programs and data; a computer-readable media drive 304 , such as a floppy, CD-ROM, or DVD drive, for reading programs and data stored on a computer-readable medium; and a network connection 305 for connecting the computer system to other computer systems, such as via the Internet. While computer systems configured as described above are typically used to support the operation of the facility, those skilled in the art will appreciate that the facility may be implemented using devices of various types and configurations, and having various components.
  • FIG. 4 is a graph showing sample normal distribution curves used by the facility to select scores for each of two candidate messages for a particular slot.
  • the graph shows curve 410 corresponding to a first candidate message and curve 420 corresponding to a second candidate message.
  • the height of each curve at a given score indicates the relative probability that that score will be selected for the candidate message to which the curve corresponds.
  • the mean value 411 of curve 410 is larger than the mean value 421 of curve 420 , indicating that the first message has proved to have a higher average value than the second message when presented in the slot.
  • Curve 420 is broader than curve 410 , indicating that the value of the second message has a higher level of variability, or “standard error,” than the first message.
  • the standard error metric takes into account the number of exposures of a message, this may indicate that the second message has been included in the display fewer times than the first message, perhaps because the second message has only been scheduled into the slot recently.
  • the standard error is an inverse function of number of exposures, and a direct function of the total variations in the message's value.
  • the standard error is large when a message is first introduced to a slot, and tends to decline thereafter, except for messages that exhibit erratic performance.
  • the mean value adjusts over time to reflect the current average of the value produced by exposures of the message in the slot.
  • the graph also shows sample scores for these two messages, selected at random within the area of each of the curves.
  • the higher mean of curve 410 causes the first message to be scored higher than the second message in the majority of cases in which the two messages are competing for a slot.
  • the greater breadth of curve 420 ensures that the second message will be scored higher than the first message in at least a small number of cases in which the two messages are competing for the same slot. It can be seen that, in this case, the score 422 determined by the facility for the second message is larger than the score 412 selected for the first message, causing the second message to be included in the display ahead of the first message, despite the first message's higher mean value.
  • the second message is included in the display in subsequent instances of the page containing the slot, its standard error will likely decrease, reducing its likelihood of selection relative to candidates like the first message unless these additional exposures create significant value, increasing its mean value.
  • FIGS. 5-8 are flow diagrams showing various operational routines performed by some embodiments of the facility.
  • FIG. 5 is a flow diagram showing routine performed by the facility in order to select messages to include in displays contained in a requested page in some embodiments.
  • the facility loops through each slot in the page to determine which message to display in that slot.
  • the facility loops through each message scheduled in the slot to determine which of the scheduled message will be displayed in each particular slot for an instance of a requested page.
  • the facility determines whether the message is eligible for this instance of the page. If the facility determines that the message is eligible for this instance of the page, then the facility continues in block 504 , else the facility continues in block 505 .
  • block 504 the facility scores the message for the slot. Block 504 is discussed below in greater detail in connection with FIG. 6 .
  • the facility continues in block 505 .
  • block 505 if additional messages remain to be processed, then the facility returns to block 502 to select the next message, else the facility continues in block 506 .
  • the facility After each eligible message has been scored at block 504 , in blocks 506 - 509 , the facility loops through each message scored in step 504 in decreasing order of their scores until a message is selected for inclusion in the instance of the page.
  • the facility determines whether the message is suitable for this instance of the page. In some embodiments, the facility makes this determination by calling a suitability-checking routine associated with the message. If the facility determines that the message is suitable for this instance of the page, the facility continues to block 508 , else the facility continues in block 509 . If the facility determines at block 507 that the message is suitable for this instance of the page, at block 508 the facility includes the message in the slot.
  • the facility increments the number of exposures of the message in the slot performed during the current tracking period to reflect the inclusion of the message in the slot in block 508 .
  • the facility instead stores an indication that the message has been exposed in the slot, and at a later time counts the number of such indications that were stored. After block 509 , the facility continues in block 511 .
  • the facility performs blocks 506 - 510 in a different software subsystem and/or in a different computer system than blocks 502 - 505 .
  • the entity performing blocks 502 - 505 provides, in various embodiments, various sets of messages to the entity performing blocks 506 - 510 .
  • the first entity may provide a certain number of messages, or a group of messages in decreasing order of message scores up to a message that is known to be suitable for inclusion in the slot, such as a message with which no suitability test or suitability-checking routine is associated.
  • the second entity reports the identity of the message included in the slot in block 508 for inclusion in the statistics maintained and used by the facility.
  • block 510 if additional messages remain to be processed, then the facility continues in block 506 to select the next message, else the facility continues in block 510 . In block 510 , if additional slots remain to be processed, then the facility continues in block 501 to select the next slot, else the routine concludes.
  • routine shown in FIG. 5 and each of the routines discussed below may be altered in a variety of ways. For example, the order of the routines may be rearranged; some portions of the routine may be performed in parallel, omitted, additional functionality may be included, etc.
  • FIG. 6 is a flow diagram showing a routine typically performed by the facility in order to score a message for a slot, as referenced above with respect to block 504 ( FIG. 5 ).
  • decision block 601 it is determined whether adequate statistics are available for the performance of the message in the slot. If adequate statistics are available, then the facility continues to block 603 , else the facility continues to block 602 .
  • the facility seeds performance statistics for the message in the slot. The details of block 602 are discussed in greater detail below. After block 602 , the facility continues to block 603 .
  • the facility determines a score for the combination of the message and the slot using statistics about the performance of the message in the slot.
  • the facility uses the following series of formulas to determine a score for a message in the slot.
  • the total value S produced by the message in the slot is obtained from x, the value produced by each exposure:
  • the expected value EV is obtained from S and the number of exposures of the message in the slot N:
  • the standard deviation SD is obtained from SS and N:
  • the standard error SE is obtained from SD and N:
  • the facility implements normrand( ) by mapping the output of a random number generator onto a normal distribution, such as by mapping the output of the random number generator variate_generator onto a normal distribution normal_distribution, both provided as part of the Boost library available from and documented at http://www.boost.org/index.htm.
  • decision block 604 the facility determines whether the message in this slot is the subject of a guarantee. If the facility determines that the message in the slot is the subject of a guarantee, then the facility continues in block 605 , else the routine concludes. In block 605 , the facility adds a guarantee bonus determined by the facility to the score determined in block 603 . The facility's approach to determining this guarantee bonus is discussed in greater detail below in connection with FIG. 8 . After block 605 , the routine completes.
  • the facility seeds performance statistics for the message in the slot in block 602 as follows. In some embodiments, the facility seeds these statistics in the local context of filling the slot for the current page instance, selecting a seeded mean value equal to the average mean value of the other current candidates for the slot, and a seeded standard error equal to the average standard error of the other current candidates for the slot. In some embodiments, the facility seeds these statistics based on an overview of recent exposures of the slots. As one example, in some embodiments the facility uses the expression
  • FIG. 7 is a flow diagram showing a routine performed by the facility to track the value produced by the exposure of particular messages in particular slots in some embodiments.
  • the facility detects a user action associated with selection of a message in a slot. As discussed above in connection with FIG. 2 , the detected action may be the message selection itself, or any subsequent action to which the advertiser, the publisher, or another party ascribes a positive value.
  • the facility adds the value ascribed to the action detected in block 701 to a total value produced by the message in the slot during the current tracking period. After block 702 , the facility continues in step 701 to detect the next user action.
  • FIG. 8 is a flow diagram showing a routine performed by the facility in order to maintain the bonus associated with a guarantee in some embodiments.
  • the facility periodically performs the routine described with respect to FIG. 8 for each guarantee.
  • the facility determines whether the difference between the guaranteed and actual exposure rate for the message are statistically significant. If the facility determines that the guarantee and the actual exposure rate for the message are statistically significant, then the facility continues to block 802 , else the routine concludes. In particular, in some embodiments, the facility determines that the difference is statistically significant if the following test evaluates as false:
  • the facility adjusts the bonus level.
  • the facility adjusts the bonus level as follows:
  • the facility tracks ages for D H and D L . Each time the facility changes the value of D H , the facility increments a counter for D L . Similarly, each time the facility changes the value of D L , the facility increments a counter for D H . During the annealing process, the facility tests these counters to see either one exceeds a staleness threshold. In cases where one of the counters does exceed the staleness threshold, the facility bypasses the annealing process for one iteration to reestablish the bonus level at the stale end of the bracket.
  • the facility If the facility was engaged in a long, one-sided annealing, then the exposure during this iteration is likely to be similar to that during the last iteration, and the facility resumes annealing with greater confidence in the next iteration. On the other hand, if the target bonus level has escaped the bracket, on the next iteration the facility collapses the bracket and returns to explore mode.
  • block 803 if the bonus level as adjusted in block 802 exceeds a predetermined cap, then the facility continues in block 804 , else the routine concludes.
  • the facility reduces the adjusted bonus level produced in block 804 to be equal to the cap. After block 804 , the routine concludes.
  • FIG. 9 is a flow diagram showing a routine performed by the facility in order to combine statistics collected during the current period into the overall statistics maintained and used by the facility in some embodiments.
  • the facility loops through each page containing at least one slot.
  • the facility loops through each slot in the page.
  • the facility loops through each message scheduled into the slot.
  • the facility combines the statistics for the slot and message from the current period into the overall statistics for the slot and message.
  • the facility employs the following approach:
  • the facility weights the data points in the mean and standard error formulas based on their age. In practice, it is possible to store accumulated values and derive current values without revisiting the entire history. The following decomposes the formulas for mean, standard deviation, and standard error, showing how the weight decay is to be applied over time. Formulas follow for mean expected value EV, standard deviation SD, and standard error SE, at time t, where x is a computed value of an impression during time interval t:
  • the facility tracks three values for each campaign placement: the weighted count WN, the weighted sum WS, and the weighted sum-of-squares of the variance WSS.
  • the values computed for time interval t are computed from the previous time interval's values as shown:
  • d t refers to the decay factor chosen for time interval t.
  • the facility calculates derived statistics from the combined statistics produced in block 904 .
  • the routine may use the formulas below to obtain weighted expected value, standard deviation, and standard error for any given time interval:
  • WEV t WS t WN t ( 18 )
  • WSD t WSS t WN t - 1 ( 19 )
  • WSE t WSD t WN t ( 20 )
  • WEV t and WSE t are used to drive the bidding algorithm during time interval t.
  • the facility determines d t as follows:
  • the facility rather than dynamically computing d t as described above, the facility instead uses a fixed value for d t .
  • the facility re-initializes the statistics for the current period in order to prepare them to receive data from the next period.
  • the facility returns to block 903 to select the next message, else the facility continues in block 908 .
  • the facility returns to block 902 to select the next slot, else the facility continues in block 909 .
  • block 909 if additional pages remain to be processed, the facility continues to block 901 to select the next page, else the routine concludes.
  • the facility includes functionality for handling unsatisfiable schedules, where more than 100% of the available exposures of a slot have been promised via guarantees to display messages in that slot.
  • the facility seeks “fairness” by equally penalizing the guaranteed messages.
  • the facility maintains a guarantee satisfaction state for each guarantee. The facility collects statistics that reflect whether the message is receiving the exposure rate specified by its guarantee over two periods: the lifetime of the message's scheduling in its slot, and a trailing 12-hour window. If the specified exposure rate is satisfied for both of these time periods, the facility attributes a “happy” state to the guarantee. If the exposure specified by the guarantee is not satisfied over either of these two time periods, then the facility attributes an “unhappy” state to the guarantee. When the guarantee is unhappy, the facility chooses a bonus level intended to yield an exposure rate that is calculated to make up the guarantee's shortfall over the coming 12-hour period.
  • the facility further collects statistics to measure whether other messages that the guaranteed message is competing against would yield exposures to the guaranteed placement if the facility raised the bonus for the guaranteed placement. If a guarantee is in the unhappy state and has competitors determined by the facility to be unwilling to yield, the facility changes the state of the guarantee to “troubled.”
  • the facility uses a message selection process that diverges from the one shown in FIGS. 5 and 6 . In this process, the facility excludes from consideration any message not having a guarantee whose state is troubled. Where the eligible messages include more than one that is the subject of a troubled guarantee, the facility chooses among these in a manner proportional to the rate guarantee by each.
  • the facility attributes a selection weight to the first that is twice as large as the selection weight attributed to the second, yielding a two-thirds probability of selecting the first and the one-third probability of selecting the second.
  • the facility tracks the performance of some messages across two or more slots, rather than tracking the performance of the message in each of these slots individually.
  • the above-described facility may be straightforwardly adapted or extended in various ways.
  • the facility can be used to select content of a wide variety of types, based upon the performance of a wide variety of different actions with respect to that content, using values assigned to actions in a wide variety of ways.
  • a variety of different statistical distributions may be used to score messages or other types of content for inclusion in slots or other content containers.

Abstract

A facility selects a unit of content to include in an instance of a content container. For each of a plurality of units of content, across a plurality of other instances of the content container distributed during a foregoing period of time that included the unit of content, the facility determines both an average level of success achieved by including the unit of content, and a measure of the level of variability of the level of success achieved by including the unit of content. The facility scores each unit of content by selecting a value from a probability distribution that is based on the determined average level of success and measure of the level of variability of the level of success. The facility selects one of the units of content to include in the instance of the content container based on the determined scores.

Description

    CROSS-REFERENCE TO RELATED APPLICATIONS
  • This application is a continuation of U.S. patent application Ser. No. 11/758,932, entitled “REAL-TIME ADAPTIVE PROBABILISTIC SELECTION OF MESSAGES,” and filed Jun. 6, 2007, the entirety of which is hereby incorporated herein by reference.
  • TECHNICAL FIELD
  • The described technology is directed to the field of providing electronic content, and, more particularly, to the field of optimized message selection.
  • BACKGROUND
  • Web publishers often include advertising messages in the web pages that they serve to users. Such advertising messages may either be for the benefit of an advertiser other than the publisher, or may promote products or services sold by the publisher. For example, an online merchant may include on its web pages advertisements promoting products or services sold by the online merchant.
  • It is typical for each page served by a publisher to have a number of “slots” (also referred to herein as distinguished locations) that each may contain an advertising message. It is frequently true that, for each slot, several different advertising messages are eligible to be included in that slot, such that the publisher may select any of these eligible advertising messages to include in the display. It is common for a publisher to select from among the advertising messages eligible for a particular slot the advertising message expected to have the greatest value to the publisher, such as the advertising message expected to produce the greatest measure of revenue and/or profit for the publisher.
  • It is common to determine the level of revenue and/or profit each eligible advertising message is expected to produce by dividing the total revenue and/or profit that the advertising message has produced when included in a slot during a foregoing period of time by the number of times the advertising message has been included in the display during that period to obtain a “mean value” or “expected value” for the advertising message in the slot. This approach often provides a useful basis for selecting an eligible advertising message to include in a slot where each eligible message has been included in the display (1) a statistically significant number of times (2) under relevant conditions that are similar to those present at the time the slot is being filled. Where this situation does not exist, however, one or more of the advertising messages may be misvalued based upon a scarcity of representative experience with the advertising message, causing the publisher to select an advertising message other than the optimal advertising message and adversely affect its overall results.
  • In some cases, to counter this limitation of the expected value approach to selecting a message, publishers have used a modified approach where opportunities to present a message in a particular slot are allocated to one of two modes: a fixed percentage of opportunities are allocated to an “exploitation” mode that uses the expected value approach to selecting a message described above, while the remaining fixed percentage of opportunities are allocated to an “experimentation” mode that seeks to increase the number of times that underexposed advertising messages are included in the display, in order to obtain a more reliable valuation of these messages.
  • BRIEF DESCRIPTION OF DRAWINGS
  • FIG. 1 is a high-level data flow diagram showing data flow within a typical arrangement of components used to provide the facility.
  • FIG. 2 is a conceptual diagram showing a typical served page.
  • FIG. 3 is a block diagram showing some of the components typically incorporated in at least some of the computer systems and other devices on which the facility executes.
  • FIG. 4 is a graph showing sample normal distribution curves used by the facility to select scores for each of two candidate messages for a particular slot.
  • FIG. 5 is a flow diagram showing a routine performed by the facility in order to select messages to include in displays contained in a requested page in some embodiments.
  • FIG. 6 is a flow diagram showing a routine performed by the facility in order to score a message for slot in some embodiments.
  • FIG. 7 is a flow diagram showing a routine performed by the facility to track the value produced by the exposure of particular messages in particular slots in some embodiments.
  • FIG. 8 is a flow diagram showing a routine performed by the facility in order to maintain the bonus associated with a guarantee in some embodiments.
  • FIG. 9 is a flow diagram showing a routine performed by the facility in order to combine statistics collected during the current period into the overall statistics maintained and used by the facility in some embodiments.
  • DETAILED DESCRIPTION
  • Applicants have recognized that the modal exploitation/experimentation approach to selecting advertising messages described above has the disadvantage that the fixed percentage of opportunities that are allocated to experimentation versus exploitation is, at any particular moment, very likely to differ from the optimal percentage. Accordingly, they have developed an approach to selecting messages that does not rely on allocating a fixed percentage of opportunities to each of an exploitation mode and a separate experimentation mode.
  • A software facility (“the facility”) is described that seeks to optimize the selection of advertising messages or other content using probabilistic techniques. For each message that is eligible to be included in a particular slot in a particular served instance of a page, the facility generates an inclusion score by randomly selecting a value from a probability distribution, such as a normal distribution, corresponding to the combination of the message and the slot. In particular, the normal distribution is based on the message's history of being presented in the slot: the normal distribution is centered on the mean value for the message in the slot, and its breadth, that is, the number of standard deviations represented by the normal distribution, is generally inversely related to the number of exposures that the message has had in the slot during a foregoing time period, such that a first message that has had less exposure in the slot than a second message typically has a wider normal distribution than the second message. The probabilistic selection of an inclusion score employed by the facility weights each possible value in accordance with the height of the normal distribution at that value, such that the likelihood of selecting a value near the center of a normal distribution is greater than selecting a value further from the center of that normal distribution. The facility ranks the messages based upon their inclusion scores. The facility then proceeds to select eligible messages in decreasing order of their inclusion scores. This approach is well-adapted to resolving a selection between a set of candidates whose membership may vary frequently based upon which advertising messages are eligible to be included in a particular slot in a particular served instance of a page. Indeed, the approach used by the facility can be successful at selecting among sets of candidate messages whose membership is different in every iteration.
  • In some embodiments, the facility generates inclusion scores for a slot in a manner that seeks to satisfy guarantees established for one or more messages. Such guarantees each specify a minimum percentage of a slot's exposures that should include a particular message. For example, a guarantee may specify that a particular message be included in a particular slot for 20% of the slot's total exposures. The facility seeks to satisfy guarantees by establishing, for each guaranteed message, a guarantee bonus that is added to inclusion scores determined for the guaranteed message in the slot that is the subject of the guarantee. The facility then uses these augmented inclusion scores in its comparison of the inclusion scores of candidate messages. In some embodiments, the facility dynamically determines a guarantee bonus for each guarantee that is large enough to satisfy the guarantee, but not much larger. In particular, the facility may adjust a guarantee bonus level earlier determined for the message based upon the rate at which a message has been exposed in the slot while subject to that guarantee bonus level.
  • In some embodiments, the facility automatically attributes to new messages having no or limited exposure history in a slot seed values that cause the facility to select these new messages at a moderate rate and establish an actual history, which is later used as a basis for selecting these messages. In some cases, the facility determines a seed value for a new message in a way that is sensitive to details of messages with which the new message will compete for the current slot of the current page instance.
  • In some embodiments, the facility decays the histories it maintains in order to deemphasize past message performance that may no longer be representative of current performance. In some embodiments, the facility applies a variable decay rate obtained by simulating the operation of the facility at different decay rates, and choosing the decay rate whose simulation produces the highest overall value.
  • By selecting messages in some or all of the ways identified above, the facility frequently produces a high overall value, while satisfying exposure guarantees, and adjusting to new message-slot combinations and trends in the value of a message in a slot.
  • FIG. 1 is a high-level data flow diagram showing data flow within a typical arrangement of components used to provide the facility. A number of web client computer systems 110 that are under user control generate and send page view requests 131 to a logical web server 100 via a network such as the Internet 120. These requests typically include page view requests for web pages that may each contain one or more slots for including messages. Within the web server, these requests may either all be routed to a single web server computer system, or may be load-balanced among a number of web server computer systems. The web server typically replies to each with a served page 132.
  • While various embodiments are described in terms of the environment described above, those skilled in the art will appreciate that the facility may be implemented in a variety of other environments including a single, monolithic computer system, as well as various other combinations of computer systems or similar devices connected in various ways. In various embodiments, a variety of computing systems or other different client devices may be used in place of the web client computer systems, such as mobile phones, personal digital assistants, televisions, cameras, etc.
  • FIG. 2 is a conceptual diagram showing a typical served page. The sample page 200 contains certain content that is native to all instances of the page, such as native content 210 describing a softball item that is available for purchase. The page further contains two slots 220 and 230, each of which can contain a message selected by the facility. Three sample messages 221-223 are shown that are candidate messages for slot 220. In some embodiments, the facility identifies these candidate messages from a larger pool of messages scheduled into slot 220 based on one or more preliminary eligibility tests, which can be based on such factors as the time of day, day of week, user identity, user characteristics, user behavior, etc. Slot 230 may have a different set of candidate messages, depending on how messages have been scheduled into these slots.
  • When an instance of the page containing one of the three candidate messages 221-223 in slot 220 is served to a user, the facility increments a count of the number of times that each message is included in display 220 during the current tracking period. In the served page instance, the user may select the message to obtain additional information about the subject of the message. After selecting the message, the user may go on to perform various actions related to the subject of the message, such as purchasing an item, entering a contest, or recommending an item to another user. The operator of the facility may ascribe a value to any of these actions resulting from a message selection. If the publisher of the page is an online merchant, the value ascribed to an action resulting from message selection may be the level of revenue or profit that the publisher expects to derive from the action, such as the level of revenue or profit that the publisher expects to derive from an item purchase.
  • If the message is associated with a third-party advertiser, the ascribed value may correspond to an amount of money that the advertiser has agreed to pay the publisher when a message is selected, or when an action is performed. Alternatively, the ascribed value may correspond to a level of desirability of the action to the publisher or another party that is determined without reference to any pecuniary value of the action. For example, a publisher who regards a user's provision of their postal address to be twice as useful as the user's provision of their email address may ascribe a value of $2.00 to the provision of a postal address and a value of $1.00 to the provision of an email address. If the user performs any of these actions, the facility augments a balance of a total value the message has produced when displayed in display 220 during the current tracking period. The count of the number of times the message has been included in slot 220 and the balance of the total value that message has produced when displayed in slot 220 forms the basis used by the facility to score this message for future inclusion in slot 220.
  • FIG. 3 is a block diagram showing some of the components typically incorporated in at least some of the computer systems and other devices on which the facility executes. These computer systems and devices 300 may include one or more central processing units (“CPUs”) 301 for executing computer programs; a computer memory 302 for storing programs and data while they are being used; a persistent storage device 303, such as a hard drive for persistently storing programs and data; a computer-readable media drive 304, such as a floppy, CD-ROM, or DVD drive, for reading programs and data stored on a computer-readable medium; and a network connection 305 for connecting the computer system to other computer systems, such as via the Internet. While computer systems configured as described above are typically used to support the operation of the facility, those skilled in the art will appreciate that the facility may be implemented using devices of various types and configurations, and having various components.
  • FIG. 4 is a graph showing sample normal distribution curves used by the facility to select scores for each of two candidate messages for a particular slot. In particular, the graph shows curve 410 corresponding to a first candidate message and curve 420 corresponding to a second candidate message. The height of each curve at a given score indicates the relative probability that that score will be selected for the candidate message to which the curve corresponds. The mean value 411 of curve 410 is larger than the mean value 421 of curve 420, indicating that the first message has proved to have a higher average value than the second message when presented in the slot. Curve 420, on the other hand, is broader than curve 410, indicating that the value of the second message has a higher level of variability, or “standard error,” than the first message. Because the standard error metric takes into account the number of exposures of a message, this may indicate that the second message has been included in the display fewer times than the first message, perhaps because the second message has only been scheduled into the slot recently. The standard error is an inverse function of number of exposures, and a direct function of the total variations in the message's value. Thus, the standard error is large when a message is first introduced to a slot, and tends to decline thereafter, except for messages that exhibit erratic performance. Further, the mean value adjusts over time to reflect the current average of the value produced by exposures of the message in the slot.
  • The graph also shows sample scores for these two messages, selected at random within the area of each of the curves. On one hand, the higher mean of curve 410 causes the first message to be scored higher than the second message in the majority of cases in which the two messages are competing for a slot. On the other hand, the greater breadth of curve 420 ensures that the second message will be scored higher than the first message in at least a small number of cases in which the two messages are competing for the same slot. It can be seen that, in this case, the score 422 determined by the facility for the second message is larger than the score 412 selected for the first message, causing the second message to be included in the display ahead of the first message, despite the first message's higher mean value. As the second message is included in the display in subsequent instances of the page containing the slot, its standard error will likely decrease, reducing its likelihood of selection relative to candidates like the first message unless these additional exposures create significant value, increasing its mean value.
  • FIGS. 5-8 are flow diagrams showing various operational routines performed by some embodiments of the facility. FIG. 5 is a flow diagram showing routine performed by the facility in order to select messages to include in displays contained in a requested page in some embodiments. In blocks 501-510, the facility loops through each slot in the page to determine which message to display in that slot. In blocks 502-509, the facility loops through each message scheduled in the slot to determine which of the scheduled message will be displayed in each particular slot for an instance of a requested page. At decision block 503, the facility determines whether the message is eligible for this instance of the page. If the facility determines that the message is eligible for this instance of the page, then the facility continues in block 504, else the facility continues in block 505. In block 504, the facility scores the message for the slot. Block 504 is discussed below in greater detail in connection with FIG. 6. After block 504, the facility continues in block 505. In block 505, if additional messages remain to be processed, then the facility returns to block 502 to select the next message, else the facility continues in block 506.
  • After each eligible message has been scored at block 504, in blocks 506-509, the facility loops through each message scored in step 504 in decreasing order of their scores until a message is selected for inclusion in the instance of the page. At decision block 507, the facility determines whether the message is suitable for this instance of the page. In some embodiments, the facility makes this determination by calling a suitability-checking routine associated with the message. If the facility determines that the message is suitable for this instance of the page, the facility continues to block 508, else the facility continues in block 509. If the facility determines at block 507 that the message is suitable for this instance of the page, at block 508 the facility includes the message in the slot. In block 509, the facility increments the number of exposures of the message in the slot performed during the current tracking period to reflect the inclusion of the message in the slot in block 508. In some embodiments (not shown), in block 509, the facility instead stores an indication that the message has been exposed in the slot, and at a later time counts the number of such indications that were stored. After block 509, the facility continues in block 511.
  • In some embodiments, the facility performs blocks 506-510 in a different software subsystem and/or in a different computer system than blocks 502-505. Where this is the case, the entity performing blocks 502-505 provides, in various embodiments, various sets of messages to the entity performing blocks 506-510. For example, the first entity may provide a certain number of messages, or a group of messages in decreasing order of message scores up to a message that is known to be suitable for inclusion in the slot, such as a message with which no suitability test or suitability-checking routine is associated. In some embodiment, the second entity reports the identity of the message included in the slot in block 508 for inclusion in the statistics maintained and used by the facility.
  • In block 510, if additional messages remain to be processed, then the facility continues in block 506 to select the next message, else the facility continues in block 510. In block 510, if additional slots remain to be processed, then the facility continues in block 501 to select the next slot, else the routine concludes.
  • Those skilled in the art will appreciate that the routine shown in FIG. 5 and each of the routines discussed below may be altered in a variety of ways. For example, the order of the routines may be rearranged; some portions of the routine may be performed in parallel, omitted, additional functionality may be included, etc.
  • FIG. 6 is a flow diagram showing a routine typically performed by the facility in order to score a message for a slot, as referenced above with respect to block 504 (FIG. 5). At decision block 601 it is determined whether adequate statistics are available for the performance of the message in the slot. If adequate statistics are available, then the facility continues to block 603, else the facility continues to block 602. In block 602, the facility seeds performance statistics for the message in the slot. The details of block 602 are discussed in greater detail below. After block 602, the facility continues to block 603.
  • At block 603 the facility determines a score for the combination of the message and the slot using statistics about the performance of the message in the slot. In particular, in some embodiments, the facility uses the following series of formulas to determine a score for a message in the slot. The total value S produced by the message in the slot is obtained from x, the value produced by each exposure:

  • St=Σx  (1)
  • The expected value EV is obtained from S and the number of exposures of the message in the slot N:
  • EV t = S t N t ( 2 )
  • The sum of squares SS is obtained from x and EV:

  • SS t=Σ(x−EV t)2  (3)
  • The standard deviation SD is obtained from SS and N:
  • SD t = SS t N t - 1 ( 4 )
  • The standard error SE is obtained from SD and N:
  • SE t = SD t N t ( 5 )
  • As is described in greater detail below in connection with FIG. 8, the above calculations can be performed in advance for each combination of slot and message, rather than being repeated each time the facility determines a score. Finally, the score is obtained from EV and SE:

  • scorep =EV p+(SE p* normrand( ))  (6)
  • where normrand( ) is a function that returns a number chosen from a normal distribution with μ=0 and σ=1.
  • In some embodiments, the facility implements normrand( ) by mapping the output of a random number generator onto a normal distribution, such as by mapping the output of the random number generator variate_generator onto a normal distribution normal_distribution, both provided as part of the Boost library available from and documented at http://www.boost.org/index.htm.
  • In decision block 604, the facility determines whether the message in this slot is the subject of a guarantee. If the facility determines that the message in the slot is the subject of a guarantee, then the facility continues in block 605, else the routine concludes. In block 605, the facility adds a guarantee bonus determined by the facility to the score determined in block 603. The facility's approach to determining this guarantee bonus is discussed in greater detail below in connection with FIG. 8. After block 605, the routine completes.
  • The facility seeds performance statistics for the message in the slot in block 602 as follows. In some embodiments, the facility seeds these statistics in the local context of filling the slot for the current page instance, selecting a seeded mean value equal to the average mean value of the other current candidates for the slot, and a seeded standard error equal to the average standard error of the other current candidates for the slot. In some embodiments, the facility seeds these statistics based on an overview of recent exposures of the slots. As one example, in some embodiments the facility uses the expression
  • d = 1 30 h = 0 23 p = 1 n i dhp s dhp d = 1 30 h = 0 23 p = 1 n i dhp ( 7 )
      • where sdhp represents the value of a given statistic (weight or confidence) for a placement p in hour h and day d.
      • idhp is the number of impressions seen by placement p in hour h and day d.
      • n is the number of unique placements shown in hour h and day d.
  • FIG. 7 is a flow diagram showing a routine performed by the facility to track the value produced by the exposure of particular messages in particular slots in some embodiments. In block 701, the facility detects a user action associated with selection of a message in a slot. As discussed above in connection with FIG. 2, the detected action may be the message selection itself, or any subsequent action to which the advertiser, the publisher, or another party ascribes a positive value. In block 702, the facility adds the value ascribed to the action detected in block 701 to a total value produced by the message in the slot during the current tracking period. After block 702, the facility continues in step 701 to detect the next user action.
  • FIG. 8 is a flow diagram showing a routine performed by the facility in order to maintain the bonus associated with a guarantee in some embodiments. In some embodiments, the facility periodically performs the routine described with respect to FIG. 8 for each guarantee. At decision block 801, the facility determines whether the difference between the guaranteed and actual exposure rate for the message are statistically significant. If the facility determines that the guarantee and the actual exposure rate for the message are statistically significant, then the facility continues to block 802, else the routine concludes. In particular, in some embodiments, the facility determines that the difference is statistically significant if the following test evaluates as false:
  • | R T - R i - 1 | 1.96 * R i - 1 * 1 - R i - 1 N ( 8 )
  • where:
      • RA=exposure count÷eligibility rate
      • RT=the target exposure rate
      • Ri-1=the RA value for the past iteration
      • Di-1=the bonus in effect for the past iteration
      • Di=the bonus generated for the current iteration
      • N=the number of exposures during the past iteration
      • μ=Metrics-provided μ for the placement
      • DH=the bonus setting at the high end of the bracket, initially 0
      • DL=the bonus setting at the low end of the bracket, initially ∞
      • RH=the RA generated by DH
      • RL=the RA generated by DL
  • In block 802, the facility adjusts the bonus level. In particular, in some embodiments, the facility adjusts the bonus level as follows:
  • Select Mode
      • 1. If bracket not established (DL or DH is null), explore
      • 2. If bracket has inverted or collapsed (DL>=DH), explore
      • 3. If target not in bracket (RH<RT or RL>RT), explore
      • 4. Scan the recent tuning history to find the lowest and highest observed rates in it
      • 5. If the recent history brackets RT, anneal
      • 6. Scan recent history to see if we have had any tuning cycle where the error (difference from RT) increased
      • 7. If any tuning stage increased error, explore
      • 8. Anneal
  •  Find initial pace -- exploration mode
    if (1st time)
      // no Di−1 or Ri−1 available
      anchor = avg μ of top 3 competitors;
      pace = (μking - anchor) * max(RT, baseRate); //baseRate = 0.25
    else
      anchor = μ + Di−1;
      pace = |μking - anchor) * (Ri−1 - RT)|;
    if (pace < μking * initPaceSize)
      pace = μking * initPaceSize; // initPaceSize = 0.005
    Update existing pace -- exploration mode
    if (RT > 2 * Ri−1)
      inc = pace;
    else
       inc = pace * R T - R i - 1 R i - 1 ;
    if (inc > pace)
      inc = pace;
    else if (inc < minPaceSize * pace)
      inc = minPaceSize*pace; // minPaceSize = 0.01
    pace = pace + inc;
    Set new bonus -- exploration mode
    if (Ri−1 < RT)
      Di = anchor + pace;
    else
      Di = anchor − pace;
    if (Di < 0)
      Di = 0;
    Anneal mode
    // reset the pace so it must be calculated afresh the next time we explore
    anchor = 0; pace = 0;
       D i = D L + ( D H - D L ) * R T - R L R H - R L ( 9 )
  • In some embodiments, the facility tracks ages for DH and DL. Each time the facility changes the value of DH, the facility increments a counter for DL. Similarly, each time the facility changes the value of DL, the facility increments a counter for DH. During the annealing process, the facility tests these counters to see either one exceeds a staleness threshold. In cases where one of the counters does exceed the staleness threshold, the facility bypasses the annealing process for one iteration to reestablish the bonus level at the stale end of the bracket. If the facility was engaged in a long, one-sided annealing, then the exposure during this iteration is likely to be similar to that during the last iteration, and the facility resumes annealing with greater confidence in the next iteration. On the other hand, if the target bonus level has escaped the bracket, on the next iteration the facility collapses the bracket and returns to explore mode.
  • In block 803, if the bonus level as adjusted in block 802 exceeds a predetermined cap, then the facility continues in block 804, else the routine concludes. In block 804, the facility reduces the adjusted bonus level produced in block 804 to be equal to the cap. After block 804, the routine concludes.
  • FIG. 9 is a flow diagram showing a routine performed by the facility in order to combine statistics collected during the current period into the overall statistics maintained and used by the facility in some embodiments. In blocks 901-909, the facility loops through each page containing at least one slot. In blocks 902-908, the facility loops through each slot in the page. In blocks 903-907, the facility loops through each message scheduled into the slot. In block 904, the facility combines the statistics for the slot and message from the current period into the overall statistics for the slot and message. In particular, in some embodiments, the facility employs the following approach:
  • The facility weights the data points in the mean and standard error formulas based on their age. In practice, it is possible to store accumulated values and derive current values without revisiting the entire history. The following decomposes the formulas for mean, standard deviation, and standard error, showing how the weight decay is to be applied over time. Formulas follow for mean expected value EV, standard deviation SD, and standard error SE, at time t, where x is a computed value of an impression during time interval t:
  • S t = x ( 10 ) EV t = S t N t ( 11 ) SS t = ( x - EV t ) 2 ( 12 ) SD t = SS t N t - 1 ( 13 ) SE t = SD t N t ( 14 )
  • To compute historical decay, the facility tracks three values for each campaign placement: the weighted count WN, the weighted sum WS, and the weighted sum-of-squares of the variance WSS. The values computed for time interval t are computed from the previous time interval's values as shown:

  • WN t =N t +d t WH t-1  (15)

  • WS t =S t +d t WS t-1  (16)

  • WSS t =SS t +d t WSS t-1  (17)
  • where dt refers to the decay factor chosen for time interval t.
  • In block 905, the facility calculates derived statistics from the combined statistics produced in block 904. In particular, the routine may use the formulas below to obtain weighted expected value, standard deviation, and standard error for any given time interval:
  • WEV t = WS t WN t ( 18 ) WSD t = WSS t WN t - 1 ( 19 ) WSE t = WSD t WN t ( 20 )
  • Then WEVt and WSEt are used to drive the bidding algorithm during time interval t. The facility determines dt as follows:
      • 1. Let d′ be dt-1.
      • 2. Compute the overall value, as computed by the simulation model, for cases d′+ε, d′, and d′−ε.
      • 3. If the model output for d′+ε has the largest value, set d′ to d′+ε, and go to step 2.
      • 4. If the model output for d′−ε is largest, set d′ to d′−ε, and go to step 2.
      • 5. Set dt=d′.
  • In some embodiments, rather than dynamically computing dt as described above, the facility instead uses a fixed value for dt.
  • In block 906, the facility re-initializes the statistics for the current period in order to prepare them to receive data from the next period. In block 907, if additional messages remain to be processed, the facility returns to block 903 to select the next message, else the facility continues in block 908. In block 908, if additional slots remain to be processed, than the facility returns to block 902 to select the next slot, else the facility continues in block 909. In block 909, if additional pages remain to be processed, the facility continues to block 901 to select the next page, else the routine concludes.
  • In some embodiments, the facility includes functionality for handling unsatisfiable schedules, where more than 100% of the available exposures of a slot have been promised via guarantees to display messages in that slot. Here, the facility seeks “fairness” by equally penalizing the guaranteed messages. In some embodiments, the facility maintains a guarantee satisfaction state for each guarantee. The facility collects statistics that reflect whether the message is receiving the exposure rate specified by its guarantee over two periods: the lifetime of the message's scheduling in its slot, and a trailing 12-hour window. If the specified exposure rate is satisfied for both of these time periods, the facility attributes a “happy” state to the guarantee. If the exposure specified by the guarantee is not satisfied over either of these two time periods, then the facility attributes an “unhappy” state to the guarantee. When the guarantee is unhappy, the facility chooses a bonus level intended to yield an exposure rate that is calculated to make up the guarantee's shortfall over the coming 12-hour period.
  • In some embodiments, the facility further collects statistics to measure whether other messages that the guaranteed message is competing against would yield exposures to the guaranteed placement if the facility raised the bonus for the guaranteed placement. If a guarantee is in the unhappy state and has competitors determined by the facility to be unwilling to yield, the facility changes the state of the guarantee to “troubled.” When selecting a message to display in the slot, if any eligible message is the subject of a troubled guarantee, the facility uses a message selection process that diverges from the one shown in FIGS. 5 and 6. In this process, the facility excludes from consideration any message not having a guarantee whose state is troubled. Where the eligible messages include more than one that is the subject of a troubled guarantee, the facility chooses among these in a manner proportional to the rate guarantee by each. For example, where two messages subject to troubled guarantees are eligible, the first having an 80% guaranteed rate and the second having a 40% guaranteed rate, the facility attributes a selection weight to the first that is twice as large as the selection weight attributed to the second, yielding a two-thirds probability of selecting the first and the one-third probability of selecting the second.
  • In some embodiments, the facility tracks the performance of some messages across two or more slots, rather than tracking the performance of the message in each of these slots individually.
  • It will be appreciated by those skilled in the art that the above-described facility may be straightforwardly adapted or extended in various ways. For example, the facility can be used to select content of a wide variety of types, based upon the performance of a wide variety of different actions with respect to that content, using values assigned to actions in a wide variety of ways. Also, a variety of different statistical distributions may be used to score messages or other types of content for inclusion in slots or other content containers. While the foregoing description makes reference to particular embodiments, the scope of the invention is defined solely by the claims that follow and the elements recited therein.

Claims (24)

1. A method comprising:
determining, across a plurality of instances of a web page served during a foregoing period of time that included an advertising message,
(a) a set of values attributable to inclusion of the advertising message in each of the plurality of instances of the web page,
(b) an average value of the set of values attributable to the inclusion of the advertising message in the plurality of instances of the web page, and
(c) a measure of a level of variability of the set of values attributable to inclusion of the advertising message in the plurality of instances of the web page; and
determining a score for the advertising message reflective of the average value and the measure of the level of variability of the values attributable to the inclusion of the advertising message in the plurality of instances of web pages,
wherein determining the score for the advertising message includes performing a weighted random selection of a value from a normal probability distribution centered on the determined average value, and
wherein a breadth of the normal probability distribution is based on the determined measure of the level of variability of the set of values, and
wherein the method is implemented in a computing system having a processor and a memory.
2. The method of claim 1, wherein determining the score for the advertising message is further reflective of an exposure guarantee associated with the advertising message.
3. The method of claim 2, wherein determining the measure of the level of variability of the set of values attributable to the inclusion of the advertising message used as a basis for determining the score for the advertising message with the associated exposure guarantee comprises determining the measure of the level of variability of the set of values attributable to the inclusion of the advertising message augmented by a guarantee bonus value.
4. The method of claim 1, further comprising: determining that the advertising message has been included in the plurality of instances of the web page in less than a threshold number of times during the foregoing period of time.
5. The method of claim 1, wherein determining the average value of the set of values comprises determining a constructive average value based on an aggregate average value of all advertising messages included in the plurality of instances of the web page during a foregoing period of time.
6. The method of claim 1, wherein determining the measure of the level of variability of the set of values comprises determining a constructive measure of the level of variability of the set of values associated with the inclusion of all messages included in the plurality of instances of the web page during a foregoing period of time.
7. The method of claim 1, further comprising:
identifying a plurality of candidate advertising messages eligible to be included in the web page during a subsequent period of time;
determining the score for each of the plurality of candidate advertising messages;
selecting one of the plurality of candidate advertising messages as an identified advertising message to include in the web page during the subsequent period of time, wherein the selecting is based on the score of the respective candidate advertising messages.
8. The method of claim 7, wherein the selecting based on the score comprises selecting the candidate advertising message with the highest score.
9. The method of claim 7, wherein determining the score for each of the plurality of the candidate advertising messages includes determining the exposure guarantee and wherein selecting the identified candidate message includes selecting only a candidate message associated with an exposure guarantee.
10. A system for managing advertising messages, comprising:
a computing system having a processor and memory for implementing an analysis subsystem that determines, for an advertising message, across a plurality of instances of a web page distributed during a foregoing period of time that included the advertising message (a) a measure of success achieved by including the advertising message in one instance of the web page (b) an average of the measure of success achieved by including the advertising message across the plurality of instances of the web page, and (c) a measure of the level of variability of the measure of success achieved by including the advertising message; and
a computing system having a processor and memory for implementing a scoring subsystem that determines a score for the advertising message by performing a weighted random selection of a value from a probability distribution based on (a) the determined average measure of success and (b) the determined measure of the level of variability of the measure of success.
11. The system of claim 10 further comprising a computing system having a processor and memory for implementing a selection subsystem that selects at least one of a plurality of advertising messages to include in a subsequent instance of the web page based on the scores determined for each of the plurality of advertising messages.
12. The system of claim 11, wherein the selection subsystem selects the at least one of the plurality of advertising messages to include in the subsequent instance of the web page further based on application of a suitability test associated with the advertising message.
13. The system of claim 10, wherein the analysis subsystem monitors (a) the determined average of the measure of success and (b) the determined measure of the level of variability of the measure of success for the advertising message over time to take into account the inclusion of the advertising message in the subsequent instance of the web page and associated measure of success.
14. The system of claim 10, wherein an exposure guarantee is associated with the advertising message.
15. The system of claim 11, wherein the plurality of advertising messages are subject to respective exposure guarantees, and wherein, at a time when at least one of the respective exposure guarantees is not satisfied, the selection of an advertising message not subject to an exposure guarantee is precluded.
16. A computer-readable, non-transitory storage medium comprising:
one or more computer-executable components for managing advertising messages, the one or more computer-executable components:
determining, across a plurality of instances of a web page served during a previous period of time that included an advertising message,
(a) a set of success rates attributable to inclusion of the advertising message in each of the plurality of instances of the web page,
(b) an average of the success rates attributable to the inclusion of the advertising message in the plurality of instances of the web page, and
(c) a measure of a standard error of the success rates associated with the inclusion of the advertising message; and
determining a score for the advertising message reflective of the average of the success rates and the standard error of the success rates attributable to the inclusion of the advertising message in the plurality of instances of web pages,
wherein determining the score for the advertising message includes performing a weighted random selection of a success rates from a normal probability distribution centered on the determined average success rate,
wherein the breadth of the normal probability distribution is based on the determined measure of the standard error of the success rates.
17. The computer-readable medium of claim 16, the one or more computer-executable components further:
identifying a plurality of candidate advertising messages eligible to be included in the web page during a subsequent period of time; and
determining the score for each of the plurality of candidate advertising messages;
selecting one of the plurality of candidate advertising messages as an identified advertising message to include in the web page during the subsequent period of time, wherein the selecting is based on the score of the respective candidate advertising messages.
18. The computer-readable medium of claim 17, wherein the selecting based on the score comprises selecting the candidate advertising message with the highest score.
19. The computer-readable medium of claim 16, wherein an exposure guarantee is associated with the advertising message.
20. The computer-readable medium of claim 19, wherein the determined measure of the standard error of the success rates associated with the inclusion of the advertising message used as a basis for determining the score for the advertising message with the associated exposure guarantee comprises the determined measure of the standard error of the success rates associated with the inclusion of the advertising message augmented by a guarantee bonus success rate.
21. The computer-readable medium of claim 19, wherein a plurality of the identified candidate messages are each the subject of an exposure guarantee, and wherein, at a time when at least one of the exposure guarantees is not satisfied, the selection of any identified candidate messages that are not the subject of an exposure guarantee is precluded.
22. The computer-readable medium of claim 19, wherein the advertising message has been included in the plurality of instances of the web page in less than a threshold number of times during the previous period of time.
23. The computer-readable medium of claim 19, wherein the average of the success rates is a constructive average success rate based on an aggregate average success rate of all advertising messages included in the plurality of instances of the web page during the previous period of time.
24. The computer-readable medium of claim 19, wherein the measure of the standard error is a constructive measure of the standard error of the success rates associated with the inclusion of all messages included in the plurality of instances of the web page during a previous period of time.
US13/544,861 2007-06-06 2012-07-09 Real-time adaptive probabilistic selection of messages Abandoned US20120278160A1 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
US13/544,861 US20120278160A1 (en) 2007-06-06 2012-07-09 Real-time adaptive probabilistic selection of messages

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US11/758,932 US8219447B1 (en) 2007-06-06 2007-06-06 Real-time adaptive probabilistic selection of messages
US13/544,861 US20120278160A1 (en) 2007-06-06 2012-07-09 Real-time adaptive probabilistic selection of messages

Related Parent Applications (1)

Application Number Title Priority Date Filing Date
US11/758,932 Continuation US8219447B1 (en) 2007-06-06 2007-06-06 Real-time adaptive probabilistic selection of messages

Publications (1)

Publication Number Publication Date
US20120278160A1 true US20120278160A1 (en) 2012-11-01

Family

ID=46396003

Family Applications (2)

Application Number Title Priority Date Filing Date
US11/758,932 Active 2030-05-19 US8219447B1 (en) 2007-06-06 2007-06-06 Real-time adaptive probabilistic selection of messages
US13/544,861 Abandoned US20120278160A1 (en) 2007-06-06 2012-07-09 Real-time adaptive probabilistic selection of messages

Family Applications Before (1)

Application Number Title Priority Date Filing Date
US11/758,932 Active 2030-05-19 US8219447B1 (en) 2007-06-06 2007-06-06 Real-time adaptive probabilistic selection of messages

Country Status (1)

Country Link
US (2) US8219447B1 (en)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20120023173A1 (en) * 2010-07-21 2012-01-26 At&T Intellectual Property I, L.P. System and method for prioritizing message transcriptions
US8879695B2 (en) 2010-08-06 2014-11-04 At&T Intellectual Property I, L.P. System and method for selective voicemail transcription
US9215203B2 (en) 2010-07-22 2015-12-15 At&T Intellectual Property I, L.P. System and method for efficient unified messaging system support for speech-to-text service

Families Citing this family (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8935621B1 (en) 2010-05-18 2015-01-13 Amazon Technologies, Inc. Systems and methods for selecting components for inclusion in portions of a displayable file
US9009298B2 (en) * 2010-12-10 2015-04-14 The Nielsen Company (Us), Llc Methods and apparatus to determine audience engagement indices associated with media presentations
US20130254041A1 (en) * 2012-03-23 2013-09-26 Jeffrey Sherwin Method for delivering fair advertising inventory splits given position preferences
US9665556B1 (en) 2012-07-23 2017-05-30 Amazon Technologies, Inc. Assigning slots to user interface elements
US8949712B1 (en) * 2013-02-06 2015-02-03 Amazon Technologies, Inc. Assigning slots to content in a pipeline
US10210459B2 (en) * 2016-06-29 2019-02-19 The Nielsen Company (Us), Llc Methods and apparatus to determine a conditional probability based on audience member probability distributions for media audience measurement
US10949896B2 (en) * 2018-07-30 2021-03-16 Facebook, Inc. Distribution of embedded content items by an online system
US20230325456A1 (en) * 2022-04-08 2023-10-12 Content Square SAS Insights Interface for Hidden Products

Citations (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20020062247A1 (en) * 2000-08-29 2002-05-23 Allen Bradley P. Selecting web site content to be displayed to a web site visitor based upon a probability distribution
US20050021403A1 (en) * 2001-11-21 2005-01-27 Microsoft Corporation Methods and systems for selectively displaying advertisements
US20050021397A1 (en) * 2003-07-22 2005-01-27 Cui Yingwei Claire Content-targeted advertising using collected user behavior data
US20050043994A1 (en) * 1996-09-04 2005-02-24 Walker Jay S. Method for allowing a customer to obtain a discounted price for a transaction and terminal for performing the method
US20070219863A1 (en) * 2006-03-20 2007-09-20 Park Joseph C Content generation revenue sharing
US20080027799A1 (en) * 2006-07-28 2008-01-31 Verizon Directory Services - West Inc. Location-based advertising
US7493324B1 (en) * 2005-12-05 2009-02-17 Verizon Services Corp. Method and computer program product for using data mining tools to automatically compare an investigated unit and a benchmark unit
US8386315B1 (en) * 2001-05-30 2013-02-26 Carl Meyer Yield management system and method for advertising inventory
US8571930B1 (en) * 2005-10-31 2013-10-29 A9.Com, Inc. Strategies for determining the value of advertisements using randomized performance estimates

Family Cites Families (53)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5446891A (en) 1992-02-26 1995-08-29 International Business Machines Corporation System for adjusting hypertext links with weighed user goals and activities
US6202058B1 (en) 1994-04-25 2001-03-13 Apple Computer, Inc. System for ranking the relevance of information objects accessed by computer users
US6658568B1 (en) 1995-02-13 2003-12-02 Intertrust Technologies Corporation Trusted infrastructure support system, methods and techniques for secure electronic commerce transaction and rights management
US5749081A (en) 1995-04-06 1998-05-05 Firefly Network, Inc. System and method for recommending items to a user
US5774123A (en) 1995-12-15 1998-06-30 Ncr Corporation Apparatus and method for enhancing navigation of an on-line multiple-resource information service
US5848396A (en) 1996-04-26 1998-12-08 Freedom Of Information, Inc. Method and apparatus for determining behavioral profile of a computer user
US5825651A (en) 1996-09-03 1998-10-20 Trilogy Development Group, Inc. Method and apparatus for maintaining and configuring systems
US5958008A (en) 1996-10-15 1999-09-28 Mercury Interactive Corporation Software system and associated methods for scanning and mapping dynamically-generated web documents
US5870559A (en) 1996-10-15 1999-02-09 Mercury Interactive Software system and associated methods for facilitating the analysis and management of web sites
US6012051A (en) 1997-02-06 2000-01-04 America Online, Inc. Consumer profiling system with analytic decision processor
US6085229A (en) 1998-05-14 2000-07-04 Belarc, Inc. System and method for providing client side personalization of content of web pages and the like
US20010014868A1 (en) 1997-12-05 2001-08-16 Frederick Herz System for the automatic determination of customized prices and promotions
US6631372B1 (en) 1998-02-13 2003-10-07 Yahoo! Inc. Search engine using sales and revenue to weight search results
US6256633B1 (en) 1998-06-25 2001-07-03 U.S. Philips Corporation Context-based and user-profile driven information retrieval
US6321221B1 (en) 1998-07-17 2001-11-20 Net Perceptions, Inc. System, method and article of manufacture for increasing the user value of recommendations
US6334127B1 (en) 1998-07-17 2001-12-25 Net Perceptions, Inc. System, method and article of manufacture for making serendipity-weighted recommendations to a user
WO2000008802A2 (en) 1998-08-03 2000-02-17 Doubleclick Inc. Network for distribution of re-targeted advertising
US6266649B1 (en) 1998-09-18 2001-07-24 Amazon.Com, Inc. Collaborative recommendations using item-to-item similarity mappings
US6317722B1 (en) 1998-09-18 2001-11-13 Amazon.Com, Inc. Use of electronic shopping carts to generate personal recommendations
US7720723B2 (en) 1998-09-18 2010-05-18 Amazon Technologies, Inc. User interface and methods for recommending items to users
JP2000148675A (en) * 1998-11-09 2000-05-30 Nec Corp Device and method for providing customized advertisement on www
US6466970B1 (en) 1999-01-27 2002-10-15 International Business Machines Corporation System and method for collecting and analyzing information about content requested in a network (World Wide Web) environment
US7016864B1 (en) 1999-04-09 2006-03-21 Dell Usa, L.P. Interactive upsell advisor method and apparatus for internet applications
US6493702B1 (en) 1999-05-05 2002-12-10 Xerox Corporation System and method for searching and recommending documents in a collection using share bookmarks
US6473794B1 (en) 1999-05-27 2002-10-29 Accenture Llp System for establishing plan to test components of web based framework by displaying pictorial representation and conveying indicia coded components of existing network framework
JP4743740B2 (en) 1999-07-16 2011-08-10 マイクロソフト インターナショナル ホールディングス ビー.ブイ. Method and system for creating automated alternative content recommendations
KR100328670B1 (en) 1999-07-21 2002-03-20 정만원 System For Recommending Items With Multiple Analyzing Components
US20010011235A1 (en) 2000-01-29 2001-08-02 E-Net Co., Ltd. Apparatus for realizing personal shops in an electronic commerce business
US6901378B1 (en) 2000-03-02 2005-05-31 Corbis Corporation Method and system for automatically displaying an image and a product in a page based on contextual interaction and metadata
US6606102B1 (en) 2000-06-02 2003-08-12 Gary Odom Optimizing interest potential
US6873967B1 (en) 2000-07-17 2005-03-29 International Business Machines Corporation Electronic shopping assistant and method of use
AU2002227514A1 (en) * 2000-07-27 2002-02-13 Polygnostics Limited Collaborative filtering
US7599851B2 (en) 2000-09-05 2009-10-06 Renee Frengut Method for providing customized user interface and targeted marketing forum
US7844489B2 (en) 2000-10-30 2010-11-30 Buyerleverage Buyer-driven targeting of purchasing entities
DE60209947T2 (en) 2001-01-09 2007-02-22 Metabyte Networks, Inc., Fremont A system, method and software for providing targeted advertising through user profile data structure based on user preferences
US7089237B2 (en) 2001-01-26 2006-08-08 Google, Inc. Interface and system for providing persistent contextual relevance for commerce activities in a networked environment
US20020198882A1 (en) 2001-03-29 2002-12-26 Linden Gregory D. Content personalization based on actions performed during a current browsing session
US6831663B2 (en) 2001-05-24 2004-12-14 Microsoft Corporation System and process for automatically explaining probabilistic predictions
US7389201B2 (en) 2001-05-30 2008-06-17 Microsoft Corporation System and process for automatically providing fast recommendations using local probability distributions
US20030004781A1 (en) 2001-06-18 2003-01-02 Mallon Kenneth P. Method and system for predicting aggregate behavior using on-line interest data
US20030074252A1 (en) 2001-10-12 2003-04-17 Avenue A, Inc. System and method for determining internet advertising strategy
US7295995B1 (en) 2001-10-30 2007-11-13 A9.Com, Inc. Computer processes and systems for adaptively controlling the display of items
US7370002B2 (en) * 2002-06-05 2008-05-06 Microsoft Corporation Modifying advertisement scores based on advertisement response probabilities
AU2003263928A1 (en) 2002-08-19 2004-03-03 Choicestream Statistical personalized recommendation system
US7310612B2 (en) 2003-08-13 2007-12-18 Amazon.Com, Inc. Personalized selection and display of user-supplied content to enhance browsing of electronic catalogs
US8027832B2 (en) 2005-02-11 2011-09-27 Microsoft Corporation Efficient language identification
US10515374B2 (en) 2005-03-10 2019-12-24 Adobe Inc. Keyword generation method and apparatus
US7542951B1 (en) 2005-10-31 2009-06-02 Amazon Technologies, Inc. Strategies for providing diverse recommendations
US20070156887A1 (en) * 2005-12-30 2007-07-05 Daniel Wright Predicting ad quality
US7827060B2 (en) * 2005-12-30 2010-11-02 Google Inc. Using estimated ad qualities for ad filtering, ranking and promotion
US7870024B2 (en) 2006-02-17 2011-01-11 Coon Jonathan C Systems and methods for electronic marketing
US20080086444A1 (en) 2006-10-09 2008-04-10 International Business Machines Corporation System and method for improving cardinality estimation in a relational database management system
JP5381705B2 (en) * 2007-05-14 2014-01-08 富士通株式会社 Advertisement providing system, advertisement display device, advertisement management device, advertisement display program, advertisement management program, advertisement display method, and advertisement management method

Patent Citations (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20050043994A1 (en) * 1996-09-04 2005-02-24 Walker Jay S. Method for allowing a customer to obtain a discounted price for a transaction and terminal for performing the method
US20020062247A1 (en) * 2000-08-29 2002-05-23 Allen Bradley P. Selecting web site content to be displayed to a web site visitor based upon a probability distribution
US8386315B1 (en) * 2001-05-30 2013-02-26 Carl Meyer Yield management system and method for advertising inventory
US20050021403A1 (en) * 2001-11-21 2005-01-27 Microsoft Corporation Methods and systems for selectively displaying advertisements
US20050021397A1 (en) * 2003-07-22 2005-01-27 Cui Yingwei Claire Content-targeted advertising using collected user behavior data
US8571930B1 (en) * 2005-10-31 2013-10-29 A9.Com, Inc. Strategies for determining the value of advertisements using randomized performance estimates
US7493324B1 (en) * 2005-12-05 2009-02-17 Verizon Services Corp. Method and computer program product for using data mining tools to automatically compare an investigated unit and a benchmark unit
US20070219863A1 (en) * 2006-03-20 2007-09-20 Park Joseph C Content generation revenue sharing
US20080027799A1 (en) * 2006-07-28 2008-01-31 Verizon Directory Services - West Inc. Location-based advertising

Cited By (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20120023173A1 (en) * 2010-07-21 2012-01-26 At&T Intellectual Property I, L.P. System and method for prioritizing message transcriptions
US8612526B2 (en) * 2010-07-21 2013-12-17 At&T Intellectual Property I, L.P. System and method for prioritizing message transcriptions
US9215203B2 (en) 2010-07-22 2015-12-15 At&T Intellectual Property I, L.P. System and method for efficient unified messaging system support for speech-to-text service
US9672826B2 (en) 2010-07-22 2017-06-06 Nuance Communications, Inc. System and method for efficient unified messaging system support for speech-to-text service
US8879695B2 (en) 2010-08-06 2014-11-04 At&T Intellectual Property I, L.P. System and method for selective voicemail transcription
US9137375B2 (en) 2010-08-06 2015-09-15 At&T Intellectual Property I, L.P. System and method for selective voicemail transcription
US9992344B2 (en) 2010-08-06 2018-06-05 Nuance Communications, Inc. System and method for selective voicemail transcription

Also Published As

Publication number Publication date
US8219447B1 (en) 2012-07-10

Similar Documents

Publication Publication Date Title
US8219447B1 (en) Real-time adaptive probabilistic selection of messages
US8666809B2 (en) Advertisement campaign simulator
US8527346B2 (en) Method and system for scheduling electronic advertising
US7742944B2 (en) Method for selecting relevant campaign messages for transmission to recipients
US8473339B1 (en) Automatically switching between pricing models for services
US20150134462A1 (en) Dynamic bid price selection
EP2757516A1 (en) System and method for serving electronic content
US20060184421A1 (en) Dynamically optimizing the presentation of advertising messages
US20090063249A1 (en) Adaptive Ad Server
US20090327028A1 (en) Systems and Methods for Utilizing Assist Data to Optimize Digital Ads
US20090327029A1 (en) Systems and Methods for Utilizing Normalized Impressions To Optimize Digital Ads
US9230269B2 (en) Segment-based floors for use in online ad auctioning techniques
US20150134463A1 (en) Performance metric based bid placement
US11868826B2 (en) Real-time data processing pipeline and pacing control systems and methods
US20130346218A1 (en) Bidding on Impression Opportunities Using Confidence Indicators
US20160379277A1 (en) Systems and methods for controlling online advertising campaigns
US20100174609A1 (en) Method and system for correcting bias introduced by estimating offer values
CN111899041A (en) Information delivery processing method, information delivery device, information delivery equipment and storage medium
CN111681035B (en) Method and device for estimating liveness of liveness advertisement
CN109377273B (en) Advertisement putting method and device and computer readable storage medium
JP5699233B1 (en) Generating device, generating method, and generating program
US20160275571A1 (en) Pacing the serving of a content item
US10825041B1 (en) Real-time optimization of bid selection
US20160019583A1 (en) Systems and methods for smooth and effective budget delivery in online advertising
US20230012700A1 (en) Systems and methods for optimizing electronic content delivery for non-measurable users

Legal Events

Date Code Title Description
STCB Information on status: application discontinuation

Free format text: ABANDONED -- FAILURE TO RESPOND TO AN OFFICE ACTION