US20140279074A1 - Data management platform for digital advertising - Google Patents

Data management platform for digital advertising Download PDF

Info

Publication number
US20140279074A1
US20140279074A1 US13/924,343 US201313924343A US2014279074A1 US 20140279074 A1 US20140279074 A1 US 20140279074A1 US 201313924343 A US201313924343 A US 201313924343A US 2014279074 A1 US2014279074 A1 US 2014279074A1
Authority
US
United States
Prior art keywords
data
user
processor
real
analytics
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Abandoned
Application number
US13/924,343
Inventor
Songting Chen
Ali Dasdan
Hazem Elmeleegy
Santanu Kolay
Yinan Li
Yan Qi
Peter WILMOT
Mingxi Wu
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Turn Inc
Original Assignee
Turn Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Turn Inc filed Critical Turn Inc
Priority to US13/924,343 priority Critical patent/US20140279074A1/en
Publication of US20140279074A1 publication Critical patent/US20140279074A1/en
Assigned to TURN INC. reassignment TURN INC. ASSIGNMENT OF ASSIGNORS INTEREST (SEE DOCUMENT FOR DETAILS). Assignors: WILMOT, PETER, DASDAN, ALI, KOLAY, SANTANU, CHEN, SONGTING, ELMELEEGY, HAZEM, QI, YAN, WU, MINGXI, LI, YINAN
Assigned to SILICON VALLEY BANK, AS ADMINISTRATIVE AGENT reassignment SILICON VALLEY BANK, AS ADMINISTRATIVE AGENT SECURITY AGREEMENT Assignors: TURN INC.
Abandoned legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q30/00Commerce
    • G06Q30/02Marketing; Price estimation or determination; Fundraising
    • G06Q30/0241Advertisements
    • G06Q30/0251Targeted advertisements
    • G06Q30/0269Targeted advertisements based on user profile or attribute
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/958Organisation or management of web site content, e.g. publishing, maintaining pages or automatic linking
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/01Input arrangements or combined input and output arrangements for interaction between user and computer
    • G06F3/048Interaction techniques based on graphical user interfaces [GUI]
    • G06F3/0484Interaction techniques based on graphical user interfaces [GUI] for the control of specific functions or operations, e.g. selecting or manipulating an object, an image or a displayed text element, setting a parameter value or selecting a range
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N5/00Computing arrangements using knowledge-based models
    • G06N5/02Knowledge representation; Symbolic representation
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q30/00Commerce
    • G06Q30/02Marketing; Price estimation or determination; Fundraising
    • G06Q30/0201Market modelling; Market analysis; Collecting market data
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q30/00Commerce
    • G06Q30/02Marketing; Price estimation or determination; Fundraising
    • G06Q30/0241Advertisements
    • G06Q30/0277Online advertisement
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/01Input arrangements or combined input and output arrangements for interaction between user and computer
    • G06F3/048Interaction techniques based on graphical user interfaces [GUI]
    • G06F3/0481Interaction techniques based on graphical user interfaces [GUI] based on specific properties of the displayed interaction object or a metaphor-based environment, e.g. interaction with desktop elements like windows or icons, or assisted by a cursor's changing behaviour or appearance
    • G06F3/0482Interaction with lists of selectable items, e.g. menus
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/10Text processing
    • G06F40/103Formatting, i.e. changing of presentation of documents
    • G06F40/117Tagging; Marking up; Designating a block; Setting of attributes
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/10Text processing
    • G06F40/12Use of codes for handling textual entities
    • G06F40/137Hierarchical processing, e.g. outlines

Definitions

  • the invention relates generally to digital advertising. More particularly, the invention relates to a data management platform for digital advertising.
  • RTBEs real-time bidding exchanges
  • DSPs demand-side platforms
  • RTBEs and DSPs typically takes less than 150 ms including the network time, leaving less than 50 ms for DSPs to run their runtime pipelines. It is well understood that to make such dynamic buying decisions optimal, particular data, including user data, advertiser data, contextual data, plays a central role.
  • a DMP may be a central hub to seamlessly and rapidly collect, integrate, manage, and activate large volume of data.
  • An embodiment of the invention comprises a data management platform (DMP) that integrates the following functionalities:
  • a DMP is configured to cleanse and integrate data from multiple platforms or channels with heterogeneous schema. Importantly, such integration may have to happen at the finest granular level by linking the same audience or users across different platforms. By such functionality, a deeper and more insightful audience analytics may be obtained across campaign activities.
  • Analytics A DMP provides full cross channel reporting and analytics capabilities. Examples may include, but are not limited to, aggregation, user behavior correlation analysis, multi-touch attribution, defined as attributing credit to the channels which contributed to a final action of an audience, tag management, analytical modeling, etc. Furthermore, such DMP may be delivered through cloud-based software-as-a-service (SaaS) to end users and provide them the flexibility to plug in their own analytical intelligence.
  • SaaS software-as-a-service
  • a DMP is configured to not only get data in, but also send data out in real-time. In other words, such DMP may need to make the insights actionable. For example, such DMP may be configured to perform modeling and scoring in real-time by combining online and offline data and sending the data to other platforms to optimize the downstream media and enhance the customer experience.
  • an embodiment of the invention provides a data management apparatus for digital advertising.
  • a data integration processor is provided for collecting and storing data from providers, resolving heterogeneity of the data at schema and data levels, and performing validity checks of the data.
  • An analytics processor is provided for receiving validated data from the data integration processor and providing to users custom, nesting-aware, SQL-like query language and a library of data mining methods, machine learning models, and analytical user profiles (AUP).
  • AUP analytical user profiles
  • an activation processor is provided for encapsulating complex computations performed in real-time, segment evaluation, and online user classification using runtime user profiles (RUP).
  • FIG. 1 is a schematic diagram showing an architecture of a DMP according to an embodiment of the invention.
  • FIG. 2 is a block schematic diagram of a system in the exemplary form of a computer system according to an embodiment of the invention.
  • An embodiment of the invention comprises a data management platform (DMP) that integrates the following functionalities:
  • a DMP is configured to cleanse and integrate data from multiple platforms or channels with heterogeneous schema. Importantly, such integration may have to happen at the finest granular level by linking the same audience or users across different platforms. By such functionality, a deeper and more insightful audience analytics may be obtained across campaign activities.
  • Analytics A DMP provides full cross channel reporting and analytics capabilities. Examples may include, but are not limited to, aggregation, user behavior correlation analysis, multi-touch attribution, defined as attributing credit to the channels which contributed to a final action of an audience, tag management, analytical modeling, etc. Furthermore, such DMP may be delivered through cloud-based software-as-a-service (SaaS) to end users and provide them the flexibility to plug in their own analytical intelligence.
  • SaaS software-as-a-service
  • a DMP is configured to not only get data in, but also send data out in real-time. In other words, such DMP may need to make the insights actionable. For example, such DMP may be configured to perform modeling and scoring in real-time by combining online and offline data and sending the data to other platforms to optimize the downstream media and enhance the customer experience.
  • an embodiment of the invention provides a data management apparatus for digital advertising.
  • a data integration processor is provided for collecting and storing data from providers, resolving heterogeneity of the data at schema and data levels, and performing validity checks of the data.
  • An analytics processor is provided for receiving validated data from the data integration processor and providing to users custom, nesting-aware, SQL-like query language and a library of data mining methods, machine learning models, and analytical user profiles (AUP).
  • AUP analytical user profiles
  • an activation processor is provided for encapsulating complex computations performed in real-time, segment evaluation, and online user classification using runtime user profiles (RUP).
  • DMPs are able to handle big data in batch mode, as well as in real-time, thus unifying techniques from multiple fields of data science, including databases, data mining, streaming, distributed systems, key-value stores, and machine learning as disclosed in K.-C. Lee, B. Orten, A. Dasdan, and W. Li, Estimating Conversion Rate in Display Advertising from Past Performance Data , in KDD, pages 768-776, 2012; X. Shao and L. Li. Data - driven Multi - touch Attribution Models , in KDD, pages 258-264, 2011 (“Shao”); etc.
  • an audience or user profile covers available information for a given anonymized user, including but not limited to, demographics, psychographics, campaign, and behavioral data.
  • User profile data may be typically collected from various sources. Such data may be first party data, i.e. historical user data collected by advertisers in their own private customer relationship management (CRM) systems, or third party data, i.e. data provided by third party data partners, typically each specializing in a specific type of data, e.g. credit scores, buying intentions, etc.
  • CCM customer relationship management
  • third party data i.e. data provided by third party data partners, typically each specializing in a specific type of data, e.g. credit scores, buying intentions, etc.
  • user profiles are treated as first class citizen and are the basic units for offline analytics, as well as for real-time applications.
  • user profile data may arrive in various types, formats, and cardinalities, which may be best captured using a nested relational data model.
  • each user profile is one record, where some attributes of this record could be another table storing certain type of events.
  • the use of the nested relational data model has already gained wide adoption in the field of big data such as disclosed in S. Melnik, A. Gubarev, J. J. Long, G. Romer, S. Shivakumar, M. Tolton, and T. Vassilakis. Dremel: Interactive analysis of web - scale datasets . PVLDB, 3(1):330-339, 2010.
  • a DMP maintains two versions of the user profiles.
  • the analytical user profile (AUP) is designed for the purpose of offline analytics and data mining.
  • the AUP is stored in a Hadoop File System (HDFS), such as disclosed in Hadoop, Open Source Implementation of MapReduce at hadoop.apache.org.
  • HDFS Hadoop File System
  • ROP runtime user profile
  • FIG. 1 shows an overall architecture of a DMP 3402 .
  • the DMP 3402 comprises three key components: a data integration engine 3404 ; an analytics engine 3406 ; and a real-time activation engine 3408 , also referred to herein as a runtime engine.
  • the data integration engine 3404 is responsible for gathering and storing data from first and third party providers, resolving heterogeneity at schema and data levels, e.g. disparate user ids, and performing the necessary validity checks. Once the data are received from the external partners, such data then flow into the other two components.
  • the analytics engine 3406 may be known as Cheetah, in S. Chen, “ Cheetah: A high performance, custom data warehouse on top of mapreduce ,” PVLDB, 3(2):1459-1468, 2010 (“Cheetah document”) and has an AUP store as a data layer.
  • the analytics engine 3406 provides the data analysts with a custom, nesting-aware, SQL-like query language called Cheetah Query Language (CQL), in addition to a rich library of data mining methods and machine learning models.
  • CQL Cheetah Query Language
  • a runtime engine 3408 runs on top of an RUP store. The runtime engine 3408 encapsulates the complex computations performed in real-time, such as but not limited to segment evaluation, online user classification, etc.
  • FIG. 1 also depicts the interaction between the DMP 3402 and the other components in the digital advertising ecosystem in accordance with an embodiment of the invention.
  • the DSPs 3410 are the entities responsible for real-time bidding (RTB) or programmatic buying of ad space from publishers, i.e. supply side, on behalf of advertisers 3411 , i.e. demand side.
  • the DSPs 3410 interact directly with the runtime engine 3408 of the DMP 3402 to obtain the information that is necessary for ad selection and bid optimization.
  • the DSPs 3410 mainly respond to bid requests from the RTBEs.
  • the RTBEs are the entities where publishers make their inventory of ad space available for the highest bidders.
  • the RTBEs have support for public exchanges 3412 and private exchanges 3414 . Unlike public exchanges 3412 , private exchanges 3414 provide publishers 3416 with more control over which advertisers may use their media channels and which ads may run on their media channels.
  • user online activity data e.g. in the form of impressions, clicks, and actions
  • the DMP 3402 is sent to the DMP 3402 by the DSPs 3410 , e.g. for impression and click data, and the advertiser, e.g. for action data.
  • the online data may be obtained from those external DSPs. In such cases, they are considered to be third party media providers 3418 , analogous to third party offline data providers 3420 .
  • a user profile may be a central data repository in the DMP.
  • Each profile contains marketing campaign data, online behavior data, CRM data, etc. Some of such data are collected online, while others are collected by loading offline data files, which are typically keyed off disparate user ids from another platform.
  • the DMP designed integration software is referred to as the Datahub and is used to receive offline data files.
  • the Datahub implements three steps:
  • the Datahub handles scalability through multiple FTP servers, multi-pipeline concurrent loading, a Hadoop MapReduce computation model, and its own job scheduler to prioritize specific jobs.
  • the Datahub also achieves immunity from bad data by an initial validation of offline data, thus shielding an embodiment of the DMP from any dirty data.
  • the Datahub uses configuration files for instantiating different loading templates and a centralized catalog, supporting schema evolvement, to mitigate heterogeneity issues.
  • the Datahub consistently saves metrics, such as files that were received and stored, records processed, records rejected, last successful pipeline step, and profiling times, in a database. Such monitoring information enables system alerts, client notifications, and billing statements.
  • the Datahub may be configured to recover after failure through a fault tolerance protocol relying on persistent status files.
  • the Datahub may incorporate more custom logic into a join algorithm, e.g. two data files may easily be differentiated and loaded incrementally.
  • analytics over AUPs may be based on Cheetah, which is a high performance, custom data warehouse, as disclosed in the Cheetah document, supra.
  • Cheetah has a SQL-like query language (CQL), which also supports queries over nested data models. Below is an example query:
  • nested tables in the user profile there are two nested tables in the user profile: prof.actions and prof.impressions, which record user's actions or conversions and impressions, respectively. Both nested tables have the field, advertiser, to identify which advertiser the action/impression is related to; and the field, ts, as the time stamp. Therefore, the query above applies GROUP-BY to the column advertiser of the nested table prof.actions, to compute the total occurrence of actions, i.e. count( ) and the number of users who have the action, i.e. count(distinct uid), given the WHERE clause indicating at least one impression from the same advertiser should take place before an action.
  • Cheetah employs a number of optimization techniques for AUP queries. To name a few, but not to be limiting:
  • CQL allows for SQL-based aggregations and correlations between different audience events.
  • marketers look for more advanced analytics, such as modeling and machine learning.
  • MTA multi-touch attribution
  • MTA is a billing model that defines how advertisers distribute credit, e.g. customer purchase, to their campaigns in different media channels, e.g. video, display, mobile, etc. For example, suppose a user sees a car ad on a Web browser. Later, the user sees a TV commercial about the same car again, which makes him more interested. Finally, after the user sees this ad again on his mobile phone, he takes action and registers for a test drive. Marketers know that such media channels may contribute to a final conversion of an audience. However, a current common practice is last-touch attribution (LTA), where the last impression, the one on the mobile phone, gets the credit.
  • LTA last-touch attribution
  • CQL as well as the data mining UDFs are exposed to external clients as a data service in the cloud and are configured such that the external clients may perform ad hoc analysis and obtain very unique insights on their own.
  • RUPs may refer to user profiles stored in profile stores for real-time applications.
  • RUPs also have a nested data model and are updated incrementally and in real-time with new user events.
  • Profile stores as with other Not only SQL (NoSQL) systems, are high-performance, key-value stores for RUPs, with keys being user ids and values being RUPs.
  • NoSQL Not only SQL
  • profile stores are highly optimized to provide low-latency read/write RUP access, typically within a few milliseconds to support peak 1,000,000 queries per second across multiple, geographically distributed data centers.
  • a design of an embodiment of a profile store is inspired by G. DeCandia, D. Hastorun, M. Jampani, G. Kakulapati, A. Lakshman, A. Pilchin, S. Sivasubramanian, P. Vosshall, and W. Vogels' disclosure entitled, “ Dynamo: Amazon's highly available key - value store ,” in SOSP, pages 205-220, 2007 and by Voldemort, which may be found at www.project-voldemort.com/voldemort.
  • a software layer is built on top of Berkeley DB (BDB) that uses consistent hashing to achieve sharding, replication, consistency, and fault tolerance.
  • BDB Berkeley DB
  • the embodiment of the profile store also employs flash drives because hard disks may not be fast enough for the purpose.
  • RUPs are replicated locally in each data center, as well as globally between data centers, to achieve high availability and local low-latency access.
  • an infrastructure called the replication bus is built and employed that incrementally replicates user events across data centers and distributes such to profile stores to keep RUPs up-to-date.
  • the replication bus is highly optimized to synchronize tens of billions of events daily between data centers with an average end-to-end service level agreement (end-to-end SLA) of within a few seconds.
  • DMP data management platform
  • both AUP and RUP may store arbitrary user level data in a nested format.
  • Ingress servers are responsible for receiving and storing data look up user profiles in real-time and performing mapping between cookies when necessary.
  • the platform supports multiple types of data as impression/click events, structured data events, or arbitrary key-value pair data events. These data events are available in RUPs in real-time for the platform to use for algorithmic computation and decision making as well as analytics. Eventually such events from RUP are replicated to AUP.
  • the platform supports multiple real-time operations on the received data.
  • Many of such operations may be modeled as complex event processing. For example, one entity might want to find if a user belongs to a particular set of predefined segments in real-time.
  • the segments are represented as a complex Boolean expression of attributes defined by some predefined taxonomy. Often the segments may be significantly more complicated than simple Boolean expressions, e.g. having some user behavior constraints such as having seen a display advertisement in the last seven days.
  • Such complex segments may be represented by some form of executable code that is evaluated against the RUP data in real-time.
  • Another example use of real-time computation on a user profile in accordance with an embodiment of the invention involves evaluating a user against machine-learned models.
  • Such models may be specified by the users of the DMP in some proprietary format or by using industry standard model specification language, such as Predictive Model Markup Language (PMML), an example of which may be found at en.wikipedia.org/wiki/predictive model markup language.
  • PMML Predictive Model Markup Language
  • An example model may predict a car buyer based on the latest online activity or a person likely to apply for a credit card. Having such knowledge in real-time may be enormous valuable to clients because they may use such prediction as signals to bias the campaigns or take other actions in real-time.
  • a computation for a particular algorithm may be significantly complex requiring multiple stages of a computation layer.
  • Such style of computations may be simply thought of as a series of real-time MapReduce jobs processing the data step-by-step.
  • the computation is represented by a continuous query language or by predefined operators using UDFs, which operate on RUPs in real-time.
  • UDFs which operate on RUPs in real-time.
  • Such approach may solve complex tasks, such as learning a classification model, performing anomaly detection, or performing other data stream algorithms, such as maintaining top-K elements in a stream, as disclosed for example in A. Metwally, D. Agrawal, and A. El Abbadi. Efficient computation of frequent and top - k elements in data streams .
  • signals generated out of the computation layer are stored as unstructured data in RUPs and AUPs and may also be sent back to the clients through egress servers for immediate action.
  • DSPs or other platforms may immediately leverage such signals for better user behavior prediction to achieve better campaign performance.
  • Digital advertising has now reached a state where the pipeline between publishers on the supply side and advertisers on the demand site necessitates many technology partners to help publishers and advertisers deal with real-time optimal decisioning on a huge scale.
  • data management platforms may occupy a prominent role as the hub where data relevant to reaching the audience over different channels is integrated, analyzed, and shared.
  • a high-level overview of one or more embodiments of the DMP as an example demand side platform has been disclosed. It is contemplated that due to efficiencies gained through real-time decisioning and the scales involved with more online usage, the future of advertising may be more real-time, which may imply more data and components in real-time.
  • FIG. 2 is a block schematic diagram of a system in the exemplary form of a computer system 3500 within which a set of instructions for causing the system to perform any one of the foregoing methodologies may be executed.
  • the system may comprise a network router, a network switch, a network bridge, personal digital assistant (PDA), a cellular telephone, a Web appliance or any system capable of executing a sequence of instructions that specify actions to be taken by that system.
  • PDA personal digital assistant
  • the computer system 3500 includes a processor 3502 , a main memory 3504 and a static memory 3506 , which communicate with each other via a bus 3508 .
  • the computer system 3500 may further include a display unit 3510 , for example, a liquid crystal display (LCD) or a cathode ray tube (CRT).
  • the computer system 3500 also includes an alphanumeric input device 3512 , for example, a keyboard; a cursor control device 3514 , for example, a mouse; a disk drive unit 3516 , a signal generation device 3518 , for example, a speaker, and a network interface device 3520 .
  • the disk drive unit 3516 includes a machine-readable medium 3524 on which is stored a set of executable instructions, i.e. software, 3526 embodying any one, or all, of the methodologies described herein below.
  • the software 3526 is also shown to reside, completely or at least partially, within the main memory 3504 and/or within the processor 3502 .
  • the software 3526 may further be transmitted or received over a network 3528 , 3530 by means of a network interface device 3520 .
  • a different embodiment uses logic circuitry instead of computer-executed instructions to implement processing entities.
  • this logic may be implemented by constructing an application-specific integrated circuit (ASIC) having thousands of tiny integrated transistors.
  • ASIC application-specific integrated circuit
  • Such an ASIC may be implemented with CMOS (complementary metal oxide semiconductor), TTL (transistor-transistor logic), VLSI (very large systems integration), or another suitable construction.
  • DSP digital signal processing chip
  • FPGA field programmable gate array
  • PLA programmable logic array
  • PLD programmable logic device
  • a machine-readable medium includes any mechanism for storing or transmitting information in a form readable by a machine, e.g. a computer.
  • a machine readable medium includes read-only memory (ROM); random access memory (RAM); magnetic disk storage media; optical storage media; flash memory devices; electrical, optical, acoustical or other form of propagated signals, for example, carrier waves, infrared signals, digital signals, etc.; or any other type of media suitable for storing or transmitting information.
  • embodiments may include performing operations and using storage with cloud computing.
  • cloud computing may mean executing algorithms on any network that is accessible by internet-enabled or network-enabled devices, servers, or clients and that do not require complex hardware configurations, e.g. requiring cables and complex software configurations, e.g. requiring a consultant to install.
  • embodiments may provide one or more cloud computing solutions that enable users, e.g. users on the go, to obtain advertising analytics or universal tag management in accordance with embodiments herein on such internet-enabled or other network-enabled devices, servers, or clients.
  • one or more cloud computing embodiments may include providing or obtaining advertising analytics or performing universal tag management using mobile devices, tablets, and the like, as such devices are becoming standard consumer devices.

Abstract

A data management apparatus for digital advertising includes a data integration processor for collecting and storing data from providers, resolving heterogeneity of the data at schema and data levels, and performing validity checks of the data; an analytics processor for receiving validated data from the data integration processor and providing to users custom, nesting-aware, SQL-like query language and a library of data mining methods, machine learning models, and analytical user profiles (AUP); and an activation processor for encapsulating complex computations performed in real-time, segment evaluation, and online user classification using runtime user profiles (RUP).

Description

    CROSS REFERENCE TO RELATED APPLICATIONS
  • This application claims priority to U.S. provisional patent application Ser. No. 61/801,001, filed Mar. 22, 2013, which application is incorporated herein in its entirety by this reference thereto.
  • BACKGROUND OF THE INVENTION
  • 1. Technical Field
  • The invention relates generally to digital advertising. More particularly, the invention relates to a data management platform for digital advertising.
  • 2. Description of the Related Art
  • Over the last decade, a number of radical changes have reshaped the worlds of digital advertising, marketing, and media. The first is an innovation called programmatic buying, which is the process of executing media buys in an automated fashion through digital platforms, such as real-time bidding exchanges (RTBEs) and demand-side platforms (DSPs). This method replaces the traditional use of manual processes and negotiations to purchase digital media. Instead, an advertisement (ad) impression is made available through an auction in a RTBE in real-time. Upon requests from RTBEs, DSPs then choose to respond with bids and proposed ads on behalf of their advertisers for this impression. The entire end-to-end buying process between RTBEs and DSPs typically takes less than 150 ms including the network time, leaving less than 50 ms for DSPs to run their runtime pipelines. It is well understood that to make such dynamic buying decisions optimal, particular data, including user data, advertiser data, contextual data, plays a central role.
  • A second important shift is the prolific use of mobile devices, social networks, and video sites. As a result, marketers have gained powerful tools to reach customers through multiple channels such as but not limited to mobile, social, video, display, email, and search. There are numerous platforms dedicated to single channel optimization. For example, video channel platforms aim to maximize the user engagements with video ads, while social ad platforms aim to increase the number of fans and likes of a given product. Regardless of channel, data driven approaches have been proven to be very effective to lift the campaign performance.
  • With the advance of such technologies, one challenge to the marketers today is that the marketing strategy becomes more complicated than ever before. While much work has been done to optimize each individual channel, how different channels interact with each other is little understood. This is however very important as customers often interact with multiple touch points through multiple channels. One main obstacle is that while there are abundant data to leverage, such data may be in different platforms and in different forms. As a result, it may be a non-trivial task to create a global dashboard by extracting aggregated reporting data from different platforms. Performing even finer grain analytics across channels may be virtually impossible, which may be important to the effectiveness, attributions, and accurate rate of return of different channels.
  • Recently, data management platforms (DMPs) have been emerging as the solution to address the above challenge. A DMP may be a central hub to seamlessly and rapidly collect, integrate, manage, and activate large volume of data.
  • SUMMARY OF THE INVENTION
  • An embodiment of the invention comprises a data management platform (DMP) that integrates the following functionalities:
  • 1. Data integration: A DMP is configured to cleanse and integrate data from multiple platforms or channels with heterogeneous schema. Importantly, such integration may have to happen at the finest granular level by linking the same audience or users across different platforms. By such functionality, a deeper and more insightful audience analytics may be obtained across campaign activities.
    2. Analytics: A DMP provides full cross channel reporting and analytics capabilities. Examples may include, but are not limited to, aggregation, user behavior correlation analysis, multi-touch attribution, defined as attributing credit to the channels which contributed to a final action of an audience, tag management, analytical modeling, etc. Furthermore, such DMP may be delivered through cloud-based software-as-a-service (SaaS) to end users and provide them the flexibility to plug in their own analytical intelligence.
    3. Activation: A DMP is configured to not only get data in, but also send data out in real-time. In other words, such DMP may need to make the insights actionable. For example, such DMP may be configured to perform modeling and scoring in real-time by combining online and offline data and sending the data to other platforms to optimize the downstream media and enhance the customer experience.
  • Thus, an embodiment of the invention provides a data management apparatus for digital advertising. A data integration processor is provided for collecting and storing data from providers, resolving heterogeneity of the data at schema and data levels, and performing validity checks of the data. An analytics processor is provided for receiving validated data from the data integration processor and providing to users custom, nesting-aware, SQL-like query language and a library of data mining methods, machine learning models, and analytical user profiles (AUP). Further, an activation processor is provided for encapsulating complex computations performed in real-time, segment evaluation, and online user classification using runtime user profiles (RUP).
  • BRIEF DESCRIPTION OF THE DRAWINGS
  • FIG. 1 is a schematic diagram showing an architecture of a DMP according to an embodiment of the invention; and
  • FIG. 2 is a block schematic diagram of a system in the exemplary form of a computer system according to an embodiment of the invention.
  • DETAILED DESCRIPTION OF THE INVENTION
  • An embodiment of the invention comprises a data management platform (DMP) that integrates the following functionalities:
  • 1. Data integration: A DMP is configured to cleanse and integrate data from multiple platforms or channels with heterogeneous schema. Importantly, such integration may have to happen at the finest granular level by linking the same audience or users across different platforms. By such functionality, a deeper and more insightful audience analytics may be obtained across campaign activities.
    2. Analytics: A DMP provides full cross channel reporting and analytics capabilities. Examples may include, but are not limited to, aggregation, user behavior correlation analysis, multi-touch attribution, defined as attributing credit to the channels which contributed to a final action of an audience, tag management, analytical modeling, etc. Furthermore, such DMP may be delivered through cloud-based software-as-a-service (SaaS) to end users and provide them the flexibility to plug in their own analytical intelligence.
    3. Activation: A DMP is configured to not only get data in, but also send data out in real-time. In other words, such DMP may need to make the insights actionable. For example, such DMP may be configured to perform modeling and scoring in real-time by combining online and offline data and sending the data to other platforms to optimize the downstream media and enhance the customer experience.
  • Thus, an embodiment of the invention provides a data management apparatus for digital advertising. A data integration processor is provided for collecting and storing data from providers, resolving heterogeneity of the data at schema and data levels, and performing validity checks of the data. An analytics processor is provided for receiving validated data from the data integration processor and providing to users custom, nesting-aware, SQL-like query language and a library of data mining methods, machine learning models, and analytical user profiles (AUP). Further, an activation processor is provided for encapsulating complex computations performed in real-time, segment evaluation, and online user classification using runtime user profiles (RUP).
  • The following is an overview of an exemplary DMP in accordance with an embodiment of the invention. It has been found that DMPs are able to handle big data in batch mode, as well as in real-time, thus unifying techniques from multiple fields of data science, including databases, data mining, streaming, distributed systems, key-value stores, and machine learning as disclosed in K.-C. Lee, B. Orten, A. Dasdan, and W. Li, Estimating Conversion Rate in Display Advertising from Past Performance Data, in KDD, pages 768-776, 2012; X. Shao and L. Li. Data-driven Multi-touch Attribution Models, in KDD, pages 258-264, 2011 (“Shao”); etc.
  • The remainder of the discussion herein is organized as a high-level overview of an embodiment of a DMP and three main components thereof: data integration, analytics, and activation.
  • Audience and Nested Data Model
  • In an embodiment of the invention, an audience or user profile covers available information for a given anonymized user, including but not limited to, demographics, psychographics, campaign, and behavioral data. User profile data may be typically collected from various sources. Such data may be first party data, i.e. historical user data collected by advertisers in their own private customer relationship management (CRM) systems, or third party data, i.e. data provided by third party data partners, typically each specializing in a specific type of data, e.g. credit scores, buying intentions, etc. In one embodiment, user profiles are treated as first class citizen and are the basic units for offline analytics, as well as for real-time applications.
  • In an embodiment of the invention, user profile data may arrive in various types, formats, and cardinalities, which may be best captured using a nested relational data model. Logically, each user profile is one record, where some attributes of this record could be another table storing certain type of events. In addition to the digital marketing domain, the use of the nested relational data model has already gained wide adoption in the field of big data such as disclosed in S. Melnik, A. Gubarev, J. J. Long, G. Romer, S. Shivakumar, M. Tolton, and T. Vassilakis. Dremel: Interactive analysis of web-scale datasets. PVLDB, 3(1):330-339, 2010.
  • Based on the above-described functionality of DMPs, in accordance with an embodiment of the invention, a DMP maintains two versions of the user profiles. First, the analytical user profile (AUP) is designed for the purpose of offline analytics and data mining. In an embodiment of the invention, the AUP is stored in a Hadoop File System (HDFS), such as disclosed in Hadoop, Open Source Implementation of MapReduce at hadoop.apache.org. Second, a runtime user profile (RUP) is stored in a globally replicated key-value store to enable fast and reliable retrieval in few milliseconds for real-time applications.
  • System Architecture
  • An embodiment of the invention can be understood with reference to FIG. 1, which shows an overall architecture of a DMP 3402. The DMP 3402 comprises three key components: a data integration engine 3404; an analytics engine 3406; and a real-time activation engine 3408, also referred to herein as a runtime engine.
  • The data integration engine 3404, referred to herein as the Datahub, is responsible for gathering and storing data from first and third party providers, resolving heterogeneity at schema and data levels, e.g. disparate user ids, and performing the necessary validity checks. Once the data are received from the external partners, such data then flow into the other two components. In an embodiment, the analytics engine 3406 may be known as Cheetah, in S. Chen, “Cheetah: A high performance, custom data warehouse on top of mapreduce,” PVLDB, 3(2):1459-1468, 2010 (“Cheetah document”) and has an AUP store as a data layer. The analytics engine 3406 provides the data analysts with a custom, nesting-aware, SQL-like query language called Cheetah Query Language (CQL), in addition to a rich library of data mining methods and machine learning models. In an embodiment, a runtime engine 3408 runs on top of an RUP store. The runtime engine 3408 encapsulates the complex computations performed in real-time, such as but not limited to segment evaluation, online user classification, etc.
  • Ecosystem
  • FIG. 1 also depicts the interaction between the DMP 3402 and the other components in the digital advertising ecosystem in accordance with an embodiment of the invention. The DSPs 3410 are the entities responsible for real-time bidding (RTB) or programmatic buying of ad space from publishers, i.e. supply side, on behalf of advertisers 3411, i.e. demand side. The DSPs 3410 interact directly with the runtime engine 3408 of the DMP 3402 to obtain the information that is necessary for ad selection and bid optimization. The DSPs 3410 mainly respond to bid requests from the RTBEs. The RTBEs are the entities where publishers make their inventory of ad space available for the highest bidders. The RTBEs have support for public exchanges 3412 and private exchanges 3414. Unlike public exchanges 3412, private exchanges 3414 provide publishers 3416 with more control over which advertisers may use their media channels and which ads may run on their media channels.
  • In an embodiment of the invention, user online activity data, e.g. in the form of impressions, clicks, and actions, is sent to the DMP 3402 by the DSPs 3410, e.g. for impression and click data, and the advertiser, e.g. for action data. Because an embodiment of the DMP may be integrated both with its own DSP, as well as other DSPs in the ecosystem, the online data may be obtained from those external DSPs. In such cases, they are considered to be third party media providers 3418, analogous to third party offline data providers 3420.
  • In the following discussion, particular, important components of one or more embodiments of the DMP 3402 are explained in more detail.
  • Data Integration
  • In an embodiment of the invention, a user profile may be a central data repository in the DMP. Each profile contains marketing campaign data, online behavior data, CRM data, etc. Some of such data are collected online, while others are collected by loading offline data files, which are typically keyed off disparate user ids from another platform.
  • In an embodiment of the invention, the DMP designed integration software is referred to as the Datahub and is used to receive offline data files. At a high level, the Datahub implements three steps:
      • Upload offline data files to HDFS;
      • Check the offline data files for data type or out of range errors; and
      • Record lineage by joining the data files with the user profiles based on user id mappings.
  • In an embodiment of the invention, the Datahub handles scalability through multiple FTP servers, multi-pipeline concurrent loading, a Hadoop MapReduce computation model, and its own job scheduler to prioritize specific jobs. The Datahub also achieves immunity from bad data by an initial validation of offline data, thus shielding an embodiment of the DMP from any dirty data. Additionally, the Datahub uses configuration files for instantiating different loading templates and a centralized catalog, supporting schema evolvement, to mitigate heterogeneity issues. The Datahub consistently saves metrics, such as files that were received and stored, records processed, records rejected, last successful pipeline step, and profiling times, in a database. Such monitoring information enables system alerts, client notifications, and billing statements. Furthermore, the Datahub may be configured to recover after failure through a fault tolerance protocol relying on persistent status files. As well, by leveraging the nested data model of user profiles, the Datahub may incorporate more custom logic into a join algorithm, e.g. two data files may easily be differentiated and loaded incrementally.
  • Analytics Cheetah Query Language
  • In an embodiment of the invention, analytics over AUPs may be based on Cheetah, which is a high performance, custom data warehouse, as disclosed in the Cheetah document, supra. Cheetah has a SQL-like query language (CQL), which also supports queries over nested data models. Below is an example query:
  • SELECT advertiser, count(*) actions, count(distinct uid)
    FROM prof.actions a
    WHERE(
      SELECT count(impression id)
      FROM prof.impressions b
      WHERE a.advertiser = b.advertiser and b.ts < a.ts) > 0
    GROUP BY advertiser
  • In an embodiment of the invention, there are two nested tables in the user profile: prof.actions and prof.impressions, which record user's actions or conversions and impressions, respectively. Both nested tables have the field, advertiser, to identify which advertiser the action/impression is related to; and the field, ts, as the time stamp. Therefore, the query above applies GROUP-BY to the column advertiser of the nested table prof.actions, to compute the total occurrence of actions, i.e. count( ) and the number of users who have the action, i.e. count(distinct uid), given the WHERE clause indicating at least one impression from the same advertiser should take place before an action. The filtering condition is composed of a sub-query, which calculates the total number of impressions occurring before the concerned action, i.e. b.ts<a.ts, from the same advertiser, i.e. a.advertiser=b.advertiser, by querying on the nested table prof.impressions of the same user profile.
  • In an embodiment of the invention, Cheetah employs a number of optimization techniques for AUP queries. To name a few, but not to be limiting:
      • Cheetah optimizes correlated sub-queries over nested tables as in the above example.
      • Cheetah organizes the columns in the user profile in the PAX-like format to achieve better compression ratios, such as disclosed in A. Ailamaki, D. J. DeWitt, M. D. Hill, and M. Skounakis. Weaving Relations for Cache Performance, in VLDB, pages 169-180, 2001. Cheetah also leverages the Hadoop column store format as disclosed in Trevni: A column file format, avro.apache.org/docs/current/trevni/spec.html to avoid scanning irrelevant data.
      • Multi-query execution: Cheetah allows multiple queries to be submitted simultaneously and executed in a batch mode, where the input data are scanned only once for all those queries.
    Advanced Analytics
  • In an embodiment of the invention, CQL allows for SQL-based aggregations and correlations between different audience events. Sometimes, marketers look for more advanced analytics, such as modeling and machine learning. One example is multi-touch attribution (MTA) as described in Shao, supra.
  • In an implementation of an embodiment, MTA is a billing model that defines how advertisers distribute credit, e.g. customer purchase, to their campaigns in different media channels, e.g. video, display, mobile, etc. For example, suppose a user sees a car ad on a Web browser. Later, the user sees a TV commercial about the same car again, which makes him more interested. Finally, after the user sees this ad again on his mobile phone, he takes action and registers for a test drive. Marketers know that such media channels may contribute to a final conversion of an audience. However, a current common practice is last-touch attribution (LTA), where the last impression, the one on the mobile phone, gets the credit. A better and fairer advertising ecosystem is expected to distribute the credit to the channels that contributed to her final action. This is the so-called multi-touch attribution problem. In an embodiment of the DMP, different MTA models are incorporated as user defined functions (UDFs) into CQL. This way, CQL users have the freedom to feed an MTA algorithm with arbitrary input data.
  • In an embodiment of the invention, CQL as well as the data mining UDFs are exposed to external clients as a data service in the cloud and are configured such that the external clients may perform ad hoc analysis and obtain very unique insights on their own.
  • Real-Time Activation Runtime User Profile
  • For purposes of understanding herein, RUPs may refer to user profiles stored in profile stores for real-time applications. In an embodiment of the invention, as with AUPs, RUPs also have a nested data model and are updated incrementally and in real-time with new user events. Profile stores, as with other Not only SQL (NoSQL) systems, are high-performance, key-value stores for RUPs, with keys being user ids and values being RUPs. As important runtime components, profile stores are highly optimized to provide low-latency read/write RUP access, typically within a few milliseconds to support peak 1,000,000 queries per second across multiple, geographically distributed data centers.
  • A design of an embodiment of a profile store is inspired by G. DeCandia, D. Hastorun, M. Jampani, G. Kakulapati, A. Lakshman, A. Pilchin, S. Sivasubramanian, P. Vosshall, and W. Vogels' disclosure entitled, “Dynamo: Amazon's highly available key-value store,” in SOSP, pages 205-220, 2007 and by Voldemort, which may be found at www.project-voldemort.com/voldemort. In an embodiment of the invention, a software layer is built on top of Berkeley DB (BDB) that uses consistent hashing to achieve sharding, replication, consistency, and fault tolerance. The embodiment of the profile store also employs flash drives because hard disks may not be fast enough for the purpose. RUPs are replicated locally in each data center, as well as globally between data centers, to achieve high availability and local low-latency access.
  • In an embodiment of the invention, to guarantee real-time synchronized RUPs in every data center, an infrastructure called the replication bus is built and employed that incrementally replicates user events across data centers and distributes such to profile stores to keep RUPs up-to-date. The replication bus is highly optimized to synchronize tens of billions of events daily between data centers with an average end-to-end service level agreement (end-to-end SLA) of within a few seconds.
  • Real-Time Processing Pipeline
  • It has been found that an important feature of a modern data management platform (DMP) is its ability to cope with data flow in real-time. An embodiment of the DMP herein disclosed is equipped with many real-time data processing components. In an embodiment of the invention, some real-time DMP components may consist of data, analytics, user modeling, complex event processing (CEP), and actionable signal generation.
  • In an embodiment of the invention, both AUP and RUP may store arbitrary user level data in a nested format. Ingress servers are responsible for receiving and storing data look up user profiles in real-time and performing mapping between cookies when necessary. The platform supports multiple types of data as impression/click events, structured data events, or arbitrary key-value pair data events. These data events are available in RUPs in real-time for the platform to use for algorithmic computation and decision making as well as analytics. Eventually such events from RUP are replicated to AUP.
  • In an embodiment of the invention, the platform supports multiple real-time operations on the received data. Many of such operations may be modeled as complex event processing. For example, one entity might want to find if a user belongs to a particular set of predefined segments in real-time. The segments are represented as a complex Boolean expression of attributes defined by some predefined taxonomy. Often the segments may be significantly more complicated than simple Boolean expressions, e.g. having some user behavior constraints such as having seen a display advertisement in the last seven days. Such complex segments may be represented by some form of executable code that is evaluated against the RUP data in real-time.
  • Another example use of real-time computation on a user profile in accordance with an embodiment of the invention involves evaluating a user against machine-learned models. Such models may be specified by the users of the DMP in some proprietary format or by using industry standard model specification language, such as Predictive Model Markup Language (PMML), an example of which may be found at en.wikipedia.org/wiki/predictive model markup language. An example model may predict a car buyer based on the latest online activity or a person likely to apply for a credit card. Having such knowledge in real-time may be immensely valuable to clients because they may use such prediction as signals to bias the campaigns or take other actions in real-time.
  • In an embodiment of the invention, in some cases, a computation for a particular algorithm may be significantly complex requiring multiple stages of a computation layer. Such style of computations may be simply thought of as a series of real-time MapReduce jobs processing the data step-by-step. The computation is represented by a continuous query language or by predefined operators using UDFs, which operate on RUPs in real-time. Such approach may solve complex tasks, such as learning a classification model, performing anomaly detection, or performing other data stream algorithms, such as maintaining top-K elements in a stream, as disclosed for example in A. Metwally, D. Agrawal, and A. El Abbadi. Efficient computation of frequent and top-k elements in data streams. In ICDT, pages 398-412, 2005.
  • In an embodiment of the invention, signals generated out of the computation layer are stored as unstructured data in RUPs and AUPs and may also be sent back to the clients through egress servers for immediate action. DSPs or other platforms may immediately leverage such signals for better user behavior prediction to achieve better campaign performance.
  • Conclusion
  • Digital advertising has now reached a state where the pipeline between publishers on the supply side and advertisers on the demand site necessitates many technology partners to help publishers and advertisers deal with real-time optimal decisioning on a huge scale. Among such technology partners, data management platforms may occupy a prominent role as the hub where data relevant to reaching the audience over different channels is integrated, analyzed, and shared. A high-level overview of one or more embodiments of the DMP as an example demand side platform has been disclosed. It is contemplated that due to efficiencies gained through real-time decisioning and the scales involved with more online usage, the future of advertising may be more real-time, which may imply more data and components in real-time.
  • Machine Implementation
  • FIG. 2 is a block schematic diagram of a system in the exemplary form of a computer system 3500 within which a set of instructions for causing the system to perform any one of the foregoing methodologies may be executed. In alternative embodiments, the system may comprise a network router, a network switch, a network bridge, personal digital assistant (PDA), a cellular telephone, a Web appliance or any system capable of executing a sequence of instructions that specify actions to be taken by that system.
  • The computer system 3500 includes a processor 3502, a main memory 3504 and a static memory 3506, which communicate with each other via a bus 3508. The computer system 3500 may further include a display unit 3510, for example, a liquid crystal display (LCD) or a cathode ray tube (CRT). The computer system 3500 also includes an alphanumeric input device 3512, for example, a keyboard; a cursor control device 3514, for example, a mouse; a disk drive unit 3516, a signal generation device 3518, for example, a speaker, and a network interface device 3520.
  • The disk drive unit 3516 includes a machine-readable medium 3524 on which is stored a set of executable instructions, i.e. software, 3526 embodying any one, or all, of the methodologies described herein below. The software 3526 is also shown to reside, completely or at least partially, within the main memory 3504 and/or within the processor 3502. The software 3526 may further be transmitted or received over a network 3528, 3530 by means of a network interface device 3520.
  • In contrast to the system 3500 discussed above, a different embodiment uses logic circuitry instead of computer-executed instructions to implement processing entities. Depending upon the particular requirements of the application in the areas of speed, expense, tooling costs, and the like, this logic may be implemented by constructing an application-specific integrated circuit (ASIC) having thousands of tiny integrated transistors. Such an ASIC may be implemented with CMOS (complementary metal oxide semiconductor), TTL (transistor-transistor logic), VLSI (very large systems integration), or another suitable construction. Other alternatives include a digital signal processing chip (DSP), discrete circuitry (such as resistors, capacitors, diodes, inductors, and transistors), field programmable gate array (FPGA), programmable logic array (PLA), programmable logic device (PLD), and the like.
  • It is to be understood that embodiments may be used as or to support software programs or software modules executed upon some form of processing core (such as the CPU of a computer) or otherwise implemented or realized upon or within a system or computer readable medium. A machine-readable medium includes any mechanism for storing or transmitting information in a form readable by a machine, e.g. a computer. For example, a machine readable medium includes read-only memory (ROM); random access memory (RAM); magnetic disk storage media; optical storage media; flash memory devices; electrical, optical, acoustical or other form of propagated signals, for example, carrier waves, infrared signals, digital signals, etc.; or any other type of media suitable for storing or transmitting information.
  • Further, it is to be understood that embodiments may include performing operations and using storage with cloud computing. For the purposes of discussion herein, cloud computing may mean executing algorithms on any network that is accessible by internet-enabled or network-enabled devices, servers, or clients and that do not require complex hardware configurations, e.g. requiring cables and complex software configurations, e.g. requiring a consultant to install. For example, embodiments may provide one or more cloud computing solutions that enable users, e.g. users on the go, to obtain advertising analytics or universal tag management in accordance with embodiments herein on such internet-enabled or other network-enabled devices, servers, or clients. It further should be appreciated that one or more cloud computing embodiments may include providing or obtaining advertising analytics or performing universal tag management using mobile devices, tablets, and the like, as such devices are becoming standard consumer devices.
  • Although the invention is described herein with reference to the preferred embodiment, one skilled in the art will readily appreciate that other applications may be substituted for those set forth herein without departing from the spirit and scope of the present invention. Accordingly, the invention should only be limited by the Claims included below.

Claims (18)

1. A data management apparatus for digital advertising, comprising:
a data integration processor configured for collecting and storing data from providers, resolving heterogeneity of the data at schema and data levels, and performing validity checks of the data;
an analytics processor configured for receiving validated data from the data integration processor and providing to users a custom, nesting-aware, SQL-like query language and a library of data mining methods, machine learning models, and analytical user profiles (AUP); and
an activation processor configured for encapsulating complex computations performed in real-time, segment evaluation, and online user classification using runtime user profiles (RUP).
2. The apparatus of claim 1, wherein user online activity data is sent from a demand-side platform (DSP) and wherein the user online activity data comprises any of impressions, clicks, and actions;
3. The apparatus of claim 1, wherein user online activity data is obtained from an internal DSP and external DSPs.
4. The apparatus of claim 1, wherein a user profile comprises a central data repository and the profile comprises any of: marketing campaign data, online behavior data, and customer relations management (CRM) data.
5. The apparatus of claim 1, wherein the data integration processor is configured to:
upload offline data files to an Hadoop File System (HDFS);
check the offline data files for data type or out of range errors; and
record lineage by joining the data files with the user profiles based on user id mappings.
6. The apparatus of claim 1, wherein the data integration processor is configured to use configuration files for instantiating different loading templates and a centralized catalog, supporting schema evolvement, to mitigate heterogeneity issues.
7. The apparatus of claim 1, wherein the data integration processor is configured to consistently save metrics comprising any of files that were received and stored, records processed, records rejected, last successful pipeline step, and profiling times in a database.
8. The apparatus of claim 1, wherein the analytics processor is configured to employ optimization techniques for AUP queries, said techniques comprising any of:
correlating sub-queries over nested tables;
organizing columns in the user profile in a PAX-like format to achieve better compression ratios; and
multi-query execution.
9. The apparatus of claim 1, wherein the analytics processor is configured to incorporate different multi-touch attribution (MTA) models as user defined functions (UDFs) into a Cheetah Query Language (CQL).
10. The apparatus of claim 1, wherein the analytics processor is configured to allow users to perform form their own ad-hoc analysis to obtain unique insights.
11. The apparatus of claim 2, wherein profile stores are high-performance key-value stores for RUPs, with keys being user ids and values being RUPs.
12. The apparatus of claim 1, wherein RUPs are replicated locally in each data center and globally between data centers to achieve high availability and local low-latency access.
13. The apparatus of claim 1, wherein the activation processor is configured to provide a replication bus that incrementally replicates user events across data centers and distributes the user events to profile stores to keep RUPs up-to-date.
14. The apparatus of claim 1, wherein the activation processor is configured to support multiple types of data as any of impression and click events, structured data events, and arbitrary key-value pair data events, wherein the data events are available in RUPs in real-time to use for any of algorithmic computation, decision making, and analytics, and wherein the data events from RUP are replicated to AUP.
15. The apparatus of claim 1, wherein the activation processor is configured to process complex segments, where complex segments are represented by executable code that is evaluated against the RUP data in real-time.
16. The apparatus of claim 1, wherein the activation processor is configured to perform a computation for a particular algorithm that is complex and requires multiple stages of a computation layer, comprising a series of real-time MapReduce jobs processing the data step-by-step, wherein the computation is represented by a continuous query language or by predefined operators using UDFs, which operate on RUPs in real-time.
17. The apparatus of claim 1, wherein the activation processor is configured to generate signals out of the computation layer, store the signals as unstructured data in RUPs and AUPs, and send back to clients for any of immediate action or for better user behavior prediction to achieve better campaign performance.
18. A computer implemented data management method for digital advertising, comprising:
collecting and storing, by a data integration processor, data from providers, resolving heterogeneity of the data at schema and data levels, and performing validity checks of the data;
receiving, by an analytics processor, validated data from the data integration processor and providing to users custom, nesting-aware, SQL-like query language and a library of data mining methods, machine learning models, and analytical user profiles (AUP); and
encapsulating, by an activation processor, complex computations performed in real-time, segment evaluation, and online user classification using runtime user profiles (RUP).
US13/924,343 2013-03-15 2013-06-21 Data management platform for digital advertising Abandoned US20140279074A1 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
US13/924,343 US20140279074A1 (en) 2013-03-15 2013-06-21 Data management platform for digital advertising

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US201361801001P 2013-03-15 2013-03-15
US13/924,343 US20140279074A1 (en) 2013-03-15 2013-06-21 Data management platform for digital advertising

Publications (1)

Publication Number Publication Date
US20140279074A1 true US20140279074A1 (en) 2014-09-18

Family

ID=51532086

Family Applications (4)

Application Number Title Priority Date Filing Date
US13/924,263 Active 2033-07-06 US9858600B2 (en) 2013-03-15 2013-06-21 Universal tag for page analytics and campaign creation
US13/924,343 Abandoned US20140279074A1 (en) 2013-03-15 2013-06-21 Data management platform for digital advertising
US13/924,319 Abandoned US20140279724A1 (en) 2013-03-15 2013-06-21 Taxonomy configuration for page analytics and campaign creation
US15/298,170 Active US10217139B2 (en) 2013-03-15 2016-10-19 On-page configuration of page analytics and campaign creation

Family Applications Before (1)

Application Number Title Priority Date Filing Date
US13/924,263 Active 2033-07-06 US9858600B2 (en) 2013-03-15 2013-06-21 Universal tag for page analytics and campaign creation

Family Applications After (2)

Application Number Title Priority Date Filing Date
US13/924,319 Abandoned US20140279724A1 (en) 2013-03-15 2013-06-21 Taxonomy configuration for page analytics and campaign creation
US15/298,170 Active US10217139B2 (en) 2013-03-15 2016-10-19 On-page configuration of page analytics and campaign creation

Country Status (2)

Country Link
US (4) US9858600B2 (en)
WO (1) WO2014144014A1 (en)

Cited By (38)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20140279045A1 (en) * 2013-03-15 2014-09-18 Turn Inc. Cross-domain id synchronization in online advertisement
US9237138B2 (en) 2013-12-31 2016-01-12 The Nielsen Company (Us), Llc Methods and apparatus to collect distributed user information for media impressions and search terms
US9313294B2 (en) 2013-08-12 2016-04-12 The Nielsen Company (Us), Llc Methods and apparatus to de-duplicate impression information
US9497090B2 (en) 2011-03-18 2016-11-15 The Nielsen Company (Us), Llc Methods and apparatus to determine an adjustment factor for media impressions
US9519914B2 (en) 2013-04-30 2016-12-13 The Nielsen Company (Us), Llc Methods and apparatus to determine ratings information for online media presentations
US9596151B2 (en) 2010-09-22 2017-03-14 The Nielsen Company (Us), Llc. Methods and apparatus to determine impressions using distributed demographic information
US9697533B2 (en) 2013-04-17 2017-07-04 The Nielsen Company (Us), Llc Methods and apparatus to monitor media presentations
US9838754B2 (en) 2015-09-01 2017-12-05 The Nielsen Company (Us), Llc On-site measurement of over the top media
US9852163B2 (en) 2013-12-30 2017-12-26 The Nielsen Company (Us), Llc Methods and apparatus to de-duplicate impression information
US20180055636A1 (en) * 2016-08-29 2018-03-01 Francisco Valencia Methods of Steering and Delivery of Intravascular Devices
US9912482B2 (en) 2012-08-30 2018-03-06 The Nielsen Company (Us), Llc Methods and apparatus to collect distributed user information for media impressions and search terms
CN107993097A (en) * 2017-05-16 2018-05-04 广州舜飞信息科技有限公司 A kind of marketing method for transformation of BiddingX platforms
TWI623900B (en) * 2016-07-14 2018-05-11 現觀科技股份有限公司 Bidding and campaign management method and system and token generating and campaign management server
US10027773B2 (en) 2012-06-11 2018-07-17 The Nielson Company (Us), Llc Methods and apparatus to share online media impressions data
US10045082B2 (en) 2015-07-02 2018-08-07 The Nielsen Company (Us), Llc Methods and apparatus to correct errors in audience measurements for media accessed using over-the-top devices
US10045057B2 (en) 2015-12-23 2018-08-07 The Nielsen Company (Us), Llc Methods and apparatus to generate audience measurement data from population sample data having incomplete demographic classifications
US10068246B2 (en) 2013-07-12 2018-09-04 The Nielsen Company (Us), Llc Methods and apparatus to collect distributed user information for media impressions
WO2018197732A1 (en) * 2017-04-25 2018-11-01 Izquierdo Domenech Alejandro Method for automatically making and delivering personalised videos with audio, using browsing information from each user or group of users
US10147114B2 (en) 2014-01-06 2018-12-04 The Nielsen Company (Us), Llc Methods and apparatus to correct audience measurement data
US20180373764A1 (en) * 2015-11-25 2018-12-27 Nec Corporation Information processing system, descriptor creation method, and descriptor creation program
WO2019018509A1 (en) * 2017-07-19 2019-01-24 Allstate Insurance Company Processing system having machine learning engine for providing customized user functions
US10205994B2 (en) 2015-12-17 2019-02-12 The Nielsen Company (Us), Llc Methods and apparatus to collect distributed user information for media impressions
US10270673B1 (en) 2016-01-27 2019-04-23 The Nielsen Company (Us), Llc Methods and apparatus for estimating total unique audiences
US10311464B2 (en) 2014-07-17 2019-06-04 The Nielsen Company (Us), Llc Methods and apparatus to determine impressions corresponding to market segments
US10332158B2 (en) 2015-09-24 2019-06-25 The Nielsen Company (Us), Llc Methods and apparatus to adjust media impressions based on media impression notification loss rates in network communications
US10380633B2 (en) 2015-07-02 2019-08-13 The Nielsen Company (Us), Llc Methods and apparatus to generate corrected online audience measurement data
US10628855B2 (en) * 2018-09-25 2020-04-21 Microsoft Technology Licensing, Llc Automatically merging multiple content item queues
CN111459646A (en) * 2020-05-09 2020-07-28 南京大学 Big data quality management task scheduling method based on pipeline model and task combination
US10803475B2 (en) 2014-03-13 2020-10-13 The Nielsen Company (Us), Llc Methods and apparatus to compensate for server-generated errors in database proprietor impression data due to misattribution and/or non-coverage
WO2021025726A1 (en) * 2019-08-02 2021-02-11 Roku Dx Holdings, Inc. Predictive platform for determining incremental lift
US10956947B2 (en) 2013-12-23 2021-03-23 The Nielsen Company (Us), Llc Methods and apparatus to measure media using media object characteristics
US10963907B2 (en) 2014-01-06 2021-03-30 The Nielsen Company (Us), Llc Methods and apparatus to correct misattributions of media impressions
US11042549B2 (en) * 2019-04-11 2021-06-22 Sas Institute Inc. Database server embedded process and code accelerator
US11514062B2 (en) 2017-10-05 2022-11-29 Dotdata, Inc. Feature value generation device, feature value generation method, and feature value generation program
US11562394B2 (en) 2014-08-29 2023-01-24 The Nielsen Company (Us), Llc Methods and apparatus to associate transactions with media impressions
US11715130B2 (en) 2021-12-13 2023-08-01 Fmr Llc Systems and methods for designing targeted marketing campaigns
US11727203B2 (en) 2017-03-30 2023-08-15 Dotdata, Inc. Information processing system, feature description method and feature description program
US11971922B2 (en) 2023-01-23 2024-04-30 The Nielsen Company (Us), Llc Methods and apparatus for estimating total unique audiences

Families Citing this family (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR101585985B1 (en) * 2015-01-19 2016-01-15 경희대학교 산학협력단 Device of transmitting undiscriminating personal information and method of the same
US9807184B1 (en) * 2016-06-02 2017-10-31 Tealium Inc. Configuration of content site user interaction monitoring in data networks
USD821425S1 (en) * 2016-09-26 2018-06-26 General Electric Company Display screen or portion thereof with graphical user interface
US11222268B2 (en) * 2017-03-09 2022-01-11 Adobe Inc. Determining algorithmic multi-channel media attribution based on discrete-time survival modeling

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20060026064A1 (en) * 2004-07-30 2006-02-02 Collins Robert J Platform for advertising data integration and aggregation
US20060190497A1 (en) * 2005-02-18 2006-08-24 International Business Machines Corporation Support for schema evolution in a multi-node peer-to-peer replication environment
US20080320467A1 (en) * 2007-06-21 2008-12-25 Karunakar Bojjireddy Generically Managing the Configuration of Heterogeneous Software Artifacts
US20120046996A1 (en) * 2010-08-17 2012-02-23 Vishal Shah Unified data management platform
US20130086116A1 (en) * 2011-10-04 2013-04-04 International Business Machines Corporation Declarative specification of data integraton workflows for execution on parallel processing platforms
US20140120864A1 (en) * 2012-03-29 2014-05-01 Velti Mobile Platforms Limited Cross-Channel User Tracking Systems, Methods and Devices

Family Cites Families (18)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20020059099A1 (en) * 2000-06-26 2002-05-16 Coletta Craig J. Method and apparatus for collecting on-line consumer data and streaming advertisements in response to sweepstakes participation
US7181488B2 (en) * 2001-06-29 2007-02-20 Claria Corporation System, method and computer program product for presenting information to a user utilizing historical information about the user
US10510043B2 (en) * 2005-06-13 2019-12-17 Skyword Inc. Computer method and apparatus for targeting advertising
US8311888B2 (en) * 2005-09-14 2012-11-13 Jumptap, Inc. Revenue models associated with syndication of a behavioral profile using a monetization platform
CN101779180B (en) * 2007-08-08 2012-08-15 贝诺特公司 Method and apparatus for context-based content recommendation
US20090076887A1 (en) * 2007-09-16 2009-03-19 Nova Spivack System And Method Of Collecting Market-Related Data Via A Web-Based Networking Environment
US8631116B2 (en) * 2007-11-28 2014-01-14 Ccip Corp. System and method for active business configured website monitoring
US20090197616A1 (en) * 2008-02-01 2009-08-06 Lewis Robert C Critical mass billboard
IL191978A0 (en) * 2008-06-05 2009-02-11 Yuval Elovici Gesture avatar communication
US20100205024A1 (en) * 2008-10-29 2010-08-12 Haggai Shachar System and method for applying in-depth data mining tools for participating websites
EP2196922B1 (en) * 2008-12-10 2013-02-13 Sitecore A/S A method for collecting human experience analytics data
WO2011127049A1 (en) * 2010-04-07 2011-10-13 Liveperson, Inc. System and method for dynamically enabling customized web content and applications
US8560610B2 (en) 2010-06-16 2013-10-15 Brighttag Inc. Unified collection and distribution of data
US20120232985A1 (en) * 2011-03-07 2012-09-13 Pontilex, Inc. Advertising Using Mobile Devices
US8797920B2 (en) 2011-04-20 2014-08-05 IBT—Internet Business Technologies Methods and systems for access to real-time full-duplex web communications platforms
US9165308B2 (en) * 2011-09-20 2015-10-20 TagMan Inc. System and method for loading of web page assets
US20130339839A1 (en) * 2012-06-14 2013-12-19 Emre Yavuz Baran Analyzing User Interaction
US8959073B2 (en) * 2012-11-02 2015-02-17 Swiftype, Inc. Automatically modifying a custom search engine for a web site based on user input

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20060026064A1 (en) * 2004-07-30 2006-02-02 Collins Robert J Platform for advertising data integration and aggregation
US20060190497A1 (en) * 2005-02-18 2006-08-24 International Business Machines Corporation Support for schema evolution in a multi-node peer-to-peer replication environment
US20080320467A1 (en) * 2007-06-21 2008-12-25 Karunakar Bojjireddy Generically Managing the Configuration of Heterogeneous Software Artifacts
US20120046996A1 (en) * 2010-08-17 2012-02-23 Vishal Shah Unified data management platform
US20130086116A1 (en) * 2011-10-04 2013-04-04 International Business Machines Corporation Declarative specification of data integraton workflows for execution on parallel processing platforms
US20140120864A1 (en) * 2012-03-29 2014-05-01 Velti Mobile Platforms Limited Cross-Channel User Tracking Systems, Methods and Devices

Cited By (94)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9596151B2 (en) 2010-09-22 2017-03-14 The Nielsen Company (Us), Llc. Methods and apparatus to determine impressions using distributed demographic information
US11682048B2 (en) 2010-09-22 2023-06-20 The Nielsen Company (Us), Llc Methods and apparatus to determine impressions using distributed demographic information
US10504157B2 (en) 2010-09-22 2019-12-10 The Nielsen Company (Us), Llc Methods and apparatus to determine impressions using distributed demographic information
US11144967B2 (en) 2010-09-22 2021-10-12 The Nielsen Company (Us), Llc Methods and apparatus to determine impressions using distributed demographic information
US9497090B2 (en) 2011-03-18 2016-11-15 The Nielsen Company (Us), Llc Methods and apparatus to determine an adjustment factor for media impressions
US10536543B2 (en) 2012-06-11 2020-01-14 The Nielsen Company (Us), Llc Methods and apparatus to share online media impressions data
US10027773B2 (en) 2012-06-11 2018-07-17 The Nielson Company (Us), Llc Methods and apparatus to share online media impressions data
US11356521B2 (en) 2012-06-11 2022-06-07 The Nielsen Company (Us), Llc Methods and apparatus to share online media impressions data
US10778440B2 (en) 2012-08-30 2020-09-15 The Nielsen Company (Us), Llc Methods and apparatus to collect distributed user information for media impressions and search terms
US11870912B2 (en) 2012-08-30 2024-01-09 The Nielsen Company (Us), Llc Methods and apparatus to collect distributed user information for media impressions and search terms
US11483160B2 (en) 2012-08-30 2022-10-25 The Nielsen Company (Us), Llc Methods and apparatus to collect distributed user information for media impressions and search terms
US9912482B2 (en) 2012-08-30 2018-03-06 The Nielsen Company (Us), Llc Methods and apparatus to collect distributed user information for media impressions and search terms
US10063378B2 (en) 2012-08-30 2018-08-28 The Nielsen Company (Us), Llc Methods and apparatus to collect distributed user information for media impressions and search terms
US11792016B2 (en) 2012-08-30 2023-10-17 The Nielsen Company (Us), Llc Methods and apparatus to collect distributed user information for media impressions and search terms
US20140279045A1 (en) * 2013-03-15 2014-09-18 Turn Inc. Cross-domain id synchronization in online advertisement
US10489805B2 (en) 2013-04-17 2019-11-26 The Nielsen Company (Us), Llc Methods and apparatus to monitor media presentations
US11282097B2 (en) 2013-04-17 2022-03-22 The Nielsen Company (Us), Llc Methods and apparatus to monitor media presentations
US9697533B2 (en) 2013-04-17 2017-07-04 The Nielsen Company (Us), Llc Methods and apparatus to monitor media presentations
US11687958B2 (en) 2013-04-17 2023-06-27 The Nielsen Company (Us), Llc Methods and apparatus to monitor media presentations
US11669849B2 (en) 2013-04-30 2023-06-06 The Nielsen Company (Us), Llc Methods and apparatus to determine ratings information for online media presentations
US11410189B2 (en) 2013-04-30 2022-08-09 The Nielsen Company (Us), Llc Methods and apparatus to determine ratings information for online media presentations
US10937044B2 (en) 2013-04-30 2021-03-02 The Nielsen Company (Us), Llc Methods and apparatus to determine ratings information for online media presentations
US10643229B2 (en) 2013-04-30 2020-05-05 The Nielsen Company (Us), Llc Methods and apparatus to determine ratings information for online media presentations
US9519914B2 (en) 2013-04-30 2016-12-13 The Nielsen Company (Us), Llc Methods and apparatus to determine ratings information for online media presentations
US10192228B2 (en) 2013-04-30 2019-01-29 The Nielsen Company (Us), Llc Methods and apparatus to determine ratings information for online media presentations
US10068246B2 (en) 2013-07-12 2018-09-04 The Nielsen Company (Us), Llc Methods and apparatus to collect distributed user information for media impressions
US11205191B2 (en) 2013-07-12 2021-12-21 The Nielsen Company (Us), Llc Methods and apparatus to collect distributed user information for media impressions
US11830028B2 (en) 2013-07-12 2023-11-28 The Nielsen Company (Us), Llc Methods and apparatus to collect distributed user information for media impressions
US10552864B2 (en) 2013-08-12 2020-02-04 The Nielsen Company (Us), Llc Methods and apparatus to de-duplicate impression information
US11651391B2 (en) 2013-08-12 2023-05-16 The Nielsen Company (Us), Llc Methods and apparatus to de-duplicate impression information
US11222356B2 (en) 2013-08-12 2022-01-11 The Nielsen Company (Us), Llc Methods and apparatus to de-duplicate impression information
US9928521B2 (en) 2013-08-12 2018-03-27 The Nielsen Company (Us), Llc Methods and apparatus to de-duplicate impression information
US9313294B2 (en) 2013-08-12 2016-04-12 The Nielsen Company (Us), Llc Methods and apparatus to de-duplicate impression information
US11854049B2 (en) 2013-12-23 2023-12-26 The Nielsen Company (Us), Llc Methods and apparatus to measure media using media object characteristics
US10956947B2 (en) 2013-12-23 2021-03-23 The Nielsen Company (Us), Llc Methods and apparatus to measure media using media object characteristics
US9852163B2 (en) 2013-12-30 2017-12-26 The Nielsen Company (Us), Llc Methods and apparatus to de-duplicate impression information
US10846430B2 (en) 2013-12-31 2020-11-24 The Nielsen Company (Us), Llc Methods and apparatus to collect distributed user information for media impressions and search terms
US9641336B2 (en) 2013-12-31 2017-05-02 The Nielsen Company (Us), Llc Methods and apparatus to collect distributed user information for media impressions and search terms
US9979544B2 (en) 2013-12-31 2018-05-22 The Nielsen Company (Us), Llc Methods and apparatus to collect distributed user information for media impressions and search terms
US10498534B2 (en) 2013-12-31 2019-12-03 The Nielsen Company (Us), Llc Methods and apparatus to collect distributed user information for media impressions and search terms
US11562098B2 (en) 2013-12-31 2023-01-24 The Nielsen Company (Us), Llc Methods and apparatus to collect distributed user information for media impressions and search terms
US9237138B2 (en) 2013-12-31 2016-01-12 The Nielsen Company (Us), Llc Methods and apparatus to collect distributed user information for media impressions and search terms
US11068927B2 (en) 2014-01-06 2021-07-20 The Nielsen Company (Us), Llc Methods and apparatus to correct audience measurement data
US10147114B2 (en) 2014-01-06 2018-12-04 The Nielsen Company (Us), Llc Methods and apparatus to correct audience measurement data
US10963907B2 (en) 2014-01-06 2021-03-30 The Nielsen Company (Us), Llc Methods and apparatus to correct misattributions of media impressions
US11727432B2 (en) 2014-01-06 2023-08-15 The Nielsen Company (Us), Llc Methods and apparatus to correct audience measurement data
US10803475B2 (en) 2014-03-13 2020-10-13 The Nielsen Company (Us), Llc Methods and apparatus to compensate for server-generated errors in database proprietor impression data due to misattribution and/or non-coverage
US11568431B2 (en) 2014-03-13 2023-01-31 The Nielsen Company (Us), Llc Methods and apparatus to compensate for server-generated errors in database proprietor impression data due to misattribution and/or non-coverage
US11068928B2 (en) 2014-07-17 2021-07-20 The Nielsen Company (Us), Llc Methods and apparatus to determine impressions corresponding to market segments
US10311464B2 (en) 2014-07-17 2019-06-04 The Nielsen Company (Us), Llc Methods and apparatus to determine impressions corresponding to market segments
US11854041B2 (en) 2014-07-17 2023-12-26 The Nielsen Company (Us), Llc Methods and apparatus to determine impressions corresponding to market segments
US11562394B2 (en) 2014-08-29 2023-01-24 The Nielsen Company (Us), Llc Methods and apparatus to associate transactions with media impressions
US10368130B2 (en) 2015-07-02 2019-07-30 The Nielsen Company (Us), Llc Methods and apparatus to correct errors in audience measurements for media accessed using over the top devices
US10785537B2 (en) 2015-07-02 2020-09-22 The Nielsen Company (Us), Llc Methods and apparatus to correct errors in audience measurements for media accessed using over the top devices
US11645673B2 (en) 2015-07-02 2023-05-09 The Nielsen Company (Us), Llc Methods and apparatus to generate corrected online audience measurement data
US11259086B2 (en) 2015-07-02 2022-02-22 The Nielsen Company (Us), Llc Methods and apparatus to correct errors in audience measurements for media accessed using over the top devices
US10045082B2 (en) 2015-07-02 2018-08-07 The Nielsen Company (Us), Llc Methods and apparatus to correct errors in audience measurements for media accessed using over-the-top devices
US10380633B2 (en) 2015-07-02 2019-08-13 The Nielsen Company (Us), Llc Methods and apparatus to generate corrected online audience measurement data
US11706490B2 (en) 2015-07-02 2023-07-18 The Nielsen Company (Us), Llc Methods and apparatus to correct errors in audience measurements for media accessed using over-the-top devices
US9838754B2 (en) 2015-09-01 2017-12-05 The Nielsen Company (Us), Llc On-site measurement of over the top media
US10332158B2 (en) 2015-09-24 2019-06-25 The Nielsen Company (Us), Llc Methods and apparatus to adjust media impressions based on media impression notification loss rates in network communications
US11526914B2 (en) 2015-09-24 2022-12-13 The Nielsen Company (Us), Llc Methods and apparatus to adjust media impressions based on media impression notification loss rates in network communications
US11055752B2 (en) 2015-09-24 2021-07-06 The Nielsen Company (Us), Llc Methods and apparatus to adjust media impressions based on media impression notification loss rates in network communications
US10885011B2 (en) * 2015-11-25 2021-01-05 Dotdata, Inc. Information processing system, descriptor creation method, and descriptor creation program
US20180373764A1 (en) * 2015-11-25 2018-12-27 Nec Corporation Information processing system, descriptor creation method, and descriptor creation program
US11785293B2 (en) 2015-12-17 2023-10-10 The Nielsen Company (Us), Llc Methods and apparatus to collect distributed user information for media impressions
US11272249B2 (en) 2015-12-17 2022-03-08 The Nielsen Company (Us), Llc Methods and apparatus to collect distributed user information for media impressions
US10827217B2 (en) 2015-12-17 2020-11-03 The Nielsen Company (Us), Llc Methods and apparatus to collect distributed user information for media impressions
US10205994B2 (en) 2015-12-17 2019-02-12 The Nielsen Company (Us), Llc Methods and apparatus to collect distributed user information for media impressions
US11349999B2 (en) 2015-12-23 2022-05-31 The Nielsen Company (Us), Llc Methods and apparatus to generate audience measurement data from population sample data having incomplete demographic classifications
US11825015B2 (en) 2015-12-23 2023-11-21 The Nielsen Company (Us), Llc Methods and apparatus to generate audience measurement data from population sample data having incomplete demographic classifications
US10237419B2 (en) 2015-12-23 2019-03-19 The Nielsen Company (Us), Llc Method and apparatus to generate audience measurement data from population sample data having incomplete demographic classifications
US11102357B2 (en) 2015-12-23 2021-08-24 The Nielsen Company (Us), Llc Methods and apparatus to generate audience measurement data from population sample data having incomplete demographic classifications
US10045057B2 (en) 2015-12-23 2018-08-07 The Nielsen Company (Us), Llc Methods and apparatus to generate audience measurement data from population sample data having incomplete demographic classifications
US10694045B2 (en) 2015-12-23 2020-06-23 The Nielsen Company (Us), Llc Methods and apparatus to generate audience measurement data from population sample data having incomplete demographic classifications
US10270673B1 (en) 2016-01-27 2019-04-23 The Nielsen Company (Us), Llc Methods and apparatus for estimating total unique audiences
US10536358B2 (en) 2016-01-27 2020-01-14 The Nielsen Company (Us), Llc Methods and apparatus for estimating total unique audiences
US10979324B2 (en) 2016-01-27 2021-04-13 The Nielsen Company (Us), Llc Methods and apparatus for estimating total unique audiences
US11562015B2 (en) 2016-01-27 2023-01-24 The Nielsen Company (Us), Llc Methods and apparatus for estimating total unique audiences
US11232148B2 (en) 2016-01-27 2022-01-25 The Nielsen Company (Us), Llc Methods and apparatus for estimating total unique audiences
TWI623900B (en) * 2016-07-14 2018-05-11 現觀科技股份有限公司 Bidding and campaign management method and system and token generating and campaign management server
US20180055636A1 (en) * 2016-08-29 2018-03-01 Francisco Valencia Methods of Steering and Delivery of Intravascular Devices
US11727203B2 (en) 2017-03-30 2023-08-15 Dotdata, Inc. Information processing system, feature description method and feature description program
WO2018197732A1 (en) * 2017-04-25 2018-11-01 Izquierdo Domenech Alejandro Method for automatically making and delivering personalised videos with audio, using browsing information from each user or group of users
CN107993097A (en) * 2017-05-16 2018-05-04 广州舜飞信息科技有限公司 A kind of marketing method for transformation of BiddingX platforms
US11314798B2 (en) 2017-07-19 2022-04-26 Allstate Insurance Company Processing system having machine learning engine for providing customized user functions
WO2019018509A1 (en) * 2017-07-19 2019-01-24 Allstate Insurance Company Processing system having machine learning engine for providing customized user functions
US11514062B2 (en) 2017-10-05 2022-11-29 Dotdata, Inc. Feature value generation device, feature value generation method, and feature value generation program
US10628855B2 (en) * 2018-09-25 2020-04-21 Microsoft Technology Licensing, Llc Automatically merging multiple content item queues
US11042549B2 (en) * 2019-04-11 2021-06-22 Sas Institute Inc. Database server embedded process and code accelerator
WO2021025726A1 (en) * 2019-08-02 2021-02-11 Roku Dx Holdings, Inc. Predictive platform for determining incremental lift
CN111459646A (en) * 2020-05-09 2020-07-28 南京大学 Big data quality management task scheduling method based on pipeline model and task combination
US11715130B2 (en) 2021-12-13 2023-08-01 Fmr Llc Systems and methods for designing targeted marketing campaigns
US11971922B2 (en) 2023-01-23 2024-04-30 The Nielsen Company (Us), Llc Methods and apparatus for estimating total unique audiences

Also Published As

Publication number Publication date
US10217139B2 (en) 2019-02-26
WO2014144014A1 (en) 2014-09-18
US20170039600A1 (en) 2017-02-09
US20140278748A1 (en) 2014-09-18
US9858600B2 (en) 2018-01-02
US20140279724A1 (en) 2014-09-18

Similar Documents

Publication Publication Date Title
US20140279074A1 (en) Data management platform for digital advertising
KR102627690B1 (en) Dimensional context propagation techniques for optimizing SKB query plans
JP6887544B2 (en) Enriching events with dynamically typed big data for event processing
US10217256B2 (en) Visually exploring and analyzing event streams
US11848916B2 (en) Secure electronic messaging system
US10853847B2 (en) Methods and systems for near real-time lookalike audience expansion in ads targeting
US10120907B2 (en) Scaling event processing using distributed flows and map-reduce operations
US20210263906A1 (en) Recreating an oltp table and reapplying database transactions for real-time analytics
US20150213109A1 (en) System and method for providing big data analytics on dynamically-changing data models
AU2019236628B2 (en) Integrated entity view across distributed systems
US20160063072A1 (en) Systems, methods, and apparatuses for detecting activity patterns
US10089362B2 (en) Systems and/or methods for investigating event streams in complex event processing (CEP) applications
US10877971B2 (en) Logical queries in a distributed stream processing system
Elmeleegy et al. Overview of turn data management platform for digital advertising
US9965772B2 (en) System and method for unifying user-level data across different media platforms
US10409813B2 (en) Imputing data for temporal data store joins
US20180165349A1 (en) Generating and associating tracking events across entity lifecycles
US10572562B2 (en) Methods and systems for performing time-partitioned collaborative filtering
JP2020523655A (en) System and method for eliminating bias in media mix modeling
US20220114483A1 (en) Unified machine learning feature data pipeline
US20200104398A1 (en) Unified management of targeting attributes in a/b tests
US20240045859A1 (en) Executing aggregate computing operations in complex computing networks
Shankar et al. Recent Trends in Big Data Analytics and Role in Business Decision Making

Legal Events

Date Code Title Description
AS Assignment

Owner name: TURN INC., CALIFORNIA

Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNORS:CHEN, SONGTING;DASDAN, ALI;ELMELEEGY, HAZEM;AND OTHERS;SIGNING DATES FROM 20140814 TO 20141028;REEL/FRAME:034168/0736

AS Assignment

Owner name: SILICON VALLEY BANK, AS ADMINISTRATIVE AGENT, CALI

Free format text: SECURITY AGREEMENT;ASSIGNOR:TURN INC.;REEL/FRAME:034484/0523

Effective date: 20141126

STCB Information on status: application discontinuation

Free format text: ABANDONED -- FAILURE TO RESPOND TO AN OFFICE ACTION