WO2010033521A3 - Methods for enabling a scalable transformation of diverse data into models to drive the discovery of new knowledge - Google Patents

Methods for enabling a scalable transformation of diverse data into models to drive the discovery of new knowledge Download PDF

Info

Publication number
WO2010033521A3
WO2010033521A3 PCT/US2009/057046 US2009057046W WO2010033521A3 WO 2010033521 A3 WO2010033521 A3 WO 2010033521A3 US 2009057046 W US2009057046 W US 2009057046W WO 2010033521 A3 WO2010033521 A3 WO 2010033521A3
Authority
WO
WIPO (PCT)
Prior art keywords
data
drive
models
discovery
enabling
Prior art date
Application number
PCT/US2009/057046
Other languages
French (fr)
Other versions
WO2010033521A2 (en
Inventor
Akhileswar Ganesh Vaidyanathan
Stephen D. Prior
Jijun Wang
Bin Yu
Original Assignee
Quantum Leap Research, Inc.
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Quantum Leap Research, Inc. filed Critical Quantum Leap Research, Inc.
Publication of WO2010033521A2 publication Critical patent/WO2010033521A2/en
Publication of WO2010033521A3 publication Critical patent/WO2010033521A3/en

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/12Computing arrangements based on biological models using genetic models
    • G06N3/126Evolutionary algorithms, e.g. genetic algorithms or genetic programming
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/21Design or setup of recognition systems or techniques; Extraction of features in feature space; Blind source separation
    • G06F18/211Selection of the most significant subset of features
    • G06F18/2115Selection of the most significant subset of features by evaluating different subsets according to an optimisation criterion, e.g. class separability, forward selection or backward elimination
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V20/00Scenes; Scene-specific elements
    • G06V20/60Type of objects
    • G06V20/69Microscopic objects, e.g. biological cells or cellular parts
    • G06V20/698Matching; Classification
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16HHEALTHCARE INFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR THE HANDLING OR PROCESSING OF MEDICAL OR HEALTHCARE DATA
    • G16H50/00ICT specially adapted for medical diagnosis, medical simulation or medical data mining; ICT specially adapted for detecting, monitoring or modelling epidemics or pandemics
    • G16H50/50ICT specially adapted for medical diagnosis, medical simulation or medical data mining; ICT specially adapted for detecting, monitoring or modelling epidemics or pandemics for simulation or modelling of medical disorders
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N5/00Computing arrangements using knowledge-based models
    • G06N5/01Dynamic search techniques; Heuristics; Dynamic trees; Branch-and-bound
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N7/00Computing arrangements based on specific mathematical models
    • G06N7/01Probabilistic graphical models, e.g. probabilistic networks
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V2201/00Indexing scheme relating to image or video recognition or understanding
    • G06V2201/03Recognition of patterns in medical or anatomical images
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16HHEALTHCARE INFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR THE HANDLING OR PROCESSING OF MEDICAL OR HEALTHCARE DATA
    • G16H50/00ICT specially adapted for medical diagnosis, medical simulation or medical data mining; ICT specially adapted for detecting, monitoring or modelling epidemics or pandemics
    • G16H50/70ICT specially adapted for medical diagnosis, medical simulation or medical data mining; ICT specially adapted for detecting, monitoring or modelling epidemics or pandemics for mining of medical data, e.g. analysing previous cases of other patients
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16HHEALTHCARE INFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR THE HANDLING OR PROCESSING OF MEDICAL OR HEALTHCARE DATA
    • G16H70/00ICT specially adapted for the handling or processing of medical references
    • G16H70/60ICT specially adapted for the handling or processing of medical references relating to pathologies

Abstract

The present invention relates to a method for the automatic identification of at least one informative data filter from a data set that can be used to identify at least one relevant data subset against a target feature for subsequent hypothesis generation, model building and model testing. The present invention describes methods, and an initial implementation, for efficiently linking relevant data both within and across multiple domains and identifying informative statistical relationships across this data that can be integrated into agent-based models. The relationships, encoded by the agents, can then drive emergent behavior across the global system that is described in the integrated data environment.
PCT/US2009/057046 2008-09-16 2009-09-15 Methods for enabling a scalable transformation of diverse data into hypotheses, models and dynamic simulations to drive the discovery of new knowledge WO2010033521A2 (en)

Applications Claiming Priority (6)

Application Number Priority Date Filing Date Title
US9751208P 2008-09-16 2008-09-16
US61/097,512 2008-09-16
US21898609P 2009-06-21 2009-06-21
US61/218,986 2009-06-21
US12/556,591 2009-09-10
US12/556,591 US20120004893A1 (en) 2008-09-16 2009-09-10 Methods for Enabling a Scalable Transformation of Diverse Data into Hypotheses, Models and Dynamic Simulations to Drive the Discovery of New Knowledge

Publications (2)

Publication Number Publication Date
WO2010033521A2 WO2010033521A2 (en) 2010-03-25
WO2010033521A3 true WO2010033521A3 (en) 2010-05-20

Family

ID=42040096

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/US2009/057046 WO2010033521A2 (en) 2008-09-16 2009-09-15 Methods for enabling a scalable transformation of diverse data into hypotheses, models and dynamic simulations to drive the discovery of new knowledge

Country Status (2)

Country Link
US (1) US20120004893A1 (en)
WO (1) WO2010033521A2 (en)

Families Citing this family (34)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8874477B2 (en) 2005-10-04 2014-10-28 Steven Mark Hoffberg Multifactorial optimization system and method
US11562323B2 (en) * 2009-10-01 2023-01-24 DecisionQ Corporation Application of bayesian networks to patient screening and treatment
US20120041989A1 (en) * 2010-08-16 2012-02-16 Tata Consultancy Services Limited Generating assessment data
US8909685B2 (en) * 2011-12-16 2014-12-09 Sap Se Pattern recognition of a distribution function
US8880446B2 (en) * 2012-11-15 2014-11-04 Purepredictive, Inc. Predictive analytics factory
WO2014110167A2 (en) 2013-01-08 2014-07-17 Purepredictive, Inc. Integrated machine learning for a data management product
US9218574B2 (en) 2013-05-29 2015-12-22 Purepredictive, Inc. User interface for machine learning
US9646262B2 (en) 2013-06-17 2017-05-09 Purepredictive, Inc. Data intelligence using machine learning
US9874859B1 (en) * 2015-02-09 2018-01-23 Wells Fargo Bank, N.A. Framework for simulations of complex-adaptive systems
US10430716B2 (en) * 2016-02-10 2019-10-01 Ground Rounds, Inc. Data driven featurization and modeling
US10394929B2 (en) * 2016-12-20 2019-08-27 Mediatek, Inc. Adaptive execution engine for convolution computing systems
CA3055187A1 (en) * 2017-03-02 2018-09-07 The Johns Hopkins University Medical adverse event prediction, reporting, and prevention
US10762111B2 (en) 2017-09-25 2020-09-01 International Business Machines Corporation Automatic feature learning from a relational database for predictive modelling
US11177024B2 (en) * 2017-10-31 2021-11-16 International Business Machines Corporation Identifying and indexing discriminative features for disease progression in observational data
US11281995B2 (en) 2018-05-21 2022-03-22 International Business Machines Corporation Finding optimal surface for hierarchical classification task on an ontology
EP3857555A4 (en) * 2018-10-17 2022-12-21 Tempus Labs Data based cancer research and treatment systems and methods
US11455234B2 (en) * 2018-11-21 2022-09-27 Amazon Technologies, Inc. Robotics application development architecture
US11836577B2 (en) 2018-11-27 2023-12-05 Amazon Technologies, Inc. Reinforcement learning model training through simulation
US11429762B2 (en) 2018-11-27 2022-08-30 Amazon Technologies, Inc. Simulation orchestration for training reinforcement learning models
US10970272B2 (en) 2019-01-31 2021-04-06 Sap Se Data cloud—platform for data enrichment
US11676043B2 (en) 2019-03-04 2023-06-13 International Business Machines Corporation Optimizing hierarchical classification with adaptive node collapses
WO2020227383A1 (en) 2019-05-09 2020-11-12 Aspen Technology, Inc. Combining machine learning with domain knowledge and first principles for modeling in the process industries
US11705226B2 (en) * 2019-09-19 2023-07-18 Tempus Labs, Inc. Data based cancer research and treatment systems and methods
CN110569543B (en) * 2019-08-02 2023-08-15 中国船舶工业系统工程研究院 Complex system self-adaption method and system supporting mapping dimension lifting
US11782401B2 (en) 2019-08-02 2023-10-10 Aspentech Corporation Apparatus and methods to build deep learning controller using non-invasive closed loop exploration
WO2021076760A1 (en) 2019-10-18 2021-04-22 Aspen Technology, Inc. System and methods for automated model development from plant historical data for advanced process control
EP4127955A4 (en) * 2020-04-03 2024-03-27 Insurance Services Office Inc Systems and methods for computer modeling using incomplete data
US20220215243A1 (en) * 2021-01-05 2022-07-07 Capital One Services, Llc Risk-Reliability Framework for Evaluating Synthetic Data Models
CN112783005B (en) * 2021-01-07 2022-05-17 北京航空航天大学 System theoretical process analysis method based on simulation
US11630446B2 (en) * 2021-02-16 2023-04-18 Aspentech Corporation Reluctant first principles models
EP4075282A1 (en) * 2021-04-16 2022-10-19 Siemens Aktiengesellschaft Automated verification of a test model for a plurality of defined bdd test scenarios
CN116418828B (en) * 2021-12-28 2023-11-14 北京领航智联物联网科技有限公司 Video and audio equipment integrated management method based on artificial intelligence
CN115631326B (en) * 2022-08-15 2023-10-31 无锡东如科技有限公司 Knowledge-driven 3D visual detection method for intelligent robot
CN117634502A (en) * 2024-01-26 2024-03-01 中国农业科学院农业信息研究所 Technical opportunity identification method, device, computer equipment and storage medium

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20040088116A1 (en) * 2002-11-04 2004-05-06 Gene Network Sciences, Inc. Methods and systems for creating and using comprehensive and data-driven simulations of biological systems for pharmacological and industrial applications
US20060167784A1 (en) * 2004-09-10 2006-07-27 Hoffberg Steven M Game theoretic prioritization scheme for mobile ad hoc networks permitting hierarchal deference
US20070053513A1 (en) * 1999-10-05 2007-03-08 Hoffberg Steven M Intelligent electronic appliance system and method
US20070087756A1 (en) * 2005-10-04 2007-04-19 Hoffberg Steven M Multifactorial optimization system and method
US20070287473A1 (en) * 1998-11-24 2007-12-13 Tracbeam Llc Platform and applications for wireless location and other complex services
US20080077375A1 (en) * 2003-08-22 2008-03-27 Fernandez Dennis S Integrated Biosensor and Simulation System for Diagnosis and Therapy
US20080091471A1 (en) * 2005-10-18 2008-04-17 Bioveris Corporation Systems and methods for obtaining, storing, processing and utilizing immunologic and other information of individuals and populations

Family Cites Families (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5809499A (en) * 1995-10-20 1998-09-15 Pattern Discovery Software Systems, Ltd. Computational method for discovering patterns in data sets
US7475048B2 (en) * 1998-05-01 2009-01-06 Health Discovery Corporation Pre-processed feature ranking for a support vector machine
US7444308B2 (en) * 2001-06-15 2008-10-28 Health Discovery Corporation Data mining platform for bioinformatics and other knowledge discovery
US6774917B1 (en) * 1999-03-11 2004-08-10 Fuji Xerox Co., Ltd. Methods and apparatuses for interactive similarity searching, retrieval, and browsing of video
US7007001B2 (en) * 2002-06-26 2006-02-28 Microsoft Corporation Maximizing mutual information between observations and hidden states to minimize classification errors
US20070214133A1 (en) * 2004-06-23 2007-09-13 Edo Liberty Methods for filtering data and filling in missing data using nonlinear inference
US20060217925A1 (en) * 2005-03-23 2006-09-28 Taron Maxime G Methods for entity identification
US20070130206A1 (en) * 2005-08-05 2007-06-07 Siemens Corporate Research Inc System and Method For Integrating Heterogeneous Biomedical Information

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20070287473A1 (en) * 1998-11-24 2007-12-13 Tracbeam Llc Platform and applications for wireless location and other complex services
US20070053513A1 (en) * 1999-10-05 2007-03-08 Hoffberg Steven M Intelligent electronic appliance system and method
US20040088116A1 (en) * 2002-11-04 2004-05-06 Gene Network Sciences, Inc. Methods and systems for creating and using comprehensive and data-driven simulations of biological systems for pharmacological and industrial applications
US20080077375A1 (en) * 2003-08-22 2008-03-27 Fernandez Dennis S Integrated Biosensor and Simulation System for Diagnosis and Therapy
US20060167784A1 (en) * 2004-09-10 2006-07-27 Hoffberg Steven M Game theoretic prioritization scheme for mobile ad hoc networks permitting hierarchal deference
US20070087756A1 (en) * 2005-10-04 2007-04-19 Hoffberg Steven M Multifactorial optimization system and method
US20080091471A1 (en) * 2005-10-18 2008-04-17 Bioveris Corporation Systems and methods for obtaining, storing, processing and utilizing immunologic and other information of individuals and populations

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
TISSEAU.: "Virtual Reality - in virtuo autonomy", THESIS, UNIVERSITY OF RENNES, 6 December 2001 (2001-12-06), Retrieved from the Internet <URL:http://www.enib.fr/-tisseau/doc/hdr/hdrJTuk.pdf> [retrieved on 20100318] *

Also Published As

Publication number Publication date
US20120004893A1 (en) 2012-01-05
WO2010033521A2 (en) 2010-03-25

Similar Documents

Publication Publication Date Title
WO2010033521A3 (en) Methods for enabling a scalable transformation of diverse data into models to drive the discovery of new knowledge
WO2010011918A3 (en) Methods for prognosing mechanical systems
WO2008033394A3 (en) Complexity management tool
WO2010047794A3 (en) Environmental data collection
WO2007143405A3 (en) Automatic fault classification for model-based process monitoring
WO2011031328A3 (en) Systems and methods for management of projects for development of embedded systems
WO2007117592A3 (en) System and method for managing product information
ATE450830T1 (en) SENSOR FAULT DIAGNOSIS AND PREDICTION USING A COMPONENT MODEL AND TIMESCALE ORTHOGONAL DEVELOPMENTS
WO2010006303A3 (en) Methods and apparatus related to management of experiments
TW200951652A (en) Autonomous adaptive semiconductor manufacturing
WO2006023744A3 (en) Methods and apparatus for local outlier detection
WO2011091116A3 (en) Automated agent for social media systems
WO2009002949A3 (en) System, method and apparatus for predictive modeling of specially distributed data for location based commercial services
WO2008060620A3 (en) Systems and methods for modeling and analyzing networks
WO2007099058A3 (en) Software testing automation framework
WO2007106786A3 (en) Methods and systems for multi-credit reporting agency data modeling
WO2010051164A8 (en) System and method for stereo-view multiple animal behavior characterization
GB2445305A (en) Method and system for integrated asset management utilizing multi-level modeling of oil field assets
WO2012124953A3 (en) Refrigerator, and apparatus and method for refrigerator diagnosis
WO2010073171A3 (en) A system for using three-dimensional models to enable image comparisons independent of image source
WO2007143272A3 (en) Artificial intelligence analyzer and generator
TW200636411A (en) Automated throughput control system and method of operating the same
WO2007100969A3 (en) Apparatus and method for selecting a subset of report templates based on specified criteria
WO2010038063A3 (en) Assisting with updating a model for diagnosing failures in a system
WO2010043706A3 (en) Method for the deterministic execution and synchronisation of an information processing system comprising a plurality of processing cores executing system tasks

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 09815070

Country of ref document: EP

Kind code of ref document: A2

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 09815070

Country of ref document: EP

Kind code of ref document: A2