DE60004507D1 - Schnelle gruppierung durch spärlich bestückte datensätze - Google Patents

Schnelle gruppierung durch spärlich bestückte datensätze

Info

Publication number
DE60004507D1
DE60004507D1 DE60004507T DE60004507T DE60004507D1 DE 60004507 D1 DE60004507 D1 DE 60004507D1 DE 60004507 T DE60004507 T DE 60004507T DE 60004507 T DE60004507 T DE 60004507T DE 60004507 D1 DE60004507 D1 DE 60004507D1
Authority
DE
Germany
Prior art keywords
attribute
sparkly
record
records
data
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Lifetime
Application number
DE60004507T
Other languages
English (en)
Other versions
DE60004507T2 (de
Inventor
Maxwell Chickering
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Microsoft Corp
Original Assignee
Microsoft Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Microsoft Corp filed Critical Microsoft Corp
Application granted granted Critical
Publication of DE60004507D1 publication Critical patent/DE60004507D1/de
Publication of DE60004507T2 publication Critical patent/DE60004507T2/de
Anticipated expiration legal-status Critical
Expired - Lifetime legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/35Clustering; Classification
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/23Clustering techniques
    • G06F18/232Non-hierarchical techniques
    • G06F18/2321Non-hierarchical techniques using statistics or function optimisation, e.g. modelling of probability density functions
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y10TECHNICAL SUBJECTS COVERED BY FORMER USPC
    • Y10STECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y10S707/00Data processing: database and file management or data structures
    • Y10S707/99931Database or file accessing
    • Y10S707/99937Sorting
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y10TECHNICAL SUBJECTS COVERED BY FORMER USPC
    • Y10STECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y10S707/00Data processing: database and file management or data structures
    • Y10S707/99941Database schema or data structure
    • Y10S707/99942Manipulating data structure, e.g. compression, compaction, compilation

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Evolutionary Biology (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Artificial Intelligence (AREA)
  • Databases & Information Systems (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Probability & Statistics with Applications (AREA)
  • Evolutionary Computation (AREA)
  • Complex Calculations (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • Measurement And Recording Of Electrical Phenomena And Electrical Characteristics Of The Living Body (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)
  • Radar Systems Or Details Thereof (AREA)
  • Traffic Control Systems (AREA)
  • Spark Plugs (AREA)
DE60004507T 1999-04-23 2000-04-22 Schnelle gruppierung durch spärlich bestückte datensätze Expired - Lifetime DE60004507T2 (de)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
US298600 1999-04-23
US09/298,600 US6556958B1 (en) 1999-04-23 1999-04-23 Fast clustering with sparse data
PCT/US2000/010769 WO2000065481A1 (en) 1999-04-23 2000-04-22 Fast clustering with sparse data

Publications (2)

Publication Number Publication Date
DE60004507D1 true DE60004507D1 (de) 2003-09-18
DE60004507T2 DE60004507T2 (de) 2004-02-26

Family

ID=23151219

Family Applications (1)

Application Number Title Priority Date Filing Date
DE60004507T Expired - Lifetime DE60004507T2 (de) 1999-04-23 2000-04-22 Schnelle gruppierung durch spärlich bestückte datensätze

Country Status (6)

Country Link
US (1) US6556958B1 (de)
EP (1) EP1173816B1 (de)
AT (1) ATE247300T1 (de)
AU (1) AU4653100A (de)
DE (1) DE60004507T2 (de)
WO (1) WO2000065481A1 (de)

Families Citing this family (24)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6735253B1 (en) 1997-05-16 2004-05-11 The Trustees Of Columbia University In The City Of New York Methods and architecture for indexing and editing compressed video over the world wide web
US7143434B1 (en) * 1998-11-06 2006-11-28 Seungyup Paek Video description system and method
US6941325B1 (en) * 1999-02-01 2005-09-06 The Trustees Of Columbia University Multimedia archive description scheme
US6745157B1 (en) * 2000-06-02 2004-06-01 Mitsubishi Electric Research Laboratories, Inc Super-node normalized belief propagation for probabilistic systems
US6895398B2 (en) * 2000-07-18 2005-05-17 Inferscape, Inc. Decision engine and method and applications thereof
US7155668B2 (en) * 2001-04-19 2006-12-26 International Business Machines Corporation Method and system for identifying relationships between text documents and structured variables pertaining to the text documents
US7644102B2 (en) * 2001-10-19 2010-01-05 Xerox Corporation Methods, systems, and articles of manufacture for soft hierarchical clustering of co-occurring objects
WO2003051031A2 (en) * 2001-12-06 2003-06-19 The Trustees Of Columbia University In The City Of New York Method and apparatus for planarization of a material by growing and removing a sacrificial film
US6970882B2 (en) * 2002-04-04 2005-11-29 International Business Machines Corporation Unified relational database model for data mining selected model scoring results, model training results where selection is based on metadata included in mining model control table
JP2005525011A (ja) * 2002-04-26 2005-08-18 ザ トラスティーズ オブ コロンビア ユニヴァーシティ イン ザ シティ オブ ニューヨーク ユーティリティ関数記述にもとづく最適なビデオ・トランスコーディング用の方法及びシステム
US7848909B2 (en) * 2004-01-14 2010-12-07 Sap Aktiengesellschaft Computing prediction results during an unbroken online interactive session
WO2006096612A2 (en) * 2005-03-04 2006-09-14 The Trustees Of Columbia University In The City Of New York System and method for motion estimation and mode decision for low-complexity h.264 decoder
US20070233651A1 (en) * 2006-03-31 2007-10-04 International Business Machines Corporation Online analytic processing in the presence of uncertainties
US9087335B2 (en) * 2006-09-29 2015-07-21 American Express Travel Related Services Company, Inc. Multidimensional personal behavioral tomography
US7465241B2 (en) * 2007-03-23 2008-12-16 Acushnet Company Functionalized, crosslinked, rubber nanoparticles for use in golf ball castable thermoset layers
US8055095B2 (en) * 2008-01-23 2011-11-08 Sparsense, Inc. Parallel and adaptive signal processing
WO2009126785A2 (en) * 2008-04-10 2009-10-15 The Trustees Of Columbia University In The City Of New York Systems and methods for image archaeology
WO2009155281A1 (en) * 2008-06-17 2009-12-23 The Trustees Of Columbia University In The City Of New York System and method for dynamically and interactively searching media data
US8671069B2 (en) 2008-12-22 2014-03-11 The Trustees Of Columbia University, In The City Of New York Rapid image annotation via brain state decoding and visual pattern mining
US8566270B2 (en) * 2010-09-24 2013-10-22 Nuance Communications, Inc. Sparse representations for text classification
US9026591B2 (en) 2011-02-28 2015-05-05 Avaya Inc. System and method for advanced communication thread analysis
US8565486B2 (en) * 2012-01-05 2013-10-22 Gentex Corporation Bayesian classifier system using a non-linear probability function and method thereof
BR112016022665A2 (pt) 2014-03-31 2017-08-15 Ingrain Inc Determinação de volume elementar representativo por meio de estatística baseada em agrupamento
US20150309987A1 (en) 2014-04-29 2015-10-29 Google Inc. Classification of Offensive Words

Family Cites Families (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5603022A (en) 1994-09-23 1997-02-11 The Regents Of The University Of Michigan Data compression system and method representing records as differences between sorted domain ordinals representing field values
GB2310055A (en) 1996-02-08 1997-08-13 Ibm Compression of structured data
US5704017A (en) 1996-02-16 1997-12-30 Microsoft Corporation Collaborative filtering utilizing a belief network
US5832182A (en) 1996-04-24 1998-11-03 Wisconsin Alumni Research Foundation Method and system for data clustering for very large databases
US5977889A (en) * 1997-06-05 1999-11-02 International Business Machines Corporation Optimization of data representations for transmission of storage using differences from reference data
US6032146A (en) * 1997-10-21 2000-02-29 International Business Machines Corporation Dimension reduction for data mining application
US6360224B1 (en) * 1999-04-23 2002-03-19 Microsoft Corporation Fast extraction of one-way and two-way counts from sparse data

Also Published As

Publication number Publication date
WO2000065481A1 (en) 2000-11-02
EP1173816A1 (de) 2002-01-23
DE60004507T2 (de) 2004-02-26
AU4653100A (en) 2000-11-10
US6556958B1 (en) 2003-04-29
ATE247300T1 (de) 2003-08-15
EP1173816B1 (de) 2003-08-13

Similar Documents

Publication Publication Date Title
DE60004507D1 (de) Schnelle gruppierung durch spärlich bestückte datensätze
WO2000065427A3 (en) Fast extraction of counts from sparse data
Smirnov Algorithm FIRE—Feynman integral reduction
US9069831B2 (en) Retrieving data objects
TW322562B (de)
KR102028708B1 (ko) 대용량 이벤트 파일에서 시간 관계를 병렬 탐사하기 위한 방법
TW202040387A (zh) 資料記錄的索引創建方法
CN104424256B (zh) 布隆过滤器生成方法和装置
Johnson et al. Entity-relationship-attribute designs and sketches
CN107423404A (zh) 流程实例数据同步处理方法和装置
Calzetta et al. Decoherence of correlation histories
Navathe et al. Active Database Modeling and Design Tools: Issues, Approache, and Architecture
Helmbold et al. Fast scheduling algorithms on parallel computers
CN101814064A (zh) 报表模板的创建方法、报表生成方法及报表系统
Torpey Semigroup congruences: computational techniques and theoretical applications
Ewetz et al. Cost-effective robustness in clock networks using near-tree structures
CN107506361A (zh) 栅格数据聚合方法和装置、栅格数据解耦方法和装置及系统
Aubert An in-between" implicit" and" explicit" complexity: Automata
Sethi et al. Efficient Algorithms for Mining Rare Itemset over Time Variant Transactional Database
CN111599014B (zh) 基于Unity3D的OBJ文件解析面渲染的方法、系统及介质
CN111339566B (zh) 区块摘要方法、装置、计算机设备和存储介质
Candan et al. Management and rendering of multimedia views
JP4025180B2 (ja) 文書管理装置
CN117348888A (zh) 低代码实现简单表单业务场景的方法
Hoag et al. Applications of Synthetic Data Generation

Legal Events

Date Code Title Description
8364 No opposition during term of opposition