WO2004036544A3 - Methods and apparatus for audio data analysis and data mining using speech recognition - Google Patents

Methods and apparatus for audio data analysis and data mining using speech recognition Download PDF

Info

Publication number
WO2004036544A3
WO2004036544A3 PCT/US2003/033042 US0333042W WO2004036544A3 WO 2004036544 A3 WO2004036544 A3 WO 2004036544A3 US 0333042 W US0333042 W US 0333042W WO 2004036544 A3 WO2004036544 A3 WO 2004036544A3
Authority
WO
WIPO (PCT)
Prior art keywords
methods
speech recognition
audio data
data analysis
provides
Prior art date
Application number
PCT/US2003/033042
Other languages
French (fr)
Other versions
WO2004036544A2 (en
Inventor
Robert Scarano
Lawrence Mark
Original Assignee
Ser Solutions Inc
Robert Scarano
Lawrence Mark
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Ser Solutions Inc, Robert Scarano, Lawrence Mark filed Critical Ser Solutions Inc
Priority to AU2003301373A priority Critical patent/AU2003301373B9/en
Priority to CA2502543A priority patent/CA2502543C/en
Priority to EP03809142A priority patent/EP1554719A4/en
Publication of WO2004036544A2 publication Critical patent/WO2004036544A2/en
Publication of WO2004036544A3 publication Critical patent/WO2004036544A3/en

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/26Speech to text systems
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/40Information retrieval; Database structures therefor; File system structures therefor of multimedia data, e.g. slideshows comprising image and additional audio data
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/60Information retrieval; Database structures therefor; File system structures therefor of audio data
    • G06F16/64Browsing; Visualisation therefor
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/60Information retrieval; Database structures therefor; File system structures therefor of audio data
    • G06F16/68Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/60Information retrieval; Database structures therefor; File system structures therefor of audio data
    • G06F16/68Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually
    • G06F16/683Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using metadata automatically derived from the content
    • G06F16/685Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using metadata automatically derived from the content using automatically derived transcript of audio data, e.g. lyrics
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M3/00Automatic or semi-automatic exchanges
    • H04M3/42Systems providing special services or facilities to subscribers
    • H04M3/42221Conversation recording systems
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M3/00Automatic or semi-automatic exchanges
    • H04M3/42Systems providing special services or facilities to subscribers
    • H04M3/50Centralised arrangements for answering calls; Centralised arrangements for recording messages for absent or busy subscribers ; Centralised arrangements for recording messages
    • H04M3/51Centralised call answering arrangements requiring operator intervention, e.g. call or contact centers for telemarketing

Abstract

The present invention provides an audio analysis intelligence tool that provides ad-hoc search capabilities using spoken words as an organized data form. The present invention provides an SQL like interface to process and search audio data and combine it with other traditional data forms.
PCT/US2003/033042 2002-10-18 2003-10-20 Methods and apparatus for audio data analysis and data mining using speech recognition WO2004036544A2 (en)

Priority Applications (3)

Application Number Priority Date Filing Date Title
AU2003301373A AU2003301373B9 (en) 2002-10-18 2003-10-20 Methods and apparatus for audio data analysis and data mining using speech recognition
CA2502543A CA2502543C (en) 2002-10-18 2003-10-20 Methods and apparatus for audio data analysis and data mining using speech recognition
EP03809142A EP1554719A4 (en) 2002-10-18 2003-10-20 Methods and apparatus for audio data analysis and data mining using speech recognition

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US41973802P 2002-10-18 2002-10-18
US60/419,738 2002-10-18

Publications (2)

Publication Number Publication Date
WO2004036544A2 WO2004036544A2 (en) 2004-04-29
WO2004036544A3 true WO2004036544A3 (en) 2004-07-15

Family

ID=32108133

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/US2003/033042 WO2004036544A2 (en) 2002-10-18 2003-10-20 Methods and apparatus for audio data analysis and data mining using speech recognition

Country Status (4)

Country Link
EP (1) EP1554719A4 (en)
AU (1) AU2003301373B9 (en)
CA (1) CA2502543C (en)
WO (1) WO2004036544A2 (en)

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP0833489A2 (en) * 1996-09-26 1998-04-01 Eyretel Limited Signal monitoring apparatus
US6185527B1 (en) * 1999-01-19 2001-02-06 International Business Machines Corporation System and method for automatic audio content analysis for word spotting, indexing, classification and retrieval
US20010040942A1 (en) * 1999-06-08 2001-11-15 Dictaphone Corporation System and method for recording and storing telephone call information
US6434520B1 (en) * 1999-04-16 2002-08-13 International Business Machines Corporation System and method for indexing and querying audio archives

Family Cites Families (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
DE69420096T2 (en) 1993-09-22 1999-12-09 Teknekron Infowitch Corp Telecommunication system monitoring
US6263049B1 (en) 1996-10-10 2001-07-17 Envision Telephony, Inc. Non-random call center supervisory method and apparatus
US6047060A (en) 1998-02-20 2000-04-04 Genesys Telecommunications Laboratories, Inc. Method and apparatus for enabling full interactive monitoring of calls to and from a call-in center
US6542602B1 (en) 2000-02-14 2003-04-01 Nice Systems Ltd. Telephone call monitoring system

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP0833489A2 (en) * 1996-09-26 1998-04-01 Eyretel Limited Signal monitoring apparatus
US6185527B1 (en) * 1999-01-19 2001-02-06 International Business Machines Corporation System and method for automatic audio content analysis for word spotting, indexing, classification and retrieval
US6434520B1 (en) * 1999-04-16 2002-08-13 International Business Machines Corporation System and method for indexing and querying audio archives
US20010040942A1 (en) * 1999-06-08 2001-11-15 Dictaphone Corporation System and method for recording and storing telephone call information

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
FRAKES ET AL.: "Information Retrieval Data Strucutres & Algorithms", 1992, PRENTICE-HALL, ISBN: 0-13-463837-9, pages: 264 - 268, XP002977283 *
See also references of EP1554719A4 *

Also Published As

Publication number Publication date
CA2502543A1 (en) 2004-04-29
CA2502543C (en) 2014-08-05
EP1554719A2 (en) 2005-07-20
AU2003301373A1 (en) 2004-05-04
AU2003301373B9 (en) 2009-07-23
EP1554719A4 (en) 2006-01-18
WO2004036544A2 (en) 2004-04-29
AU2003301373B2 (en) 2009-03-12

Similar Documents

Publication Publication Date Title
AU2002336458A1 (en) Methods, systems, and programming for performing speech recognition
HK1082315A1 (en) Method and device for gain quantization in variable bit rate wideband speech coding
GB0211398D0 (en) System and method for combining voice annotation and recognition search criteria with traditional search criteria into metadata
EP1546923A4 (en) Index structure of metadata, method for providing indices of metadata, and metadata searching method and apparatus using the indices of metadata
AU2003295628A1 (en) Method and apparatus for selective speech recognition
ATE410768T1 (en) SYSTEM AND METHOD FOR OPERATING A VOICE RECOGNITION SYSTEM IN A VEHICLE
EP1923866A4 (en) Sound source separating device, speech recognizing device, portable telephone, and sound source separating method, and program
AU2003295976A1 (en) Method and apparatus for selective distributed speech recognition
AU2003293119A1 (en) Method and apparatus for selective distributed speech recognition
WO2006070373A3 (en) A system and a method for representing unrecognized words in speech to text conversions as syllables
WO2007008943A3 (en) Optimized anti-ep-cam antibodies
WO2004034377A3 (en) Apparatus, methods and programming for speech synthesis via bit manipulations of compressed data base
AU2003298685A1 (en) Method and apparatus for displaying speech recognition results
AU2003285697A1 (en) Method and system for three-dimentional handwriting recognition
WO2005070019A3 (en) Contextual searching
WO2004095419A3 (en) System and method for text-to-speech processing in a portable device
WO2006122106A3 (en) Processing information from selected sources via a single website
WO2008108076A1 (en) Encoding device and encoding method
AU2002325930A1 (en) Method for automatic speech recognition
WO2007124178A3 (en) Methods for processing formatted data
AU2003217049A1 (en) Database searching method and system
WO2005015546A8 (en) Speech input interface for dialog systems
ATE377241T1 (en) METHOD FOR SPEECH RECOGNITION WITH AUTOMATIC CORRECTION
AU2003266397A1 (en) Method for the production of polyisobutene
WO2004036544A3 (en) Methods and apparatus for audio data analysis and data mining using speech recognition

Legal Events

Date Code Title Description
AK Designated states

Kind code of ref document: A2

Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BY BZ CA CH CN CO CR CU CZ DE DK DM DZ EC EE ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NI NO NZ OM PG PH PL PT RO RU SC SD SE SG SK SL SY TJ TM TN TR TT TZ UA UG US UZ VC VN YU ZA ZM ZW

AL Designated countries for regional patents

Kind code of ref document: A2

Designated state(s): GH GM KE LS MW MZ SD SL SZ TZ UG ZM ZW AM AZ BY KG KZ MD RU TJ TM AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HU IE IT LU MC NL PT RO SE SI SK TR BF BJ CF CG CI CM GA GN GQ GW ML MR NE SN TD TG

121 Ep: the epo has been informed by wipo that ep was designated in this application
DFPE Request for preliminary examination filed prior to expiration of 19th month from priority date (pct application filed before 20040101)
WWE Wipo information: entry into national phase

Ref document number: 2003301373

Country of ref document: AU

WWE Wipo information: entry into national phase

Ref document number: 1-2005-500697

Country of ref document: PH

WWE Wipo information: entry into national phase

Ref document number: 2502543

Country of ref document: CA

Ref document number: 283/MUMNP/2005

Country of ref document: IN

WWE Wipo information: entry into national phase

Ref document number: 2003809142

Country of ref document: EP

WWP Wipo information: published in national office

Ref document number: 2003809142

Country of ref document: EP

NENP Non-entry into the national phase

Ref country code: JP

WWW Wipo information: withdrawn in national office

Ref document number: JP