WO2000077772A3 - Speech and voice signal preprocessing - Google Patents

Speech and voice signal preprocessing Download PDF

Info

Publication number
WO2000077772A3
WO2000077772A3 PCT/GB2000/002332 GB0002332W WO0077772A3 WO 2000077772 A3 WO2000077772 A3 WO 2000077772A3 GB 0002332 W GB0002332 W GB 0002332W WO 0077772 A3 WO0077772 A3 WO 0077772A3
Authority
WO
WIPO (PCT)
Prior art keywords
speech
voice
voice signal
signal preprocessing
signal
Prior art date
Application number
PCT/GB2000/002332
Other languages
French (fr)
Other versions
WO2000077772A2 (en
Inventor
Ronald Chalmers
Mark Christopher Simpson
Steven Leslie Pae
Original Assignee
Cyber Technology Iom Liminted
Ronald Chalmers
Mark Christopher Simpson
Steven Leslie Pae
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Cyber Technology Iom Liminted, Ronald Chalmers, Mark Christopher Simpson, Steven Leslie Pae filed Critical Cyber Technology Iom Liminted
Priority to GB0200735A priority Critical patent/GB2367938A/en
Priority to AU55471/00A priority patent/AU5547100A/en
Publication of WO2000077772A2 publication Critical patent/WO2000077772A2/en
Publication of WO2000077772A3 publication Critical patent/WO2000077772A3/en

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/02Feature extraction for speech recognition; Selection of recognition unit

Abstract

In a system or method of voice or speech recognition, a voice waveform signal modeled as the product of a power component and an informational component is divided into higher and lower frequency signals, corresponding to the information signal and the power signal. The signals are amplified separately and then combined. By applying higher amplification to the information signal, a more detailed sample of the initial waveform can be provided to voice recognition or word recognition apparatus.
PCT/GB2000/002332 1999-06-14 2000-06-14 Speech and voice signal preprocessing WO2000077772A2 (en)

Priority Applications (2)

Application Number Priority Date Filing Date Title
GB0200735A GB2367938A (en) 1999-06-14 2000-06-14 Speech and voice signal processing
AU55471/00A AU5547100A (en) 1999-06-14 2000-06-14 Speech and voice signal processing

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
GB9913773.9 1999-06-14
GBGB9913773.9A GB9913773D0 (en) 1999-06-14 1999-06-14 Speech signal processing

Publications (2)

Publication Number Publication Date
WO2000077772A2 WO2000077772A2 (en) 2000-12-21
WO2000077772A3 true WO2000077772A3 (en) 2002-10-10

Family

ID=10855289

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/GB2000/002332 WO2000077772A2 (en) 1999-06-14 2000-06-14 Speech and voice signal preprocessing

Country Status (3)

Country Link
AU (1) AU5547100A (en)
GB (2) GB9913773D0 (en)
WO (1) WO2000077772A2 (en)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9460722B2 (en) 2013-07-17 2016-10-04 Verint Systems Ltd. Blind diarization of recorded calls with arbitrary number of speakers
US9503571B2 (en) 2005-04-21 2016-11-22 Verint Americas Inc. Systems, methods, and media for determining fraud patterns and creating fraud behavioral models
US9571652B1 (en) 2005-04-21 2017-02-14 Verint Americas Inc. Enhanced diarization systems, media and methods of use

Families Citing this family (15)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8930261B2 (en) 2005-04-21 2015-01-06 Verint Americas Inc. Method and system for generating a fraud risk score using telephony channel based audio and non-audio data
US8903859B2 (en) 2005-04-21 2014-12-02 Verint Americas Inc. Systems, methods, and media for generating hierarchical fused risk scores
US9113001B2 (en) 2005-04-21 2015-08-18 Verint Americas Inc. Systems, methods, and media for disambiguating call data to determine fraud
US8924285B2 (en) 2005-04-21 2014-12-30 Verint Americas Inc. Building whitelists comprising voiceprints not associated with fraud and screening calls using a combination of a whitelist and blacklist
US8793131B2 (en) 2005-04-21 2014-07-29 Verint Americas Inc. Systems, methods, and media for determining fraud patterns and creating fraud behavioral models
RU2419890C1 (en) * 2009-09-24 2011-05-27 Общество с ограниченной ответственностью "Центр речевых технологий" Method of identifying speaker from arbitrary speech phonograms based on formant equalisation
US9368116B2 (en) 2012-09-07 2016-06-14 Verint Systems Ltd. Speaker separation in diarization
US10134400B2 (en) 2012-11-21 2018-11-20 Verint Systems Ltd. Diarization using acoustic labeling
US9984706B2 (en) 2013-08-01 2018-05-29 Verint Systems Ltd. Voice activity detection using a soft decision mechanism
US9875742B2 (en) 2015-01-26 2018-01-23 Verint Systems Ltd. Word-level blind diarization of recorded calls with arbitrary number of speakers
CN106683686A (en) * 2016-11-18 2017-05-17 祝洋 Examinee gender statistical equipment and statistical method of same
US11538128B2 (en) 2018-05-14 2022-12-27 Verint Americas Inc. User interface for fraud alert management
US10887452B2 (en) 2018-10-25 2021-01-05 Verint Americas Inc. System architecture for fraud detection
IL288671B1 (en) 2019-06-20 2024-02-01 Verint Americas Inc Systems and methods for authentication and fraud detection
US11868453B2 (en) 2019-11-07 2024-01-09 Verint Americas Inc. Systems and methods for customer authentication based on audio-of-interest

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4827516A (en) * 1985-10-16 1989-05-02 Toppan Printing Co., Ltd. Method of analyzing input speech and speech analysis apparatus therefor
EP0625775A1 (en) * 1993-05-18 1994-11-23 International Business Machines Corporation Speech recognition system with improved rejection of words and sounds not contained in the system vocabulary
US5495522A (en) * 1993-02-01 1996-02-27 Multilink, Inc. Method and apparatus for audio teleconferencing a plurality of phone channels
WO1998043237A1 (en) * 1997-03-25 1998-10-01 The Secretary Of State For Defence Recognition system
US5878392A (en) * 1991-04-12 1999-03-02 U.S. Philips Corporation Speech recognition using recursive time-domain high-pass filtering of spectral feature vectors

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4827516A (en) * 1985-10-16 1989-05-02 Toppan Printing Co., Ltd. Method of analyzing input speech and speech analysis apparatus therefor
US5878392A (en) * 1991-04-12 1999-03-02 U.S. Philips Corporation Speech recognition using recursive time-domain high-pass filtering of spectral feature vectors
US5495522A (en) * 1993-02-01 1996-02-27 Multilink, Inc. Method and apparatus for audio teleconferencing a plurality of phone channels
EP0625775A1 (en) * 1993-05-18 1994-11-23 International Business Machines Corporation Speech recognition system with improved rejection of words and sounds not contained in the system vocabulary
WO1998043237A1 (en) * 1997-03-25 1998-10-01 The Secretary Of State For Defence Recognition system

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
OPPENHEIM A V, SCHAFER R W, STOCKHAM T G: "Nonlinear Filtering of Multiplied and Convolved Signals", PROCEEDINGS OF THE IEEE, no. 56, August 1968 (1968-08-01), pages 1264 - 1291, XP000946572, ISSN: 0165-1684 *
PINOLI J: "A general comparative study of the multiplicative homomorphic, log-ratio and logarithmic image processing approaches", SIGNAL PROCESSING. EUROPEAN JOURNAL DEVOTED TO THE METHODS AND APPLICATIONS OF SIGNAL PROCESSING,NL,ELSEVIER SCIENCE PUBLISHERS B.V. AMSTERDAM, vol. 58, no. 1, 1 April 1997 (1997-04-01), pages 11 - 45, XP004082677, ISSN: 0165-1684 *

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9503571B2 (en) 2005-04-21 2016-11-22 Verint Americas Inc. Systems, methods, and media for determining fraud patterns and creating fraud behavioral models
US9571652B1 (en) 2005-04-21 2017-02-14 Verint Americas Inc. Enhanced diarization systems, media and methods of use
US9460722B2 (en) 2013-07-17 2016-10-04 Verint Systems Ltd. Blind diarization of recorded calls with arbitrary number of speakers

Also Published As

Publication number Publication date
GB2367938A (en) 2002-04-17
GB9913773D0 (en) 1999-08-11
GB0200735D0 (en) 2002-02-27
AU5547100A (en) 2001-01-02
WO2000077772A2 (en) 2000-12-21

Similar Documents

Publication Publication Date Title
WO2000077772A3 (en) Speech and voice signal preprocessing
AU7750700A (en) Method and apparatus for the provision of information signals based upon speech recognition
EP0899719A3 (en) Method for aligning text with audio signals
AU1191899A (en) System and method for representing complex information auditorially
AU7339000A (en) A system, method, and article of manufacture for detecting emotion in voice signals through analysis of a plurality of voice signal parameters
CA2333137A1 (en) Multiple waveform software radio
TW200509065A (en) System and method for combined frequency-domain and time-domain pitch extraction for speech signals
AU1632100A (en) Method and apparatus for pitch tracking
WO2002070989A3 (en) Automated method for a takeoff estimate of construction drawings
AU2003254288A1 (en) Distributed speech recognition with back-end voice activity detection apparatus and method
ATE354849T1 (en) METHOD AND DEVICES FOR ANALYZING SIGNALS
CA2413658A1 (en) System and method of spoken language understanding in human computer dialogs
WO1998013754A3 (en) Method and apparatus for processing the output of a speech recognition engine
WO1999066496A8 (en) Intelligent text-to-speech synthesis
EP0773532A3 (en) Continuous speech recognition
EP0608833A3 (en) Method of and apparatus for performing time-scale modification of speech signals.
WO2000067626A3 (en) Oat extracts: refining, compositions and methods of use
EP1168306A3 (en) Method and apparatus for improving the intelligibility of digitally compressed speech
EP1130043A3 (en) Polyhydroxyalkanoate containing 3-hydroxythienylalkanoic acid as monomer unit and method for producing the same
EP1071073A3 (en) Dictionary organizing method for variable context speech synthesis
EP0998051A3 (en) Block size determination and adaptation method for audio transform coding
EP1013758A3 (en) Novel carbonyl reductase, method for producing said enzyme, DNA encoding said enzyme, and method for producing alcohol using said enzyme
WO2001031633A8 (en) Speech recognition
CA2426001A1 (en) Method and system for estimating artificial high band signal in speech codec
DE3275779D1 (en) Recognition of speech or speech-like sounds

Legal Events

Date Code Title Description
AK Designated states

Kind code of ref document: A2

Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BY CA CH CN CR CU CZ DE DK DM DZ EE ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NO NZ PL PT RO RU SD SE SG SI SK SL TJ TM TR TT TZ UA UG US UZ VN YU ZA ZW

AL Designated countries for regional patents

Kind code of ref document: A2

Designated state(s): GH GM KE LS MW MZ SD SL SZ TZ UG ZW AM AZ BY KG KZ MD RU TJ TM AT BE CH CY DE DK ES FI FR GB GR IE IT LU MC NL PT SE BF BJ CF CG CI CM GA GN GW ML MR NE SN TD TG

121 Ep: the epo has been informed by wipo that ep was designated in this application
DFPE Request for preliminary examination filed prior to expiration of 19th month from priority date (pct application filed before 20040101)
ENP Entry into the national phase

Ref country code: GB

Ref document number: 200200735

Kind code of ref document: A

Format of ref document f/p: F

REG Reference to national code

Ref country code: DE

Ref legal event code: 8642

122 Ep: pct application non-entry in european phase
AK Designated states

Kind code of ref document: A3

Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BY CA CH CN CR CU CZ DE DK DM DZ EE ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NO NZ PL PT RO RU SD SE SG SI SK SL TJ TM TR TT TZ UA UG US UZ VN YU ZA ZW

AL Designated countries for regional patents

Kind code of ref document: A3

Designated state(s): GH GM KE LS MW MZ SD SL SZ TZ UG ZW AM AZ BY KG KZ MD RU TJ TM AT BE CH CY DE DK ES FI FR GB GR IE IT LU MC NL PT SE BF BJ CF CG CI CM GA GN GW ML MR NE SN TD TG

NENP Non-entry into the national phase

Ref country code: JP