DE69534942D1 - Verfahren und vorrichtung zur sprechererkennung und -verifizierung - Google Patents

Verfahren und vorrichtung zur sprechererkennung und -verifizierung

Info

Publication number
DE69534942D1
DE69534942D1 DE69534942T DE69534942T DE69534942D1 DE 69534942 D1 DE69534942 D1 DE 69534942D1 DE 69534942 T DE69534942 T DE 69534942T DE 69534942 T DE69534942 T DE 69534942T DE 69534942 D1 DE69534942 D1 DE 69534942D1
Authority
DE
Germany
Prior art keywords
cepstrum
speaker recognition
speech
improved
components
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Lifetime
Application number
DE69534942T
Other languages
English (en)
Other versions
DE69534942T2 (de
Inventor
J Mammone
T Assaleh
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Rutgers State University of New Jersey
Original Assignee
Rutgers State University of New Jersey
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Rutgers State University of New Jersey filed Critical Rutgers State University of New Jersey
Application granted granted Critical
Publication of DE69534942D1 publication Critical patent/DE69534942D1/de
Publication of DE69534942T2 publication Critical patent/DE69534942T2/de
Anticipated expiration legal-status Critical
Expired - Lifetime legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L17/00Speaker identification or verification
    • G10L17/02Preprocessing operations, e.g. segment selection; Pattern representation or modelling, e.g. based on linear discriminant analysis [LDA] or principal components; Feature selection or extraction
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/03Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
    • G10L25/12Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters the extracted parameters being prediction coefficients
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/03Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
    • G10L25/18Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters the extracted parameters being spectral information of each sub-band
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/03Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
    • G10L25/24Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters the extracted parameters being the cepstrum
DE69534942T 1994-02-28 1995-02-28 System zur sprecher-identifizierung und-überprüfung Expired - Lifetime DE69534942T2 (de)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
US203988 1994-02-28
US08/203,988 US5522012A (en) 1994-02-28 1994-02-28 Speaker identification and verification system
PCT/US1995/002801 WO1995023408A1 (en) 1994-02-28 1995-02-28 Speaker identification and verification system

Publications (2)

Publication Number Publication Date
DE69534942D1 true DE69534942D1 (de) 2006-05-24
DE69534942T2 DE69534942T2 (de) 2006-12-07

Family

ID=22756137

Family Applications (1)

Application Number Title Priority Date Filing Date
DE69534942T Expired - Lifetime DE69534942T2 (de) 1994-02-28 1995-02-28 System zur sprecher-identifizierung und-überprüfung

Country Status (9)

Country Link
US (1) US5522012A (de)
EP (1) EP0748500B1 (de)
JP (1) JPH10500781A (de)
CN (1) CN1142274A (de)
AT (1) ATE323933T1 (de)
AU (1) AU683370B2 (de)
CA (1) CA2184256A1 (de)
DE (1) DE69534942T2 (de)
WO (1) WO1995023408A1 (de)

Families Citing this family (50)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5666466A (en) * 1994-12-27 1997-09-09 Rutgers, The State University Of New Jersey Method and apparatus for speaker recognition using selected spectral information
JPH08211897A (ja) * 1995-02-07 1996-08-20 Toyota Motor Corp 音声認識装置
US5839103A (en) * 1995-06-07 1998-11-17 Rutgers, The State University Of New Jersey Speaker verification system using decision fusion logic
JP3397568B2 (ja) * 1996-03-25 2003-04-14 キヤノン株式会社 音声認識方法及び装置
FR2748343B1 (fr) * 1996-05-03 1998-07-24 Univ Paris Curie Procede de reconnaissance vocale d'un locuteur mettant en oeuvre un modele predictif, notamment pour des applications de controle d'acces
US6078664A (en) * 1996-12-20 2000-06-20 Moskowitz; Scott A. Z-transform implementation of digital watermarks
US6038528A (en) * 1996-07-17 2000-03-14 T-Netix, Inc. Robust speech processing with affine transform replicated data
SE515447C2 (sv) * 1996-07-25 2001-08-06 Telia Ab Metod och anordning för talverifiering
US5946654A (en) * 1997-02-21 1999-08-31 Dragon Systems, Inc. Speaker identification using unsupervised speech models
SE511418C2 (sv) * 1997-03-13 1999-09-27 Telia Ab Metod för talarverifiering/identifiering via modellering av typiska icke-typiska egenskaper.
US5995924A (en) * 1997-05-05 1999-11-30 U.S. West, Inc. Computer-based method and apparatus for classifying statement types based on intonation analysis
US6182037B1 (en) * 1997-05-06 2001-01-30 International Business Machines Corporation Speaker recognition over large population with fast and detailed matches
US5940791A (en) * 1997-05-09 1999-08-17 Washington University Method and apparatus for speech analysis and synthesis using lattice ladder notch filters
US7630895B2 (en) * 2000-01-21 2009-12-08 At&T Intellectual Property I, L.P. Speaker verification method
US6076055A (en) * 1997-05-27 2000-06-13 Ameritech Speaker verification method
US6192353B1 (en) 1998-02-09 2001-02-20 Motorola, Inc. Multiresolutional classifier with training system and method
US6243695B1 (en) * 1998-03-18 2001-06-05 Motorola, Inc. Access control system and method therefor
US6317710B1 (en) * 1998-08-13 2001-11-13 At&T Corp. Multimedia search apparatus and method for searching multimedia content using speaker detection by audio data
US6400310B1 (en) * 1998-10-22 2002-06-04 Washington University Method and apparatus for a tunable high-resolution spectral estimator
US6684186B2 (en) * 1999-01-26 2004-01-27 International Business Machines Corporation Speaker recognition using a hierarchical speaker model tree
CN1148720C (zh) * 1999-03-11 2004-05-05 英国电讯有限公司 说话者识别
US20030115047A1 (en) * 1999-06-04 2003-06-19 Telefonaktiebolaget Lm Ericsson (Publ) Method and system for voice recognition in mobile communication systems
US6401063B1 (en) * 1999-11-09 2002-06-04 Nortel Networks Limited Method and apparatus for use in speaker verification
US6901362B1 (en) * 2000-04-19 2005-05-31 Microsoft Corporation Audio segmentation and classification
KR100366057B1 (ko) * 2000-06-26 2002-12-27 한국과학기술원 인간 청각 모델을 이용한 효율적인 음성인식 장치
US6754373B1 (en) * 2000-07-14 2004-06-22 International Business Machines Corporation System and method for microphone activation using visual speech cues
US20040190688A1 (en) * 2003-03-31 2004-09-30 Timmins Timothy A. Communications methods and systems using voiceprints
JP2002306492A (ja) * 2001-04-16 2002-10-22 Electronic Navigation Research Institute カオス論的ヒューマンファクタ評価装置
CN1236423C (zh) * 2001-05-10 2006-01-11 皇家菲利浦电子有限公司 说话人声音的后台学习
US20040158462A1 (en) * 2001-06-11 2004-08-12 Rutledge Glen J. Pitch candidate selection method for multi-channel pitch detectors
US6898568B2 (en) * 2001-07-13 2005-05-24 Innomedia Pte Ltd Speaker verification utilizing compressed audio formants
US20030149881A1 (en) * 2002-01-31 2003-08-07 Digital Security Inc. Apparatus and method for securing information transmitted on computer networks
KR100488121B1 (ko) * 2002-03-18 2005-05-06 정희석 화자간 변별력 향상을 위하여 개인별 켑스트럼 가중치를 적용한 화자 인증 장치 및 그 방법
JP3927559B2 (ja) * 2004-06-01 2007-06-13 東芝テック株式会社 話者認識装置、プログラム及び話者認識方法
CN1811911B (zh) * 2005-01-28 2010-06-23 北京捷通华声语音技术有限公司 自适应的语音变换处理方法
US7603275B2 (en) * 2005-10-31 2009-10-13 Hitachi, Ltd. System, method and computer program product for verifying an identity using voiced to unvoiced classifiers
US7788101B2 (en) * 2005-10-31 2010-08-31 Hitachi, Ltd. Adaptation method for inter-person biometrics variability
CN101051464A (zh) * 2006-04-06 2007-10-10 株式会社东芝 说话人认证的注册和验证方法及装置
DE102007011831A1 (de) * 2007-03-12 2008-09-18 Voice.Trust Ag Digitales Verfahren und Anordnung zur Authentifizierung einer Person
CN101303854B (zh) * 2007-05-10 2011-11-16 摩托罗拉移动公司 用于提供识别的语音输出的方法
US8849432B2 (en) * 2007-05-31 2014-09-30 Adobe Systems Incorporated Acoustic pattern identification using spectral characteristics to synchronize audio and/or video
CN101339765B (zh) * 2007-07-04 2011-04-13 黎自奋 一种国语单音辨认方法
CN101281746A (zh) * 2008-03-17 2008-10-08 黎自奋 一个百分之百辨认率的国语单音与句子辨认方法
DE102009051508B4 (de) * 2009-10-30 2020-12-03 Continental Automotive Gmbh Vorrichtung, System und Verfahren zur Sprachdialogaktivierung und -führung
EP2897076B8 (de) * 2014-01-17 2018-02-07 Cirrus Logic International Semiconductor Ltd. Manipulationssicheres Element zur Verwendung bei der Sprechererkennung
GB2552722A (en) * 2016-08-03 2018-02-07 Cirrus Logic Int Semiconductor Ltd Speaker recognition
GB2552723A (en) 2016-08-03 2018-02-07 Cirrus Logic Int Semiconductor Ltd Speaker recognition
JP6791258B2 (ja) * 2016-11-07 2020-11-25 ヤマハ株式会社 音声合成方法、音声合成装置およびプログラム
WO2018163279A1 (ja) * 2017-03-07 2018-09-13 日本電気株式会社 音声処理装置、音声処理方法、および音声処理プログラム
GB201801875D0 (en) * 2017-11-14 2018-03-21 Cirrus Logic Int Semiconductor Ltd Audio processing

Family Cites Families (22)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4058676A (en) * 1975-07-07 1977-11-15 International Communication Sciences Speech analysis and synthesis system
JPS58129682A (ja) * 1982-01-29 1983-08-02 Toshiba Corp 個人照合装置
US5131043A (en) * 1983-09-05 1992-07-14 Matsushita Electric Industrial Co., Ltd. Method of and apparatus for speech recognition wherein decisions are made based on phonemes
US4991216A (en) * 1983-09-22 1991-02-05 Matsushita Electric Industrial Co., Ltd. Method for speech recognition
IT1160148B (it) * 1983-12-19 1987-03-04 Cselt Centro Studi Lab Telecom Dispositivo per la verifica del parlatore
CA1229681A (en) * 1984-03-06 1987-11-24 Kazunori Ozawa Method and apparatus for speech-band signal coding
US5146539A (en) * 1984-11-30 1992-09-08 Texas Instruments Incorporated Method for utilizing formant frequencies in speech recognition
US4773093A (en) * 1984-12-31 1988-09-20 Itt Defense Communications Text-independent speaker recognition system and method based on acoustic segment matching
US4922539A (en) * 1985-06-10 1990-05-01 Texas Instruments Incorporated Method of encoding speech signals involving the extraction of speech formant candidates in real time
JPH0760318B2 (ja) * 1986-09-29 1995-06-28 株式会社東芝 連続音声認識方式
US4837830A (en) * 1987-01-16 1989-06-06 Itt Defense Communications, A Division Of Itt Corporation Multiple parameter speaker recognition system and methods
US4926488A (en) * 1987-07-09 1990-05-15 International Business Machines Corporation Normalization of speech by adaptive labelling
US5001761A (en) * 1988-02-09 1991-03-19 Nec Corporation Device for normalizing a speech spectrum
US5048088A (en) * 1988-03-28 1991-09-10 Nec Corporation Linear predictive speech analysis-synthesis apparatus
CN1013525B (zh) * 1988-11-16 1991-08-14 中国科学院声学研究所 认人与不认人实时语音识别的方法和装置
US5293448A (en) * 1989-10-02 1994-03-08 Nippon Telegraph And Telephone Corporation Speech analysis-synthesis method and apparatus therefor
US5007094A (en) * 1989-04-07 1991-04-09 Gte Products Corporation Multipulse excited pole-zero filtering approach for noise reduction
JPH02309820A (ja) * 1989-05-25 1990-12-25 Sony Corp デイジタル信号処理装置
US4975956A (en) * 1989-07-26 1990-12-04 Itt Corporation Low-bit-rate speech coder using LPC data reduction processing
US5167004A (en) * 1991-02-28 1992-11-24 Texas Instruments Incorporated Temporal decorrelation method for robust speaker verification
US5165008A (en) * 1991-09-18 1992-11-17 U S West Advanced Technologies, Inc. Speech synthesis using perceptual linear prediction parameters
WO1993018505A1 (en) * 1992-03-02 1993-09-16 The Walt Disney Company Voice transformation system

Also Published As

Publication number Publication date
EP0748500B1 (de) 2006-04-19
JPH10500781A (ja) 1998-01-20
DE69534942T2 (de) 2006-12-07
CN1142274A (zh) 1997-02-05
WO1995023408A1 (en) 1995-08-31
AU683370B2 (en) 1997-11-06
AU2116495A (en) 1995-09-11
ATE323933T1 (de) 2006-05-15
US5522012A (en) 1996-05-28
CA2184256A1 (en) 1995-08-31
EP0748500A1 (de) 1996-12-18
MX9603686A (es) 1997-12-31
EP0748500A4 (de) 1998-09-23

Similar Documents

Publication Publication Date Title
ATE323933T1 (de) Verfahren und vorrichtung zur sprechererkennung und -verifizierung
Kim et al. Auditory processing of speech signals for robust speech recognition in real-world noisy environments
US7319959B1 (en) Multi-source phoneme classification for noise-robust automatic speech recognition
US20030061037A1 (en) Method and apparatus for identifying noise environments from noisy signals
Kingsbury Perceptually inspired signal processing strategies for robust speech recognition in reverberant environments
CN1238058A (zh) 语音处理系统
CN111489763B (zh) 一种基于gmm模型的复杂环境下说话人识别自适应方法
US9269352B2 (en) Speech recognition with a plurality of microphones
Strand et al. Cepstral mean and variance normalization in the model domain
CN112116909A (zh) 语音识别方法、装置及系统
Bäckström et al. Voice activity detection
CN86100298A (zh) 语音识别
Dai et al. An improved model of masking effects for robust speech recognition system
KR100741355B1 (ko) 인지 가중 필터를 이용한 전처리 방법
Thomsen et al. Speech enhancement and noise-robust automatic speech recognition
JPH024920B2 (de)
Gajic Auditory based methods for robust speech feature extraction
Christiansen et al. Noise reduction in speech using adaptive filtering I: Signal processing algorithms
Kitamura et al. Speaker individualities in speech spectral envelopes and fundamental frequency contours
JPS6148898A (ja) 音声の有声無声判定装置
Power et al. Consistency among speech parameter vectors: Application to predicting speech intelligibility
Gerazov et al. Overview of Feature Selection for Automatic Speech Recognition
Shridhar et al. A unified approach to speaker verification with noisy speech inputs
Bonde et al. Noise robust automatic speech recognition with adaptive quantile based noise estimation and speech band emphasizing filter bank
Hermansky et al. Recent advances in addressing sources of non-linguistic information

Legal Events

Date Code Title Description
8364 No opposition during term of opposition