DE69534942D1 - Verfahren und vorrichtung zur sprechererkennung und -verifizierung - Google Patents

Verfahren und vorrichtung zur sprechererkennung und -verifizierung

Info

Publication number: DE69534942D1
Authority: DE; Germany
Prior art keywords: cepstrum; speaker recognition; speech; improved; components
Prior art date: 1994-02-28
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.): Expired - Lifetime

Application number

DE69534942T

Other languages

English (en)

Other versions

DE69534942T2 (de

Inventor

J Mammone

T Assaleh

Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)

Rutgers State University of New Jersey

Original Assignee

Rutgers State University of New Jersey

Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)

1994-02-28

Filing date

1995-02-28

Publication date

2006-05-24

1995-02-28 Application filed by Rutgers State University of New Jersey filed Critical Rutgers State University of New Jersey

2006-05-24 Application granted granted Critical

2006-05-24 Publication of DE69534942D1 publication Critical patent/DE69534942D1/de

2006-12-07 Publication of DE69534942T2 publication Critical patent/DE69534942T2/de

2015-03-01 Anticipated expiration legal-status Critical

Status Expired - Lifetime legal-status Critical Current

Links

Classifications

- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L17/00—Speaker identification or verification
- G10L17/02—Preprocessing operations, e.g. segment selection; Pattern representation or modelling, e.g. based on linear discriminant analysis [LDA] or principal components; Feature selection or extraction
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/03—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
- G10L25/12—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters the extracted parameters being prediction coefficients
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/03—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
- G10L25/18—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters the extracted parameters being spectral information of each sub-band
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/03—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
- G10L25/24—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters the extracted parameters being the cepstrum

DE69534942T 1994-02-28 1995-02-28 System zur sprecher-identifizierung und-überprüfung Expired - Lifetime DE69534942T2 (de)

Applications Claiming Priority (3)

Application Number	Priority Date	Filing Date	Title
US203988		1994-02-28
US08/203,988 US5522012A (en)	1994-02-28	1994-02-28	Speaker identification and verification system
PCT/US1995/002801 WO1995023408A1 (en)	1994-02-28	1995-02-28	Speaker identification and verification system

Publications (2)

Publication Number	Publication Date
DE69534942D1 true DE69534942D1 (de)	2006-05-24
DE69534942T2 DE69534942T2 (de)	2006-12-07

Family

ID=22756137

Family Applications (1)

Application Number	Title	Priority Date	Filing Date
DE69534942T Expired - Lifetime DE69534942T2 (de)	1994-02-28	1995-02-28	System zur sprecher-identifizierung und-überprüfung

Country Status (9)

Country	Link
US (1)	US5522012A (de)
EP (1)	EP0748500B1 (de)
JP (1)	JPH10500781A (de)
CN (1)	CN1142274A (de)
AT (1)	ATE323933T1 (de)
AU (1)	AU683370B2 (de)
CA (1)	CA2184256A1 (de)
DE (1)	DE69534942T2 (de)
WO (1)	WO1995023408A1 (de)

Families Citing this family (50)

* Cited by examiner, † Cited by third party
Publication number	Priority date	Publication date	Assignee	Title
US5666466A (en) *	1994-12-27	1997-09-09	Rutgers, The State University Of New Jersey	Method and apparatus for speaker recognition using selected spectral information
JPH08211897A (ja) *	1995-02-07	1996-08-20	Toyota Motor Corp	音声認識装置
US5839103A (en) *	1995-06-07	1998-11-17	Rutgers, The State University Of New Jersey	Speaker verification system using decision fusion logic
JP3397568B2 (ja) *	1996-03-25	2003-04-14	キヤノン株式会社	音声認識方法及び装置
FR2748343B1 (fr) *	1996-05-03	1998-07-24	Univ Paris Curie	Procede de reconnaissance vocale d'un locuteur mettant en oeuvre un modele predictif, notamment pour des applications de controle d'acces
US6078664A (en) *	1996-12-20	2000-06-20	Moskowitz; Scott A.	Z-transform implementation of digital watermarks
US6038528A (en) *	1996-07-17	2000-03-14	T-Netix, Inc.	Robust speech processing with affine transform replicated data
SE515447C2 (sv) *	1996-07-25	2001-08-06	Telia Ab	Metod och anordning för talverifiering
US5946654A (en) *	1997-02-21	1999-08-31	Dragon Systems, Inc.	Speaker identification using unsupervised speech models
SE511418C2 (sv) *	1997-03-13	1999-09-27	Telia Ab	Metod för talarverifiering/identifiering via modellering av typiska icke-typiska egenskaper.
US5995924A (en) *	1997-05-05	1999-11-30	U.S. West, Inc.	Computer-based method and apparatus for classifying statement types based on intonation analysis
US6182037B1 (en) *	1997-05-06	2001-01-30	International Business Machines Corporation	Speaker recognition over large population with fast and detailed matches
US5940791A (en) *	1997-05-09	1999-08-17	Washington University	Method and apparatus for speech analysis and synthesis using lattice ladder notch filters
US7630895B2 (en) *	2000-01-21	2009-12-08	At&T Intellectual Property I, L.P.	Speaker verification method
US6076055A (en) *	1997-05-27	2000-06-13	Ameritech	Speaker verification method
US6192353B1 (en)	1998-02-09	2001-02-20	Motorola, Inc.	Multiresolutional classifier with training system and method
US6243695B1 (en) *	1998-03-18	2001-06-05	Motorola, Inc.	Access control system and method therefor
US6317710B1 (en) *	1998-08-13	2001-11-13	At&T Corp.	Multimedia search apparatus and method for searching multimedia content using speaker detection by audio data
US6400310B1 (en) *	1998-10-22	2002-06-04	Washington University	Method and apparatus for a tunable high-resolution spectral estimator
US6684186B2 (en) *	1999-01-26	2004-01-27	International Business Machines Corporation	Speaker recognition using a hierarchical speaker model tree
CN1148720C (zh) *	1999-03-11	2004-05-05	英国电讯有限公司	说话者识别
US20030115047A1 (en) *	1999-06-04	2003-06-19	Telefonaktiebolaget Lm Ericsson (Publ)	Method and system for voice recognition in mobile communication systems
US6401063B1 (en) *	1999-11-09	2002-06-04	Nortel Networks Limited	Method and apparatus for use in speaker verification
US6901362B1 (en) *	2000-04-19	2005-05-31	Microsoft Corporation	Audio segmentation and classification
KR100366057B1 (ko) *	2000-06-26	2002-12-27	한국과학기술원	인간 청각 모델을 이용한 효율적인 음성인식 장치
US6754373B1 (en) *	2000-07-14	2004-06-22	International Business Machines Corporation	System and method for microphone activation using visual speech cues
US20040190688A1 (en) *	2003-03-31	2004-09-30	Timmins Timothy A.	Communications methods and systems using voiceprints
JP2002306492A (ja) *	2001-04-16	2002-10-22	Electronic Navigation Research Institute	カオス論的ヒューマンファクタ評価装置
CN1236423C (zh) *	2001-05-10	2006-01-11	皇家菲利浦电子有限公司	说话人声音的后台学习
US20040158462A1 (en) *	2001-06-11	2004-08-12	Rutledge Glen J.	Pitch candidate selection method for multi-channel pitch detectors
US6898568B2 (en) *	2001-07-13	2005-05-24	Innomedia Pte Ltd	Speaker verification utilizing compressed audio formants
US20030149881A1 (en) *	2002-01-31	2003-08-07	Digital Security Inc.	Apparatus and method for securing information transmitted on computer networks
KR100488121B1 (ko) *	2002-03-18	2005-05-06	정희석	화자간 변별력 향상을 위하여 개인별 켑스트럼 가중치를 적용한 화자 인증 장치 및 그 방법
JP3927559B2 (ja) *	2004-06-01	2007-06-13	東芝テック株式会社	話者認識装置、プログラム及び話者認識方法
CN1811911B (zh) *	2005-01-28	2010-06-23	北京捷通华声语音技术有限公司	自适应的语音变换处理方法
US7603275B2 (en) *	2005-10-31	2009-10-13	Hitachi, Ltd.	System, method and computer program product for verifying an identity using voiced to unvoiced classifiers
US7788101B2 (en) *	2005-10-31	2010-08-31	Hitachi, Ltd.	Adaptation method for inter-person biometrics variability
CN101051464A (zh) *	2006-04-06	2007-10-10	株式会社东芝	说话人认证的注册和验证方法及装置
DE102007011831A1 (de) *	2007-03-12	2008-09-18	Voice.Trust Ag	Digitales Verfahren und Anordnung zur Authentifizierung einer Person
CN101303854B (zh) *	2007-05-10	2011-11-16	摩托罗拉移动公司	用于提供识别的语音输出的方法
US8849432B2 (en) *	2007-05-31	2014-09-30	Adobe Systems Incorporated	Acoustic pattern identification using spectral characteristics to synchronize audio and/or video
CN101339765B (zh) *	2007-07-04	2011-04-13	黎自奋	一种国语单音辨认方法
CN101281746A (zh) *	2008-03-17	2008-10-08	黎自奋	一个百分之百辨认率的国语单音与句子辨认方法
DE102009051508B4 (de) *	2009-10-30	2020-12-03	Continental Automotive Gmbh	Vorrichtung, System und Verfahren zur Sprachdialogaktivierung und -führung
EP2897076B8 (de) *	2014-01-17	2018-02-07	Cirrus Logic International Semiconductor Ltd.	Manipulationssicheres Element zur Verwendung bei der Sprechererkennung
GB2552722A (en) *	2016-08-03	2018-02-07	Cirrus Logic Int Semiconductor Ltd	Speaker recognition
GB2552723A (en)	2016-08-03	2018-02-07	Cirrus Logic Int Semiconductor Ltd	Speaker recognition
JP6791258B2 (ja) *	2016-11-07	2020-11-25	ヤマハ株式会社	音声合成方法、音声合成装置およびプログラム
WO2018163279A1 (ja) *	2017-03-07	2018-09-13	日本電気株式会社	音声処理装置、音声処理方法、および音声処理プログラム
GB201801875D0 (en) *	2017-11-14	2018-03-21	Cirrus Logic Int Semiconductor Ltd	Audio processing

Family Cites Families (22)

* Cited by examiner, † Cited by third party
Publication number	Priority date	Publication date	Assignee	Title
US4058676A (en) *	1975-07-07	1977-11-15	International Communication Sciences	Speech analysis and synthesis system
JPS58129682A (ja) *	1982-01-29	1983-08-02	Toshiba Corp	個人照合装置
US5131043A (en) *	1983-09-05	1992-07-14	Matsushita Electric Industrial Co., Ltd.	Method of and apparatus for speech recognition wherein decisions are made based on phonemes
US4991216A (en) *	1983-09-22	1991-02-05	Matsushita Electric Industrial Co., Ltd.	Method for speech recognition
IT1160148B (it) *	1983-12-19	1987-03-04	Cselt Centro Studi Lab Telecom	Dispositivo per la verifica del parlatore
CA1229681A (en) *	1984-03-06	1987-11-24	Kazunori Ozawa	Method and apparatus for speech-band signal coding
US5146539A (en) *	1984-11-30	1992-09-08	Texas Instruments Incorporated	Method for utilizing formant frequencies in speech recognition
US4773093A (en) *	1984-12-31	1988-09-20	Itt Defense Communications	Text-independent speaker recognition system and method based on acoustic segment matching
US4922539A (en) *	1985-06-10	1990-05-01	Texas Instruments Incorporated	Method of encoding speech signals involving the extraction of speech formant candidates in real time
JPH0760318B2 (ja) *	1986-09-29	1995-06-28	株式会社東芝	連続音声認識方式
US4837830A (en) *	1987-01-16	1989-06-06	Itt Defense Communications, A Division Of Itt Corporation	Multiple parameter speaker recognition system and methods
US4926488A (en) *	1987-07-09	1990-05-15	International Business Machines Corporation	Normalization of speech by adaptive labelling
US5001761A (en) *	1988-02-09	1991-03-19	Nec Corporation	Device for normalizing a speech spectrum
US5048088A (en) *	1988-03-28	1991-09-10	Nec Corporation	Linear predictive speech analysis-synthesis apparatus
CN1013525B (zh) *	1988-11-16	1991-08-14	中国科学院声学研究所	认人与不认人实时语音识别的方法和装置
US5293448A (en) *	1989-10-02	1994-03-08	Nippon Telegraph And Telephone Corporation	Speech analysis-synthesis method and apparatus therefor
US5007094A (en) *	1989-04-07	1991-04-09	Gte Products Corporation	Multipulse excited pole-zero filtering approach for noise reduction
JPH02309820A (ja) *	1989-05-25	1990-12-25	Sony Corp	デイジタル信号処理装置
US4975956A (en) *	1989-07-26	1990-12-04	Itt Corporation	Low-bit-rate speech coder using LPC data reduction processing
US5167004A (en) *	1991-02-28	1992-11-24	Texas Instruments Incorporated	Temporal decorrelation method for robust speaker verification
US5165008A (en) *	1991-09-18	1992-11-17	U S West Advanced Technologies, Inc.	Speech synthesis using perceptual linear prediction parameters
WO1993018505A1 (en) *	1992-03-02	1993-09-16	The Walt Disney Company	Voice transformation system

1994
- 1994-02-28 US US08/203,988 patent/US5522012A/en not_active Expired - Lifetime
1995
- 1995-02-28 AU AU21164/95A patent/AU683370B2/en not_active Ceased
- 1995-02-28 EP EP95913980A patent/EP0748500B1/de not_active Expired - Lifetime
- 1995-02-28 CN CN95191853.2A patent/CN1142274A/zh active Pending
- 1995-02-28 JP JP7522534A patent/JPH10500781A/ja not_active Ceased
- 1995-02-28 WO PCT/US1995/002801 patent/WO1995023408A1/en active IP Right Grant
- 1995-02-28 AT AT95913980T patent/ATE323933T1/de not_active IP Right Cessation
- 1995-02-28 DE DE69534942T patent/DE69534942T2/de not_active Expired - Lifetime
- 1995-02-28 CA CA002184256A patent/CA2184256A1/en not_active Abandoned

Also Published As

Publication number	Publication date
EP0748500B1 (de)	2006-04-19
JPH10500781A (ja)	1998-01-20
DE69534942T2 (de)	2006-12-07
CN1142274A (zh)	1997-02-05
WO1995023408A1 (en)	1995-08-31
AU683370B2 (en)	1997-11-06
AU2116495A (en)	1995-09-11
ATE323933T1 (de)	2006-05-15
US5522012A (en)	1996-05-28
CA2184256A1 (en)	1995-08-31
EP0748500A1 (de)	1996-12-18
MX9603686A (es)	1997-12-31
EP0748500A4 (de)	1998-09-23

Legal Events

Date	Code	Title	Description
2007-05-31	8364	No opposition during term of opposition

Publication	Publication Date	Title
ATE323933T1 (de)	2006-05-15	Verfahren und vorrichtung zur sprechererkennung und -verifizierung
Kim et al.	1999	Auditory processing of speech signals for robust speech recognition in real-world noisy environments
US7319959B1 (en)	2008-01-15	Multi-source phoneme classification for noise-robust automatic speech recognition
US20030061037A1 (en)	2003-03-27	Method and apparatus for identifying noise environments from noisy signals
Kingsbury	1998	Perceptually inspired signal processing strategies for robust speech recognition in reverberant environments
CN1238058A (zh)	1999-12-08	语音处理系统
CN111489763B (zh)	2023-06-20	一种基于gmm模型的复杂环境下说话人识别自适应方法
US9269352B2 (en)	2016-02-23	Speech recognition with a plurality of microphones
Strand et al.	2004	Cepstral mean and variance normalization in the model domain
CN112116909A (zh)	2020-12-22	语音识别方法、装置及系统
Bäckström et al.	2017	Voice activity detection
CN86100298A (zh)	1986-08-06	语音识别
Dai et al.	2013	An improved model of masking effects for robust speech recognition system
KR100741355B1 (ko)	2007-07-20	인지 가중 필터를 이용한 전처리 방법
Thomsen et al.	2015	Speech enhancement and noise-robust automatic speech recognition
JPH024920B2 (de)	1990-01-30
Gajic	2003	Auditory based methods for robust speech feature extraction
Christiansen et al.	1982	Noise reduction in speech using adaptive filtering I: Signal processing algorithms
Kitamura et al.	2007	Speaker individualities in speech spectral envelopes and fundamental frequency contours
JPS6148898A (ja)	1986-03-10	音声の有声無声判定装置
Power et al.	1996	Consistency among speech parameter vectors: Application to predicting speech intelligibility
Gerazov et al.	2012	Overview of Feature Selection for Automatic Speech Recognition
Shridhar et al.	1982	A unified approach to speaker verification with noisy speech inputs
Bonde et al.	2005	Noise robust automatic speech recognition with adaptive quantile based noise estimation and speech band emphasizing filter bank
Hermansky et al.	1997	Recent advances in addressing sources of non-linguistic information