DE69534942D1 - Verfahren und vorrichtung zur sprechererkennung und -verifizierung - Google Patents
Verfahren und vorrichtung zur sprechererkennung und -verifizierungInfo
- Publication number
- DE69534942D1 DE69534942D1 DE69534942T DE69534942T DE69534942D1 DE 69534942 D1 DE69534942 D1 DE 69534942D1 DE 69534942 T DE69534942 T DE 69534942T DE 69534942 T DE69534942 T DE 69534942T DE 69534942 D1 DE69534942 D1 DE 69534942D1
- Authority
- DE
- Germany
- Prior art keywords
- cepstrum
- speaker recognition
- speech
- improved
- components
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Lifetime
Links
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L17/00—Speaker identification or verification
- G10L17/02—Preprocessing operations, e.g. segment selection; Pattern representation or modelling, e.g. based on linear discriminant analysis [LDA] or principal components; Feature selection or extraction
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/03—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
- G10L25/12—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters the extracted parameters being prediction coefficients
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/03—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
- G10L25/18—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters the extracted parameters being spectral information of each sub-band
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/03—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
- G10L25/24—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters the extracted parameters being the cepstrum
Applications Claiming Priority (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US203988 | 1994-02-28 | ||
US08/203,988 US5522012A (en) | 1994-02-28 | 1994-02-28 | Speaker identification and verification system |
PCT/US1995/002801 WO1995023408A1 (en) | 1994-02-28 | 1995-02-28 | Speaker identification and verification system |
Publications (2)
Publication Number | Publication Date |
---|---|
DE69534942D1 true DE69534942D1 (de) | 2006-05-24 |
DE69534942T2 DE69534942T2 (de) | 2006-12-07 |
Family
ID=22756137
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
DE69534942T Expired - Lifetime DE69534942T2 (de) | 1994-02-28 | 1995-02-28 | System zur sprecher-identifizierung und-überprüfung |
Country Status (9)
Country | Link |
---|---|
US (1) | US5522012A (de) |
EP (1) | EP0748500B1 (de) |
JP (1) | JPH10500781A (de) |
CN (1) | CN1142274A (de) |
AT (1) | ATE323933T1 (de) |
AU (1) | AU683370B2 (de) |
CA (1) | CA2184256A1 (de) |
DE (1) | DE69534942T2 (de) |
WO (1) | WO1995023408A1 (de) |
Families Citing this family (50)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5666466A (en) * | 1994-12-27 | 1997-09-09 | Rutgers, The State University Of New Jersey | Method and apparatus for speaker recognition using selected spectral information |
JPH08211897A (ja) * | 1995-02-07 | 1996-08-20 | Toyota Motor Corp | 音声認識装置 |
US5839103A (en) * | 1995-06-07 | 1998-11-17 | Rutgers, The State University Of New Jersey | Speaker verification system using decision fusion logic |
JP3397568B2 (ja) * | 1996-03-25 | 2003-04-14 | キヤノン株式会社 | 音声認識方法及び装置 |
FR2748343B1 (fr) * | 1996-05-03 | 1998-07-24 | Univ Paris Curie | Procede de reconnaissance vocale d'un locuteur mettant en oeuvre un modele predictif, notamment pour des applications de controle d'acces |
US6078664A (en) * | 1996-12-20 | 2000-06-20 | Moskowitz; Scott A. | Z-transform implementation of digital watermarks |
US6038528A (en) * | 1996-07-17 | 2000-03-14 | T-Netix, Inc. | Robust speech processing with affine transform replicated data |
SE515447C2 (sv) * | 1996-07-25 | 2001-08-06 | Telia Ab | Metod och anordning för talverifiering |
US5946654A (en) * | 1997-02-21 | 1999-08-31 | Dragon Systems, Inc. | Speaker identification using unsupervised speech models |
SE511418C2 (sv) * | 1997-03-13 | 1999-09-27 | Telia Ab | Metod för talarverifiering/identifiering via modellering av typiska icke-typiska egenskaper. |
US5995924A (en) * | 1997-05-05 | 1999-11-30 | U.S. West, Inc. | Computer-based method and apparatus for classifying statement types based on intonation analysis |
US6182037B1 (en) * | 1997-05-06 | 2001-01-30 | International Business Machines Corporation | Speaker recognition over large population with fast and detailed matches |
US5940791A (en) * | 1997-05-09 | 1999-08-17 | Washington University | Method and apparatus for speech analysis and synthesis using lattice ladder notch filters |
US7630895B2 (en) * | 2000-01-21 | 2009-12-08 | At&T Intellectual Property I, L.P. | Speaker verification method |
US6076055A (en) * | 1997-05-27 | 2000-06-13 | Ameritech | Speaker verification method |
US6192353B1 (en) | 1998-02-09 | 2001-02-20 | Motorola, Inc. | Multiresolutional classifier with training system and method |
US6243695B1 (en) * | 1998-03-18 | 2001-06-05 | Motorola, Inc. | Access control system and method therefor |
US6317710B1 (en) * | 1998-08-13 | 2001-11-13 | At&T Corp. | Multimedia search apparatus and method for searching multimedia content using speaker detection by audio data |
US6400310B1 (en) * | 1998-10-22 | 2002-06-04 | Washington University | Method and apparatus for a tunable high-resolution spectral estimator |
US6684186B2 (en) * | 1999-01-26 | 2004-01-27 | International Business Machines Corporation | Speaker recognition using a hierarchical speaker model tree |
CN1148720C (zh) * | 1999-03-11 | 2004-05-05 | 英国电讯有限公司 | 说话者识别 |
US20030115047A1 (en) * | 1999-06-04 | 2003-06-19 | Telefonaktiebolaget Lm Ericsson (Publ) | Method and system for voice recognition in mobile communication systems |
US6401063B1 (en) * | 1999-11-09 | 2002-06-04 | Nortel Networks Limited | Method and apparatus for use in speaker verification |
US6901362B1 (en) * | 2000-04-19 | 2005-05-31 | Microsoft Corporation | Audio segmentation and classification |
KR100366057B1 (ko) * | 2000-06-26 | 2002-12-27 | 한국과학기술원 | 인간 청각 모델을 이용한 효율적인 음성인식 장치 |
US6754373B1 (en) * | 2000-07-14 | 2004-06-22 | International Business Machines Corporation | System and method for microphone activation using visual speech cues |
US20040190688A1 (en) * | 2003-03-31 | 2004-09-30 | Timmins Timothy A. | Communications methods and systems using voiceprints |
JP2002306492A (ja) * | 2001-04-16 | 2002-10-22 | Electronic Navigation Research Institute | カオス論的ヒューマンファクタ評価装置 |
CN1236423C (zh) * | 2001-05-10 | 2006-01-11 | 皇家菲利浦电子有限公司 | 说话人声音的后台学习 |
US20040158462A1 (en) * | 2001-06-11 | 2004-08-12 | Rutledge Glen J. | Pitch candidate selection method for multi-channel pitch detectors |
US6898568B2 (en) * | 2001-07-13 | 2005-05-24 | Innomedia Pte Ltd | Speaker verification utilizing compressed audio formants |
US20030149881A1 (en) * | 2002-01-31 | 2003-08-07 | Digital Security Inc. | Apparatus and method for securing information transmitted on computer networks |
KR100488121B1 (ko) * | 2002-03-18 | 2005-05-06 | 정희석 | 화자간 변별력 향상을 위하여 개인별 켑스트럼 가중치를 적용한 화자 인증 장치 및 그 방법 |
JP3927559B2 (ja) * | 2004-06-01 | 2007-06-13 | 東芝テック株式会社 | 話者認識装置、プログラム及び話者認識方法 |
CN1811911B (zh) * | 2005-01-28 | 2010-06-23 | 北京捷通华声语音技术有限公司 | 自适应的语音变换处理方法 |
US7603275B2 (en) * | 2005-10-31 | 2009-10-13 | Hitachi, Ltd. | System, method and computer program product for verifying an identity using voiced to unvoiced classifiers |
US7788101B2 (en) * | 2005-10-31 | 2010-08-31 | Hitachi, Ltd. | Adaptation method for inter-person biometrics variability |
CN101051464A (zh) * | 2006-04-06 | 2007-10-10 | 株式会社东芝 | 说话人认证的注册和验证方法及装置 |
DE102007011831A1 (de) * | 2007-03-12 | 2008-09-18 | Voice.Trust Ag | Digitales Verfahren und Anordnung zur Authentifizierung einer Person |
CN101303854B (zh) * | 2007-05-10 | 2011-11-16 | 摩托罗拉移动公司 | 用于提供识别的语音输出的方法 |
US8849432B2 (en) * | 2007-05-31 | 2014-09-30 | Adobe Systems Incorporated | Acoustic pattern identification using spectral characteristics to synchronize audio and/or video |
CN101339765B (zh) * | 2007-07-04 | 2011-04-13 | 黎自奋 | 一种国语单音辨认方法 |
CN101281746A (zh) * | 2008-03-17 | 2008-10-08 | 黎自奋 | 一个百分之百辨认率的国语单音与句子辨认方法 |
DE102009051508B4 (de) * | 2009-10-30 | 2020-12-03 | Continental Automotive Gmbh | Vorrichtung, System und Verfahren zur Sprachdialogaktivierung und -führung |
EP2897076B8 (de) * | 2014-01-17 | 2018-02-07 | Cirrus Logic International Semiconductor Ltd. | Manipulationssicheres Element zur Verwendung bei der Sprechererkennung |
GB2552722A (en) * | 2016-08-03 | 2018-02-07 | Cirrus Logic Int Semiconductor Ltd | Speaker recognition |
GB2552723A (en) | 2016-08-03 | 2018-02-07 | Cirrus Logic Int Semiconductor Ltd | Speaker recognition |
JP6791258B2 (ja) * | 2016-11-07 | 2020-11-25 | ヤマハ株式会社 | 音声合成方法、音声合成装置およびプログラム |
WO2018163279A1 (ja) * | 2017-03-07 | 2018-09-13 | 日本電気株式会社 | 音声処理装置、音声処理方法、および音声処理プログラム |
GB201801875D0 (en) * | 2017-11-14 | 2018-03-21 | Cirrus Logic Int Semiconductor Ltd | Audio processing |
Family Cites Families (22)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US4058676A (en) * | 1975-07-07 | 1977-11-15 | International Communication Sciences | Speech analysis and synthesis system |
JPS58129682A (ja) * | 1982-01-29 | 1983-08-02 | Toshiba Corp | 個人照合装置 |
US5131043A (en) * | 1983-09-05 | 1992-07-14 | Matsushita Electric Industrial Co., Ltd. | Method of and apparatus for speech recognition wherein decisions are made based on phonemes |
US4991216A (en) * | 1983-09-22 | 1991-02-05 | Matsushita Electric Industrial Co., Ltd. | Method for speech recognition |
IT1160148B (it) * | 1983-12-19 | 1987-03-04 | Cselt Centro Studi Lab Telecom | Dispositivo per la verifica del parlatore |
CA1229681A (en) * | 1984-03-06 | 1987-11-24 | Kazunori Ozawa | Method and apparatus for speech-band signal coding |
US5146539A (en) * | 1984-11-30 | 1992-09-08 | Texas Instruments Incorporated | Method for utilizing formant frequencies in speech recognition |
US4773093A (en) * | 1984-12-31 | 1988-09-20 | Itt Defense Communications | Text-independent speaker recognition system and method based on acoustic segment matching |
US4922539A (en) * | 1985-06-10 | 1990-05-01 | Texas Instruments Incorporated | Method of encoding speech signals involving the extraction of speech formant candidates in real time |
JPH0760318B2 (ja) * | 1986-09-29 | 1995-06-28 | 株式会社東芝 | 連続音声認識方式 |
US4837830A (en) * | 1987-01-16 | 1989-06-06 | Itt Defense Communications, A Division Of Itt Corporation | Multiple parameter speaker recognition system and methods |
US4926488A (en) * | 1987-07-09 | 1990-05-15 | International Business Machines Corporation | Normalization of speech by adaptive labelling |
US5001761A (en) * | 1988-02-09 | 1991-03-19 | Nec Corporation | Device for normalizing a speech spectrum |
US5048088A (en) * | 1988-03-28 | 1991-09-10 | Nec Corporation | Linear predictive speech analysis-synthesis apparatus |
CN1013525B (zh) * | 1988-11-16 | 1991-08-14 | 中国科学院声学研究所 | 认人与不认人实时语音识别的方法和装置 |
US5293448A (en) * | 1989-10-02 | 1994-03-08 | Nippon Telegraph And Telephone Corporation | Speech analysis-synthesis method and apparatus therefor |
US5007094A (en) * | 1989-04-07 | 1991-04-09 | Gte Products Corporation | Multipulse excited pole-zero filtering approach for noise reduction |
JPH02309820A (ja) * | 1989-05-25 | 1990-12-25 | Sony Corp | デイジタル信号処理装置 |
US4975956A (en) * | 1989-07-26 | 1990-12-04 | Itt Corporation | Low-bit-rate speech coder using LPC data reduction processing |
US5167004A (en) * | 1991-02-28 | 1992-11-24 | Texas Instruments Incorporated | Temporal decorrelation method for robust speaker verification |
US5165008A (en) * | 1991-09-18 | 1992-11-17 | U S West Advanced Technologies, Inc. | Speech synthesis using perceptual linear prediction parameters |
WO1993018505A1 (en) * | 1992-03-02 | 1993-09-16 | The Walt Disney Company | Voice transformation system |
-
1994
- 1994-02-28 US US08/203,988 patent/US5522012A/en not_active Expired - Lifetime
-
1995
- 1995-02-28 AU AU21164/95A patent/AU683370B2/en not_active Ceased
- 1995-02-28 EP EP95913980A patent/EP0748500B1/de not_active Expired - Lifetime
- 1995-02-28 CN CN95191853.2A patent/CN1142274A/zh active Pending
- 1995-02-28 JP JP7522534A patent/JPH10500781A/ja not_active Ceased
- 1995-02-28 WO PCT/US1995/002801 patent/WO1995023408A1/en active IP Right Grant
- 1995-02-28 AT AT95913980T patent/ATE323933T1/de not_active IP Right Cessation
- 1995-02-28 DE DE69534942T patent/DE69534942T2/de not_active Expired - Lifetime
- 1995-02-28 CA CA002184256A patent/CA2184256A1/en not_active Abandoned
Also Published As
Publication number | Publication date |
---|---|
EP0748500B1 (de) | 2006-04-19 |
JPH10500781A (ja) | 1998-01-20 |
DE69534942T2 (de) | 2006-12-07 |
CN1142274A (zh) | 1997-02-05 |
WO1995023408A1 (en) | 1995-08-31 |
AU683370B2 (en) | 1997-11-06 |
AU2116495A (en) | 1995-09-11 |
ATE323933T1 (de) | 2006-05-15 |
US5522012A (en) | 1996-05-28 |
CA2184256A1 (en) | 1995-08-31 |
EP0748500A1 (de) | 1996-12-18 |
MX9603686A (es) | 1997-12-31 |
EP0748500A4 (de) | 1998-09-23 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
ATE323933T1 (de) | Verfahren und vorrichtung zur sprechererkennung und -verifizierung | |
Kim et al. | Auditory processing of speech signals for robust speech recognition in real-world noisy environments | |
US7319959B1 (en) | Multi-source phoneme classification for noise-robust automatic speech recognition | |
US20030061037A1 (en) | Method and apparatus for identifying noise environments from noisy signals | |
Kingsbury | Perceptually inspired signal processing strategies for robust speech recognition in reverberant environments | |
CN1238058A (zh) | 语音处理系统 | |
CN111489763B (zh) | 一种基于gmm模型的复杂环境下说话人识别自适应方法 | |
US9269352B2 (en) | Speech recognition with a plurality of microphones | |
Strand et al. | Cepstral mean and variance normalization in the model domain | |
CN112116909A (zh) | 语音识别方法、装置及系统 | |
Bäckström et al. | Voice activity detection | |
CN86100298A (zh) | 语音识别 | |
Dai et al. | An improved model of masking effects for robust speech recognition system | |
KR100741355B1 (ko) | 인지 가중 필터를 이용한 전처리 방법 | |
Thomsen et al. | Speech enhancement and noise-robust automatic speech recognition | |
JPH024920B2 (de) | ||
Gajic | Auditory based methods for robust speech feature extraction | |
Christiansen et al. | Noise reduction in speech using adaptive filtering I: Signal processing algorithms | |
Kitamura et al. | Speaker individualities in speech spectral envelopes and fundamental frequency contours | |
JPS6148898A (ja) | 音声の有声無声判定装置 | |
Power et al. | Consistency among speech parameter vectors: Application to predicting speech intelligibility | |
Gerazov et al. | Overview of Feature Selection for Automatic Speech Recognition | |
Shridhar et al. | A unified approach to speaker verification with noisy speech inputs | |
Bonde et al. | Noise robust automatic speech recognition with adaptive quantile based noise estimation and speech band emphasizing filter bank | |
Hermansky et al. | Recent advances in addressing sources of non-linguistic information |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
8364 | No opposition during term of opposition |