EP0993671B1 - Procede de recherche d'un modele de bruit dans des signaux sonores bruites - Google Patents
Procede de recherche d'un modele de bruit dans des signaux sonores bruites Download PDFInfo
- Publication number
- EP0993671B1 EP0993671B1 EP98935094A EP98935094A EP0993671B1 EP 0993671 B1 EP0993671 B1 EP 0993671B1 EP 98935094 A EP98935094 A EP 98935094A EP 98935094 A EP98935094 A EP 98935094A EP 0993671 B1 EP0993671 B1 EP 0993671B1
- Authority
- EP
- European Patent Office
- Prior art keywords
- model
- noise
- frames
- energy
- search
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Lifetime
Links
Images
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0208—Noise filtering
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0208—Noise filtering
- G10L21/0216—Noise filtering characterised by the method used for estimating noise
- G10L2021/02168—Noise filtering characterised by the method used for estimating noise the estimation exclusively taking place during speech pauses
Definitions
- the invention relates to improving the intelligibility of voice communications in the presence of noise. It no longer applies especially but not exclusively to telephone communications or by radiotelephone or other electronic means, at the speech recognition, etc., whenever the recording environment sound is noisy and may deteriorate the perception or recognition of the transmitted voice.
- noise comes from engines, air conditioning, ventilation of on-board equipment, aerodynamic noise. These noises are picked up by the microphone in which the pilot or a member of the crew.
- the invention provides a method of searching for a model of noise which can be used in particular in treatments for reducing noise.
- Noise reduction treatments based on the noise model found allow to increase the signal / noise ratio of the transmitted signal, a aim being to deteriorate the intelligibility of the signal as little as possible.
- the denoising denoising and denoising will be used to speak operations to remove or reduce noise components present in the signal.
- the denoising can be based as we will see on the permanent search for an ambient noise model, on spectral analysis of this noise, and on the digital reconstruction of a useful signal eliminating as much as possible the modeled noise.
- the noise model is sought in the noisy signals themselves and whenever a plausible noise pattern has been found, this noise model is stored for use. Then a new research begins to find a more suitable model or simply more recent.
- the invention provides a search method automatic noise patterns in audio input signals noisy, in which we digitize the input signals, and we process these signals from a model found (for example in order to eliminate at best noise corresponding to the model), characterized in that the signals input are cut into successive frames of P samples each, and a repetitive search for a noise model is performed in permanence in the input signals themselves, looking for N successive frames having the expected characteristics of a noise, in storing the corresponding NxP samples to constitute a model of noise useful for processing denoising of input signals, and repeating the research to find a new noise model and store the new one model to replace the previous one or keep the previous model according to the respective characteristics of the two models.
- the noise model used in particular for denoising is not a known predetermined model or a chosen model among several predetermined models, but this is a model found in the noisy signal itself, which allows not only to adapt denoising to the real annoying noise, but also to adapt the denoising to variations of this noise.
- the noise model is obtained by considering that the signals whose energy is stable (and preferably, as we will see, whose energy is minimal), over a certain period probably represent noise; the invention is characterized, according to claim 1, in that the searching for a noise model then includes searching for N frames successive whose energies are close to each other (N being between a minimum value N1 and a maximum value N2), the calculation of the average energy of the N successive frames found, and the storage NxP samples as a new active model if the relationship between this average energy and the average energy of the frames of the active model previously stored is below a determined replacement threshold.
- the search for N successive frames then comprises at minus the following iterative steps: calculation of the energy of a frame current of rank n can be added to a current model of preparation already comprising n-1 successive frames; ratio calculation between this energy and the energy of the previous frame of rank n-1 (and of preferably that of other previous frames between 1 and n-1); comparison of this ratio with a low threshold less than 1 and a high threshold greater than 1; and decision on the possibility of incorporating the frame of rank n in the model in being developed: the frame is not incorporated into the model if the report is not between the two thresholds; it is incorporated into the model if the ratio is between the two thresholds. The procedure is repeated on the next current frame of input signals, with incrementation of n, until the model is stopped.
- n reaches the high value N2
- the model developed cannot be taken count as an active model only if n-1 is already greater than or equal to the minimum N1, because the principle is that a noise model is representative if it has an approximately stable energy on at least N1 frames.
- the model developed does not become active in place of the previous model only if the ratio between its average energy per frame and the average energy of the previous model does not exceed a threshold of predetermined replacement.
- the search for a new model starts again as soon as the preparation of the previous one is interrupted.
- the replacement of a previous model by a new model is inhibited as soon as speech is detected in noisy signals.
- the presence of speech can indeed be detected by digital signal processing procedures (such as than those that can be used in speech recognition).
- the signal analysis which allows denoising will be based on spectral analysis of signals in time intervals of duration D, which we will call “frames”, and which will have approximately this duration.
- the general principle of the denoising process is based on a permanent and automatic search for a noise model which will be used to process the input signal to denois it.
- This research is done on digitized u (t) signal samples stored in a buffer input.
- This memory is capable of memorizing all the samples of several frames of the input signal (e.g. at least 2 frames).
- the noise model sought consists of a succession multiple frames including energy stability and energy level relative suggest that it is an ambient noise and not a speech signal or some other disturbing noise. We will see later how this automatic search.
- the denoising of the input signal u (t) is done from the model of noise that is in memory, and more precisely from the characteristics spectral of this model.
- a Fourier transform and an estimate of average spectral density of noise are therefore performed on the model of stored noise.
- the denoising operation is preferably done using a digital filtering from Wiener which will be discussed in more detail.
- the filter of Wiener is parameterized by the spectral characteristics of the model of noise recorded and by the spectral characteristics of the signal u (t) to denoise.
- the digitized input signal therefore undergoes a transform of Fourier and an estimate of spectral density.
- the numerical values of the Fourier transform i.e. the input signal represented by its frequency components, are processed by the Wiener filter and the output of the Wiener filter represents, in frequency space, the signal digital denoised, that is to say rid as much as possible of the noise represented by the registered model.
- the filtered digital signal is used either for the reconstruction of a sound signal in which the ambient noise has been partly eliminated, i.e. at the speech Recognition.
- phase of automatic search for a noise model and the permanent updating of this model are crucial steps in the process and are more precisely the subject of the invention.
- noise ambient is a signal with a stable minimum energy in the short term.
- the number of frames intended to assess the noise stability is 5 to 20.
- Energy must be stable over several frames, otherwise we must assume that the signal contains rather speech or noise other than ambient noise. It must be minimal, fault what we consider that the signal contains breathing or phonetic speech elements resembling noise but overlapping to ambient noise.
- Figure 2 shows a typical evolution configuration temporal energy of a microphone signal at the time of a start speech emission, with a breath noise phase, which goes out for a few tens to hundreds of milliseconds to make room for the ambient noise alone, after which a high energy level indicates the presence speech, to finally return to ambient noise.
- N1 5
- a determined range of values for example between 1/3 and 3
- the noise model is generally based on permanent ambient noise. Even before speaking, preceded by breathing, there is a phase where ambient noise alone is present for a sufficient time to be taken into account as an active noise model. This phase ambient noise alone after breathing is brief; the number N1 is chosen relatively weak, so that we have time to readjust the noise model on ambient noise after the breathing phase.
- the ambient noise changes slowly, the change will be taken into account. account of the fact that the comparison threshold with the stored model is greater than 1. If it evolves more rapidly in the increasing direction, the evolution may not be taken into account, so it is best to plan to reset the search for a model from time to time noise. For example, in a stopped ground plane, the ambient noise will be relatively weak, and it should not be that during the phase of takeoff the noise model remains frozen on what it was at a standstill because a noise model is only replaced by a less energetic model or not much more energetic. The methods of reset envisaged.
- FIG. 3 represents a flowchart of the operations of automatic search for an ambient noise pattern.
- the input signal u (t), sampled at the frequency F e 1 / T e and digitized by an analog-digital converter, is stored in a buffer memory capable of storing all the samples of at least 2 frames.
- n The number of the current frame in an operation of looking for a noise pattern is denoted by n and is counted by a counter as you search. At the initialization of the search, n is set to 1. This number n will be incremented progressively the development of a model of several successive frames. when analyzes the current frame n, the model already understands by hypothesis n-1 successive frames meeting the conditions imposed to be part of a model.
- the signal energy of the frame is calculated by summing the squares of the numerical values of the samples of the frame. She is kept in memory.
- the ratio between the energies of the two frames is calculated. If this ratio is between two thresholds S and S 'one of which is greater than 1 and the other is less than 1, we consider that the energies of the two frames are close and that the two frames can be part of a noise model.
- the frames are declared incompatible and the search is reset by resetting n to 1.
- the rank n is incremented of the current frame, and we perform, in a procedure loop iterative, an energy calculation of the next frame and a comparison with the energy of the previous frame or previous frames, using the thresholds S and S '.
- the first type of comparison consists in comparing only the energy of the frame n to the energy of the n-1 frame.
- the second type is to compare the energy of frame n at each of frames 1 to n-1. The second way leads to greater homogeneity of the model but it has the disadvantage of not not take sufficiently into account the cases where the noise level increases or decreases rapidly.
- the energy of the frame of rank n is compared with the energy of the frame of rank n-1 and possibly of other frames previous (not necessarily all of them for that matter).
- N2 is chosen so as to limit the calculation time in subsequent noise spectral density estimation operations.
- n is less than N2
- the homogeneous frame is added to the to help build the noise model, n is incremented and the next frame is analyzed.
- n is equal to N2
- the frame is also added to the n-1 previous homogeneous frames and the model of n homogeneous frames is stored for use in noise elimination. Searching for a model is also reset by resetting n to 1.
- the previous steps relate to the first search for model. But once a model has been stored, it can at any time be replaced by a more recent model.
- the replacement condition is still a condition of energy, but this time it relates to the average energy of the model and not more about the energy of each frame.
- the new model is considered better and we store it in place of the previous one. Otherwise, the new model is rejected and the old remains in force.
- the threshold SR is preferably slightly greater than 1.
- the SR threshold was less than or equal to 1, we would store at each times the least energetic homogeneous frames, which corresponds well to the fact that ambient noise is considered to be the energy level at below which we never descend. But, we would eliminate any possibility evolution of the model if the ambient noise starts to increase.
- SR threshold was too high above 1, there is a risk of poorly distinguish ambient noise and other disturbing noises (breathing), or even some phonemes that sound like noise (consonants hissing or hissing for example). Noise removal from a noise pattern stalled on breath or on whistling consonants or hissing could then harm the intelligibility of the denoised signal.
- the threshold SR is approximately 1.5. At above this threshold we will keep the old model; below this threshold we will replace the old model with the new one. In both cases, we will reset the search by restarting the reading of a first frame of the input signal u (t), and by putting n at 1.
- the digital treatments of commonly used signal in speech detection identify the presence of words based on the characteristic spectra of periodicity of certain phonemes, in particular the corresponding phonemes to vowels or voiced consonants.
- This inhibition is to prevent certain sounds from being taken for noise when these are useful phonemes, that a model of noise based on these sounds is stored and that noise suppression after the development of the model then tends to suppress all the sounds Similar.
- Ambient noise can indeed increase significantly and fast, for example during the acceleration phase of the engines of a plane or other vehicle, air, land or sea. But the SR threshold requires that the previous noise model be kept when the energy average noise increases too quickly.
- Periodicity can be based on duration average speech in the intended application; for example the durations of speech are on average a few seconds for the crew of a airplane, and the reset can take place with a periodicity of a few seconds.
- the proper denoising treatment carried out from of a stored noise model, can be performed as follows, by working on the Fourier transforms of the input signal.
- the Fourier transform of the input signal is carried out frame by frame and provides for each frame P samples in the frequency space, each sample corresponding to a frequency F e / i with i varying from 1 to P. These P samples will be processed preferably in a Wiener filter.
- the Wiener filter is a digital filter of P coefficients each corresponding to one of the frequencies F e / i of the frequency space.
- Each sample of the input signal in the frequency space is multiplied by the respective coefficient W i of the filter.
- the set of P samples thus processed constitutes a denoised signal frame, in the frequency space.
- these denoised frames are used directly in the frequency space.
- the coefficients W i of the Wiener filter are calculated from the spectral density of the noisy input signal and the noise spectral density of the stored noise model.
- the spectral density of a frame of the input signal is obtained from the Fourier transform of the noisy input signal. For each frequency, we take the squared module of the sample provided by the Fourier transform, to obtain a value DS i for each frequency F e / i.
- the module squared of the P samples is calculated for each frame, and the N squared modules corresponding to the same frequency F e / i are averaged over the N frames of the noise model.
- P noise density values DB i are obtained.
- the sample of rank i of the Fourier transform of an input signal frame is multiplied by W i and the succession of the P samples thus multiplied by P Wiener coefficients constitutes the denoised input frame.
- the implementation of the method according to the invention can be done at from non-specialized computers, provided with calculation programs required and receiving the digital signal samples as they are supplied by an analog-to-digital converter.
- This implementation can also be done from a specialized computer based on digital signal processors, which allows more signals to be processed more quickly digital.
- FIG. 4 represents an example of general architecture of a specialized computer receiving the sound signal to denois and providing real time an audible noise signal.
- the computer includes two signal processors digital DSP1 and DSP2 and working memories associated with these processors.
- Noise signals are passed through a converter analog-digital CA / D and are stored in parallel in two FIFO1 and FIFO2 buffers (of the "first-in, first-out" type, i.e. first in first out).
- One of the memories is connected to the processor DSP1, the other to the DSP2 processor.
- the DSP1 processor is the master processor and it is dedicated rather looking for a noise model. It is therefore programmed to execute at least the following operations: frame energy calculation, energy averaging, comparison with thresholds, comparison frame rank with N1 and N2, etc. It also calculates densities energy spectral of the noise model.
- This DSP1 processor is coupled to a dynamic working memory DRAM1 in which we store the current frame sample during a calculation, the energy of a frame current, the energy of the previous frame (s), the samples of Fourier transform of the noise model. It is also coupled with a static working memory in which the tables used are stored the computation of Fourier transforms, and the comparison thresholds S and SR.
- the DSP2 processor is dedicated rather to the calculation of transforms Fourier signal to denois, calculating the spectral density of this signal, calculating Wiener coefficients, Wiener filtering, and inverse Fourier transform if the latter is to be performed.
- the DSP2 processor is coupled to a dynamic working memory DRAM2 and a static working memory SRAM2.
- DRAM2 memory stores current frame samples, transform calculation results from Fourier, results of calculation of spectral energy density of the signal, the calculated Wiener coefficients, etc.
- the SRAM2 memory stores in particular tables used for the computation of Fourier transforms.
- the denoised sound signal samples calculated by the DSP2 processor are transmitted, through a circulating buffer FIFO3, to a digital analog converter CNIA, and to a circuit of smoothing which reconstructs the denoised sound signal in analog form.
Description
- la figure 1 représente un organigramme général d'un procédé de réduction de bruit utilisant le procédé de l'invention;
- la figure 2 représente un exemple typique de signal issu d'une prise de son bruitée;
- la figure 3 représente l'organigramme des étapes de recherche d'un modèle de bruit dans le signal d'entrée;
- la figure 4 représente un exemple d'architecture de circuit électronique pour la mise en oeuvre d'opérations de débruitage utilisant le procédé selon l'invention.
- le bruit qu'on veut éliminer est le bruit de fond ambiant;
- le bruit ambiant a une énergie relativement stable à court terme,
- la parole est le plus souvent précédée d'un bruit de respiration du pilote qu'il ne faut pas confondre avec le bruit ambiant; mais ce bruit de respiration s'éteint quelques centaines de millisecondes avant la première émission de parole proprement dite, de sorte qu'on ne retrouve que le bruit ambiant juste avant l'émission de parole;
- et enfin, les bruits et la parole se superposent en termes d'énergie de signal, de sorte qu'un signal contenant de la parole ou un bruit perturbateur, y compris la respiration dans le microphone, contient forcément plus d'énergie qu'un signal de bruit ambiant.
- ou bien n est inférieur ou égal à un nombre minimal N1 en dessous duquel le modèle ne peut pas être considéré comme significatif du bruit ambiant parce que la durée d'homogénéité est trop courte; par exemple N1 = 5; dans ce cas on abandonne le modèle en cours d'élaboration, et on réinitialise la recherche au début en remettant n à 1;
- ou bien n est supérieur au nombre minimal N1. Dans ce cas, puisqu'on trouve maintenant un manque d'homogénéité, on considère qu'il y a peut-être un début de parole après une phase de bruit homogène, et on conserve à titre de modèle de bruit tous les échantillons des n-1 trames de bruit homogènes qui ont précédé le manque d'homogénéité. Ce modèle reste stocké jusqu'à ce qu'on trouve un modèle plus récent qui semble également représenter du bruit ambiant. La recherche est réinitialisée de toutes façons en remettant n à 1.
Claims (8)
- Procédé de recherche automatique de modèles de bruit dans des signaux d'entrée sonores bruités, comprenant la numérisation des signaux d'entrée, et le traitement de ces signaux à partir d'un modèle trouvé, procédé dans lequel. les signaux d'entrée sont découpés en trames successives de P échantillons chacune, et une recherche répétitive d'un modèle de bruit est effectuée en permanence dans les signaux d'entrée eux-mêmes, en recherchant N trames successives ayant les caractéristiques attendues d'un bruit, en stockant les NxP échantillons correspondants pour constituer un modèle de bruit utile au traitement de débruitage des signaux d'entrée, et en réitérant la recherche pour trouver un nouveau modèle de bruit et stocker le nouveau modèle en remplacement du précédent ou conserver le modèle précédent selon les caractéristiques respectives des deux modèles, caractérisé en ce que la recherche d'un modèle de bruit comprend la recherche de N trames successives dont les énergies sont proches les unes des autres, N étant compris entre une valeur minimale N1 et une valeur maximale N2, le calcul de l'énergie moyenne des N trames successives trouvées, et le stockage des NxP échantillons à titre de nouveau modèle actif si le rapport entre cette énergie moyenne et l'énergie moyenne des trames du modèle actif précédemment stocké est inférieur à un seuil de remplacement déterminé.
- Procédé selon la revendication 1, caractérisé en ce que la recherche de N trames successives comprend alors au moins les étapes itératives suivantes : calcul de l'énergie d'une trame courante de rang n susceptible d'être ajoutée à un modèle en cours d'élaboration comprenant déjà n-1 trames successives; calcul du rapport entre cette énergie et l'énergie de la trame précédente de rang n-1; comparaison de ce rapport avec un seuil bas inférieur à 1 et un seuil haut supérieur à 1; et décision sur la possibilité d'incorporer la trame de rang n au modèle en cours d'élaboration en fonction du résultat de la comparaison.
- Procédé selon la revendication 2, caractérisé en ce que la recherche de N trames successives comprend également le calcul du rapport entre l'énergie de la trame courante et l'énergie d'une ou plusieurs autres trames précédentes, la comparaison avec les seuils, la trame étant incorporée au modèle en cours d'élaboration en fonction du résultat de la comparaison.
- Procédé selon l'une des revendications 2 et 3, caractérisé en ce que dans le cas où la trame de rang n est incorporée au modèle, on incrémente n d'une unité pour continuer l'élaboration du modèle si n est inférieur à N2, et, dans le cas contraire, on arrête l'élaboration du modèle, on calcule l'énergie moyenne des n trames, on calcule le rapport entre cette énergie et l'énergie moyenne des trames du modèle précédemment stocké, on conserve le modèle précédent ou on le remplace par le modèle en cours d'élaboration selon la valeur du rapport, et on recommence la recherche itérative d'un nouveau modèle.
- Procédé selon l'une des revendications 2 et 3, caractérisé en ce que dans le cas où la trame courante de rang n n'est pas incorporée au modèle en cours d'élaboration,on arrête l'élaboration du modèle de n-1 trames;si n est supérieur à N1, on calcule le rapport entre l'énergie moyenne des trames du modèle en cours d'élaboration et l'énergie moyenne des trames du modèle précédemment stocké, et on conserve le modèle précédent ou on le remplace par le nouveau modèle selon la valeur du rapport,et on recommence une recherche itérative d'un nouveau modèle.
- Procédé selon l'une des revendications précédentes, caractérisé en ce que l'on recherche la présence de parole dans le signal d'entrée, et on inhibe la recherche d'un nouveau modèle si la présence de parole est détectée.
- Procédé selon l'une des revendications précédentes, caractérisé en ce qu'on réinitialise périodiquement la recherche en imposant le nouveau modèle quelles que soient les caractéristiques respectives du nouveau modèle et du modèle précédent
- Procédé selon l'une des revendications précédentes, caractérisé en ce que l'on traite les signaux d'entrée bruités à partir d'un modèle de bruit trouvé, par filtrage spectral, en vue d'éliminer au mieux le bruit correspondant au modèle.
Applications Claiming Priority (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
FR9708509A FR2765715B1 (fr) | 1997-07-04 | 1997-07-04 | Procede de recherche d'un modele de bruit dans des signaux sonores bruites |
FR9708509 | 1997-07-04 | ||
PCT/FR1998/001428 WO1999001862A1 (fr) | 1997-07-04 | 1998-07-03 | Procede de recherche d'un modele de bruit dans des signaux sonores bruites |
Publications (2)
Publication Number | Publication Date |
---|---|
EP0993671A1 EP0993671A1 (fr) | 2000-04-19 |
EP0993671B1 true EP0993671B1 (fr) | 2002-06-12 |
Family
ID=9508879
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
EP98935094A Expired - Lifetime EP0993671B1 (fr) | 1997-07-04 | 1998-07-03 | Procede de recherche d'un modele de bruit dans des signaux sonores bruites |
Country Status (6)
Country | Link |
---|---|
US (1) | US6438513B1 (fr) |
EP (1) | EP0993671B1 (fr) |
JP (1) | JP4338226B2 (fr) |
DE (1) | DE69806006T2 (fr) |
FR (1) | FR2765715B1 (fr) |
WO (1) | WO1999001862A1 (fr) |
Families Citing this family (30)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6633842B1 (en) * | 1999-10-22 | 2003-10-14 | Texas Instruments Incorporated | Speech recognition front-end feature extraction for noisy speech |
EP1104925A1 (fr) * | 1999-12-03 | 2001-06-06 | Siemens Aktiengesellschaft | Procédé de traitement de la parole par soustraction d'une fonction du bruit |
EP1152399A1 (fr) * | 2000-05-04 | 2001-11-07 | Faculte Polytechniquede Mons | Traitement en sous bandes de signal de parole par réseaux de neurones |
FR2808917B1 (fr) * | 2000-05-09 | 2003-12-12 | Thomson Csf | Procede et dispositif de reconnaissance vocale dans des environnements a niveau de bruit fluctuant |
US7010483B2 (en) * | 2000-06-02 | 2006-03-07 | Canon Kabushiki Kaisha | Speech processing system |
US7072833B2 (en) * | 2000-06-02 | 2006-07-04 | Canon Kabushiki Kaisha | Speech processing system |
US6954745B2 (en) * | 2000-06-02 | 2005-10-11 | Canon Kabushiki Kaisha | Signal processing system |
US20020026253A1 (en) * | 2000-06-02 | 2002-02-28 | Rajan Jebu Jacob | Speech processing apparatus |
US7035790B2 (en) * | 2000-06-02 | 2006-04-25 | Canon Kabushiki Kaisha | Speech processing system |
EP1170728A1 (fr) * | 2000-07-05 | 2002-01-09 | Alcatel | Dispositif de réduction adaptive du bruit dans des signaux de parole |
US7062442B2 (en) * | 2001-02-23 | 2006-06-13 | Popcatcher Ab | Method and arrangement for search and recording of media signals |
EP1417583B1 (fr) * | 2001-02-23 | 2006-10-11 | Popcatcher Ab | Procédé de réception d'un signal multimédia |
GB2380644A (en) * | 2001-06-07 | 2003-04-09 | Canon Kk | Speech detection |
FR2842064B1 (fr) * | 2002-07-02 | 2004-12-03 | Thales Sa | Systeme de spatialisation de sources sonores a performances ameliorees |
SE524162C2 (sv) * | 2002-08-23 | 2004-07-06 | Rickard Berg | Förfarande för att behandla signaler |
WO2004109661A1 (fr) * | 2003-06-05 | 2004-12-16 | Matsushita Electric Industrial Co., Ltd. | Appareil et procede de reglage de la qualite sonore |
EP1494040A1 (fr) * | 2003-06-30 | 2005-01-05 | Sulzer Markets and Technology AG | Méthode de compensation de bruit de quantification et utilisation de la méthode |
US8718298B2 (en) * | 2003-12-19 | 2014-05-06 | Lear Corporation | NVH dependent parallel compression processing for automotive audio systems |
JP4340686B2 (ja) * | 2004-03-31 | 2009-10-07 | パイオニア株式会社 | 音声認識装置及び音声認識方法 |
US7139701B2 (en) * | 2004-06-30 | 2006-11-21 | Motorola, Inc. | Method for detecting and attenuating inhalation noise in a communication system |
CN101031963B (zh) * | 2004-09-16 | 2010-09-15 | 法国电信 | 处理有噪声的声音信号的方法以及实现该方法的装置 |
JP5724361B2 (ja) * | 2010-12-17 | 2015-05-27 | 富士通株式会社 | 音声認識装置、音声認識方法および音声認識プログラム |
CN104301064B (zh) | 2013-07-16 | 2018-05-04 | 华为技术有限公司 | 处理丢失帧的方法和解码器 |
US9633669B2 (en) * | 2013-09-03 | 2017-04-25 | Amazon Technologies, Inc. | Smart circular audio buffer |
DE102013111784B4 (de) * | 2013-10-25 | 2019-11-14 | Intel IP Corporation | Audioverarbeitungsvorrichtungen und audioverarbeitungsverfahren |
CN105225666B (zh) | 2014-06-25 | 2016-12-28 | 华为技术有限公司 | 处理丢失帧的方法和装置 |
EP3248191B1 (fr) * | 2015-01-20 | 2021-09-29 | Dolby Laboratories Licensing Corporation | Modélisation et réduction de bruit de systèmes de propulsion de drone |
CN105991900B (zh) * | 2015-02-05 | 2019-08-09 | 扬智科技股份有限公司 | 噪声检测方法和去噪方法 |
CN106067847B (zh) * | 2016-05-25 | 2019-10-22 | 腾讯科技(深圳)有限公司 | 一种语音数据传输方法及装置 |
CN109087659A (zh) * | 2018-08-03 | 2018-12-25 | 三星电子(中国)研发中心 | 音频优化方法及设备 |
Family Cites Families (20)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US4630304A (en) * | 1985-07-01 | 1986-12-16 | Motorola, Inc. | Automatic background noise estimator for a noise suppression system |
US5029118A (en) * | 1985-12-04 | 1991-07-02 | Nissan Motor Co. Ltd. | Periodic noise canceling system and method |
FR2677828B1 (fr) | 1991-06-14 | 1993-08-20 | Sextant Avionique | Procede de detection d'un signal utile bruite. |
FR2697101B1 (fr) | 1992-10-21 | 1994-11-25 | Sextant Avionique | Procédé de détection de la parole. |
FR2704111B1 (fr) | 1993-04-16 | 1995-05-24 | Sextant Avionique | Procédé de détection énergétique de signaux noyés dans du bruit. |
US5521851A (en) * | 1993-04-26 | 1996-05-28 | Nihon Kohden Corporation | Noise reduction method and apparatus |
WO1995002288A1 (fr) * | 1993-07-07 | 1995-01-19 | Picturetel Corporation | Reduction de bruits de fond pour l'amelioration de la qualite de voix |
JPH07193548A (ja) * | 1993-12-25 | 1995-07-28 | Sony Corp | 雑音低減処理方法 |
JP3453898B2 (ja) * | 1995-02-17 | 2003-10-06 | ソニー株式会社 | 音声信号の雑音低減方法及び装置 |
JP2685031B2 (ja) * | 1995-06-30 | 1997-12-03 | 日本電気株式会社 | 雑音消去方法及び雑音消去装置 |
US5659622A (en) * | 1995-11-13 | 1997-08-19 | Motorola, Inc. | Method and apparatus for suppressing noise in a communication system |
FR2744871B1 (fr) | 1996-02-13 | 1998-03-06 | Sextant Avionique | Systeme de spatialisation sonore, et procede de personnalisation pour sa mise en oeuvre |
US5937381A (en) * | 1996-04-10 | 1999-08-10 | Itt Defense, Inc. | System for voice verification of telephone transactions |
US6144937A (en) * | 1997-07-23 | 2000-11-07 | Texas Instruments Incorporated | Noise suppression of speech by signal processing including applying a transform to time domain input sequences of digital signals representing audio information |
TW333610B (en) * | 1997-10-16 | 1998-06-11 | Winbond Electronics Corp | The phonetic detecting apparatus and its detecting method |
US6216103B1 (en) * | 1997-10-20 | 2001-04-10 | Sony Corporation | Method for implementing a speech recognition system to determine speech endpoints during conditions with background noise |
US6182018B1 (en) * | 1998-08-25 | 2001-01-30 | Ford Global Technologies, Inc. | Method and apparatus for identifying sound in a composite sound signal |
US6188981B1 (en) * | 1998-09-18 | 2001-02-13 | Conexant Systems, Inc. | Method and apparatus for detecting voice activity in a speech signal |
US6108610A (en) * | 1998-10-13 | 2000-08-22 | Noise Cancellation Technologies, Inc. | Method and system for updating noise estimates during pauses in an information signal |
US6289309B1 (en) * | 1998-12-16 | 2001-09-11 | Sarnoff Corporation | Noise spectrum tracking for speech enhancement |
-
1997
- 1997-07-04 FR FR9708509A patent/FR2765715B1/fr not_active Expired - Fee Related
-
1998
- 1998-07-03 EP EP98935094A patent/EP0993671B1/fr not_active Expired - Lifetime
- 1998-07-03 DE DE69806006T patent/DE69806006T2/de not_active Expired - Fee Related
- 1998-07-03 WO PCT/FR1998/001428 patent/WO1999001862A1/fr active IP Right Grant
- 1998-07-03 JP JP50654799A patent/JP4338226B2/ja not_active Expired - Fee Related
- 1998-07-03 US US09/446,886 patent/US6438513B1/en not_active Expired - Lifetime
Also Published As
Publication number | Publication date |
---|---|
WO1999001862A1 (fr) | 1999-01-14 |
FR2765715A1 (fr) | 1999-01-08 |
JP4338226B2 (ja) | 2009-10-07 |
DE69806006T2 (de) | 2002-12-19 |
FR2765715B1 (fr) | 1999-09-17 |
US6438513B1 (en) | 2002-08-20 |
DE69806006D1 (de) | 2002-07-18 |
EP0993671A1 (fr) | 2000-04-19 |
JP2002513479A (ja) | 2002-05-08 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
EP0993671B1 (fr) | Procede de recherche d'un modele de bruit dans des signaux sonores bruites | |
EP0918317B1 (fr) | Procédé de filtrage fréquentiel appliqué au débruitage de signaux sonores mettant en oeuvre un filtre de Wiener | |
EP1789956B1 (fr) | Procede de traitement d'un signal sonore bruite et dispositif pour la mise en oeuvre du procede | |
EP0594480B1 (fr) | Procédé de détection de la parole | |
EP1154405B1 (fr) | Procédé et dispositif de reconnaissance vocale dans des environnements a niveau de bruit fluctuant | |
EP2057835B1 (fr) | Procédé de réduction de l'écho acoustique résiduel après suppression d'écho dans un dispositif "mains libres" | |
EP1830349B1 (fr) | Procédé de débruitage d'un signal audio | |
EP2772916B1 (fr) | Procédé de débruitage d'un signal audio par un algorithme à gain spectral variable à dureté modulable dynamiquement | |
WO2002061731A1 (fr) | Procede et dispositif de reduction de bruit | |
FR2943875A1 (fr) | Procede et dispositif de classification du bruit de fond contenu dans un signal audio. | |
WO1999005831A1 (fr) | Procede et dispositif d'egalisation aveugle des effets d'un canal de transmission sur un signal de parole numerique | |
WO2003048711A2 (fr) | System de detection de parole dans un signal audio en environnement bruite | |
EP1131813B1 (fr) | Procede de reconnaissance vocale dans un signal acoustique bruite et systeme mettant en oeuvre ce procede | |
CA2404441C (fr) | Parametres robustes pour la reconnaissance de parole bruitee | |
EP0534837B1 (fr) | Procédé de traitement de la parole en présence de bruits acoustiques utilisant la sous traction spectrale non-linéaire et les modèles de Markov cachés | |
EP3627510A1 (fr) | Filtrage d'un signal sonore acquis par un systeme de reconnaissance vocale | |
EP1016073A1 (fr) | Procede de debruitage d'un signal de parole numerique | |
EP2515300B1 (fr) | Procédé et système de réduction du bruit | |
EP1021805B1 (fr) | Procede et disposition de conditionnement d'un signal de parole numerique | |
WO1999027523A1 (fr) | Procede de reconstruction, apres debruitage, de signaux sonores |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PUAI | Public reference made under article 153(3) epc to a published international application that has entered the european phase |
Free format text: ORIGINAL CODE: 0009012 |
|
17P | Request for examination filed |
Effective date: 20000105 |
|
AK | Designated contracting states |
Kind code of ref document: A1 Designated state(s): DE FR GB NL |
|
17Q | First examination report despatched |
Effective date: 20000921 |
|
RAP1 | Party data changed (applicant data changed or rights of an application transferred) |
Owner name: THALES AVIONICS S.A. |
|
GRAG | Despatch of communication of intention to grant |
Free format text: ORIGINAL CODE: EPIDOS AGRA |
|
RIC1 | Information provided on ipc code assigned before grant |
Free format text: 7G 10L 21/02 A |
|
GRAG | Despatch of communication of intention to grant |
Free format text: ORIGINAL CODE: EPIDOS AGRA |
|
GRAH | Despatch of communication of intention to grant a patent |
Free format text: ORIGINAL CODE: EPIDOS IGRA |
|
GRAH | Despatch of communication of intention to grant a patent |
Free format text: ORIGINAL CODE: EPIDOS IGRA |
|
GRAA | (expected) grant |
Free format text: ORIGINAL CODE: 0009210 |
|
AK | Designated contracting states |
Kind code of ref document: B1 Designated state(s): DE FR GB NL |
|
REG | Reference to a national code |
Ref country code: GB Ref legal event code: FG4D Free format text: NOT ENGLISH |
|
REF | Corresponds to: |
Ref document number: 69806006 Country of ref document: DE Date of ref document: 20020718 |
|
GBT | Gb: translation of ep patent filed (gb section 77(6)(a)/1977) |
Effective date: 20020813 |
|
PLBE | No opposition filed within time limit |
Free format text: ORIGINAL CODE: 0009261 |
|
STAA | Information on the status of an ep patent application or granted ep patent |
Free format text: STATUS: NO OPPOSITION FILED WITHIN TIME LIMIT |
|
26N | No opposition filed |
Effective date: 20030313 |
|
PGFP | Annual fee paid to national office [announced via postgrant information from national office to epo] |
Ref country code: NL Payment date: 20090705 Year of fee payment: 12 Ref country code: GB Payment date: 20090701 Year of fee payment: 12 Ref country code: DE Payment date: 20090626 Year of fee payment: 12 |
|
REG | Reference to a national code |
Ref country code: NL Ref legal event code: V1 Effective date: 20110201 |
|
GBPC | Gb: european patent ceased through non-payment of renewal fee |
Effective date: 20100703 |
|
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: DE Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES Effective date: 20110201 |
|
REG | Reference to a national code |
Ref country code: DE Ref legal event code: R119 Ref document number: 69806006 Country of ref document: DE Effective date: 20110201 |
|
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: NL Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES Effective date: 20110201 |
|
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: GB Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES Effective date: 20100703 |
|
REG | Reference to a national code |
Ref country code: FR Ref legal event code: PLFP Year of fee payment: 19 |
|
PGFP | Annual fee paid to national office [announced via postgrant information from national office to epo] |
Ref country code: FR Payment date: 20160628 Year of fee payment: 19 |
|
REG | Reference to a national code |
Ref country code: FR Ref legal event code: ST Effective date: 20180330 |
|
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: FR Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES Effective date: 20170731 |