WO2000021194A1 - Dual-sensor voice transmission system - Google Patents

Dual-sensor voice transmission system Download PDF

Info

Publication number
WO2000021194A1
WO2000021194A1 PCT/US1999/023234 US9923234W WO0021194A1 WO 2000021194 A1 WO2000021194 A1 WO 2000021194A1 US 9923234 W US9923234 W US 9923234W WO 0021194 A1 WO0021194 A1 WO 0021194A1
Authority
WO
WIPO (PCT)
Prior art keywords
signal
electrical
level
predetermined
signals
Prior art date
Application number
PCT/US1999/023234
Other languages
French (fr)
Inventor
Steven H. Puthuff
Jon C. Taenzer
Earl R. Geddes
Original Assignee
Resound Corporation
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Resound Corporation filed Critical Resound Corporation
Priority to AU62915/99A priority Critical patent/AU6291599A/en
Publication of WO2000021194A1 publication Critical patent/WO2000021194A1/en

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04RLOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
    • H04R3/00Circuits for transducers, loudspeakers or microphones
    • H04R3/005Circuits for transducers, loudspeakers or microphones for combining the signals of two or more microphones
    • HELECTRICITY
    • H03ELECTRONIC CIRCUITRY
    • H03GCONTROL OF AMPLIFICATION
    • H03G3/00Gain control in amplifiers or frequency changers without distortion of the input signal
    • H03G3/20Automatic control
    • H03G3/30Automatic control in amplifiers having semiconductor devices
    • H03G3/3005Automatic control in amplifiers having semiconductor devices in amplifiers suitable for low-frequencies, e.g. audio amplifiers
    • H03G3/301Automatic control in amplifiers having semiconductor devices in amplifiers suitable for low-frequencies, e.g. audio amplifiers the gain being continuously variable
    • HELECTRICITY
    • H03ELECTRONIC CIRCUITRY
    • H03GCONTROL OF AMPLIFICATION
    • H03G3/00Gain control in amplifiers or frequency changers without distortion of the input signal
    • H03G3/20Automatic control
    • H03G3/30Automatic control in amplifiers having semiconductor devices
    • H03G3/32Automatic control in amplifiers having semiconductor devices the control being dependent upon ambient noise level or sound level
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04RLOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
    • H04R2460/00Details of hearing devices, i.e. of ear- or headphones covered by H04R1/10 or H04R5/033 but not provided for in any of their subgroups, or of hearing aids covered by H04R25/00 but not provided for in any of its subgroups
    • H04R2460/13Hearing devices using bone conduction transducers

Definitions

  • the invention relates to two-way communication devices, and more particularly, to ear level voice pickups used in two-way communications.
  • Existing ear level pickup technology uses a conventional microphone to sense and relay the wearer's voice to a listener at a remote location.
  • the microphone responds to sound signal air pressure disturbances and converts these to a representative electrical signal which is then provided to a transmitter.
  • the transmitter relays the electrical signal, via any appropriate means such as radio waves or telephone lines, to a receiver at a remote location, where the electrical signal is reconverted, using a loud speaker, to an audible sound signal for hearing by the listener at the remote location.
  • a problem afflicting such systems is their susceptibility to degradation by high ambient noise levels present in the environment.
  • the environmental sounds interfere with the talker's voice and, above certain levels, overshadow that voice and frustrate intelligibility at the remote location.
  • these systems are often targeted for wearers whose lines of work necessitate exposure to high ambient noise levels— e.g.
  • a near-field microphone such as a boom microphone
  • sounds that originate at a distance to the microphone can be de-emphasized, such that only sounds originating in the vicinity of the microphone— i.e. , those from the talker's mouth— are picked up for transmission.
  • a near-field microphone such as a boom microphone
  • sounds that originate at a distance to the microphone can be de-emphasized, such that only sounds originating in the vicinity of the microphone— i.e. , those from the talker's mouth— are picked up for transmission.
  • disadvantages to such configurations including the need to place the microphone close to the mouth, thereby detracting from the discreetness and convenience of the device while increasing its size and cost and the number of components used.
  • the user of the microphone must rotate the microphone into place before speaking and must be careful not to knock the headset supporting the microphone out of position or off his head during wear. All these problems detract from the utility of the device.
  • Bone conduction involves conveyance of speech-induced vibrations directly to the ear canal, or, for that matter, to any part of a person's body supported by bone structure. Since the bones provide a direct pathway from the sound source—that is, from the larynx and vocal chords for example— a device which is disposed against suitable bone structures of the talker so as to detect the bone vibrations is able to circumvent a noisy acoustic environment.
  • bone conduction derives from a talker's natural reaction to high ambient noise levels. Under normal circumstances, bone conduction is a poor medium for high frequency, low energy sound transmission, but is an excellent medium for low frequency, high energy transmission. Raising the level of speech, by louder speaking in reaction to a noisy environment, imparts greater energy to the bones—especially information-bearing consonant energy which normally lies at the higher frequencies and to which, all things being equal, air pressure conduction is normally better suited than bone conduction.
  • the talker by raising his or her voice, is disproportionately raising the sound level of the consonant component of speech, to which is attributed 80- 90% of intelligibility, and is thus naturally compensating for the normally consonant-unfriendly nature of bone conduction by imparting greater energy precisely at the sound frequencies at which bone conduction is poor.
  • throat microphones were first introduced in WWII and worn by bomber pilots who needed to communicate in extremely noisy environments. The throat microphones picked up vibrations at the talker's throat and provided a clearer representation of his voice than a sound pressure microphone, which would have picked up the overshadowing ambient acoustic noise as well. Such devices were crude and the quality of their signal poor.
  • a device known as the VoiceducerTM manufactured by TempcoTM , also takes advantage of bone conduction. It is placed in the user's ear canal and operates to pick up vibrations induced therein by the user's speech apparatus as described above.
  • Andrea ElectronicsTM manufactures a device having two conventional air pressure microphones, each oriented differently from the other, and one of which is designed to pick up air pressure vibrations in the ear canal whose source is in fact due to bone vibration.
  • the dual conversion relied upon by this device produces a further degradation of performance.
  • Other prior art bone conduction pickup apparatus include bone vibration sensors disposed directly atop the user's head, inertial microphones, and ceramic microphones operating without diaphragms.
  • the present invention exploits the phenomenon of bone conduction to suppress the deleterious effects of high acoustic noise environments without suffering the degradation of voice pickup quality in low noise environments demonstrated by prior art bone conduction pickup systems.
  • the above-described human speech characteristics are exploited by providing a device which adaptively senses sound through bone conduction when acoustic noise levels are high.
  • bone conduction alone is generally a poor expedient for transfer of intelligible speech, and in an acoustically quiet environment an air pressure microphone is much better suited to picking up a talker's voice. It is only when acoustic noise levels are high that the quality of air pressure microphone detection is degraded enough to make bone conduction a better alternative.
  • the device in accordance with the invention accordingly relies on both bone conduction and air pressure vibration to pick up the talker's voice.
  • the bone conducted speech is detected using a motion transducer comprising a vibration sensor, and specifically, in a preferred embodiment, an accelerometer, although use of other types of vibration sensors, such as displacement and velocity sensors, is also contemplated.
  • a motion transducer comprising a vibration sensor, and specifically, in a preferred embodiment, an accelerometer, although use of other types of vibration sensors, such as displacement and velocity sensors, is also contemplated.
  • sound detection is effected primarily through a conventional microphone, which may comprise, inter alia, a directional microphone, a pressure difference microphone or an omni-directional microphone.
  • An adjustable transition circuit is provided for fading out the conventional microphone signal and fading in the bone vibration sensor signal with an increase in ambient noise levels.
  • the transition circuit in one embodiment, combines the signals from the microphone and vibration sensor, sensing the average amplitude of the microphone signal and inversely adjusting the gain of pre-amplifiers and/or amplifiers connected to the outputs of the vibration sensor and microphone.
  • the gain adjustment can be controlled by sensing the overall noise level detected by the vibration sensor rather than the microphone. Gain adjustment is effected such that the amplitude of the overall output signal is maintained at a smoothly changing level throughout the transition range.
  • the signal from the microphone is amplified linearly up to a predetermined level only, after which the signal level is limited and prevented from further increase.
  • the signal from the vibration sensor initially amplified at a lower gain level than that of the microphone, is not limited and eventually, after the predetermined level, surpasses the limited microphone signal and comes to dominate, thereby improving system intelligibility and increasing immunity to ambient noise effects.
  • Such a configuration can be implemented using available sound processing circuitry and off-the-shelf technology, at reduced production costs and power consumption requirements.
  • An additional advantage of the invention is its particular suitability to voice-activated applications, as the bone conduction expedient improves system immunity to false triggering due to noisy environments.
  • the invention realizes power conservation advantages by for example turning off a radio transmitter in the absence of the user's voice.
  • Yet another advantage of the invention derives from the use of the two modalities of sound transfer—namely, air pressure waves and bone conduction.
  • the two modalities permit the use of signal correlation techniques to enhance the voice signal, thus improving overall system performance.
  • Characteristics of the loudness of speech derived from signals from either or both modalities can be used to control the system of the invention.
  • the amplitude envelope of the bone conducted signal can be used to dynamically vary the gain of the microphone amplifier, so that speech portions of the microphone signal are enhanced while noise portions are suppressed.
  • These characteristics include signal RMS value, peak value, average value, and energy level.
  • the invention finds utility in numerous voice pickup applications, including but not limited to ear level pickups, telephony and dual microphone directional pickup systems. BRIEF DESCRIPTION OF THE DRAWINGS
  • FIG. 1 is a schematic diagram of a dual-sensor device in accordance with the invention.
  • FIG. 2 is a graphical illustration of the signal output of the transducers in one embodiment of the device of FIG. 1;
  • FIG. 3 is a schematic diagram of a preferred embodiment of a dual-sensor device in accordance with the invention.
  • FIG. 4 is a graphical illustration of the signal output of the transducers in another embodiment of the device of FIG. 3;
  • FIG. 5 is a schematic illustration showing an embodiment in accordance with the invention in which selection between the accelerometer and microphone outputs is based on the detected envelopes of the signals;
  • FIG. 6 is a schematic illustration in which phase correlation of the microphone and accelerometer signals is effected in accordance with the invention
  • FIGS. 7 and 8 are graphical illustrations of the use of signal compression and expansion contemplated by exemplary embodiments in accordance with the invention
  • FIG. 9 is a schematic illustration of an embodiment in accordance with the invention in which the detected signals are processed by frequency band components
  • FIG. 10 is a schematic illustration of an embodiment in accordance with the invention in which variable filtering is used.
  • FIG. 11 is a graphical representation of the operation of the circuit of FIG. 10.
  • FIG. 1 shows a schematic implementation of a dual-sensor device 10 in accordance with the invention.
  • the device 10 uses an acoustic sound transducer, such as a microphone 20, and an accelerometer 30 whose outputs are selectively manipulated by a processing circuit 16 to compensate for obfuscation of the talker's voice due to the presence of ambient acoustic noise.
  • the device is held against the skin of the talker's skull and may take the form of an ear level pickup worn behind the ear, or a telephone handset held against the ear during telephonic communication. Other possibilities include placement in a hat band or a face mask worn with, for example, a firefighter's helmet.
  • Microphone 20 may be any conventional acoustic sound transducer such as, e.g. , a standard microphone, directional microphone, omni-directional microphone, or pressure difference/gradient microphone or other microphone.
  • Accelerometer 30 is a transducer device which senses the vibratory displacement of the bones in proximity to the ear as imparted primarily by vibrations during speech.
  • the output of accelerometer 30 is a second derivative representation of bone displacement exhibiting, in the frequency domain, a 12 dB/octave rise which matches the approximately 12 dB/octave roll-off of bone conduction speech signals at the ear. It is contemplated that vibration sensors other than accelerometers may be used, and accordingly, use of the latter is not intended to be limiting.
  • a velocity sensor exhibiting a first derivative rise in the frequency domain of 6 dB/octave can be used preferably with an electronic 6 dB/octave frequency equalization, or similarly, a displacement sensor, may each be used in lieu of accelerometer 30 and also fall within the purview of the invention.
  • Equalization circuits 22, 32 can be designed to provide greater gain at higher frequencies than gain at lower frequencies to thereby decrease the "basiness" of the response, especially of accelerometer 30. It is also known that the bony structure of the head imparts certain resonances to the signal, so that the equalization is also intended to correct for those changes in the signal as well.
  • Variable or adjustable gain pre-amplifiers 24 and 34 are connected to the equalization circuits 22, 32. These pre-amplifiers, along with faders or amplifiers 26 and 36, serve to control the gain of the signals representing the transducer responses.
  • An adder circuit 38 combines the signals from microphone 20 and accelerometer 30 and provides the output signal 40 as the output signal of the pickup device through variable gain amplifier 46.
  • the output from the pre-amplifier 24 is applied to a control circuit 44 comprising for example an amplitude detector which senses the average amplitude of the signal from the microphone 20.
  • a control circuit 44 comprising for example an amplitude detector which senses the average amplitude of the signal from the microphone 20.
  • the control circuit 44 commences a fading effect, decreasing the gain of amplifier 26 while increasing that of amplifier 36.
  • the bone conduction component of the audio signal comprising primarily the talker's speech, is amplified to a greater extent than the ambient noise signal at high ambient noise levels.
  • the microphone signal is permitted to dominate as the output signal.
  • the microphone singal under lower ambient noise levels provides a qualitatively better signal despite a measure of equalization applied to the accelerometer signal by equalization circuit 32 to improve the quality thereof.
  • the value of the threshold will depend on the particular application and may vary anywhere in the range of and beyond 85 - 100 dB SPL (sound pressure level— at 110 dB SPL ambient noise levels, a person is unable to hear his or her own voice), while the transition region is selected to span around 20 dB, although this value is just exemplary, with the actual value being selected based on the particular application. Beyond the cross-over threshold, the signal from the accelerometer 30 will predominate, exceeding the microphone signal by about 20-30 dB, while preceding the threshold, the microphone signal predominates.
  • FIG. 2 A graphical representation of the system signal output is depicted in FIG. 2, which shows the microphone signal dominating the accelerometer signal below a threshold level (shown by the dashed line), while the accelerometer signal dominates above the threshold.
  • the transition region is the region where the transition between these two states occurs.
  • Peak detector 42 is provided to maintain the level of the overall signal constant, operating to adjust the gains of pre-amplifiers 24 and 34 in accordance with the output 40.
  • a dual-sensor device 50 is shown in which gain control is effected only on the output signal of one of the transducers— the microphone 70.
  • Processing circuit 56 uses existing sound processing circuitry, depicted within the dashed region 60, in conjunction with accelerometer 80, equalization circuit 82, pre-amplifier 84, and adder 76 to provide the enhanced voice pickup device of the invention.
  • Circuit 60 comprises equalization circuit 72, AGC (automatic gain control) amplifier circuit 74 having a specific control circuit (not shown) for controlling the gain thereof, and a bandsplit filter device 78 per known multi-band full dynamic range compression sound processing technology used in, e.g. , hearing aid technology and ideally suited for operation at low voltage and current requirements.
  • the dual-sensor 50 operates as follows.
  • the output of microphone 70 is provided to equalization circuit 72 which again serves to flatten the frequency response for improved quality.
  • the signal is then fed to AGC circuit 74, which is designed to have a gain limiting threshold at around 85 dB SPL.
  • Circuit 74 operates as a linear amplifier, limiting amplification beyond the selected threshold, which in this implementation is 85 dB SPL such that the output signal simply will not become louder with an increase of the input signal. It is to be noted that limiting is different from clipping in that clipping, which eliminates the peaks of the signals above a predetermined threshold, consequently introduces undesirable distortions of the sound signal.
  • Signal limiters are well known in the art.
  • the gain of pre-amplifier circuit 84 is not limited, and as ambient noise levels increase, the accelerometer signal, unlike the microphone signal, continues to increase. Although initially set at a lower level than that of the microphone signal, the accelerometer signal eventually surpasses that of the microphone when the ambient noise level reaches a sufficient magnitude because the talker naturally raises his voice in order to hear himself.
  • Adder 76 operates to combine the signals, with output from the microphone predominating at the low ambient noise level and the output from the accelerometer predominating at the high ambient noise level.
  • the accelerometer is exemplary and can be any vibration sensor.
  • the characteristic signal output is graphically represented in FIG. 4. As in the previous embodiment, the microphone signal output dominates at sound levels below the cross-over threshold, while the accelerometer signal dominates at levels above the transition threshold.
  • the dual-sensor 50 is designed with readily available circuitry and off- the-shelf components. It has minimal voltage and current consumption characteristics and is well-suited for miniaturized, in-the-ear applications as contemplated for ear level pickup devices. Inherent advantages afforded thereby include reduced cost and simplicity of design and assembly, as well as enhanced performance provided by the unique application of the two transducers. Also, as noise increases, the ratio of accelerometer signal to microphone signal improves i.e. , noise output becomes progressively a smaller and smaller portion of output giving a progressively better signal-to-noise ratio.
  • FIG. 5 shows an embodiment in accordance with the invention in which selection between the accelerometer and microphone outputs is based on the detected envelopes of the signals.
  • Signals from microphone 92 are provided to pre-amplifier 93 and envelope detector 94, while signals from accelerometer 96 are provided to pre-amplifier 97 and envelope detector 98.
  • a switching circuit 95 selects the more variable of these signals. It should be noted that while the noise envelope is nearly constant, the voice envelope is greatly time-varying as noise increases. Consequently, the microphone signal will vary less but the accelerometer signal will continue to vary. In lower noise, microphone signal will vary more than accelerometer signal due to low-pass nature of bone conduction.
  • the envelope detectors 94 and 98 can have short (- 10 ms) time constants for selection of individual syllables, or long ( ⁇ 1 s) so gradual selection is made as background noise varies. Additionally, the switching circuit 95 can be replaced preferably with a fader circuit to permit gradual rather than abrupt transitions.
  • FIG. 6 alternatively shows phase correlation of the microphone and accelerometer signals in accordance with the invention.
  • Signals from the microphone 100 and accelerometer 130 are applied to correlator 120 via preamplifiers 110 and 140, respectively, after the phase difference of the accelerometer signal is corrected in phase difference correction circuit 150.
  • the phase corrected signals either on an average energy basis, instantaneous signal level basis or in individual multi-frequency bands (via Fast Fourier Transformation implemented by digital signal processing, for example)
  • the correlated components of the signals can be passed to the output, whereas the uncorrelated components can be determined to be noise and prevented from passing to the output.
  • the determination can be made either on a time-basis only or on both a frequency and time-basis simultaneously.
  • AGC amplifier 74 and pre-amp 84 of FIG. 1 In an alternative embodiment, AGC amplifier 74 and pre-amp 84 of FIG.
  • AGC amplifier 74 provides signal compression at a predetermined ratio
  • pre-amp 84 provides signal expansion at a predetermined ratio.
  • Various combinations of amplification characteristics relative to a transition threshold can be used. More particularly, as shown in FIG. 7, using the AGC amplifier 74, the microphone signal can be amplified at a linear, 1 : 1 ratio below the transition threshold and compressed at a N: l ratio (N > 1, e.g., 2.5, 10, etc.) above the transition threshold, while the accelerometer signal, using pre-amplifier 84, is expanded at a 1 :M ratio (M > 1, e.g. , 2 or 1.5, etc.) throughout the operational range.
  • the microphone signal can be amplified at a linear 1:1 ratio below the transition threshold and compressed at a N: l (N > 1) ratio above the transition threshold, while the accelerometer signal is expanded at a 1:M ratio (M > 1) below the transition threshold and linearly amplified at a 1: 1 ratio above the threshold.
  • the amplifiers are designed to produce equal output levels at the transition SPL level.
  • the transition decision, made by the control circuit can be linked to any characteristic of one or both detected signals, including RMS values, peak values, average values, signal energy, correlation and variability, again depending on the particular application contemplated and the physical and economic constraints imposed.
  • the signals from the microphone and vibration sensor can be split into high and low frequency bands and then faded separately.
  • the signal from microphone 160 is amplified by amplifier 162 and then applied to bandsplit filter 164 serving to split the signal into a high frequency band and a low frequency band.
  • the signal from vibration sensor 166 is amplified by amplifier 168 and then applied to bandsplit filter 170 serving to split the signal into high and low frequency bands.
  • Bandsplit filters 164 and 170 preferably have the same critical frequency f c .
  • the signals are then respectively applied to high frequency and low frequency fader circuits 172 and 174, the outputs of which are summed in adder 176 and provided as the system output. Fader circuits 172 and 174 comprise any circuit discussed above designed to implement the fade over functions between the microphone and the vibration sensor.
  • the circuit of FIG. 9 is well-suited to exploit the recognized phenomenon that most background acoustic noise—the drone of machinery, the roar of a blazing fire, etc.— lies at the low frequency end of the noise spectrum.
  • fading is effected earlier— i.e. , at a lower overall noise intensity level-for the low frequency band component than for the high frequency band component simply because the high frequency band contains less energy than the low frequency band.
  • This maintains more of the high frequency intelligibility which passes through the acoustic path to the microphone 160 while still removing much of the excess bass noise due to the earlier fade over in the low frequency band to the cleaner bone conduction signal of vibration sensor 166, which preferably comprises an accelerometer.
  • the detected sound spectrum can be split into more than two bands, each of which is faded at a corresponding intensity level.
  • Digital signal processing techniques readily lend themselves to such an implementation, with Fast Fourier Transform techniques being used in conjunction with suitable algorithms for effecting the fading when the signal intensity at a band exceeds a threshold intensity.
  • the threshold intensity would be determined at the level at which the microphone 160 signal indicates there is so much noise that the voicing is no longer passing through, and the signal in that band would then be faded over to bone conduction from vibration sensor 166.
  • the threshold level would be predetermined and device-specific, depending on the characteristics of the overall system design and the particular application for the device, but would generally be set at about 95 -100 dB SPL noise level.
  • a particular advantage of such a multi-band processing system is realized when the interfering noise is at specific regions in the noise sound spectrum. Such a case occurs from the whine of an electric motor or the whistle of an air conditioning system for example. In such narrow band situations the noise is not dominated by low frequencies, and thus only the bands in which the narrow band interference occurs are affected.
  • the digital signal processing in accordance with the invention permits only the affected band, or bands, to responsively fade over to the bone conduction signal, without fading other, unaffected bands, thus maintaining the microphone path signal for the band signals where the microphone signal is the cleaner signal.
  • Another modification either to the dual band or the multi-band systems described above, relies on the use of variable thresholds in each of the bands, rather than on the fixed threshold described.
  • the fader moves the cross-over point, using known sliding filters, so that the fading successively slides higher and higher in frequency with increased noise sound level intensity.
  • a critical frequency f c control circuit 178 dynamically changes the critical frequencies of sliding high pass filter 180 and sliding low pass filter 182 based on a detected intensity level, in this case the intensity level from the microphone signal from microphone 184.
  • the system is designed with the presumption that the background noise is louder in the low frequency region than in the high frequency region and has a fairly uniform slope to it, and this characteristic is exploited using a fader circuit which, as the noise level increases, moves the frequency cross-over point of the fade commensurately higher.

Abstract

A pickup system (10) utilizes both sound pressure disturbances sensed by a microphone (20) and bone conduction sensed by a vibration sensor (30) such as an accelerometer, velocity sensor or displacement sensor to pick up and faithfully transmit a talker's voice despite acoustically noisy ambient environments. The system senses the microphone signal amplitude and adjusts the gain of associated amplification circuitry (23, 26) to effectively and smoothly fade in the microphone signal and fade out the vibration sensor signal as ambient noise increases. The pickup system (10) may be part of a stand alone ear level communication transmitter worn behind the talker's ear, or it may be implemented in a telephone system or other communication systems.

Description

DUAL-SENSOR VOICE TRANSMISSION SYSTEM
BACKGROUND OF THE INVENTION FIELD OF THE INVENTION
The invention relates to two-way communication devices, and more particularly, to ear level voice pickups used in two-way communications.
DESCRIPTION OF RELATED ART
Existing ear level pickup technology uses a conventional microphone to sense and relay the wearer's voice to a listener at a remote location. The microphone responds to sound signal air pressure disturbances and converts these to a representative electrical signal which is then provided to a transmitter. The transmitter relays the electrical signal, via any appropriate means such as radio waves or telephone lines, to a receiver at a remote location, where the electrical signal is reconverted, using a loud speaker, to an audible sound signal for hearing by the listener at the remote location.
A problem afflicting such systems is their susceptibility to degradation by high ambient noise levels present in the environment. When the pickup wearer is immersed in a noisy environment, the environmental sounds interfere with the talker's voice and, above certain levels, overshadow that voice and frustrate intelligibility at the remote location. As these systems are often targeted for wearers whose lines of work necessitate exposure to high ambient noise levels— e.g. , government field agents such as secret service employees assigned to protect officials and to communicate in the midst of large crowds or during assassination attempts or other crises where panic-struck crowds are particularly noisy or firefighters dispatched to sites of roaring conflagrations, or rescuers beneath a hovering helicopter— it is particularly important that the systems be designed to adequately reject unwanted noise and maintain the requisite functionality permitting clear communication between talker and listener.
One obvious approach to the suppression of ambient noise pickup has been to use a near-field microphone, such as a boom microphone, disposed in close proximity to the talker's mouth. With an appropriate mechanical or electronic microphone configuration, sounds that originate at a distance to the microphone can be de-emphasized, such that only sounds originating in the vicinity of the microphone— i.e. , those from the talker's mouth— are picked up for transmission. However, there are disadvantages to such configurations, including the need to place the microphone close to the mouth, thereby detracting from the discreetness and convenience of the device while increasing its size and cost and the number of components used. Additionally, the user of the microphone must rotate the microphone into place before speaking and must be careful not to knock the headset supporting the microphone out of position or off his head during wear. All these problems detract from the utility of the device.
Another known type of sound pickup device relies on bone conduction. It is recognized that a talker's voice reaches the talker's own ear not only through the normal pathway of air pressure waves, but also through conduction by the jaw and other bones of the skull as induced by vibrations from the talker's sound- generating apparatus during speech. Bone conduction involves conveyance of speech-induced vibrations directly to the ear canal, or, for that matter, to any part of a person's body supported by bone structure. Since the bones provide a direct pathway from the sound source— that is, from the larynx and vocal chords for example— a device which is disposed against suitable bone structures of the talker so as to detect the bone vibrations is able to circumvent a noisy acoustic environment.
Another advantage to bone conduction derives from a talker's natural reaction to high ambient noise levels. Under normal circumstances, bone conduction is a poor medium for high frequency, low energy sound transmission, but is an excellent medium for low frequency, high energy transmission. Raising the level of speech, by louder speaking in reaction to a noisy environment, imparts greater energy to the bones— especially information-bearing consonant energy which normally lies at the higher frequencies and to which, all things being equal, air pressure conduction is normally better suited than bone conduction. The talker, by raising his or her voice, is disproportionately raising the sound level of the consonant component of speech, to which is attributed 80- 90% of intelligibility, and is thus naturally compensating for the normally consonant-unfriendly nature of bone conduction by imparting greater energy precisely at the sound frequencies at which bone conduction is poor.
The prior art has used vibration transducers and even accelerometers to detect bone vibration. To that end, throat microphones were first introduced in WWII and worn by bomber pilots who needed to communicate in extremely noisy environments. The throat microphones picked up vibrations at the talker's throat and provided a clearer representation of his voice than a sound pressure microphone, which would have picked up the overshadowing ambient acoustic noise as well. Such devices were crude and the quality of their signal poor. A device known as the Voiceducer™ , manufactured by Tempco™ , also takes advantage of bone conduction. It is placed in the user's ear canal and operates to pick up vibrations induced therein by the user's speech apparatus as described above. Similarly, Andrea Electronics™ manufactures a device having two conventional air pressure microphones, each oriented differently from the other, and one of which is designed to pick up air pressure vibrations in the ear canal whose source is in fact due to bone vibration. The dual conversion relied upon by this device produces a further degradation of performance. Other prior art bone conduction pickup apparatus include bone vibration sensors disposed directly atop the user's head, inertial microphones, and ceramic microphones operating without diaphragms.
SUMMARY OF THE INVENTION
The present invention exploits the phenomenon of bone conduction to suppress the deleterious effects of high acoustic noise environments without suffering the degradation of voice pickup quality in low noise environments demonstrated by prior art bone conduction pickup systems. The above-described human speech characteristics are exploited by providing a device which adaptively senses sound through bone conduction when acoustic noise levels are high. As discussed above, bone conduction alone is generally a poor expedient for transfer of intelligible speech, and in an acoustically quiet environment an air pressure microphone is much better suited to picking up a talker's voice. It is only when acoustic noise levels are high that the quality of air pressure microphone detection is degraded enough to make bone conduction a better alternative. The device in accordance with the invention accordingly relies on both bone conduction and air pressure vibration to pick up the talker's voice. The bone conducted speech is detected using a motion transducer comprising a vibration sensor, and specifically, in a preferred embodiment, an accelerometer, although use of other types of vibration sensors, such as displacement and velocity sensors, is also contemplated. When ambient noise levels are low, sound detection is effected primarily through a conventional microphone, which may comprise, inter alia, a directional microphone, a pressure difference microphone or an omni-directional microphone. An adjustable transition circuit is provided for fading out the conventional microphone signal and fading in the bone vibration sensor signal with an increase in ambient noise levels.
The transition circuit, in one embodiment, combines the signals from the microphone and vibration sensor, sensing the average amplitude of the microphone signal and inversely adjusting the gain of pre-amplifiers and/or amplifiers connected to the outputs of the vibration sensor and microphone. As the average amplitude of the microphone signal increases relative to a predetermined threshold, due to increased ambient noise levels, the gain of the vibration sensor is increased while that of the microphone is decreased. Alternatively, the gain adjustment can be controlled by sensing the overall noise level detected by the vibration sensor rather than the microphone. Gain adjustment is effected such that the amplitude of the overall output signal is maintained at a smoothly changing level throughout the transition range.
In a second, preferred embodiment, the signal from the microphone is amplified linearly up to a predetermined level only, after which the signal level is limited and prevented from further increase. The signal from the vibration sensor, initially amplified at a lower gain level than that of the microphone, is not limited and eventually, after the predetermined level, surpasses the limited microphone signal and comes to dominate, thereby improving system intelligibility and increasing immunity to ambient noise effects. Such a configuration can be implemented using available sound processing circuitry and off-the-shelf technology, at reduced production costs and power consumption requirements.
An additional advantage of the invention is its particular suitability to voice-activated applications, as the bone conduction expedient improves system immunity to false triggering due to noisy environments. The invention realizes power conservation advantages by for example turning off a radio transmitter in the absence of the user's voice.
Yet another advantage of the invention derives from the use of the two modalities of sound transfer— namely, air pressure waves and bone conduction. The two modalities permit the use of signal correlation techniques to enhance the voice signal, thus improving overall system performance. Characteristics of the loudness of speech derived from signals from either or both modalities can be used to control the system of the invention. For example the amplitude envelope of the bone conducted signal can be used to dynamically vary the gain of the microphone amplifier, so that speech portions of the microphone signal are enhanced while noise portions are suppressed. These characteristics include signal RMS value, peak value, average value, and energy level.
The invention finds utility in numerous voice pickup applications, including but not limited to ear level pickups, telephony and dual microphone directional pickup systems. BRIEF DESCRIPTION OF THE DRAWINGS
Many advantages of the present invention will be apparent to those skilled in the art with a reading of this specification in conjunction with the attached drawings, wherein like reference numerals are applied to like elements and wherein:
FIG. 1 is a schematic diagram of a dual-sensor device in accordance with the invention;
FIG. 2 is a graphical illustration of the signal output of the transducers in one embodiment of the device of FIG. 1;
FIG. 3 is a schematic diagram of a preferred embodiment of a dual-sensor device in accordance with the invention;
FIG. 4 is a graphical illustration of the signal output of the transducers in another embodiment of the device of FIG. 3;
FIG. 5 is a schematic illustration showing an embodiment in accordance with the invention in which selection between the accelerometer and microphone outputs is based on the detected envelopes of the signals;
FIG. 6 is a schematic illustration in which phase correlation of the microphone and accelerometer signals is effected in accordance with the invention; FIGS. 7 and 8 are graphical illustrations of the use of signal compression and expansion contemplated by exemplary embodiments in accordance with the invention;
FIG. 9 is a schematic illustration of an embodiment in accordance with the invention in which the detected signals are processed by frequency band components;
FIG. 10 is a schematic illustration of an embodiment in accordance with the invention in which variable filtering is used; and
FIG. 11 is a graphical representation of the operation of the circuit of FIG. 10.
DESCRIPTION OF THE PREFERRED EMBODIMENTS
FIG. 1 shows a schematic implementation of a dual-sensor device 10 in accordance with the invention. The device 10 uses an acoustic sound transducer, such as a microphone 20, and an accelerometer 30 whose outputs are selectively manipulated by a processing circuit 16 to compensate for obfuscation of the talker's voice due to the presence of ambient acoustic noise. The device is held against the skin of the talker's skull and may take the form of an ear level pickup worn behind the ear, or a telephone handset held against the ear during telephonic communication. Other possibilities include placement in a hat band or a face mask worn with, for example, a firefighter's helmet. Microphone 20 may be any conventional acoustic sound transducer such as, e.g. , a standard microphone, directional microphone, omni-directional microphone, or pressure difference/gradient microphone or other microphone. Accelerometer 30 is a transducer device which senses the vibratory displacement of the bones in proximity to the ear as imparted primarily by vibrations during speech. The output of accelerometer 30 is a second derivative representation of bone displacement exhibiting, in the frequency domain, a 12 dB/octave rise which matches the approximately 12 dB/octave roll-off of bone conduction speech signals at the ear. It is contemplated that vibration sensors other than accelerometers may be used, and accordingly, use of the latter is not intended to be limiting. For instance, a velocity sensor exhibiting a first derivative rise in the frequency domain of 6 dB/octave can be used preferably with an electronic 6 dB/octave frequency equalization, or similarly, a displacement sensor, may each be used in lieu of accelerometer 30 and also fall within the purview of the invention.
Signals from the transducers 20 and 30 are fed through corresponding equalization circuits 22 and 32, which operate to adjust the frequency response of the transducer signals and provide a flatter frequency response. Equalization circuits 22, 32 can be designed to provide greater gain at higher frequencies than gain at lower frequencies to thereby decrease the "basiness" of the response, especially of accelerometer 30. It is also known that the bony structure of the head imparts certain resonances to the signal, so that the equalization is also intended to correct for those changes in the signal as well.
Variable or adjustable gain pre-amplifiers 24 and 34 are connected to the equalization circuits 22, 32. These pre-amplifiers, along with faders or amplifiers 26 and 36, serve to control the gain of the signals representing the transducer responses. An adder circuit 38 combines the signals from microphone 20 and accelerometer 30 and provides the output signal 40 as the output signal of the pickup device through variable gain amplifier 46.
In operation, the output from the pre-amplifier 24 is applied to a control circuit 44 comprising for example an amplitude detector which senses the average amplitude of the signal from the microphone 20. When this average amplitude reaches or exceeds a predetermined threshold with a rise in ambient noise levels, the control circuit 44 commences a fading effect, decreasing the gain of amplifier 26 while increasing that of amplifier 36. In this manner, the bone conduction component of the audio signal, comprising primarily the talker's speech, is amplified to a greater extent than the ambient noise signal at high ambient noise levels. At low ambient noise levels, the microphone signal is permitted to dominate as the output signal. The microphone singal under lower ambient noise levels provides a qualitatively better signal despite a measure of equalization applied to the accelerometer signal by equalization circuit 32 to improve the quality thereof. Typically, the value of the threshold will depend on the particular application and may vary anywhere in the range of and beyond 85 - 100 dB SPL (sound pressure level— at 110 dB SPL ambient noise levels, a person is unable to hear his or her own voice), while the transition region is selected to span around 20 dB, although this value is just exemplary, with the actual value being selected based on the particular application. Beyond the cross-over threshold, the signal from the accelerometer 30 will predominate, exceeding the microphone signal by about 20-30 dB, while preceding the threshold, the microphone signal predominates. A graphical representation of the system signal output is depicted in FIG. 2, which shows the microphone signal dominating the accelerometer signal below a threshold level (shown by the dashed line), while the accelerometer signal dominates above the threshold. The transition region is the region where the transition between these two states occurs. Peak detector 42 is provided to maintain the level of the overall signal constant, operating to adjust the gains of pre-amplifiers 24 and 34 in accordance with the output 40.
In the preferred embodiment in accordance with FIG. 3, a dual-sensor device 50 is shown in which gain control is effected only on the output signal of one of the transducers— the microphone 70. Processing circuit 56 uses existing sound processing circuitry, depicted within the dashed region 60, in conjunction with accelerometer 80, equalization circuit 82, pre-amplifier 84, and adder 76 to provide the enhanced voice pickup device of the invention. Circuit 60 comprises equalization circuit 72, AGC (automatic gain control) amplifier circuit 74 having a specific control circuit (not shown) for controlling the gain thereof, and a bandsplit filter device 78 per known multi-band full dynamic range compression sound processing technology used in, e.g. , hearing aid technology and ideally suited for operation at low voltage and current requirements.
The dual-sensor 50 operates as follows. The output of microphone 70 is provided to equalization circuit 72 which again serves to flatten the frequency response for improved quality. The signal is then fed to AGC circuit 74, which is designed to have a gain limiting threshold at around 85 dB SPL. Circuit 74 operates as a linear amplifier, limiting amplification beyond the selected threshold, which in this implementation is 85 dB SPL such that the output signal simply will not become louder with an increase of the input signal. It is to be noted that limiting is different from clipping in that clipping, which eliminates the peaks of the signals above a predetermined threshold, consequently introduces undesirable distortions of the sound signal. Signal limiters are well known in the art.
By contrast, the gain of pre-amplifier circuit 84 is not limited, and as ambient noise levels increase, the accelerometer signal, unlike the microphone signal, continues to increase. Although initially set at a lower level than that of the microphone signal, the accelerometer signal eventually surpasses that of the microphone when the ambient noise level reaches a sufficient magnitude because the talker naturally raises his voice in order to hear himself. Adder 76 operates to combine the signals, with output from the microphone predominating at the low ambient noise level and the output from the accelerometer predominating at the high ambient noise level. Again, the accelerometer is exemplary and can be any vibration sensor.
The characteristic signal output is graphically represented in FIG. 4. As in the previous embodiment, the microphone signal output dominates at sound levels below the cross-over threshold, while the accelerometer signal dominates at levels above the transition threshold.
The dual-sensor 50 is designed with readily available circuitry and off- the-shelf components. It has minimal voltage and current consumption characteristics and is well-suited for miniaturized, in-the-ear applications as contemplated for ear level pickup devices. Inherent advantages afforded thereby include reduced cost and simplicity of design and assembly, as well as enhanced performance provided by the unique application of the two transducers. Also, as noise increases, the ratio of accelerometer signal to microphone signal improves i.e. , noise output becomes progressively a smaller and smaller portion of output giving a progressively better signal-to-noise ratio.
FIG. 5 shows an embodiment in accordance with the invention in which selection between the accelerometer and microphone outputs is based on the detected envelopes of the signals. Signals from microphone 92 are provided to pre-amplifier 93 and envelope detector 94, while signals from accelerometer 96 are provided to pre-amplifier 97 and envelope detector 98. A switching circuit 95 selects the more variable of these signals. It should be noted that while the noise envelope is nearly constant, the voice envelope is greatly time-varying as noise increases. Consequently, the microphone signal will vary less but the accelerometer signal will continue to vary. In lower noise, microphone signal will vary more than accelerometer signal due to low-pass nature of bone conduction. The envelope detectors 94 and 98 can have short (- 10 ms) time constants for selection of individual syllables, or long (~1 s) so gradual selection is made as background noise varies. Additionally, the switching circuit 95 can be replaced preferably with a fader circuit to permit gradual rather than abrupt transitions.
FIG. 6 alternatively shows phase correlation of the microphone and accelerometer signals in accordance with the invention. Signals from the microphone 100 and accelerometer 130 are applied to correlator 120 via preamplifiers 110 and 140, respectively, after the phase difference of the accelerometer signal is corrected in phase difference correction circuit 150. By thus correlating the phase corrected signals, either on an average energy basis, instantaneous signal level basis or in individual multi-frequency bands (via Fast Fourier Transformation implemented by digital signal processing, for example), the correlated components of the signals can be passed to the output, whereas the uncorrelated components can be determined to be noise and prevented from passing to the output. In other words, the determination can be made either on a time-basis only or on both a frequency and time-basis simultaneously.
In an alternative embodiment, AGC amplifier 74 and pre-amp 84 of FIG.
3 can be configured to provide non-linear amplification, controlled by an appropriate control circuit, such that AGC amplifier 74 provides signal compression at a predetermined ratio, while pre-amp 84 provides signal expansion at a predetermined ratio. Various combinations of amplification characteristics relative to a transition threshold can be used. More particularly, as shown in FIG. 7, using the AGC amplifier 74, the microphone signal can be amplified at a linear, 1 : 1 ratio below the transition threshold and compressed at a N: l ratio (N > 1, e.g., 2.5, 10, etc.) above the transition threshold, while the accelerometer signal, using pre-amplifier 84, is expanded at a 1 :M ratio (M > 1, e.g. , 2 or 1.5, etc.) throughout the operational range. Alternatively, as shown in FIG. 8, the microphone signal can be amplified at a linear 1:1 ratio below the transition threshold and compressed at a N: l (N > 1) ratio above the transition threshold, while the accelerometer signal is expanded at a 1:M ratio (M > 1) below the transition threshold and linearly amplified at a 1: 1 ratio above the threshold. The amplifiers are designed to produce equal output levels at the transition SPL level. Those skilled in the art will recognize that many different combinations of linear amplification, compression and expansion of the detected signals are possible and would depend on the particular application. Moreover, the transition decision, made by the control circuit, can be linked to any characteristic of one or both detected signals, including RMS values, peak values, average values, signal energy, correlation and variability, again depending on the particular application contemplated and the physical and economic constraints imposed.
In another embodiment in accordance with the invention and schematically shown in FIG. 9, the signals from the microphone and vibration sensor can be split into high and low frequency bands and then faded separately. In FIG. 9, the signal from microphone 160 is amplified by amplifier 162 and then applied to bandsplit filter 164 serving to split the signal into a high frequency band and a low frequency band. Similarly, the signal from vibration sensor 166 is amplified by amplifier 168 and then applied to bandsplit filter 170 serving to split the signal into high and low frequency bands. Bandsplit filters 164 and 170 preferably have the same critical frequency fc. The signals are then respectively applied to high frequency and low frequency fader circuits 172 and 174, the outputs of which are summed in adder 176 and provided as the system output. Fader circuits 172 and 174 comprise any circuit discussed above designed to implement the fade over functions between the microphone and the vibration sensor.
The circuit of FIG. 9 is well-suited to exploit the recognized phenomenon that most background acoustic noise— the drone of machinery, the roar of a blazing fire, etc.— lies at the low frequency end of the noise spectrum. By splitting the input signal into low and high frequency band components, fading is effected earlier— i.e. , at a lower overall noise intensity level-for the low frequency band component than for the high frequency band component simply because the high frequency band contains less energy than the low frequency band. This maintains more of the high frequency intelligibility which passes through the acoustic path to the microphone 160 while still removing much of the excess bass noise due to the earlier fade over in the low frequency band to the cleaner bone conduction signal of vibration sensor 166, which preferably comprises an accelerometer.
The detected sound spectrum can be split into more than two bands, each of which is faded at a corresponding intensity level. Digital signal processing techniques readily lend themselves to such an implementation, with Fast Fourier Transform techniques being used in conjunction with suitable algorithms for effecting the fading when the signal intensity at a band exceeds a threshold intensity. The threshold intensity would be determined at the level at which the microphone 160 signal indicates there is so much noise that the voicing is no longer passing through, and the signal in that band would then be faded over to bone conduction from vibration sensor 166. The threshold level would be predetermined and device-specific, depending on the characteristics of the overall system design and the particular application for the device, but would generally be set at about 95 -100 dB SPL noise level.
A particular advantage of such a multi-band processing system is realized when the interfering noise is at specific regions in the noise sound spectrum. Such a case occurs from the whine of an electric motor or the whistle of an air conditioning system for example. In such narrow band situations the noise is not dominated by low frequencies, and thus only the bands in which the narrow band interference occurs are affected. The digital signal processing in accordance with the invention permits only the affected band, or bands, to responsively fade over to the bone conduction signal, without fading other, unaffected bands, thus maintaining the microphone path signal for the band signals where the microphone signal is the cleaner signal. Another modification, either to the dual band or the multi-band systems described above, relies on the use of variable thresholds in each of the bands, rather than on the fixed threshold described. In each band, the fader moves the cross-over point, using known sliding filters, so that the fading successively slides higher and higher in frequency with increased noise sound level intensity. A similar implementation is shown schematically in FIG. 10, in which a critical frequency fc control circuit 178 dynamically changes the critical frequencies of sliding high pass filter 180 and sliding low pass filter 182 based on a detected intensity level, in this case the intensity level from the microphone signal from microphone 184. As the background level increases, the system is designed with the presumption that the background noise is louder in the low frequency region than in the high frequency region and has a fairly uniform slope to it, and this characteristic is exploited using a fader circuit which, as the noise level increases, moves the frequency cross-over point of the fade commensurately higher. In other words, at low levels of background noise where the fader circuit begins to operate, only the lowest frequencies are faded over. As the noise level increases, fading gradually begins at the higher frequencies, and with further increases in the noise level the fading operation slides up in frequency, always passing the microphone signal at the highest frequencies and the bone conduction signal at the lowest frequencies until crossover region frequency exceeds the audio band. The operation of such a system is represented graphically in FIG. 11. The signal from microphone 184 is amplified by amplifier 188, while the signal from vibration sensor 186 is amplified by amplifier 190. The signal outputs from sliding filters 180 and 182 are combined in adder 192 to generate the circuit output. The above are exemplary modes of carrying out the invention and are not intended to be limiting. It will be apparent to those skilled in the art that modifications thereto can be made without departure from the spirit and scope of the invention as set forth by the following claims.

Claims

WHAT IS CLAIMED IS:
1. A pickup device for converting a sound signal to an electrical output signal, the pickup device comprising:
an acoustic sound transducer for generating a first electrical signal representative of the sound signal;
a vibration sensor for generating a second electrical signal representative of the sound signal; and
a processing circuit for generating the electrical output signal from the first and second electrical signals.
2. The device of Claim 1, wherein the vibration sensor comprises an accelerometer.
3. The device of Claim 2, wherein the accelerometer exhibits a second derivative rise of about 12 dB/octave.
4. The device of Claim 1, wherein the vibration sensor comprises a velocity sensor.
5. The device of Claim 4, wherein the velocity sensor exhibits a first derivative rise of about 6 dB/octave.
6. The device of Claim 1, wherein the vibration sensor comprises a displacement sensor.
7. The device of Claim 1, wherein the sound transducer comprises a microphone.
8. The device of Claim 1, wherein the processing circuit comprises:
a first amplifier adapted to amplify the first electrical signal to thereby generate a first signal output;
a second amplifier adapted to amplify the second electrical signal to thereby generate a second signal output; and
a control circuit for controlling the gain of at least one of the first and second amplifiers based on a detected characteristic of at least one of the first and second electrical signals.
9. The device of Claim 8, wherein the detected characteristic is average amplitude and wherein the control circuit comprises an average amplitude detector for detecting the average amplitude of at least one of the first and second electrical signals and adjusting the gain of at least one of the first and second amplifiers in accordance with said detected average amplitude.
10. The device of Claim 8, wherein the detected characteristic is signal peak value and wherein the control circuit comprises a peak value detector for detecting the peak value of at least one of the first and second electrical signals and adjusting the gain of at least one of the first and second amplifiers in accordance with said detected peak value.
11. The device of Claim 8, wherein the detected characteristic is signal RMS value and wherein the control circuit comprises a signal RMS value detector for detecting the RMS value of at least one of the first and second electrical signals and adjusting the gain of at least one of the first and second amplifiers in accordance with said detected RMS value.
12. The device of Claim 8, wherein the detected characteristic is signal energy and wherein the control circuit comprises a signal energy detector for detecting the energy of at least one of the first and second electrical signals and adjusting the gain of at least one of the first and second amplifiers in accordance with said detected energy.
13. The device of Claim 8, wherein the control circuit is adapted, over a cross-over ambient noise level range, to decrease the gain of the first amplifier and increase the gain of the second amplifier.
14. The device of Claim 13, wherein said increase and decrease are effected when a predetermined threshold is reached.
15. The device of Claim 14, wherein the first signal output is at a level which exceeds a level of the second signal output below said cross-over range and wherein said second signal output is at a level which exceeds the level of the first signal output above said cross-over range.
16. The device of Claim 15, wherein the cross-over range spans about 20 dB.
17. The device of Claim 8, wherein the processing circuit comprises:
first and second adjustable gain pre-amplifiers respectively providing the first and second electrical signals to the first and second amplifiers; and
a peak detector for maintaining the electrical output signal below a predetermined level, the peak detector adjusting the gain of the first and second pre-amplifiers in accordance with a detected peak of the electrical output signal.
18. The device of Claim 17, wherein the peak detector effects said maintaining by comparing the electrical output signal with a predetermined peak threshold.
19. The device of Claim 1, wherein the processing circuit comprises first and second equalization circuits for respectively equalizing frequency responses of the sound transducer and the vibration sensor.
20. The device of Claim 8, wherein the processing circuit comprises an adder for combining the first and second signal outputs to thereby generate said electrical output signal.
21. The device of Claim 8, wherein the control circuit is adapted, over a cross-over ambient noise level range, to decrease the gain of the first amplifier and increase the gain of the second amplifier when a predetermined average amplitude threshold is reached, said average amplitude threshold being in the range of about 85-100 SPL.
22. The device of Claim 1, wherein the processing circuit comprises:
a first amplifier adapted to amplify the first electrical signal to thereby generate a first signal output, the first amplifier having a predetermined gain limit; a second amplifier adapted to amplify the second electrical signal to thereby generate a second signal output; and
an adder for combining the first and second signal outputs to thereby generate a combined signal.
23. The device of Claim 22, wherein the first and second electrical signals are respectively provided to the first and second amplifiers by first and second equalization circuits adapted to respectively equalize frequency responses of the sound transducer and the vibration sensor.
24. The device of Claim 23, wherein the combined signal is provided to a bandsplit filter.
25. The device of Claim 24, wherein the first amplifier, the equalization circuit and the bandsplit filter are components of a unitary sound processing device.
26. The device of Claim 21, wherein the predetermined gain limit represents a selected ambient noise level.
27. The device of Claim 26, wherein the selected ambient noise level lies within a predetermined transition range containing 85 dB SPL.
28. The device of Claim 27, wherein, below the predetermined transition range, the first signal output exceeds the second signal output, and wherein, above the predetermined transition range, the second signal output exceeds the first signal output.
29. The device of Claim 1, wherein the processing circuit comprises:
a first envelope detector adapted to detect an envelope of the first electrical signal;
a second envelope detector adapted to detect an envelope of the second electrical signal; and
a switching circuit for actively selecting the first and second electrical signals based on outputs of the first and second envelope detectors.
30. The device of Claim 29 wherein the switching circuit comprises a fader.
31. The device of Claim 29, wherein the first and second envelope detectors have time constants of about 10 ms.
32. The device of Claim 29, wherein the first and second envelope detectors have time constants of about 1 s.
33. The device of Claim 1, wherein the processing circuit comprises:
a phase difference correction circuit for correcting a phase difference between the first and second electrical signals to thereby convert the first and second electrical signal into phase corrected signals; and
a correlator circuit for correlating the phase corrected signals.
34. The device of Claim 33, wherein the correlator circuit correlates the phase corrected signals based on the average energy of the phase corrected signals.
35. The device of Claim 33, wherein the correlator circuit correlates the phase corrected signals based on instantaneous signal levels of the phase corrected signals.
36. The device of Claim 33, wherein the correlator circuit correlates the phase corrected signals based on individual frequency bands of the phase corrected signals.
37. The device of Claim 1, wherein the processing circuit comprises:
means for splitting the first electrical signal into a plurality of first frequency band components;
means for splitting the second electrical signal into a plurality of second frequency band components each corresponding to an associated first frequency band component;
means for selectively fading between first frequency band components and corresponding second frequency band components; and
an adder for combining the faded frequency band components to thereby generate the electrical output signal.
38. The device of Claim 1, wherein the processing circuit comprises:
first and second sliding filters for respectively filtering the first and second electrical signals, the first and second sliding filters each having a controllable critical frequency fc;
a critical frequency controller for controlling the critical frequency fc of each of the first and second sliding filters based on detected sound signal intensity; and
an adder for combining the filtered first and second electrical signals to thereby generate the electrical output signal.
39. The device of Claim 38, wherein the first sliding filter comprises a sliding highpass filter and the second sliding filter comprises a sliding lowpass filter.
40. The device of Claim 39, wherein the detected sound signal intensity is derived from the first electrical signal.
41. The device of Claim 39, wherein the detected sound signal intensity is derived from the second electrical signal.
42. A method for generating an electrical output signal representative of a talker's voice, said method comprising:
generating a first electrical signal responsive to air pressure disturbances caused by the voice and ambient acoustic noise;
generating a second electrical signal responsive to bone-conducted sounds caused by the voice; and
combining the first and second electrical signals to thereby generate the electrical output signal.
43. The method of Claim 42, further comprising the steps of:
detecting a predetermined characteristic of at least one of the first and second electrical signals;
amplifying the first and second electrical signals at corresponding first and second amplification gain levels to respectively generate first and second signal outputs; and
selectively controlling the first and second amplification gain levels in accordance with the predetermined characteristic.
44. The method of Claim 43, wherein the predetermined characteristic is the average amplitude of at least one of the first and second electrical signals and wherein the selectively controlling step comprises:
comparing the detected average amplitude with a predetermined amplitude threshold representative of an ambient noise level falling within a predetermined transition range; and limiting the first amplification gain level to thereby maintain the level of the first signal output above the level of the second signal output when the ambient noise level is below said predetermined cross-over range, and to maintain the level of the second signal output above the level of the first signal output when the ambient noise level is above said predetermined transition range.
45. The device of Claim 44, wherein the ambient noise level is 85 dB SPL and wherein the cross-over range spans 20 dB.
46. The method of Claim 43, wherein the predetermined characteristic is the RMS value of at least one of the first and second electrical signals, and wherein the selectively controlling step comprises:
comparing the detected RMS value with a predetermined threshold representative of an ambient noise level falling within a predetermined transition range; and
adjusting the first and second amplification gain levels to thereby maintain the level of the first signal output above the level of the second signal output when the ambient noise level is below said predetermined cross-over range, and to maintain the level of the second signal output above the level of the first signal output when the ambient noise level is above said predetermined transition range.
47. The method of Claim 43, wherein the predetermined characteristic is the peak value of at least one of the first and second electrical signals, and wherein the selectively controlling step comprises:
comparing the detected peak value with a predetermined amplitude threshold representative of an ambient noise level falling within a predetermined transition range; and
adjusting the first and second amplification gain levels to thereby maintain the level of the first signal output above the level of the second signal output when the ambient noise level is below said predetermined cross-over range, and to maintain the level of the second signal output above the level of the first signal output when the ambient noise level is above said predetermined transition range.
48. The method of Claim 43, wherein the predetermined characteristic is the energy of at least one of the first and second electrical signals, and wherein the selectively controlling step comprises:
comparing the detected energy with a predetermined amplitude threshold representative of an ambient noise level falling within a predetermined transition range; and
adjusting the first and second amplification gain levels to thereby maintain the level of the first signal output above the level of the second signal output when the ambient noise level is below said predetermined cross-over range, and to maintain the level of the second signal output above the level of the first signal output when the ambient noise level is above said predetermined transition range.
49. The method of Claim 42, wherein the step of generating a first electrical signal is effected using a microphone.
50. The method of Claim 42, wherein the step of generating a second electrical signal is effected using an accelerometer.
51. The method of Claim 42, wherein the step of generating a second electrical signal is effected using a velocity.
52. The method of Claim 42, wherein the step of generating a second electrical signal is effected using a displacement sensor.
53. The method of Claim 43, further comprising the steps of:
generating first and second equalized signals respectively representing the first and second electrical signals; providing the first and second equalized signals to corresponding first and second adjustable gain pre-amplifiers;
detecting a peak level of the electrical output signal; and
adjusting the gain of the first and second adjustable gain pre- amplifiers to maintain the electrical output signal below a predetermined level.
54. The method of Claim 42, wherein said step of combining comprises using an adder to generate a combined signal.
55. The method of Claim 54, wherein the combined signal is provided to a bandsplit filter.
56. The method of Claim 49, wherein the microphone is a telephone microphone.
57. The method of Claim 49, wherein the microphone is a hearing aid microphone.
58. The method of Claim 42, wherein the first and second electrical signals respectively have first and second characteristic envelopes, the step of combining comprising actively selecting between the first and second signals based on the characteristic envelopes.
59. The method of Claim 58, wherein the step of actively selecting is effected using a fader.
60. The method of Claim 58, wherein the step of combining comprises use of envelope detectors having time constants of about 10 ms.
61. The method of Claim 58, wherein the step of combining comprises use of envelope detectors having time constants of about 1 s.
62. The method of Claim 42, wherein the step of combining comprises:
correcting a phase difference between the first and second electrical signals; and
correlating the phase difference corrected first and second electrical signals.
63. The method of Claim 62, wherein the step of correlating is based on the average energy of the phase corrected signals.
64. The method of Claim 62, wherein the step of correlating is based on instantaneous signal levels of the phase corrected signals.
65. The method of Claim 62, wherein the step of correlating is based on individual frequency bands of the phase corrected signals.
66. The method of Claim 42, wherein the step of combining comprises:
splitting the first electrical signal into a plurality of first frequency band components;
splitting the second electrical signal into a plurality of second frequency band components each corresponding to an associated first frequency band component; and
selectively fading between the first frequency band components and corresponding second frequency band components.
67. The method of Claim 42, further comprising the steps of: detecting noise intensity; and
dynamically filtering the first and second electrical signals based on the detected noise intensity.
68. The method of Claim 42, wherein the step of detecting comprises detecting the intensity of the first electrical signal, and wherein the step of dynamically filtering comprises applying the first electrical signal to a sliding low pass filter and applying the second electrical signal to a sliding high pass filter.
69. The method of Claim 42, wherein the step of detecting comprises detecting the intensity of the second electrical signal, and wherein the step of dynamically filtering comprises applying the first electrical signal to a sliding low pass filter and applying the second electrical signal to a sliding high pass filter.
PCT/US1999/023234 1998-10-08 1999-10-05 Dual-sensor voice transmission system WO2000021194A1 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
AU62915/99A AU6291599A (en) 1998-10-08 1999-10-05 Dual-sensor voice transmission system

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US16810198A 1998-10-08 1998-10-08
US09/168,101 1998-10-08

Publications (1)

Publication Number Publication Date
WO2000021194A1 true WO2000021194A1 (en) 2000-04-13

Family

ID=22610131

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/US1999/023234 WO2000021194A1 (en) 1998-10-08 1999-10-05 Dual-sensor voice transmission system

Country Status (2)

Country Link
AU (1) AU6291599A (en)
WO (1) WO2000021194A1 (en)

Cited By (14)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7120477B2 (en) 1999-11-22 2006-10-10 Microsoft Corporation Personal mobile computing device having antenna microphone and speech detection for improved speech recognition
WO2006117032A1 (en) * 2005-04-29 2006-11-09 Harman Becker Automotive Systems Gmbh Detection and surpression of wind noise in microphone signals
US7283850B2 (en) 2004-10-12 2007-10-16 Microsoft Corporation Method and apparatus for multi-sensory speech enhancement on a mobile device
US7406303B2 (en) 2005-07-05 2008-07-29 Microsoft Corporation Multi-sensory speech enhancement using synthesized sensor signal
US7680656B2 (en) 2005-06-28 2010-03-16 Microsoft Corporation Multi-sensory speech enhancement using a speech-state model
WO2010133812A1 (en) * 2009-05-20 2010-11-25 Elno Société Nouvelle Acoustic device
US7930178B2 (en) 2005-12-23 2011-04-19 Microsoft Corporation Speech modeling and enhancement based on magnitude-normalized spectra
EP2460156A1 (en) * 2009-07-29 2012-06-06 BYD Company Limited Method and device for eliminating background noise
EP2518724A1 (en) * 2011-04-26 2012-10-31 Parrot Microphone/headphone audio headset comprising a means for suppressing noise in a speech signal, in particular for a hands-free telephone system
EP2945399A1 (en) * 2014-05-16 2015-11-18 Parrot Audio headset with active noise control anc with prevention of the effects of saturation of a microphone signal feedback
EP3163572A1 (en) * 2015-10-29 2017-05-03 BlackBerry Limited Method and device for supressing ambient noise in a speech signal generated at a microphone of the device
EP2393309B1 (en) * 2010-06-07 2019-10-09 Oticon Medical A/S Device and method for applying a vibration signal to a human skull bone
EP3188508B1 (en) 2015-12-30 2020-03-11 GN Hearing A/S Method and device for streaming communication between hearing devices
US20220132245A1 (en) * 2020-10-23 2022-04-28 Knowles Electronics, Llc Wearable audio device having improved output

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4442323A (en) * 1980-07-19 1984-04-10 Pioneer Electronic Corporation Microphone with vibration cancellation
US5363452A (en) * 1992-05-19 1994-11-08 Shure Brothers, Inc. Microphone for use in a vibrating environment
US5692059A (en) * 1995-02-24 1997-11-25 Kruger; Frederick M. Two active element in-the-ear microphone system

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4442323A (en) * 1980-07-19 1984-04-10 Pioneer Electronic Corporation Microphone with vibration cancellation
US5363452A (en) * 1992-05-19 1994-11-08 Shure Brothers, Inc. Microphone for use in a vibrating environment
US5692059A (en) * 1995-02-24 1997-11-25 Kruger; Frederick M. Two active element in-the-ear microphone system

Cited By (27)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7120477B2 (en) 1999-11-22 2006-10-10 Microsoft Corporation Personal mobile computing device having antenna microphone and speech detection for improved speech recognition
US7283850B2 (en) 2004-10-12 2007-10-16 Microsoft Corporation Method and apparatus for multi-sensory speech enhancement on a mobile device
US8194881B2 (en) 2005-04-29 2012-06-05 Nuance Communications, Inc. Detection and suppression of wind noise in microphone signals
EP1732352A1 (en) * 2005-04-29 2006-12-13 Harman Becker Automotive Systems GmbH Detection and suppression of wind noise in microphone signals
WO2006117032A1 (en) * 2005-04-29 2006-11-09 Harman Becker Automotive Systems Gmbh Detection and surpression of wind noise in microphone signals
US7680656B2 (en) 2005-06-28 2010-03-16 Microsoft Corporation Multi-sensory speech enhancement using a speech-state model
US7406303B2 (en) 2005-07-05 2008-07-29 Microsoft Corporation Multi-sensory speech enhancement using synthesized sensor signal
US7930178B2 (en) 2005-12-23 2011-04-19 Microsoft Corporation Speech modeling and enhancement based on magnitude-normalized spectra
WO2010133812A1 (en) * 2009-05-20 2010-11-25 Elno Société Nouvelle Acoustic device
FR2945904A1 (en) * 2009-05-20 2010-11-26 Elno Soc Nouvelle ACOUSTIC DEVICE
EP2460156A4 (en) * 2009-07-29 2012-12-26 Byd Co Ltd Method and device for eliminating background noise
EP2460156A1 (en) * 2009-07-29 2012-06-06 BYD Company Limited Method and device for eliminating background noise
EP2393309B1 (en) * 2010-06-07 2019-10-09 Oticon Medical A/S Device and method for applying a vibration signal to a human skull bone
FR2974655A1 (en) * 2011-04-26 2012-11-02 Parrot MICRO / HELMET AUDIO COMBINATION COMPRISING MEANS FOR DEBRISING A NEARBY SPEECH SIGNAL, IN PARTICULAR FOR A HANDS-FREE TELEPHONY SYSTEM.
US8751224B2 (en) 2011-04-26 2014-06-10 Parrot Combined microphone and earphone audio headset having means for denoising a near speech signal, in particular for a “hands-free” telephony system
EP2518724A1 (en) * 2011-04-26 2012-10-31 Parrot Microphone/headphone audio headset comprising a means for suppressing noise in a speech signal, in particular for a hands-free telephone system
CN105100990A (en) * 2014-05-16 2015-11-25 鹦鹉股份有限公司 Audio headset with active noise control ANC with prevention of effects of saturation of microphone signal feedback
FR3021180A1 (en) * 2014-05-16 2015-11-20 Parrot AUDIO ACTIVE ANC CONTROL AUDIO HELMET WITH PREVENTION OF THE EFFECTS OF A SATURATION OF THE MICROPHONE SIGNAL "FEEDBACK"
US9466281B2 (en) 2014-05-16 2016-10-11 Parrot ANC noise active control audio headset with prevention of the effects of a saturation of the feedback microphone signal
EP2945399A1 (en) * 2014-05-16 2015-11-18 Parrot Audio headset with active noise control anc with prevention of the effects of saturation of a microphone signal feedback
EP3163572A1 (en) * 2015-10-29 2017-05-03 BlackBerry Limited Method and device for supressing ambient noise in a speech signal generated at a microphone of the device
EP3188508B1 (en) 2015-12-30 2020-03-11 GN Hearing A/S Method and device for streaming communication between hearing devices
EP3188508B2 (en) 2015-12-30 2024-01-10 GN Advanced Hearing Protection A/S Method and device for streaming communication between hearing devices
US20220132245A1 (en) * 2020-10-23 2022-04-28 Knowles Electronics, Llc Wearable audio device having improved output
US11337000B1 (en) * 2020-10-23 2022-05-17 Knowles Electronics, Llc Wearable audio device having improved output
US20220240010A1 (en) * 2020-10-23 2022-07-28 Knowles Electronics, Llc Wearable audio device having improved output
US11659328B2 (en) * 2020-10-23 2023-05-23 Knowles Electronics, Llc Wearable audio device having improved output

Also Published As

Publication number Publication date
AU6291599A (en) 2000-04-26

Similar Documents

Publication Publication Date Title
US10327071B2 (en) Head-wearable hearing device
CN103959813B (en) Earhole Wearable sound collection device, signal handling equipment and sound collection method
US5937070A (en) Noise cancelling systems
US9508335B2 (en) Active noise control and customized audio system
Hamacher et al. Signal processing in high-end hearing aids: State of the art, challenges, and future trends
CA2268918C (en) Compression systems for hearing aids
US8019103B2 (en) Hearing aid with suppression of wind noise
US5396560A (en) Hearing aid incorporating a novelty filter
EP3566469B1 (en) Speech intelligibility enhancing system
EP1111960B1 (en) Digital hearing device and method
CA2197661A1 (en) Directional ear device with adaptive bandwidth and gain control
WO2000021194A1 (en) Dual-sensor voice transmission system
JP2010513987A (en) Near-field vector signal amplification
CN111757231A (en) Hearing device with active noise control based on wind noise
CN215935067U (en) Wearable audio equipment
JP2002125298A (en) Microphone device and earphone microphone device
JPH0396199A (en) Noise reduction headphone
JP2005520367A (en) Hearing aid and audio signal processing method
AU2004202682B2 (en) Method for Operating a Hearing Aid Device and Hearing Aid Device with a Microphone System in which Different Directional Characteristics can be Set
WO2009081184A1 (en) Noise cancellation system and method with adjustment of high pass filter cut-off frequency
JP4214607B2 (en) Microphone device
EP4198976B1 (en) Wind noise suppression system
WO2000001198A1 (en) System for reducing the effects of acoustically noisy environments on detected sound signals
CN113645533A (en) Active noise reduction earphone and noise reduction method thereof
CN117295000A (en) Hearing aid comprising an active occlusion removal system

Legal Events

Date Code Title Description
AK Designated states

Kind code of ref document: A1

Designated state(s): AU CA JP

AL Designated countries for regional patents

Kind code of ref document: A1

Designated state(s): AT BE CH CY DE DK ES FI FR GB GR IE IT LU MC NL PT SE

121 Ep: the epo has been informed by wipo that ep was designated in this application
DFPE Request for preliminary examination filed prior to expiration of 19th month from priority date (pct application filed before 20040101)
122 Ep: pct application non-entry in european phase