US4532648A - Speech recognition system for an automotive vehicle - Google Patents

Speech recognition system for an automotive vehicle Download PDF

Info

Publication number
US4532648A
US4532648A US06/428,230 US42823082A US4532648A US 4532648 A US4532648 A US 4532648A US 42823082 A US42823082 A US 42823082A US 4532648 A US4532648 A US 4532648A
Authority
US
United States
Prior art keywords
signal
spoken instruction
spoken
engine
level
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Lifetime
Application number
US06/428,230
Inventor
Kazunori Noso
Norimasa Kishi
Toru Futami
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Nissan Motor Co Ltd
AT&T Corp
Original Assignee
Nissan Motor Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Nissan Motor Co Ltd filed Critical Nissan Motor Co Ltd
Assigned to NISSAN MOTOR COMPANY, LIMITED reassignment NISSAN MOTOR COMPANY, LIMITED ASSIGNMENT OF ASSIGNORS INTEREST. Assignors: FUTAMI, TORU, KISHI, NORIMASA, NOSO, KAZUNORI
Assigned to AT & T TECHNOLOGIES, INC., reassignment AT & T TECHNOLOGIES, INC., CHANGE OF NAME (SEE DOCUMENT FOR DETAILS). EFFECTIVE JAN. 3,1984 Assignors: WESTERN ELECTRIC COMPANY, INCORPORATED
Application granted granted Critical
Publication of US4532648A publication Critical patent/US4532648A/en
Anticipated expiration legal-status Critical
Expired - Lifetime legal-status Critical Current

Links

Images

Classifications

    • BPERFORMING OPERATIONS; TRANSPORTING
    • B60VEHICLES IN GENERAL
    • B60RVEHICLES, VEHICLE FITTINGS, OR VEHICLE PARTS, NOT OTHERWISE PROVIDED FOR
    • B60R16/00Electric or fluid circuits specially adapted for vehicles and not otherwise provided for; Arrangement of elements of electric or fluid circuits specially adapted for vehicles and not otherwise provided for
    • B60R16/02Electric or fluid circuits specially adapted for vehicles and not otherwise provided for; Arrangement of elements of electric or fluid circuits specially adapted for vehicles and not otherwise provided for electric constitutive elements
    • B60R16/037Electric or fluid circuits specially adapted for vehicles and not otherwise provided for; Arrangement of elements of electric or fluid circuits specially adapted for vehicles and not otherwise provided for electric constitutive elements for occupant comfort, e.g. for automatic adjustment of appliances according to personal settings, e.g. seats, mirrors, steering wheel
    • B60R16/0373Voice control
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/20Speech recognition techniques specially adapted for robustness in adverse environments, e.g. in noise, of stress induced speech
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/78Detection of presence or absence of voice signals
    • G10L25/87Detection of discrete points within a voice signal
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • G10L2015/226Procedures used during a speech recognition process, e.g. man-machine dialogue using non-speech characteristics
    • G10L2015/228Procedures used during a speech recognition process, e.g. man-machine dialogue using non-speech characteristics of application context

Definitions

  • the present invention relates generally to a speech recognition system for an automotive vehicle, and more particularly to a speech recognition system by which a driver's spoken instructions can be reliably recorded or recognized even when engine noise increases within the passenger compartment after the vehicle engine begins to operate.
  • the speech recognizer is used in a relatively quiet environment; however, the speech recognition system for an automotive vehicle is usually used within a relatively noisy passenger compartment and additionally the noise fluctuates intensely therewithin. Therefore, one of the major problems is how to cope with erroneous spoken phrase recordings or recognitions caused by fluctuating engine noise within the passenger compartment.
  • a voice detector in the speech recognizer by which the start and the end of a spoken instruction are determined by detecting whether the magnitude of a spoken instruction signal exceeds a predetermined reference threshold voltage level for a predetermined period of time or whether the magnitude of the spoken instruction signal drops below the predetermined reference threshold voltage level for another predetermined period of time, respectively.
  • the prior-art speech recognizer since the reference threshold voltage level is fixed, when noise signal level is high, for instance, when the vehicle is running and therefore the noise level exceeds the reference threshold voltage level for a long time, there exists a problem in that the voice detector can erroneously consider this state to represent the beginning of a spoken instruction. In other words, the prior-art speech recognizer is prone to erroneous recognition due to intense noise within the passenger compartment.
  • the primary object of the present invention to provide a speech recognition system for an automotive vehicle which can record or recognize spoken instruction phases reliably even when engine noise increases within the passenger compartment.
  • the gain of an amplifier in the speech input section or the threshold level of the voice detector section is so switched as to reduce the sensitivity to spoken instructions.
  • the driver must necessarily utter a spoken instruction in a louder voice and thus the proportion of noise level to spoken instruction signal level is reduced.
  • the speech recognition system for an automotive vehicle comprises an engine operation detector or a speed sensor, an analog switch, two amplifiers having different gain factors respectively or two multipliers having different multiplication ratio respectively, etc., in addition to or in place of the elements or sections of the conventional speech recognizer.
  • FIG. 1 is a schematic block diagram of a typical prior-art speech recognizer for assistance in explaining the operations thereof;
  • FIG. 2 is a schematic block diagram of a detailed portion of the voice detecting means of the prior-art speech recognizer shown in FIG. 1;
  • FIG. 3(A) is a graphical representation of the waveforms of a spoken instruction signal including noise as measured at point (A) in FIG. 2;
  • FIG. 3(B) is a graphical representation of the waveforms of the spoken instruction signal including noise and a reference threshold voltage level as measured at point (B) in FIG. 2;
  • FIG. 3(C) is a graphical representation of the waveform of the spoken instruction pulse signal as measured at point (C) in FIG. 2;
  • FIG. 3(D) is a graphical representation of the waveform of the spoken instruction start/end signal as measured at point (D) in FIG. 2;
  • FIG. 4 is a schematic block diagram of a first embodiment of the speech recognition system according to the present invention, in which only an essential portion of the system is shown together with an engine operation detector and in which two amplifiers are switched in the speech input section;
  • FIG. 5 is a schematic block diagram of a second embodiment of the speech recognition system according to the present invention, in which only an essential portion of the system is shown and in which two feedback resistors are switched in the speech input section;
  • FIG. 6 is a schematic block diagram of a third embodiment of the speech recognition system according to the present invention, in which only an essential portion of the system is shown together with an engine operation detector and in which two multipliers are switched in the speech input section.
  • FIG. 1 shows a schematic block diagram of a typical speech recognizer 100.
  • the user To use the speech recognizer, the user must first record a plurality of predetermined spoken instructions. Specifically, in this spoken instruction recording mode (reference mode), the user first depresses a record switch 1 disposed near the user. When the record switch 1 is depressed, a switch input interface 4 detects the depression of the record switch 2 and outputs a signal to a controller 5 via a wire 4a. In response to this signal, the controller 5 outputs a recording mode command signal to other sections in order to preset the entire speech recognizer to the recording mode.
  • this spoken instruction recording mode reference mode
  • the controller 5 outputs a recording mode command signal to other sections in order to preset the entire speech recognizer to the recording mode.
  • the spoken instruction recording mode when the user says a phrase to be used as a spoken instruction, such as "open doors", near a microphone 2, the spoken phrase is transduced into a corresponding electric signal through the microphone 2, amplified through a speech input interface 6 consising mainly of a spectrum-normalizing amplifier, smoothed through a root-mean-square (RMS) smoother 15 including a rectifier and a smoother, and finally inputted to a voice detector 7.
  • RMS root-mean-square
  • the spectrum-normalizing amplifier amplifies the input at different gain levels at different frequencies, so as to adjust the naturally frequency-dependent power spectrum of human speech to a more nearly flat power spectrum.
  • This voice detector 7 detects whether or not the magnitude of the spoken phrase signal exceeds a predetermined level for a predetermined period of time (150 to 250 ms) in order to recognize the start of the spoken phase input signal and whether or not the magnitude of the signal drops below a predetermined level for a predetermined period of time (about 300 ms) in order to recognize the end of the signal. Upon detection of the start of the signal, this voice detector 7 outputs another recording mode command signal to the controller 5.
  • the controller 5 activates a group of bandpass filters 8, so that the spoken phrase signal from the microphone 2 is divided into a number of predetermined frequency bands.
  • the frequency-divided spoken phrase signals are squared or rectififed therein in order to derive the voice power spectrum across the frequency bands and then converted into corresponding digital time-series matrix-phonetic pattern data (explained later).
  • These data are then stored in a memory unit 10.
  • the speech recognizer is set to the spoken instruction recording mode by the depression of the record switch 1, the time-series matrix-phonetic pattern data are transferred to a reference pattern memory unit 11 and stored therein as reference data for use in recognizing the speech instructions.
  • the user can input speech instructions, such as "open doors", to the speech recognizer through the microphone 2 while depressing a recognition switch 3.
  • the switch input interface 4 detects the depression of the recognition switch 3 and outputs a signal to the controller 5 via a wire 4b.
  • the controller 5 outputs a recognition mode command signal to other sections in order to preset the entire speech recognizer to the recognition mode.
  • the spoken phrase recognition mode when the user says an instruction phrase similar to the one recorded previously near the microphone 2 and when the voice detector 7 outputs a signal, the spoken instruction is transduced into a corresponding electric signal through the microphone 2, amplified through the speech input interface 6, filtered and divided into voice power spectra across the frequency bands through the band pass filters 8, squared or rectified and further converted into corresponding digital time-series matrix-phonetic pattern data through the parameter extraction section 9, and then stored in the memory unit 10, in the same manner as in the recording mode.
  • the time-series matrix-phonetic pattern data stored in the memory unit 10 in the recognition mode are sequentially compared with the time-series matrix-phonetic pattern data stored in the reference pattern memory unit 11 in the recording mode by a resemblance comparator 12.
  • the resemblance comparator 12 calculates the level of correlation of the inputted speech instruction to the reference speech instruction after time normalization and level normalization to compensate for variable speaking rate (because the same person might speak quickly and loudly at one time but slowly and in a whisper at some other time).
  • the correlation factor is usually obtained by calculating the Tchebycheff distance (explained later) between recognition-mode time-series matrix-phonetic pattern data and recording-mode time-series matrix-phonetic pattern data.
  • the correlation factor calculated by the resemblance comparator 12 is next given to a resemblance determination section 13 to determine whether or not the calculated values lie within a predetermined range, that is, to evalutate this cross-correlation. If within the range, a command signal, indicating that a recognition-mode spoken instruction has adequate resemblance to one of the recorded instruction phrases, is outputted to one of actuators 14 in order to open the vehicle doors, for instance.
  • the above-mentioned operations are all executed in accordance with command signals outputted from the controller 5.
  • the speech recognizer 100 comprises various discrete elements or sections; however, it is of course possible to embody the speech recognizer 100 with a microcomputer including a central processing unit, a read-only memory, a random-access memory, a clock oscillator, etc.
  • the voice detector 7, the parameter extraction section 9, the memory 10, the reference pattern memory 11, the resemblance comparator 12 and the resemblance determination section 13 can all be incorporated within the microcomputer, executing the same or similar processes, calculations and/or operations as explained hereinabove.
  • the digital time-series matrix-phonetic pattern data and the Tchebycheff distance are defined as follows:
  • the digital recording-mode time series matrix-phonetic pattern data can be expressed as ##EQU1## where A designates a first recording-mode speech instruction (reference) (e.g. OPEN DOORS), i denotes the filter index, and i denotes time-series increment index.
  • reference e.g. OPEN DOORS
  • FIG. 2 shows in more detail the speech detection section of the voice detecting means of the prior-art speech recognizer shown in FIG. 1, which is closely relevant to the present invention.
  • a spoken phrase inputted via a microphone and transduced into a corresponding electric signal (100) first passes through the speech input interface 6.
  • the interface 6 is mainly made up of a spectrum-normalizing amplifier by which the electric signal is amplified to a greater degree at higher frequencies. This is because speech sounds tend to be attenuated greatly in the higher frequency range.
  • the waveform of the spoken instruction signal (200) including noise outputted from the spectrum-normalizing amplifier 6 may appear as shown in FIG. 3(A).
  • the amplified spoken instruction signal (200) is next applied to the bandpass filters 8 to begin the process of recognizing whether the signal is a correctly spoken instruction and to the RMS smoother 15, consisting mainly of a rectifier 15-1 and a smoother 15-2, to begin the process of detecting the start and end of the spoken phrase.
  • the rectified and smoothed spoken instruction signal (400) may appear as shown in FIG. 3(B), in which T f denotes a constant reference threshold voltage level.
  • the smoothed signal (400) is then conducted to the voice detector 7 including a voltage level comparator 7-1 and a pulse duration comparator 7-2.
  • the voltage level comparator 7-1 comprises the voltage level of the smoothed signal with the predetermined reference threshold voltage level T f and outputs a H-voltage level pulse signal (600) only while the voltage level of the speech instruction signal exceeds the reference threshold level T f as depicted in FIG. 3(C).
  • the pulse duration comparator 7-2 compares the pulse width of the H-voltage level pulse signal (600) with a predetermined reference spoken instruction start time t s and the pulse width of the L-voltage level pulse signal (600) with another predetermined reference end time t e and outputs a H-voltage level signal (700) only when the H-voltage level pulse width exceeds the reference start time t s and a L-voltage level signal (700) only when the L-voltage level pulse width exceeds the reference end time t e .
  • the pulse duration comparator 7-2 outputs no H-voltage level signal.
  • the pulse width of the second H-voltage level pulse signal is labeled t 2
  • the pulse duration comparator 7-2 outputs a H-voltage level signal, indicating the start of a spoken instruction.
  • the H-voltage level start signal (700) from the pulse duration comparator 7-2 is delayed by the reference start time t s after the actual start time P s of the spoken instruction. Thereafter, this H-voltage level start signal is outputted until the duration comparator 7-2 detects the end of speech instruction.
  • the pulse duration comparator 7-2 outputs no L-voltage level signal, that is, during comparator 7-2 sustains the H-voltage level signal.
  • the pulse duration comparator 7-2 outputs a L-voltage level signal, indicating the end of speech instruction.
  • the L-voltage level end signal from the duration comparator 7-2 is delayed by the reference end time t e after the actual end time P e of speech instruction. Thereafter, the end signal is outputted until the duration comparator 7-2 detects the start of another speech instruction.
  • the controller 5 In response to the H-voltage level signal from the duration comparator 7-2 as shown in FIG. 3(D), the controller 5 outputs a command signal to activate a group of bandpass filters 8 and other sections to recognize the spoken instruction signal outputted from the spectrum-normalizing amplifier 6, as already explained.
  • the speech recognizer cannot cope well with the fluctuations of noise level within the passenger compartment, with the result that accurate detection of speech instruction start and end is comprised so that noise may be interpreted as attempts at speech and/or spoken instructions may be ignored.
  • FIG. 4 is a block diagram showing a first embodiment of the present invention.
  • the gain factor of the speech input interface is adjusted according to the engine operation; that is, the gain factor is reduced while the engine is operating.
  • a record switch 1 As in the conventional speech recognizer 100, there are provided a record switch 1, a microphone 2, a recognition switch 3, a switch input interface 4, and a controller 5.
  • first and second amplifers 61 and 62 having first and second gains G 1 and G 2 in the speech input interface 6 for amplifying and outputting the spoken instruction signal from the microphone 2, the first gain G 1 being determined to be higher than the second gain G 2 .
  • the outputs of the amplifiers 61 and 62 are inputted to an analog switch 63.
  • the fixed contact a of this analog switch 63 is switched to the contact b as shown in FIG. 4 when the engine stops operating, but switched to the contact c when the engine is operating. Therefore, when the engine stops, the gain factor for the spoken instruction signal is high; when the engine is running, the gain factor for the spoken instruction signal is switched into being lower.
  • an engine operation detector 20 an ignition relay 16 having a relay contact closed when the ignition switch is turned on, and an alternator 17, in order to detect the engine condition and to switch the analog switch 63.
  • the engine operation detector 20 detects that the engine is operating and outputs a signal to the controller 5 when the ignition relay 16 is energized and also the alternator 17 outputs a signal. In response to this signal, the controller 5 sets the analog switch 63 to the amplifier 62 side.
  • the engine operation detector 20 detects that the engine stops and outputs no signal to the controller 5. In response to this, the controller 5 sets the analog switch 63 to the amplifier 61 side as shown.
  • the reason why the alternator output signal is given to the engine operation detector 20 is that the alternator output is indicative of the engine operation condition because the alternator outputs a signal when the ignition switch is set to the starter position and therefore the engine starts rotating.
  • the controller 5 sets the analog switch 63 to the amplifier 61 side. Therefore, the spoken instruction signal from the microphone 2 is amplified on the basis of the first gain G 1 preset in the first amplifier 61.
  • the controller 5 switches the analog switch 63 to the second amplifier 62 side. Therefore, the spoken instruction signal from the microphone 2 is amplified on the basis of the second gain G 2 lower than the first gain G 1 preset in the first amplifier 61.
  • the second gain G 2 is relatively low as compared with the first gain G 1 , in order to obtain a spoken instruction signal exceeding a predetermined level necessary for recording or recognizing, the driver must necessarily utter a spoken instruction toward the microphone 2 in a louder voice as compared with the case where the engine stops. Therefore, even if noise level within the passenger compartment is high due to engine operation, the level of a spoken instruction becomes naturally high in comparison with a rise in noise level. As a result, the proportion of noise level to spoken instruction signal level is reduced, thus improving the recording or recognition rate of a spoken instruction in the speech recognition system.
  • FIG. 5 is a block diagram showing a second embodiment of the present invention, in which a feedback resistor for the amplifier is switched according to the engine operation; that is, the gain factor is reduced while the engine is operating.
  • the engine operation detector 20 detects that the engine stops and outputs no signal to the controller 5.
  • the analog switch 63 is set to the first resistor side R 1 , as shown by a solid line in FIG. 5.
  • the larger the feedback resistor the higher the gain of the amplifier. Since the first resistor R 1 is predetermined to be larger than the second resistor R 2 , the gain factor G of the amplifier 60 becomes high when the resistor R 1 is connected between input and output terminals thereof. Further, in this case, the gain factor is determined as a function of the input resistor Ro and the feedback resistor R 1 or R 2 .
  • the analog switch 63 is set to the first resistor side R 1 , the spoken instruction signal from the microphone 2 is amplified on the basis of a higher gain.
  • the inputted spoken instruction phrase can be recorded or recognized reliably.
  • the controller 5 sets the analog switch 63 to the second resistor side R 2 which is smaller than R 1 , the spoken instruction is amplified on the basis of a lower gain.
  • the driver since the driver must necessarily utter a spoken instruction in a louder voice, even if noise level is high due to engine operation, the inputted spoken instruction phrase can be recorded or recognized reliably.
  • FIG. 6 is a block diagram showing a third embodiment of the present invention, in which the reference threshold level of the voltage level comparator 7-1 in the voice detector is switched according to the engine operation, that is, the multiplication ratio is increased when the engine is operating.
  • a record switch 1 As in the conventional speech recognizer 100, there are provided a record switch 1, a microphone 2, a recognition switch 3, a switch input interface 4, a controller, a RMS smoother (a first smoother), a voice detector etc.
  • a second smoother 72 In addition to these elements, there are provided a second smoother 72, first and second multipliers 73a and 73b, an analog switch 74 and a holding circuit 75.
  • the voice detector 7 for detecting the start point and the end point of a spoken instruction signal from the speech input interface 6 and for outputting a start command signal and an end command signal comprises a rectifier 15-1 for rectifying the spoken instruction signal, a first smoother 15-2 for smoothing the output signal from the rectifier 15-1 at a time constant of 10 to 20 milliseconds and for outputting a DC voltage roughly corresponding to the spoken instruction, a second smoother 72 for smoothing the output signal from the rectifier 15-1 at a time constant of about one to second and for outputting a DC voltage roughly corresponding to noise included in the spoken signal, first and second multipliers 73a and 73b for multiplying the output signal from the second smoother 72 at multiplication ratios of K 1 and K 2 (K 1 ⁇ K 2 ), an analog switch 74 for selecting the output from the multipliers 73a and 73b in response to the signal
  • the analog switch 74 is switched to the first multiplier 73a side when the engine operation detector 20 detects that the engine is stopped, as shown by a solid line, and to the second multiplier 73b side when the engine operation detector 20 detects that the engine is operating, as shown by a broken line in FIG. 6. Further, the holding circuit 75 first outputs the signal obtained from the multiplier 73a or 73b via the analog switch 74 to the level comparator 7-1 as the reference signal e r ; however, once the pulse duration comparator 7-2 outputs a start signal e s , since the signal from the multiplier 73a or 73b is held in response to this start signal e s (i.e. holding signal), the holding circuit keeps outputting the held signal as the reference signal e r to the level comparator 7-1, until an end signal e e is outputted to the holding circuit 75.
  • the reason why such a holding circuit 75 as described above is additionally provided is as follows: unless there is provided the holding circuit 75, when the reference end threshold level increases, the smoothed signal (400) drops below the threshold level before the end of spoken instruction, thus resulting in an erroneous spoken instruction end detection. In other words, since the time constant of the second smoother 72 is larger than that of the first smoother 15-2, the reference threshold level increases gradually with a time delay as the smoothed spoken instruction signal (400) increases gradually; that is, the timing of two signals does not match.
  • the engine operation detector 20 detects that the engine is stopped, and therefore the analog switch 74 is set to the first multiplier 73a side via the controller 5.
  • the spoken instruction transduced into an electric signal through the microphone 2 is rectified through the rectifier 15-1 after amplification by the speech input interface 6.
  • the first smoother 15-2 outputs a DC voltage corresponding to the power component of the spoken instruction signal.
  • the second smoother 72 applies a DC voltage proportional to the power level of background noise included in the spoken instruction signal to the multipliers 73a and 73b.
  • the output signal from the second smoother 72 is taken out being multiplied by K 1 times through the first multiplier 73a, and is given to the level comparator 7-1 via the analog switch 74 and the holding circuit 75 as the reference signal e r .
  • the level comparator 7-1 outputs a H-level output signal when the output of the first smoother 15-2 exceeds the reference signal e r .
  • this H-level output signal is kept outputted, for instance, for about 150 milliseconds, a start signal e s for spoken instruction recognition or recording is applied to the controller 5; the spoken instruction signal branched from the output of the speech input interface 6 is inputted to the bandpass filters 8; the spoken instruction is recorded or recognized by the same circuit sections as in the prior-art system. Further, when the pulse duration comparator 7-2 outputs the start signal e s , the output signal of the first multiplier 73a is held by the holding circuit 75 and the reference signal e r for the level comparator 7-1 is fixed. Next, when the input of the spoken instruction has been completed, the output of the level comparator 7-1 returns to a L-level.
  • the pulse duration comparator 7-2 outputs an end signal e e to the controller 5.
  • the controller 5 determines that the input of the spoken instruction has been completed and controls the entire system so as to begin to process the recording or recognition of the spoken instruction.
  • the analog switch 74 is set to the second multiplier 73b side. Therefore, the DC signal corresponding to background noise power level included in the spoken instruction and given from the second smoother 72 is multiplied at a multiplication ratio K 2 greater than K 1 and is applied to the level comparator 7-1 as a reference signal e r .
  • the level comparator 7-1 since the level of the reference signal e r is adjusted to a higher level, as compared with that obtained when the engine is stopped, in the level comparator 7-1, only when a spoken instruction exceeds this reference signal e r , that is, a relatively loud spoken instruction is inputted, the level comparator 7-1 generates a H-level output signal. Also, a loud voice makes clear the features of voice parameters. Therefore, in the state where the engine is operating, that is, where the vehicle is running and therefore noise within the passenger compartment is high, unless a relatively loud spoken instruction is uttered toward the microphone 2, no recognition or recording is made. If a spoken instruction having high energy is inputted, even when the ambient noise level is high, the proportion of noise component to the spoken instruction signal is sufficiently reduced, so that it is possible to record or recognize the spoken instruction more reliably.
  • the gain of the speech input interface or the multiplication ratio in the voice detector is switched digitally being classified into a state wherein the engine is stopped and into a state wherein the engine is operating.
  • the speech recognition system comprises various discrete elements or sections; however, it is of course possible to embody the system with a microcomputer including a central processing unit, a read-only memory, a random-access memory, a clock oscillator, etc.
  • the engine operation detector, the second smoother, the first and second multipliers, the analog switch, the holding circuit, etc. can all be incorporated within the microcomputer, executing the same or similar processes, calculations and/or operations as explained hereinabove.
  • the microcomputer also executes various operations necessary for the speech recognizer in accordance with appropriate software stored in the read-only memory.
  • the amplifier gain in the speech input interface or the threshold level in the voice detector is so adjusted that the sensitivity to spoken instruction can be reduced and therefore since the driver must necessarily utter a spoken instruction in a louder voice to reduce the proportion of noise level to spoken instruction signal level, in the case where the engine is operating, even if noise level within the passenger compartment rises intensely, it is possible to improve reliability in recording or recognition rate of a spoken instruction in the speech recognition system.

Abstract

A speech recognition system for an automotive vehicle which can record or recognize spoken instruction phrases reliably even when engine noise increases high within the passenger compartment. When the engine begins to operate, the gain of an amplifier of a speech input section or the threshold level of a voice detector is so switched as to reduce the sensitivity to spoken instructions. As a result, the driver must necessarily utter a spoken instruction in a louder voice and thus the proportion of noise level to spoken instruction signal level is reduced. The system according to the present invention comprises engine operation detecting means, an analog switch, two amplifiers or two multipliers, etc. in addition to the conventional speech recognizer.

Description

BACKGROUND OF THE INVENTION
1. Field of the Invention
The present invention relates generally to a speech recognition system for an automotive vehicle, and more particularly to a speech recognition system by which a driver's spoken instructions can be reliably recorded or recognized even when engine noise increases within the passenger compartment after the vehicle engine begins to operate.
2. Description of the Prior Art
There is a well-known speech recognizer which can activate various actuators in response to human spoken instructions. When this speech recognizer is mounted on a vehicle, the headlight, for instance, can be turned on or off in response to spoken instructions such as "Headlight on" or "Headlight off". Such a speech recognizer usually can recognize various spoken instructions in order to control various actuators; however, there are some problems involved in applying this system to an automotive vehicle.
Usually, the speech recognizer is used in a relatively quiet environment; however, the speech recognition system for an automotive vehicle is usually used within a relatively noisy passenger compartment and additionally the noise fluctuates intensely therewithin. Therefore, one of the major problems is how to cope with erroneous spoken phrase recordings or recognitions caused by fluctuating engine noise within the passenger compartment.
In the prior-art speech recognizer, since a spoken instruction signal including noise is always amplified on a constant gain factor, when the noise level within the passenger compartment increases, especially when the engine begins to operate and therefore the engine noise is inputted to the speech recognizer at random, the noise mixed with the spoken instruction signal at a relatively high ratio is also amplified together with the spoken instruction signal, thus causing a problem in that the spoken instruction cannot be recognized reliably or is recognized erroneously to operate a wrong vehicle device actuator.
Furthermore, in order to distinguish a spoken instruction from noise, conventionally there is provided a voice detector in the speech recognizer, by which the start and the end of a spoken instruction are determined by detecting whether the magnitude of a spoken instruction signal exceeds a predetermined reference threshold voltage level for a predetermined period of time or whether the magnitude of the spoken instruction signal drops below the predetermined reference threshold voltage level for another predetermined period of time, respectively.
In the prior-art speech recognizer, however, since the reference threshold voltage level is fixed, when noise signal level is high, for instance, when the vehicle is running and therefore the noise level exceeds the reference threshold voltage level for a long time, there exists a problem in that the voice detector can erroneously consider this state to represent the beginning of a spoken instruction. In other words, the prior-art speech recognizer is prone to erroneous recognition due to intense noise within the passenger compartment.
A more detailed description of a typical prior-art speech recognizer and a typical prior-art voice detector will be made with reference to the attached drawings in conjunction with the present invention under DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS.
SUMMARY OF THE INVENTION
With these problems in mind, therefore, it is the primary object of the present invention to provide a speech recognition system for an automotive vehicle which can record or recognize spoken instruction phases reliably even when engine noise increases within the passenger compartment. In more detail, when the engine begins to operate, the gain of an amplifier in the speech input section or the threshold level of the voice detector section is so switched as to reduce the sensitivity to spoken instructions. As a result, the driver must necessarily utter a spoken instruction in a louder voice and thus the proportion of noise level to spoken instruction signal level is reduced.
To achieve the above mentioned object, the speech recognition system for an automotive vehicle according to the present invention comprises an engine operation detector or a speed sensor, an analog switch, two amplifiers having different gain factors respectively or two multipliers having different multiplication ratio respectively, etc., in addition to or in place of the elements or sections of the conventional speech recognizer.
BRIEF DESCRIPTION OF THE DRAWINGS
The features and advantages of the speech recognition system for an automotive vehicle according to the present invention will be more clearly appreciated from the following description taken in conjunction with the accompanying drawings in which like reference numerals designate corresponding elements or sections throughout the drawings and in which:
FIG. 1 is a schematic block diagram of a typical prior-art speech recognizer for assistance in explaining the operations thereof;
FIG. 2 is a schematic block diagram of a detailed portion of the voice detecting means of the prior-art speech recognizer shown in FIG. 1;
FIG. 3(A) is a graphical representation of the waveforms of a spoken instruction signal including noise as measured at point (A) in FIG. 2;
FIG. 3(B) is a graphical representation of the waveforms of the spoken instruction signal including noise and a reference threshold voltage level as measured at point (B) in FIG. 2;
FIG. 3(C) is a graphical representation of the waveform of the spoken instruction pulse signal as measured at point (C) in FIG. 2;
FIG. 3(D) is a graphical representation of the waveform of the spoken instruction start/end signal as measured at point (D) in FIG. 2;
FIG. 4 is a schematic block diagram of a first embodiment of the speech recognition system according to the present invention, in which only an essential portion of the system is shown together with an engine operation detector and in which two amplifiers are switched in the speech input section;
FIG. 5 is a schematic block diagram of a second embodiment of the speech recognition system according to the present invention, in which only an essential portion of the system is shown and in which two feedback resistors are switched in the speech input section; and
FIG. 6 is a schematic block diagram of a third embodiment of the speech recognition system according to the present invention, in which only an essential portion of the system is shown together with an engine operation detector and in which two multipliers are switched in the speech input section.
DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS
To facilitate understanding of the present invention, a brief reference will be made to the principle or operation of a typical prior-art speech recognizer, with reference to FIG. 1.
FIG. 1 shows a schematic block diagram of a typical speech recognizer 100. To use the speech recognizer, the user must first record a plurality of predetermined spoken instructions. Specifically, in this spoken instruction recording mode (reference mode), the user first depresses a record switch 1 disposed near the user. When the record switch 1 is depressed, a switch input interface 4 detects the depression of the record switch 2 and outputs a signal to a controller 5 via a wire 4a. In response to this signal, the controller 5 outputs a recording mode command signal to other sections in order to preset the entire speech recognizer to the recording mode. In the spoken instruction recording mode, when the user says a phrase to be used as a spoken instruction, such as "open doors", near a microphone 2, the spoken phrase is transduced into a corresponding electric signal through the microphone 2, amplified through a speech input interface 6 consising mainly of a spectrum-normalizing amplifier, smoothed through a root-mean-square (RMS) smoother 15 including a rectifier and a smoother, and finally inputted to a voice detector 7.
The spectrum-normalizing amplifier amplifies the input at different gain levels at different frequencies, so as to adjust the naturally frequency-dependent power spectrum of human speech to a more nearly flat power spectrum. This voice detector 7 detects whether or not the magnitude of the spoken phrase signal exceeds a predetermined level for a predetermined period of time (150 to 250 ms) in order to recognize the start of the spoken phase input signal and whether or not the magnitude of the signal drops below a predetermined level for a predetermined period of time (about 300 ms) in order to recognize the end of the signal. Upon detection of the start of the signal, this voice detector 7 outputs another recording mode command signal to the controller 5. In response to this command signal, the controller 5 activates a group of bandpass filters 8, so that the spoken phrase signal from the microphone 2 is divided into a number of predetermined frequency bands. Given to a parameter extraction section 9, the frequency-divided spoken phrase signals are squared or rectififed therein in order to derive the voice power spectrum across the frequency bands and then converted into corresponding digital time-series matrix-phonetic pattern data (explained later). These data are then stored in a memory unit 10. In this case, however, since the speech recognizer is set to the spoken instruction recording mode by the depression of the record switch 1, the time-series matrix-phonetic pattern data are transferred to a reference pattern memory unit 11 and stored therein as reference data for use in recognizing the speech instructions.
After having recorded the reference spoken instructions, the user can input speech instructions, such as "open doors", to the speech recognizer through the microphone 2 while depressing a recognition switch 3.
When this recognition switch 3 is depresssed, the switch input interface 4 detects the depression of the recognition switch 3 and outputs a signal to the controller 5 via a wire 4b. In response to this signal, the controller 5 outputs a recognition mode command signal to other sections in order to preset the entire speech recognizer to the recognition mode. In this spoken phrase recognition mode, when the user says an instruction phrase similar to the one recorded previously near the microphone 2 and when the voice detector 7 outputs a signal, the spoken instruction is transduced into a corresponding electric signal through the microphone 2, amplified through the speech input interface 6, filtered and divided into voice power spectra across the frequency bands through the band pass filters 8, squared or rectified and further converted into corresponding digital time-series matrix-phonetic pattern data through the parameter extraction section 9, and then stored in the memory unit 10, in the same manner as in the recording mode.
Next, the time-series matrix-phonetic pattern data stored in the memory unit 10 in the recognition mode are sequentially compared with the time-series matrix-phonetic pattern data stored in the reference pattern memory unit 11 in the recording mode by a resemblance comparator 12. The resemblance comparator 12 calculates the level of correlation of the inputted speech instruction to the reference speech instruction after time normalization and level normalization to compensate for variable speaking rate (because the same person might speak quickly and loudly at one time but slowly and in a whisper at some other time). The correlation factor is usually obtained by calculating the Tchebycheff distance (explained later) between recognition-mode time-series matrix-phonetic pattern data and recording-mode time-series matrix-phonetic pattern data. The correlation factor calculated by the resemblance comparator 12 is next given to a resemblance determination section 13 to determine whether or not the calculated values lie within a predetermined range, that is, to evalutate this cross-correlation. If within the range, a command signal, indicating that a recognition-mode spoken instruction has adequate resemblance to one of the recorded instruction phrases, is outputted to one of actuators 14 in order to open the vehicle doors, for instance. The above-mentioned operations are all executed in accordance with command signals outputted from the controller 5.
Description has been made hereinabove of the case where the speech recognizer 100 comprises various discrete elements or sections; however, it is of course possible to embody the speech recognizer 100 with a microcomputer including a central processing unit, a read-only memory, a random-access memory, a clock oscillator, etc. In this case, the voice detector 7, the parameter extraction section 9, the memory 10, the reference pattern memory 11, the resemblance comparator 12 and the resemblance determination section 13 can all be incorporated within the microcomputer, executing the same or similar processes, calculations and/or operations as explained hereinabove.
The digital time-series matrix-phonetic pattern data and the Tchebycheff distance are defined as follows:
In the case where the number of the bandpass filters is four and the number of time-series increments for each is 32, the digital recording-mode time series matrix-phonetic pattern data can be expressed as ##EQU1## where A designates a first recording-mode speech instruction (reference) (e.g. OPEN DOORS), i denotes the filter index, and i denotes time-series increment index.
If a first recognition-mode speech instruction (e.g. OPEN DOORS) is denoted by the character "B", the Tchebycheff distance can be obtained from the following expression: ##EQU2##
FIG. 2 shows in more detail the speech detection section of the voice detecting means of the prior-art speech recognizer shown in FIG. 1, which is closely relevant to the present invention.
In the figure, a spoken phrase inputted via a microphone and transduced into a corresponding electric signal (100) first passes through the speech input interface 6. The interface 6 is mainly made up of a spectrum-normalizing amplifier by which the electric signal is amplified to a greater degree at higher frequencies. This is because speech sounds tend to be attenuated greatly in the higher frequency range. The waveform of the spoken instruction signal (200) including noise outputted from the spectrum-normalizing amplifier 6 may appear as shown in FIG. 3(A).
The amplified spoken instruction signal (200) is next applied to the bandpass filters 8 to begin the process of recognizing whether the signal is a correctly spoken instruction and to the RMS smoother 15, consisting mainly of a rectifier 15-1 and a smoother 15-2, to begin the process of detecting the start and end of the spoken phrase. The rectified and smoothed spoken instruction signal (400) may appear as shown in FIG. 3(B), in which Tf denotes a constant reference threshold voltage level.
The smoothed signal (400) is then conducted to the voice detector 7 including a voltage level comparator 7-1 and a pulse duration comparator 7-2. The voltage level comparator 7-1 comprises the voltage level of the smoothed signal with the predetermined reference threshold voltage level Tf and outputs a H-voltage level pulse signal (600) only while the voltage level of the speech instruction signal exceeds the reference threshold level Tf as depicted in FIG. 3(C).
The pulse duration comparator 7-2 compares the pulse width of the H-voltage level pulse signal (600) with a predetermined reference spoken instruction start time ts and the pulse width of the L-voltage level pulse signal (600) with another predetermined reference end time te and outputs a H-voltage level signal (700) only when the H-voltage level pulse width exceeds the reference start time ts and a L-voltage level signal (700) only when the L-voltage level pulse width exceeds the reference end time te.
To explain in more detail with reference to FIGS. 3(C) and (D), if the pulse width of the first H-voltage level pulse signal is labeled t1, since t1 is shorter than the reference start time ts, the pulse duration comparator 7-2 outputs no H-voltage level signal. On the other hand, if the pulse width of the second H-voltage level pulse signal is labeled t2, since t2 is longer than the reference start time ts, the pulse duration comparator 7-2 outputs a H-voltage level signal, indicating the start of a spoken instruction. In this case, the H-voltage level start signal (700) from the pulse duration comparator 7-2 is delayed by the reference start time ts after the actual start time Ps of the spoken instruction. Thereafter, this H-voltage level start signal is outputted until the duration comparator 7-2 detects the end of speech instruction.
Next, when the H-voltage level pulse signal t2 changes to a L-voltage level for a period of time t3, since the t3 is shorter than the reference end time te, the pulse duration comparator 7-2 outputs no L-voltage level signal, that is, during comparator 7-2 sustains the H-voltage level signal.
Thereafter in this case, even if a third pulse signal having a pulse width t4 is outputted again from the voltage level comparator 7-1, since the pulse duration comparator 7-2 is still outputting a H-voltage level signal, the operation of the duration comparator 7-2 is not effected.
Next, when the H-voltage level pulse signal t4 changes to a L-voltage level for a period of time t5, since t5 is longer than the reference end time te, the pulse duration comparator 7-2 outputs a L-voltage level signal, indicating the end of speech instruction. In this case, the L-voltage level end signal from the duration comparator 7-2 is delayed by the reference end time te after the actual end time Pe of speech instruction. Thereafter, the end signal is outputted until the duration comparator 7-2 detects the start of another speech instruction.
In response to the H-voltage level signal from the duration comparator 7-2 as shown in FIG. 3(D), the controller 5 outputs a command signal to activate a group of bandpass filters 8 and other sections to recognize the spoken instruction signal outputted from the spectrum-normalizing amplifier 6, as already explained.
In the prior-art voice detecting means connected to the microphone as described above, since the reference threshold level in the voltage level comparator 7-1 is fixed at a predetermined level, the speech recognizer cannot cope well with the fluctuations of noise level within the passenger compartment, with the result that accurate detection of speech instruction start and end is comprised so that noise may be interpreted as attempts at speech and/or spoken instructions may be ignored.
In view of the above description and with reference to the attached drawings, the embodiments of the speech recognition system for an automotive vehicle according to the present invention will be described hereinbelow.
FIG. 4 is a block diagram showing a first embodiment of the present invention. In brief summation of this embodiment, the gain factor of the speech input interface is adjusted according to the engine operation; that is, the gain factor is reduced while the engine is operating.
As in the conventional speech recognizer 100, there are provided a record switch 1, a microphone 2, a recognition switch 3, a switch input interface 4, and a controller 5. In addition to these elements, there are provided first and second amplifers 61 and 62 having first and second gains G1 and G2 in the speech input interface 6 for amplifying and outputting the spoken instruction signal from the microphone 2, the first gain G1 being determined to be higher than the second gain G2. The outputs of the amplifiers 61 and 62 are inputted to an analog switch 63. As understood later, the fixed contact a of this analog switch 63 is switched to the contact b as shown in FIG. 4 when the engine stops operating, but switched to the contact c when the engine is operating. Therefore, when the engine stops, the gain factor for the spoken instruction signal is high; when the engine is running, the gain factor for the spoken instruction signal is switched into being lower.
On the other hand, there are provided an engine operation detector 20, an ignition relay 16 having a relay contact closed when the ignition switch is turned on, and an alternator 17, in order to detect the engine condition and to switch the analog switch 63. The engine operation detector 20 detects that the engine is operating and outputs a signal to the controller 5 when the ignition relay 16 is energized and also the alternator 17 outputs a signal. In response to this signal, the controller 5 sets the analog switch 63 to the amplifier 62 side. On the other hand, when the ignition relay 16 is deenergized and the alternator 17 outputs no signal, the engine operation detector 20 detects that the engine stops and outputs no signal to the controller 5. In response to this, the controller 5 sets the analog switch 63 to the amplifier 61 side as shown.
Further, the reason why the alternator output signal is given to the engine operation detector 20 is that the alternator output is indicative of the engine operation condition because the alternator outputs a signal when the ignition switch is set to the starter position and therefore the engine starts rotating.
In the system according to the present invention, however, it is also possible to apply an output signal generated from a vehicle speed sensor 171 or a speedometer 172 to the controller 5, because the sensor or meter can also represent whether or not the engine or vehicle is running.
Next, the operation of the first embodiment of FIG. 4 will be described. When a spoken instruction is uttered toward the microphone 2 with the record switch 1 or the recognition switch 3 turned on when the engine stops, since the engine operation detector 20 detects that the engine is stopped, the controller 5 sets the analog switch 63 to the amplifier 61 side. Therefore, the spoken instruction signal from the microphone 2 is amplified on the basis of the first gain G1 preset in the first amplifier 61.
In contrast with this, in the state where the engine is operating and therefore the vehicle is running, since the engine operation detector 20 detects that the vehicle is running and outputs a signal, the controller 5 switches the analog switch 63 to the second amplifier 62 side. Therefore, the spoken instruction signal from the microphone 2 is amplified on the basis of the second gain G2 lower than the first gain G1 preset in the first amplifier 61.
Since the second gain G2 is relatively low as compared with the first gain G1, in order to obtain a spoken instruction signal exceeding a predetermined level necessary for recording or recognizing, the driver must necessarily utter a spoken instruction toward the microphone 2 in a louder voice as compared with the case where the engine stops. Therefore, even if noise level within the passenger compartment is high due to engine operation, the level of a spoken instruction becomes naturally high in comparison with a rise in noise level. As a result, the proportion of noise level to spoken instruction signal level is reduced, thus improving the recording or recognition rate of a spoken instruction in the speech recognition system.
FIG. 5 is a block diagram showing a second embodiment of the present invention, in which a feedback resistor for the amplifier is switched according to the engine operation; that is, the gain factor is reduced while the engine is operating.
In this second embodiment, only one amplifier 60 is provided for the speech input interface 6; however, two feedback resistors R1 and R2 are connected to the input terminal of the amplifier 60 and selectively connected to the output terminal of the amplifier 60 via the analog switch 63 in response to the signal from the controller 5.
In more detail, when the ignition relay 16 is deenergized and the alternator 17 outputs no signal, the engine operation detector 20 detects that the engine stops and outputs no signal to the controller 5.
Therefore, the analog switch 63 is set to the first resistor side R1, as shown by a solid line in FIG. 5. The larger the feedback resistor, the higher the gain of the amplifier. Since the first resistor R1 is predetermined to be larger than the second resistor R2, the gain factor G of the amplifier 60 becomes high when the resistor R1 is connected between input and output terminals thereof. Further, in this case, the gain factor is determined as a function of the input resistor Ro and the feedback resistor R1 or R2. When the analog switch 63 is set to the first resistor side R1, the spoken instruction signal from the microphone 2 is amplified on the basis of a higher gain.
Accordingly, even if the driver utters a spoken instruction in a low voice, the inputted spoken instruction phrase can be recorded or recognized reliably.
In contrast with this, where the engine is operating, since the controller 5 sets the analog switch 63 to the second resistor side R2 which is smaller than R1, the spoken instruction is amplified on the basis of a lower gain.
Accordingly, since the driver must necessarily utter a spoken instruction in a louder voice, even if noise level is high due to engine operation, the inputted spoken instruction phrase can be recorded or recognized reliably.
In other words, by changing the gain factor of the amplifier in dependence upon selection of feedback resistors, it is possible to reduce the proportion of noise level to spoken instruction signal level in the case where the engine is operating, in the same way as in the first embodiment.
FIG. 6 is a block diagram showing a third embodiment of the present invention, in which the reference threshold level of the voltage level comparator 7-1 in the voice detector is switched according to the engine operation, that is, the multiplication ratio is increased when the engine is operating.
As in the conventional speech recognizer 100, there are provided a record switch 1, a microphone 2, a recognition switch 3, a switch input interface 4, a controller, a RMS smoother (a first smoother), a voice detector etc. In addition to these elements, there are provided a second smoother 72, first and second multipliers 73a and 73b, an analog switch 74 and a holding circuit 75.
In the same way as in the first embodiment of FIG. 4, the output signal of the engine operation detector 15 is given to the controller 5 so as to detect the engine conditions. The voice detector 7 for detecting the start point and the end point of a spoken instruction signal from the speech input interface 6 and for outputting a start command signal and an end command signal comprises a rectifier 15-1 for rectifying the spoken instruction signal, a first smoother 15-2 for smoothing the output signal from the rectifier 15-1 at a time constant of 10 to 20 milliseconds and for outputting a DC voltage roughly corresponding to the spoken instruction, a second smoother 72 for smoothing the output signal from the rectifier 15-1 at a time constant of about one to second and for outputting a DC voltage roughly corresponding to noise included in the spoken signal, first and second multipliers 73a and 73b for multiplying the output signal from the second smoother 72 at multiplication ratios of K1 and K2 (K1 <K2), an analog switch 74 for selecting the output from the multipliers 73a and 73b in response to the signal from the engine operation detector 20, a holding circuit 75 for holding the output signal from either of two multipliers 73a and 73b via the analog switch 74 when a start of a spoken instruction is detected, a level comparator 7-1 for comparing the DC voltage signal corresponding to the spoken instruction signal from the first smoother 15-2 with a reference signal er of the DC voltage corresponding to noise level given via the holding circuit 75 and for outputting a spoken instruction start signal when the output level from the first smoother 15-2 exceeds the reference signal er, and a pulse duration comparator 7-2 for outputting a spoken instruction start signal es indicative of presence of a spoken instruction signal to the controller 5 when the H-level output from the level comparator 7-1 is kept outputted, for instance, for more than 150 milliseconds and a spoken instruction end signal ee indicative of absence of a spoken instruction signal to the controller 5 when the output signal from the level comparator 7-1 drops to a L-level and is kept dropped for about 250 milliseconds. In this voice detector 7, the analog switch 74 is switched to the first multiplier 73a side when the engine operation detector 20 detects that the engine is stopped, as shown by a solid line, and to the second multiplier 73b side when the engine operation detector 20 detects that the engine is operating, as shown by a broken line in FIG. 6. Further, the holding circuit 75 first outputs the signal obtained from the multiplier 73a or 73b via the analog switch 74 to the level comparator 7-1 as the reference signal er ; however, once the pulse duration comparator 7-2 outputs a start signal es, since the signal from the multiplier 73a or 73b is held in response to this start signal es (i.e. holding signal), the holding circuit keeps outputting the held signal as the reference signal er to the level comparator 7-1, until an end signal ee is outputted to the holding circuit 75.
The reason why such a holding circuit 75 as described above is additionally provided is as follows: unless there is provided the holding circuit 75, when the reference end threshold level increases, the smoothed signal (400) drops below the threshold level before the end of spoken instruction, thus resulting in an erroneous spoken instruction end detection. In other words, since the time constant of the second smoother 72 is larger than that of the first smoother 15-2, the reference threshold level increases gradually with a time delay as the smoothed spoken instruction signal (400) increases gradually; that is, the timing of two signals does not match.
Next, there will be described the operation of the third embodiment of FIG. 6.
When a spoken instruction is uttered toward the microphone with the record switch 1 or recognition switch 3 turned on in the state where the engine is stopped, the engine operation detector 20 detects that the engine is stopped, and therefore the analog switch 74 is set to the first multiplier 73a side via the controller 5. The spoken instruction transduced into an electric signal through the microphone 2 is rectified through the rectifier 15-1 after amplification by the speech input interface 6. The first smoother 15-2 outputs a DC voltage corresponding to the power component of the spoken instruction signal. On the other hand, the second smoother 72 applies a DC voltage proportional to the power level of background noise included in the spoken instruction signal to the multipliers 73a and 73b. At this time, since the analog switch 74 is closed to the first multiplier 73a side, the output signal from the second smoother 72 is taken out being multiplied by K1 times through the first multiplier 73a, and is given to the level comparator 7-1 via the analog switch 74 and the holding circuit 75 as the reference signal er. The level comparator 7-1 outputs a H-level output signal when the output of the first smoother 15-2 exceeds the reference signal er. If this H-level output signal is kept outputted, for instance, for about 150 milliseconds, a start signal es for spoken instruction recognition or recording is applied to the controller 5; the spoken instruction signal branched from the output of the speech input interface 6 is inputted to the bandpass filters 8; the spoken instruction is recorded or recognized by the same circuit sections as in the prior-art system. Further, when the pulse duration comparator 7-2 outputs the start signal es, the output signal of the first multiplier 73a is held by the holding circuit 75 and the reference signal er for the level comparator 7-1 is fixed. Next, when the input of the spoken instruction has been completed, the output of the level comparator 7-1 returns to a L-level. If this L-level state continues, for instance, for about 250 milliseconds, the pulse duration comparator 7-2 outputs an end signal ee to the controller 5. The controller 5 determines that the input of the spoken instruction has been completed and controls the entire system so as to begin to process the recording or recognition of the spoken instruction.
On the other hand, when a spoken instruction is inputted in the same way in the state where the engine is operating, since the engine operation detector 20 detects the state where the vehicle is running, the analog switch 74 is set to the second multiplier 73b side. Therefore, the DC signal corresponding to background noise power level included in the spoken instruction and given from the second smoother 72 is multiplied at a multiplication ratio K2 greater than K1 and is applied to the level comparator 7-1 as a reference signal er. Therefore, since the level of the reference signal er is adjusted to a higher level, as compared with that obtained when the engine is stopped, in the level comparator 7-1, only when a spoken instruction exceeds this reference signal er, that is, a relatively loud spoken instruction is inputted, the level comparator 7-1 generates a H-level output signal. Also, a loud voice makes clear the features of voice parameters. Therefore, in the state where the engine is operating, that is, where the vehicle is running and therefore noise within the passenger compartment is high, unless a relatively loud spoken instruction is uttered toward the microphone 2, no recognition or recording is made. If a spoken instruction having high energy is inputted, even when the ambient noise level is high, the proportion of noise component to the spoken instruction signal is sufficiently reduced, so that it is possible to record or recognize the spoken instruction more reliably.
In the embodiments described above, the gain of the speech input interface or the multiplication ratio in the voice detector is switched digitally being classified into a state wherein the engine is stopped and into a state wherein the engine is operating. However, it is also possible to adjust the gain or multiplication ratio analogically according to the magnitude of noise level.
Description has been made hereinabove of the case where the speech recognition system according to the present invention comprises various discrete elements or sections; however, it is of course possible to embody the system with a microcomputer including a central processing unit, a read-only memory, a random-access memory, a clock oscillator, etc. In this case, the engine operation detector, the second smoother, the first and second multipliers, the analog switch, the holding circuit, etc. can all be incorporated within the microcomputer, executing the same or similar processes, calculations and/or operations as explained hereinabove. In such case, the microcomputer also executes various operations necessary for the speech recognizer in accordance with appropriate software stored in the read-only memory.
As described above, in the speech recognition system according to the present invention, since the amplifier gain in the speech input interface or the threshold level in the voice detector is so adjusted that the sensitivity to spoken instruction can be reduced and therefore since the driver must necessarily utter a spoken instruction in a louder voice to reduce the proportion of noise level to spoken instruction signal level, in the case where the engine is operating, even if noise level within the passenger compartment rises intensely, it is possible to improve reliability in recording or recognition rate of a spoken instruction in the speech recognition system.
It will be understood by those skilled in the art that the foregoing description is in terms of preferred embodiments of the present invention wherein various changes and modifications may be made without departing from the spirit and scope of the invention, as is set forth in the appended claims.

Claims (7)

What is claimed is:
1. A speech recognition system for an automotive vehicle for recording or recognizing a spoken instruction received through a microphone and for activating a vehicle actuator in response to a recognized spoken instruction, which comprises:
(a) means for detecting whether or not an automotive vehicle engine is operating and outputting one of an engine operating signal and an engine stopped signal;
(b) a speech input and voice detection section connected to the microphone for amplifying a spoken instruction signal transduced through the microphone, detecting the start and end of a spoken instruction, and outputting an instruction start command signal and an instruction end command signal, respectively, in response to detection thereof, at least one of the gain factor in said speech input section and the threshold level in said voice detection section being so adjusted that the sensitivity to spoken instructions can be reduced and that the driver must necessarily utter a spoken instruction in a louder voice to reduce the proportion of noise level to spoken instruction signal level, when said engine operation detecting means outputs said engine operating signal; and
(c) a voice analysis section connected to said speech input and voice detection section and responsive to instruction start and instruction end command signals for analyzing the spoken instruction signal from said speech input section, comparing the results of analysis with predetermined reference values corresponding to at least one spoken instruction, and activating at least one actuator when the results of analysis match predetermined reference values associated with the actuator.
2. A speech recognition system for an automotive vehicle as set forth in claim 1, wherein said engine operation detecting means comprises:
(a) an ignition relay switch closed when an ignition switch is turned on and for outputting an ignition signal;
(b) an alternator for outputting an alternator signal when an engine is operating; and
(c) an engine operation detector connected to said ignition relay switch and said alternator for outputting an engine operation signal in response to the ignition signal and the alternator signal.
3. A speech recognition system for an automotive vehicle as set forth in claim 1, wherein said engine operation detecting means is a speed sensor.
4. A speech recognition system for an automotive vehicle as set forth in claim 1, wherein said engine operation detecting means is a speedometer.
5. A speech recognition system for an automotive vehicle as set forth in claim 1, wherein said speech input section comprises:
(a) a first amplifier connected to the microphone;
(b) a second amplifier connected to the microphone for amplifying the spoken instruction signal transduced via the microphone, the gain factor of which is smaller than that of said first amplifier; and
(c) an analog switch for connecting said first amplifier to said voice detection section in response to the engine stop signal from said engine operation detecting means and for connecting said second amplifier to said voice detection section in response to the engine operation signal from said engine operation detecting means.
6. A speech recognition system for an automotive vehicle as set forth in claim 1, wherein said speech input section comprises:
(a) an amplifier connected between the microphone and said voice detection section for amplifying the spoken instruction signal transduced via the microphone;
(b) a first feedback resistor R1 connected to the input terminal of said amplifier;
(c) a second feedback resistor R2 connected to the input terminal of said amplifier, the resistance value of which is smaller than that of said first feedback resistor; and
(d) an analog switch for connecting said first feedback resistor R1 to the output of said amplifier in response to the engine stop signal from said engine operation detecting means and for connecting said second feedback resistor R2 to the output of said amplifier in response to the engine operation signal from said engine operation detecting means.
7. A speech recognition system for an automotive vehicle as set forth in claim 1, wherein said voice detection section comprises:
(a) a first smoother connected to said speech input section for smoothing the spoken instruction signal amplified via said speech input section;
(b) a second smoother connected to said speech input section for smoothing the spoken instruction signal amplified via said speech input section, the time constant of which is larger than that of said first smoother;
(c) a first multiplier connected to said second smoother;
(d) a second multiplier connected to said second smoother, the multiplication ratio of which is greater than that of said first multiplier;
(e) an analog switch connected to said first multiplier for outputting the smoothed spoken instruction signal multiplied by said first multiplier in response to the engine stop signal from said engine operating detecting means and connected to said second multiplier for outputting the smoothed spoken instruction signal multiplied by said second multiplier in response to the engine operation signal from said engine operating detecting means;
(f) a holding circuit connected to said analog switch for holding the smoothed spoken instruction signal passed through said analog switch as a reference threshold level in response to an instruction start command signal;
(g) a voltage level comparator one input terminal of which is connected to said first smoother and the other input terminal of which is connected to said holding circuit, for comparing the spoken instruction signal voltage level smoothed by said first smoother with the reference threshold level outputted from said holding circuit and outputting a H-level signal when the signal voltage level smoothed by said first smoother exceeds the reference threshold level; and
(h) a pulse duration comparator connected to said voltage level comparator for comparing the pulse width of the H-level signal from said level comparator with a reference start time and outputting a spoken instruction start command signal when the pulse width exceeds the reference start time and for comparing the pulse width of the L-level signal from said level comparator with a reference end time and outputting a spoken instruction end command signal when the pulse width exceeds the reference end, time, the spoken instruction start command signal being applied to said holding circuit as a holding signal.
US06/428,230 1981-10-22 1982-09-29 Speech recognition system for an automotive vehicle Expired - Lifetime US4532648A (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
JP56169176A JPS5870292A (en) 1981-10-22 1981-10-22 Voice recognition equipment for vehicle
JP56-169176 1981-10-22

Publications (1)

Publication Number Publication Date
US4532648A true US4532648A (en) 1985-07-30

Family

ID=15881649

Family Applications (1)

Application Number Title Priority Date Filing Date
US06/428,230 Expired - Lifetime US4532648A (en) 1981-10-22 1982-09-29 Speech recognition system for an automotive vehicle

Country Status (4)

Country Link
US (1) US4532648A (en)
EP (1) EP0078014B1 (en)
JP (1) JPS5870292A (en)
DE (1) DE3272921D1 (en)

Cited By (34)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
DE3732394A1 (en) * 1987-09-25 1989-04-06 Siemens Ag Method for compensating disturbance noises for speech-recognition systems depending on speakers and installed in motor vehicles
US4827520A (en) * 1987-01-16 1989-05-02 Prince Corporation Voice actuated control system for use in a vehicle
US4882685A (en) * 1985-08-26 1989-11-21 Lely Cornelis V D Voice activated compact electronic calculator
US4984274A (en) * 1988-07-07 1991-01-08 Casio Computer Co., Ltd. Speech recognition apparatus with means for preventing errors due to delay in speech recognition
US5014317A (en) * 1987-08-07 1991-05-07 Casio Computer Co., Ltd. Recording/reproducing apparatus with voice recognition function
US5450525A (en) * 1992-11-12 1995-09-12 Russell; Donald P. Vehicle accessory control with manual and voice response
US5584052A (en) * 1992-11-16 1996-12-10 Ford Motor Company Integrated microphone/pushbutton housing for voice activated cellular phone
US5630014A (en) * 1993-10-27 1997-05-13 Nec Corporation Gain controller with automatic adjustment using integration energy values
US5727121A (en) * 1994-02-10 1998-03-10 Fuji Xerox Co., Ltd. Sound processing apparatus capable of correct and efficient extraction of significant section data
US5764852A (en) * 1994-08-16 1998-06-09 International Business Machines Corporation Method and apparatus for speech recognition for distinguishing non-speech audio input events from speech audio input events
US5806040A (en) * 1994-01-04 1998-09-08 Itt Corporation Speed controlled telephone credit card verification system
US5852804A (en) * 1990-11-30 1998-12-22 Fujitsu Limited Method and apparatus for speech recognition
GB2327835A (en) * 1997-07-02 1999-02-03 Simoco Int Ltd Improving speech intelligibility in noisy enviromnment
DE19735254A1 (en) * 1997-08-14 1999-02-18 Cohausz Helge B Vehicle video signal recorder for situation in front of car
WO1999024296A1 (en) 1993-12-13 1999-05-20 Lojack Corporation Inc. Method of and apparatus for motor vehicle security assurance employing voice recognition control of vehicle operation
US5995924A (en) * 1997-05-05 1999-11-30 U.S. West, Inc. Computer-based method and apparatus for classifying statement types based on intonation analysis
WO2001059763A1 (en) * 2000-02-11 2001-08-16 BSH Bosch und Siemens Hausgeräte GmbH Electrical device with voice input unit and method for voice input
US6311156B1 (en) * 1989-09-22 2001-10-30 Kit-Fun Ho Apparatus for determining aerodynamic wind of utterance
DE19822989C2 (en) * 1997-05-21 2002-06-06 Trw Inc Keyless vehicle entry system using voice signals
US6757656B1 (en) * 2000-06-15 2004-06-29 International Business Machines Corporation System and method for concurrent presentation of multiple audio information sources
US6862567B1 (en) * 2000-08-30 2005-03-01 Mindspeed Technologies, Inc. Noise suppression in the frequency domain by adjusting gain according to voicing parameters
KR100501919B1 (en) * 2002-09-06 2005-07-18 주식회사 보이스웨어 Voice Recognizer Provided with Two Amplifiers and Voice Recognizing Method thereof
US20050273323A1 (en) * 2004-06-03 2005-12-08 Nintendo Co., Ltd. Command processing apparatus
US20060287859A1 (en) * 2005-06-15 2006-12-21 Harman Becker Automotive Systems-Wavemakers, Inc Speech end-pointer
US7346374B2 (en) 1999-05-26 2008-03-18 Johnson Controls Technology Company Wireless communications system and method
US7349722B2 (en) 1999-05-26 2008-03-25 Johnson Controls Technology Company Wireless communications system and method
US20080228478A1 (en) * 2005-06-15 2008-09-18 Qnx Software Systems (Wavemakers), Inc. Targeted speech
US20090124280A1 (en) * 2005-10-25 2009-05-14 Nec Corporation Cellular phone, and codec circuit and receiving call sound volume automatic adjustment method for use in cellular phone
US8200214B2 (en) 2006-10-11 2012-06-12 Johnson Controls Technology Company Wireless network selection
CN103869971A (en) * 2012-12-10 2014-06-18 三星电子株式会社 Method and user device for providing context awareness service using speech recognition
US9503041B1 (en) * 2015-05-11 2016-11-22 Hyundai Motor Company Automatic gain control module, method for controlling the same, vehicle including the automatic gain control module, and method for controlling the vehicle
US9875583B2 (en) * 2015-10-19 2018-01-23 Toyota Motor Engineering & Manufacturing North America, Inc. Vehicle operational data acquisition responsive to vehicle occupant voice inputs
US9928833B2 (en) 2016-03-17 2018-03-27 Toyota Motor Engineering & Manufacturing North America, Inc. Voice interface for a vehicle
US10341442B2 (en) 2015-01-12 2019-07-02 Samsung Electronics Co., Ltd. Device and method of controlling the device

Families Citing this family (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPS63500126A (en) * 1985-07-01 1988-01-14 エッコ・インダストリ−ズ・インコ−ポレ−テッド speaker verification device
DE3766124D1 (en) * 1986-02-15 1990-12-20 Smiths Industries Plc METHOD AND DEVICE FOR VOICE PROCESSING.
KR100217734B1 (en) * 1997-02-26 1999-09-01 윤종용 Method and apparatus for controlling voice recognition threshold level for voice actuated telephone
WO1999057938A1 (en) 1998-05-06 1999-11-11 Volkswagen Aktiengesellschaft Method and device for operating voice-controlled systems in motor vehicles
FR2802690A1 (en) * 1999-12-17 2001-06-22 Thomson Multimedia Sa VOICE RECOGNITION METHOD AND DEVICE, RELATED REMOTE CONTROL DEVICE
US7467084B2 (en) 2003-02-07 2008-12-16 Volkswagen Ag Device and method for operating a voice-enhancement system
US7912228B2 (en) 2003-07-18 2011-03-22 Volkswagen Ag Device and method for operating voice-supported systems in motor vehicles
EP1625973B1 (en) 2004-08-10 2007-08-01 Volkswagen Aktiengesellschaft Speech support system for motor vehicle

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4100370A (en) * 1975-12-15 1978-07-11 Fuji Xerox Co., Ltd. Voice verification system based on word pronunciation
US4158750A (en) * 1976-05-27 1979-06-19 Nippon Electric Co., Ltd. Speech recognition system with delayed output
US4239936A (en) * 1977-12-28 1980-12-16 Nippon Electric Co., Ltd. Speech recognition system
GB2056732A (en) * 1979-07-16 1981-03-18 Nissan Motor Voice warning system for an automotive vehicle
US4380824A (en) * 1980-04-18 1983-04-19 Hitachi, Ltd. Receiving reproducing system

Family Cites Families (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPS5491007A (en) * 1977-12-28 1979-07-19 Nec Corp Audio recognition unit
JPS6060080B2 (en) * 1977-12-28 1985-12-27 日本電気株式会社 voice recognition device

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4100370A (en) * 1975-12-15 1978-07-11 Fuji Xerox Co., Ltd. Voice verification system based on word pronunciation
US4158750A (en) * 1976-05-27 1979-06-19 Nippon Electric Co., Ltd. Speech recognition system with delayed output
US4239936A (en) * 1977-12-28 1980-12-16 Nippon Electric Co., Ltd. Speech recognition system
GB2056732A (en) * 1979-07-16 1981-03-18 Nissan Motor Voice warning system for an automotive vehicle
US4380824A (en) * 1980-04-18 1983-04-19 Hitachi, Ltd. Receiving reproducing system

Cited By (57)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4882685A (en) * 1985-08-26 1989-11-21 Lely Cornelis V D Voice activated compact electronic calculator
US4827520A (en) * 1987-01-16 1989-05-02 Prince Corporation Voice actuated control system for use in a vehicle
US5014317A (en) * 1987-08-07 1991-05-07 Casio Computer Co., Ltd. Recording/reproducing apparatus with voice recognition function
DE3732394A1 (en) * 1987-09-25 1989-04-06 Siemens Ag Method for compensating disturbance noises for speech-recognition systems depending on speakers and installed in motor vehicles
US4984274A (en) * 1988-07-07 1991-01-08 Casio Computer Co., Ltd. Speech recognition apparatus with means for preventing errors due to delay in speech recognition
US6311156B1 (en) * 1989-09-22 2001-10-30 Kit-Fun Ho Apparatus for determining aerodynamic wind of utterance
US5852804A (en) * 1990-11-30 1998-12-22 Fujitsu Limited Method and apparatus for speech recognition
US5450525A (en) * 1992-11-12 1995-09-12 Russell; Donald P. Vehicle accessory control with manual and voice response
US5584052A (en) * 1992-11-16 1996-12-10 Ford Motor Company Integrated microphone/pushbutton housing for voice activated cellular phone
US5630014A (en) * 1993-10-27 1997-05-13 Nec Corporation Gain controller with automatic adjustment using integration energy values
WO1999024296A1 (en) 1993-12-13 1999-05-20 Lojack Corporation Inc. Method of and apparatus for motor vehicle security assurance employing voice recognition control of vehicle operation
US5806040A (en) * 1994-01-04 1998-09-08 Itt Corporation Speed controlled telephone credit card verification system
US5727121A (en) * 1994-02-10 1998-03-10 Fuji Xerox Co., Ltd. Sound processing apparatus capable of correct and efficient extraction of significant section data
US5764852A (en) * 1994-08-16 1998-06-09 International Business Machines Corporation Method and apparatus for speech recognition for distinguishing non-speech audio input events from speech audio input events
US5995924A (en) * 1997-05-05 1999-11-30 U.S. West, Inc. Computer-based method and apparatus for classifying statement types based on intonation analysis
DE19822989C2 (en) * 1997-05-21 2002-06-06 Trw Inc Keyless vehicle entry system using voice signals
GB2327835A (en) * 1997-07-02 1999-02-03 Simoco Int Ltd Improving speech intelligibility in noisy enviromnment
GB2327835B (en) * 1997-07-02 2000-04-19 Simoco Int Ltd Method and apparatus for speech enhancement in a speech communication system
DE19735254A1 (en) * 1997-08-14 1999-02-18 Cohausz Helge B Vehicle video signal recorder for situation in front of car
US7349722B2 (en) 1999-05-26 2008-03-25 Johnson Controls Technology Company Wireless communications system and method
US8897708B2 (en) 1999-05-26 2014-11-25 Johnson Controls Technology Company Wireless communications system and method
US8494449B2 (en) 1999-05-26 2013-07-23 Johnson Controls Technology Company Wireless communications system and method
US7970446B2 (en) 1999-05-26 2011-06-28 Johnson Controls Technology Company Wireless control system and method
US8634888B2 (en) 1999-05-26 2014-01-21 Johnson Controls Technology Company Wireless control system and method
US9370041B2 (en) 1999-05-26 2016-06-14 Visteon Global Technologies, Inc. Wireless communications system and method
US9318017B2 (en) 1999-05-26 2016-04-19 Visteon Global Technologies, Inc. Wireless control system and method
US7346374B2 (en) 1999-05-26 2008-03-18 Johnson Controls Technology Company Wireless communications system and method
US8380251B2 (en) 1999-05-26 2013-02-19 Johnson Controls Technology Company Wireless communications system and method
US6778964B2 (en) 2000-02-11 2004-08-17 Bsh Bosch Und Siemens Hausgerate Gmbh Electrical appliance voice input unit and method with interference correction based on operational status of noise source
WO2001059763A1 (en) * 2000-02-11 2001-08-16 BSH Bosch und Siemens Hausgeräte GmbH Electrical device with voice input unit and method for voice input
US6757656B1 (en) * 2000-06-15 2004-06-29 International Business Machines Corporation System and method for concurrent presentation of multiple audio information sources
US6862567B1 (en) * 2000-08-30 2005-03-01 Mindspeed Technologies, Inc. Noise suppression in the frequency domain by adjusting gain according to voicing parameters
KR100501919B1 (en) * 2002-09-06 2005-07-18 주식회사 보이스웨어 Voice Recognizer Provided with Two Amplifiers and Voice Recognizing Method thereof
US20050273323A1 (en) * 2004-06-03 2005-12-08 Nintendo Co., Ltd. Command processing apparatus
US8447605B2 (en) * 2004-06-03 2013-05-21 Nintendo Co., Ltd. Input voice command recognition processing apparatus
US20080228478A1 (en) * 2005-06-15 2008-09-18 Qnx Software Systems (Wavemakers), Inc. Targeted speech
US20060287859A1 (en) * 2005-06-15 2006-12-21 Harman Becker Automotive Systems-Wavemakers, Inc Speech end-pointer
US8311819B2 (en) 2005-06-15 2012-11-13 Qnx Software Systems Limited System for detecting speech with background voice estimates and noise estimates
US8170875B2 (en) * 2005-06-15 2012-05-01 Qnx Software Systems Limited Speech end-pointer
US8457961B2 (en) 2005-06-15 2013-06-04 Qnx Software Systems Limited System for detecting speech with background voice estimates and noise estimates
US8165880B2 (en) * 2005-06-15 2012-04-24 Qnx Software Systems Limited Speech end-pointer
US8554564B2 (en) 2005-06-15 2013-10-08 Qnx Software Systems Limited Speech end-pointer
US20070288238A1 (en) * 2005-06-15 2007-12-13 Hetherington Phillip A Speech end-pointer
US20090124280A1 (en) * 2005-10-25 2009-05-14 Nec Corporation Cellular phone, and codec circuit and receiving call sound volume automatic adjustment method for use in cellular phone
US7933548B2 (en) * 2005-10-25 2011-04-26 Nec Corporation Cellular phone, and codec circuit and receiving call sound volume automatic adjustment method for use in cellular phone
US8200214B2 (en) 2006-10-11 2012-06-12 Johnson Controls Technology Company Wireless network selection
US10395639B2 (en) 2012-12-10 2019-08-27 Samsung Electronics Co., Ltd. Method and user device for providing context awareness service using speech recognition
US11721320B2 (en) 2012-12-10 2023-08-08 Samsung Electronics Co., Ltd. Method and user device for providing context awareness service using speech recognition
US11410640B2 (en) 2012-12-10 2022-08-09 Samsung Electronics Co., Ltd. Method and user device for providing context awareness service using speech recognition
US10832655B2 (en) 2012-12-10 2020-11-10 Samsung Electronics Co., Ltd. Method and user device for providing context awareness service using speech recognition
CN103869971B (en) * 2012-12-10 2018-03-30 三星电子株式会社 For providing the method and user's set of context-aware services using speech recognition
US9940924B2 (en) 2012-12-10 2018-04-10 Samsung Electronics Co., Ltd. Method and user device for providing context awareness service using speech recognition
CN103869971A (en) * 2012-12-10 2014-06-18 三星电子株式会社 Method and user device for providing context awareness service using speech recognition
US10341442B2 (en) 2015-01-12 2019-07-02 Samsung Electronics Co., Ltd. Device and method of controlling the device
US9503041B1 (en) * 2015-05-11 2016-11-22 Hyundai Motor Company Automatic gain control module, method for controlling the same, vehicle including the automatic gain control module, and method for controlling the vehicle
US9875583B2 (en) * 2015-10-19 2018-01-23 Toyota Motor Engineering & Manufacturing North America, Inc. Vehicle operational data acquisition responsive to vehicle occupant voice inputs
US9928833B2 (en) 2016-03-17 2018-03-27 Toyota Motor Engineering & Manufacturing North America, Inc. Voice interface for a vehicle

Also Published As

Publication number Publication date
DE3272921D1 (en) 1986-10-02
JPS5870292A (en) 1983-04-26
JPS6367198B2 (en) 1988-12-23
EP0078014B1 (en) 1986-08-27
EP0078014A1 (en) 1983-05-04

Similar Documents

Publication Publication Date Title
US4532648A (en) Speech recognition system for an automotive vehicle
US4558459A (en) Speech recognition system for an automotive vehicle
US4531228A (en) Speech recognition system for an automotive vehicle
US4610023A (en) Speech recognition system and method for variable noise environment
US4597098A (en) Speech recognition system in a variable noise environment
US4538295A (en) Speech recognition system for an automotive vehicle
US4912766A (en) Speech processor
US9026438B2 (en) Detecting barge-in in a speech dialogue system
EP0763812B1 (en) Speech signal processing apparatus for detecting a speech signal from a noisy speech signal
EP0438174B1 (en) Signal processing device
US4625083A (en) Voice operated switch
US4677389A (en) Noise-dependent volume control having a reduced sensitivity to speech signals
US5583969A (en) Speech signal processing apparatus for amplifying an input signal based upon consonant features of the signal
JPS6329754B2 (en)
EP0459384B1 (en) Speech signal processing apparatus for cutting out a speech signal from a noisy speech signal
EP0100773B1 (en) Speech recognition system for an automotive vehicle
JP3350106B2 (en) Voice recognition device
JPH02232697A (en) Voice recognition device
JPS5868097A (en) Voice recognition equipment for vehicle
JPH04230800A (en) Voice signal processor
JP2000039900A (en) Speech interaction device with self-diagnosis function
JPH07101853B2 (en) Noise reduction method
JPH09127982A (en) Voice recognition device
JP3299574B2 (en) Recognition device
JP3294286B2 (en) Speech recognition system

Legal Events

Date Code Title Description
AS Assignment

Owner name: NISSAN MOTOR COMPANY, LIMITED, 2, TAKARA-CHO, KANA

Free format text: ASSIGNMENT OF ASSIGNORS INTEREST.;ASSIGNORS:NOSO, KAZUNORI;KISHI, NORIMASA;FUTAMI, TORU;REEL/FRAME:004055/0400

Effective date: 19820904

Owner name: NISSAN MOTOR COMPANY, LIMITED, JAPAN

Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNORS:NOSO, KAZUNORI;KISHI, NORIMASA;FUTAMI, TORU;REEL/FRAME:004055/0400

Effective date: 19820904

AS Assignment

Owner name: AT & T TECHNOLOGIES, INC.,

Free format text: CHANGE OF NAME;ASSIGNOR:WESTERN ELECTRIC COMPANY, INCORPORATED;REEL/FRAME:004251/0868

Effective date: 19831229

STCF Information on status: patent grant

Free format text: PATENTED CASE

FEPP Fee payment procedure

Free format text: PAYOR NUMBER ASSIGNED (ORIGINAL EVENT CODE: ASPN); ENTITY STATUS OF PATENT OWNER: LARGE ENTITY

FPAY Fee payment

Year of fee payment: 4

FEPP Fee payment procedure

Free format text: PAYER NUMBER DE-ASSIGNED (ORIGINAL EVENT CODE: RMPN); ENTITY STATUS OF PATENT OWNER: LARGE ENTITY

Free format text: PAYOR NUMBER ASSIGNED (ORIGINAL EVENT CODE: ASPN); ENTITY STATUS OF PATENT OWNER: LARGE ENTITY

FPAY Fee payment

Year of fee payment: 8

FPAY Fee payment

Year of fee payment: 12