CN101419795B - Audio signal detection method and device, and auxiliary oral language examination system - Google Patents

Audio signal detection method and device, and auxiliary oral language examination system Download PDF

Info

Publication number
CN101419795B
CN101419795B CN2008102392007A CN200810239200A CN101419795B CN 101419795 B CN101419795 B CN 101419795B CN 2008102392007 A CN2008102392007 A CN 2008102392007A CN 200810239200 A CN200810239200 A CN 200810239200A CN 101419795 B CN101419795 B CN 101419795B
Authority
CN
China
Prior art keywords
information
signal
sound signal
volume
ratio
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Fee Related
Application number
CN2008102392007A
Other languages
Chinese (zh)
Other versions
CN101419795A (en
Inventor
李伟
徐波
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Tianjin Xunfei Information Technology Co ltd
Original Assignee
Beijing Zhichengzhuosheng Technology Dev Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Zhichengzhuosheng Technology Dev Co ltd filed Critical Beijing Zhichengzhuosheng Technology Dev Co ltd
Priority to CN2008102392007A priority Critical patent/CN101419795B/en
Publication of CN101419795A publication Critical patent/CN101419795A/en
Application granted granted Critical
Publication of CN101419795B publication Critical patent/CN101419795B/en
Expired - Fee Related legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Landscapes

  • Electrically Operated Instructional Devices (AREA)

Abstract

The invention discloses a method used for detecting voice frequency signal and a device thereof, as well as a system for assisting oral exam; the method comprises: the voice frequency signal is recorded; when the voice signal existing in the recorded voice frequency signal is determined, the amplitude information and the noise-signal ratio information in the voice frequency signal are acquired; when the volume of the voice signal in the voice frequency signal is judged to be unnormal according to the acquired amplitude information and noise-signal ratio information, the prompt is sent out aiming at the unnormal volume of the voice signal. By adopting the technical proposal, the problems of lower reliability and flexibility of a computer system for assisting the oral exam in the prior art are solved.

Description

Audio signal detection method and device and auxiliary oral language examination system
Technical field
The present invention relates to the signal detection technique field, particularly a kind of audio signal detection method and device and auxiliary oral language examination system.
Background technology
At present, the SET of class of languages has been brought into use computer system to assist and has been carried out, the place operated by rotary motion of SET is at computer room, wherein a computing machine is invigilator's computing machine, all the other are taken an examination with computing machine (below be called the auxiliary oral language examination client) for the examinee, the supervisor monitors whole examination process by the invigilator with computing machine (following abbreviation monitoring server), after the examinee logs on oral language examination system, directly with virtual examination scene in the personage open a dialogue, thereby finish the examination task, examinee's voice under the auxiliary oral language examination client records are unified scoring to the voice of record after examination is finished.By above-mentioned this interactive examination mode, thereby having shortened the examinee simultaneously greatly yet and having waited the time of examining because of the phenomenon in the face of the normal performance of the nervous influence of supervisor can not appear in the examinee; Owing to adopting the networking scoring, realized in addition reexamining that the examinee shows, and to the dynamic tracking of teacher's performance of marking, thereby error score reduced.
But adopt the computer system auxiliary oral language examination very high to the requirement of computing machine sound pick-up outfit, because the difference of hardware and software, the situation that the speech volume that may occur recording is excessive or too small, examinee's examination recording is the fault recording under the above-mentioned situation, if just finding examinee's recording in the scoring process after examination is the fault recording, this examinee's mark just is difficult to determine so, and this just makes the reliability of computer system auxiliary oral language examination and dirigibility lower.
Summary of the invention
The embodiment of the invention provides a kind of audio signal detection method and device, in order to solve the reliability and all lower problem of dirigibility of the computer system auxiliary oral language examination that exists in the prior art.
Accordingly, the embodiment of the invention also provides a kind of auxiliary oral language examination system.
Technical solution of the present invention is as follows:
A kind of audio signal detection method, the method comprising the steps of: the recording audio signal; When in determining described sound signal, having voice signal, obtain amplitude information and signal to noise ratio (S/N ratio) information in the described sound signal; And, judge the volume of the voice signal in the described sound signal when undesired according to amplitude information that obtains and signal to noise ratio (S/N ratio) information, send prompting at the voice signal volume is undesired.
A kind of sound signal pick-up unit comprises: recording elements is used for the recording audio signal; Determining unit is used for determining whether the described sound signal that recording elements is recorded exists voice signal; First acquiring unit is used for obtaining amplitude information and signal to noise ratio (S/N ratio) information in the described sound signal when determining unit is determined described sound signal and had voice signal; First judging unit is used for amplitude information and the signal to noise ratio (S/N ratio) information obtained according to first acquiring unit, judges whether the volume of the voice signal in the described sound signal is normal; Tip element is used in the judged result of first judging unit for not the time, sends prompting at the voice signal volume is undesired.
A kind of auxiliary oral language examination system, comprise auxiliary oral language examination client and monitoring server, the auxiliary oral language examination client, be used for the recording audio signal, and when in determining described sound signal, having voice signal, obtain amplitude information and signal to noise ratio (S/N ratio) information in the described sound signal, and according to described amplitude information and signal to noise ratio (S/N ratio) information, judge the volume of the voice signal in the described sound signal when undesired, send prompting at the voice signal volume is undesired; And be used for obtaining second characteristic information of the sound signal of recording in the stipulated time length, and according to described second characteristic information that obtains, judge sound signal in this stipulated time length when undesired, with the sound signal in this stipulated time length and should stipulated time length in sound signal be that the information of abnormal signal sends to monitoring server; Monitoring server, be used to show in the stipulated time length that the auxiliary oral language examination client sends sound signal and should stipulated time length in sound signal be the information of abnormal signal.
Technical solution of the present invention is by in the computer system auxiliary oral language examination, sound signal by auxiliary oral language examination client recording examinee, when in determining the sound signal of recording, having voice signal, obtain amplitude information and signal to noise ratio (S/N ratio) information in the above-mentioned sound signal, according to amplitude information that obtains and signal to noise ratio (S/N ratio) information, whether the volume of judging the voice signal in the above-mentioned sound signal is normal, in judged result for not the time, send prompting at the voice signal volume is undesired, this has just been avoided just finding in the scoring process after examination that examinee's recording is the fault recording, make examinee's mark be difficult to determine, realized that the examination recording to the examinee detects in computer system auxiliary oral language examination process, and at detected fault recording is pointed out, thereby effectively raise the reliability and the dirigibility of computer system auxiliary oral language examination.
Description of drawings
Fig. 1 is in the embodiment of the invention, the audio signal detection method schematic flow sheet;
Fig. 2 is in the embodiment of the invention, periodically sound signal is carried out the testing process synoptic diagram;
Fig. 3 is in the embodiment of the invention, sound signal pick-up unit structural representation.
Embodiment
The embodiment of the invention proposes, in the computer system auxiliary oral language examination, by the voice signal volume of auxiliary oral language examination client in the sound signal that the examinee that judgement obtains recording sends when undesired, send prompting at the voice signal volume is undesired, thereby realized that the examination recording to the examinee detects in computer system auxiliary oral language examination process, and at detected fault recording is pointed out, thereby the reliability and the dirigibility that have improved the computer system auxiliary oral language examination.
Below in conjunction with Figure of description the embodiment of the invention is elaborated.
As shown in Figure 1, be embodiment of the invention sound intermediate frequency signal detecting method process flow diagram, its processing procedure is as follows:
Step 101, the recording audio signal.
Whether step 102 exists voice signal in the sound signal of determining to record, wherein the specific implementation of this process can be as follows:
At first obtain first characteristic information in the above-mentioned sound signal, according to first characteristic information that obtains, determine whether there is voice signal in the above-mentioned sound signal, first characteristic information that wherein obtains can but be not limited to fundamental frequency information or Mel cepstrum coefficient (MFCC, Mel-Frequency Cepstral Coefficient) information etc., preferable, first characteristic information that obtains can also be fundamental frequency information and MFCC information.
There is voice signal in step 103 if determine in the above-mentioned sound signal in the step 102, then obtains amplitude information and signal to noise ratio (S/N ratio) information in the above-mentioned sound signal.
Step 104 according to amplitude information that obtains in the step 103 and signal to noise ratio (S/N ratio) information, judges whether the volume of the voice signal in the above-mentioned sound signal is normal, and wherein the specific implementation process of this process can be as follows:
At first according to amplitude information and the signal to noise ratio (S/N ratio) information obtained, determine the volume value of voice signal in the above-mentioned sound signal, whether the above-mentioned volume value of judge determining then is between first defined threshold and second defined threshold, if, the volume of then determining voice signal is normal, if not, determine that then the volume of voice signal is unusual, stipulate here that wherein above-mentioned first defined threshold is less than second defined threshold.
The voice signal volume is further unusually may to comprise two kinds of situations, be the voice signal volume less than normal quantity and voice signal volume greater than normal quantity, concrete definite mode is: obtain volume value less than first defined threshold if judge, determine that then the volume of voice signal is less than normal quantity, if judge to obtain described volume value, determine that then the volume of voice signal is greater than normal quantity greater than second defined threshold.
Step 105 if the judged result in the step 104 is not for, is then sent prompting at the voice signal volume is undesired, wherein Ti Shi specific implementation can but be not limited to following:
If determine to obtain the volume of voice signal less than normal quantity, whether the ratio of intensity level of then judging the intensity level of the air-flow composition in the above-mentioned sound signal and voice signal is less than the 3rd defined threshold, when judgement obtains above-mentioned ratio less than the 3rd defined threshold, then send and reduce and the information of the distance between the input equipment of recording, when judgement obtains above-mentioned ratio and is not less than the 3rd defined threshold, then send the information that increases the pronunciation volume.
If determine to obtain the volume of voice signal greater than normal quantity, whether the ratio of intensity level of then judging the intensity level of the air-flow composition in the above-mentioned sound signal and voice signal is greater than the 4th defined threshold, when judgement obtains above-mentioned ratio greater than the 4th defined threshold, then send the information of distance between increase and the recording input media, when judgement obtains above-mentioned ratio and is not more than the 4th defined threshold, then send the information that reduces to pronounce volume.
When wherein the audio signal detection method of above-mentioned introduction is implemented in the auxiliary oral language examination client, just can realize that the auxiliary oral language examination client detects whether fault of the sound signal of examinee by microphone records, and when fault, in time point out the examinee to make the corresponding action adjustment, for example when the sound signal volume that detects the examinee is too small, the prompting examinee increase the pronunciation volume or reduce and microphone between distance, and when the sound signal volume that detects the examinee was excessive, the prompting examinee reduced the distance between volume or increase and the microphone or the like of pronouncing.
In addition, at the voice signal volume is undesired send prompting after, can also further obtain second characteristic information in the sound signal of recording in the stipulated time length, according to second characteristic information that obtains, judge whether the sound signal in this stipulated time length is normal, and in judged result for not the time, with the sound signal in this stipulated time length and should stipulated time length in sound signal be that the information of abnormal signal sends to monitoring server.Wherein second characteristic information can and overflow at least a information in the energy information for energy information, fundamental frequency information, pulse energy information, energy hunting information.So just can realize that the auxiliary oral language examination client can be in each stipulated time length (for example should stipulated time length can be time of having recorded one examination paper etc.), detect the sound signal fault whether in this stipulated time length, if the fault of detecting then in time sound signal in this section period and the information that breaks down are sent to monitoring server, thereby make the supervisor who is sitting in the monitoring server front can in time learn the auxiliary oral language examination client that sound signal breaks down, thereby in time make corresponding actions, guaranteed the stability of whole SET preferably.
Provide more specifically embodiment below.
In embodiments of the present invention, can periodically detect the sound signal of recording, for example establishing sense cycle is 1 second, so whenever, record 1 second sound signal, just the sound signal in this second is carried out relevant detection, also can the sound signal in the stipulated time section be detected, for example the sound signal in 2 seconds to 5 seconds is detected, again the sound signal in 10 seconds to 12 seconds is detected, whether the above-mentioned sound signal recorded of either way can detecting in the SET process is the fault recording, and sends prompting when detecting the fault recording.
As shown in Figure 2, for whenever recording 1 second sound signal, just the sound signal in this second is carried out the concrete implementing procedure figure of respective detection, its processing procedure is as follows:
Step 201, adopting length is 0.02 second, and overlapping 0.01 second rectangular window between window and the window, and the N sound signal of recording second is divided into 99 sections, and every section audio signal is as a frame, and wherein N is a natural number;
Step 201 is obtained the fundamental frequency information of each frame;
Step 203, judge whether to have more than 20 frames in this second and can get access to fundamental frequency information, if judged result is for being, then determine to have voice signal in the sound signal in this second, execution in step 204, if judged result is then determined not have voice signal, execution in step 216 in the sound signal in this second for not; Wherein can from vowel and part consonant, can get access to fundamental frequency information.
Step 204 is obtained the amplitude information and the signal to noise ratio (S/N ratio) information of sound signal in this second;
Step 205, amplitude information that obtains according to step 204 and signal to noise ratio (S/N ratio) information are determined in this second the volume value of voice signal in the sound signal;
Whether step 206, the volume value of judge determining be between first defined threshold and second defined threshold, if judged result determines that then the volume value of voice signal is unusual, execution in step 207 for not; If judged result is for being, then the volume value of definite voice signal is normal, execution in step 216;
Whether step 207 judges the volume value of determining less than first defined threshold, if then the volume of determining voice signal is less than normal quantity, execution in step 208 if not, determines that then the volume of voice signal is greater than normal quantity execution in step 211;
Step 208, whether the ratio of intensity level of judging the intensity level of the air-flow composition in the sound signal in this second and voice signal is less than the 3rd defined threshold, if then execution in step 209, if not, then execution in step 210;
If the signal to noise ratio (S/N ratio) of sound signal is in predesignating scope in this second, can increase the recording gain.
In advance by adding up the sound of speaking of different people, the average and the variance of the MFCC information of pure ground unrest and various air-flow compositions, set up corresponding mixed Gauss model, based on the model of setting up the MFCC information of the sound signal recorded is carried out Classification and Identification, determine the ratio of the intensity level of the intensity level of air-flow composition and voice signal.
Step 209 is sent and is reduced and the information of the distance between the input equipment of recording;
The output device of wherein recording is generally microphone, if the ratio of determining in the step 208 thinks then that less than the 3rd defined threshold examinee's face distance microphone is near excessively, sends the information that the prompting examinee reduces distance between face and the microphone this moment.
Step 210 is sent the information that increases the pronunciation volume;
If the ratio of determining in the step 208 is not less than the 3rd defined threshold, think that then examinee's face distance microphone is suitable, but the volume of examinee's pronunciation is too small, send the information that the prompting examinee increases the pronunciation volume this moment.
Step 211, whether the ratio of intensity level of judging the intensity level of the air-flow composition in the sound signal in this second and voice signal is greater than the 4th defined threshold, if then execution in step 212, if not, then execution in step 213;
Same, if the signal to noise ratio (S/N ratio) of sound signal is in predesignating scope in this second, can increase the recording gain.
Step 212 is sent the information that increases with the distance between the input media of recording;
If above-mentioned ratio greater than the 4th defined threshold, thinks that then examinee's face distance microphone is far away excessively, send the information that the prompting examinee increases distance between face and the microphone this moment.
Step 213 is sent the information that reduces to pronounce volume;
If above-mentioned ratio is not more than the 4th defined threshold, think that then examinee's face distance microphone is suitable, but the volume of examinee's pronunciation is excessive, send prompting examinee reduce the to pronounce information of volume this moment.
Step 214 is obtained energy information, fundamental frequency information, pulse energy information, the energy hunting information in the sound signal of recording in the stipulated time length and is overflowed energy information;
Because breaking down, sound pick-up outfit or computer system also may cause the fault recording, the waveform that comprises sound signal is straight line substantially, that is to say and do not have the record fault recording of voice signal down, the waveform of sound signal is that the fault recording of impact noise and the waveform of sound signal are the intensive fault recording of overflowing noise.
If the examinee need answer 10 road exercise questions when carrying out SET, the stipulated time length of per pass exercise question all is 5 minutes, whether when test taker answers is finished one exercise question, also will further detect this road exercise question corresponding audio signal is the fault recording that is caused by sound pick-up outfit or computer system so.
Step 215, as if the above-mentioned information of obtaining according to step 214, the sound signal of determining in this stipulated time length is unusual, the sound signal in then should stipulated time length and should stipulated time length in sound signal be that the information of abnormal signal sends to monitoring server.
After the supervisor receives information by monitoring server, confirm whether the sound signal that receives is genuine unusual, if confirm as unusually, the supervisor can in time handle, perhaps allow this examinee adopt standby sound pick-up outfit to finish examination this time.
Step 216 after N is set to N+1, goes to step 201.
In the embodiment of the invention, judge whether there is voice signal in the sound signal according to fundamental frequency information, and according to energy information, fundamental frequency information, pulse energy information, energy hunting information with overflow energy information and judge whether sound signal is normal, all be those skilled in the art's common technology means, therefore concrete deterministic process repeats no more here.
By above-mentioned processing procedure as can be known, in the technical solution of the present invention, in the computer system auxiliary oral language examination, when in the sound signal that the examinee that judgement obtains recording sends, having voice signal by the auxiliary oral language examination client, obtain amplitude information and signal to noise ratio (S/N ratio) information in the above-mentioned sound signal, according to amplitude information that obtains and signal to noise ratio (S/N ratio) information, whether the volume of judging the voice signal in the above-mentioned sound signal is normal, in judged result for not the time, send prompting at the voice signal volume is undesired, this has just been avoided just finding in the scoring process after examination that examinee's recording is the fault recording, make examinee's mark be difficult to determine, realized that the examination recording to the examinee detects in area of computer aided SET process, and at detected fault recording is pointed out, thereby effectively raise the reliability and the dirigibility of computer system auxiliary oral language examination.
Accordingly, the present invention also provides a kind of sound signal pick-up unit, as shown in Figure 3, comprises recording elements 301, determining unit 302, first acquiring unit 303, first judging unit 304 and Tip element 305.
Wherein recording elements 301, are used for the recording audio signal;
Determining unit 302 is used for determining whether the sound signal that recording elements 301 is recorded exists voice signal;
First acquiring unit 303 is used for obtaining amplitude information and signal to noise ratio (S/N ratio) information in the above-mentioned sound signal when determining unit 302 is determined above-mentioned sound signal and had voice signal;
First judging unit 304 is used for amplitude information and the signal to noise ratio (S/N ratio) information obtained according to first acquiring unit 303, judges whether the volume of the voice signal in the above-mentioned sound signal is normal;
Tip element 305 is used in the judged result of first judging unit 304 for not the time, sends prompting at the voice signal volume is undesired.
Wherein determining unit 302 comprises that specifically obtaining subelement and first determines subelement, obtains first characteristic information that subelement is used for obtaining the sound signal that recording elements 301 records; First determines that subelement is used for according to obtaining first characteristic information that subelement obtains, and determines whether to have voice signal in the sound signal that recording elements 301 records.
First judging unit comprises that specifically second determines subelement, first judgment sub-unit and the 3rd definite subelement, second determines that subelement is used for amplitude information and the signal to noise ratio (S/N ratio) information of obtaining according to first acquiring unit 303, determines the volume value of voice signal in the sound signal; First judgment sub-unit is used to judge that second determines volume value that subelement determines whether between first defined threshold and second defined threshold, and wherein first defined threshold is less than second defined threshold; The 3rd determines that subelement is used in the judged result of first judgment sub-unit when being, the volume of determining voice signal is normal, and in the judged result of first judgment sub-unit for not the time, determine that the volume of voice signal is unusual, wherein first defined threshold is less than second defined threshold.
The 3rd definite subelement determines that the volume of voice signal comprises two kinds of embodiments unusually, first kind of embodiment: obtain the voice signal volume value less than first defined threshold if first judgment sub-unit is judged, then the 3rd definite subelement determines that the volume of voice signal is less than normal quantity; Second kind of embodiment: obtain the voice signal volume value greater than second defined threshold if first judgment sub-unit is judged, then the 3rd definite subelement determines that the volume of voice signal is greater than normal quantity.
At above-mentioned first kind of embodiment, promptly the 3rd definite subelement determines that the volume of voice signal is less than normal quantity, Tip element 305 specifically comprises second judgment sub-unit and the first prompting subelement, and second judgment sub-unit is used for judging that whether the ratio of intensity level of the intensity level of air-flow composition of sound signal and voice signal is less than the 3rd defined threshold; The first prompting subelement is used for when the judgement of second judgment sub-unit obtains above-mentioned ratio less than the 3rd defined threshold, send and reduce and the information of the distance between the input equipment of recording, and when second judgment sub-unit judges that obtaining above-mentioned ratio is not less than the 3rd defined threshold, send the information that increases the pronunciation volume.
At above-mentioned second kind of embodiment, promptly the 3rd definite subelement determines that the volume of voice signal is greater than normal quantity, Tip element 305 specifically comprises the 3rd judgment sub-unit and the second prompting subelement, and the 3rd judgment sub-unit is used for judging that whether the ratio of intensity level of the intensity level of air-flow composition of sound signal and voice signal is greater than the 4th defined threshold; The second prompting subelement is used for when the judgement of the 3rd judgment sub-unit obtains above-mentioned ratio greater than the 4th defined threshold, send the information of distance between increase and the recording input media, and when the 3rd judgment sub-unit judges that obtaining above-mentioned ratio is not more than the 4th defined threshold, send the information that reduces to pronounce volume.
Further, embodiment of the invention sound intermediate frequency signal supervisory instrument can further include second acquisition unit, second judging unit and transmitting element, and second acquisition unit is used for obtaining second characteristic information of the sound signal of recording in the stipulated time length; Second judging unit is used for described second characteristic information that obtains according to second acquisition unit, judges whether the sound signal in this stipulated time length is normal; Transmitting element is used in the judged result of second judging unit for not the time, with the sound signal in this stipulated time length and should stipulated time length in sound signal be that the information of abnormal signal sends to monitoring server.
Wherein, first characteristic information can but be not limited to fundamental frequency information and/or Mel cepstrum coefficient information; Second characteristic information is energy information, fundamental frequency information, pulse energy information, energy hunting information and overflows at least a information in the energy information.
The embodiment of the invention also provides a kind of auxiliary oral language examination system, comprise auxiliary oral language examination client and monitoring server, auxiliary oral language examination client wherein, be used for the recording audio signal, and when in determining sound signal, having voice signal, obtain amplitude information and signal to noise ratio (S/N ratio) information in the sound signal, and according to amplitude information and signal to noise ratio (S/N ratio) information, whether the volume of judging the voice signal in the sound signal is normal, and in judged result when being undesired, send prompting at the voice signal volume is undesired; And be used for obtaining second characteristic information of the sound signal of recording in the stipulated time length, and according to second characteristic information that obtains, judge whether the sound signal in this stipulated time length is normal, in judged result when being undesired, with the sound signal in this stipulated time length and should stipulated time length in sound signal be that the information of abnormal signal sends to monitoring server;
Monitoring server, be used to show in the stipulated time length that the auxiliary oral language examination client sends sound signal and should stipulated time length in sound signal be the information of abnormal signal.
Obviously, those skilled in the art can carry out various changes and modification to the present invention and not break away from the spirit and scope of the present invention.Like this, if of the present invention these are revised and modification belongs within the scope of claim of the present invention and equivalent technologies thereof, then the present invention also is intended to comprise these changes and modification interior.

Claims (13)

1. an audio signal detection method is characterized in that, comprising:
The recording audio signal;
When in determining described sound signal, having voice signal, obtain amplitude information and signal to noise ratio (S/N ratio) information in the described sound signal; And
According to amplitude information that obtains and signal to noise ratio (S/N ratio) information, judge the volume of the voice signal in the described sound signal when undesired, send prompting at the voice signal volume is undesired;
Obtain second characteristic information in the sound signal of recording in the stipulated time length; And
According to described second characteristic information that obtains, judge sound signal in this stipulated time length when undesired, with the sound signal in this stipulated time length and should stipulated time length in sound signal be that the information of abnormal signal sends to monitoring server.
2. detection method as claimed in claim 1 is characterized in that, determines in the described sound signal to have voice signal, specifically comprises:
Obtain first characteristic information in the described sound signal; And
According to described first characteristic information, determine in the described sound signal and have voice signal.
3. detection method as claimed in claim 1 is characterized in that, according to amplitude information that obtains and signal to noise ratio (S/N ratio) information, the volume of judging the voice signal in the described sound signal is undesired, specifically comprises:
According to amplitude information that obtains and signal to noise ratio (S/N ratio) information, determine the volume value of voice signal in the described sound signal;
If judge the described volume value that obtains determining, determine that then described volume is undesired, and described volume is less than normal quantity less than first defined threshold; And
If judge the described volume value that obtains determining, determine that then described volume is undesired, and described volume is greater than normal quantity greater than second defined threshold;
Described first defined threshold is less than second defined threshold.
4. detection method as claimed in claim 3 is characterized in that, in definite described volume during less than normal quantity, sends prompting at the voice signal volume is undesired, specifically comprises:
Whether the ratio of intensity level of judging the intensity level of the air-flow composition in the described sound signal and voice signal is less than the 3rd defined threshold; And
When judgement obtains described ratio less than the 3rd defined threshold, send and reduce and the information of the distance between the input equipment of recording;
When judgement obtains described ratio and is not less than the 3rd defined threshold, send the information that increases the pronunciation volume.
5. detection method as claimed in claim 3 is characterized in that, in definite described volume during greater than normal quantity, sends prompting at the voice signal volume is undesired, specifically comprises:
Whether the ratio of intensity level of judging the intensity level of the air-flow composition in the described sound signal and voice signal is greater than the 4th defined threshold; And
When judgement obtains described ratio greater than the 4th defined threshold, send the information of distance between increase and the recording input media;
When judgement obtains described ratio and is not more than the 4th defined threshold, send the information that reduces to pronounce volume.
6. detection method as claimed in claim 2 is characterized in that, described first characteristic information is fundamental frequency information and/or Mel cepstrum coefficient information.
7. detection method as claimed in claim 1 is characterized in that, described second characteristic information is energy information, fundamental frequency information, pulse energy information, energy hunting information and overflows at least a information in the energy information.
8. a sound signal pick-up unit is characterized in that, comprising:
Recording elements is used for the recording audio signal;
Determining unit is used for determining whether the described sound signal that recording elements is recorded exists voice signal;
First acquiring unit is used for obtaining amplitude information and signal to noise ratio (S/N ratio) information in the described sound signal when determining unit is determined described sound signal and had voice signal;
First judging unit is used for amplitude information and the signal to noise ratio (S/N ratio) information obtained according to first acquiring unit, judges whether the volume of the voice signal in the described sound signal is normal;
Tip element is used in the judged result of first judging unit for not the time, sends prompting at the voice signal volume is undesired;
Second acquisition unit is used for obtaining second characteristic information of the sound signal of recording in the stipulated time length;
Second judging unit is used for described second characteristic information that obtains according to second acquisition unit, judges whether the sound signal in this stipulated time length is normal;
Transmitting element is used in the judged result of second judging unit for not the time, with the sound signal in this stipulated time length and should stipulated time length in sound signal be that the information of abnormal signal sends to monitoring server.
9. pick-up unit as claimed in claim 8 is characterized in that, described determining unit specifically comprises:
Obtain subelement, be used for obtaining first characteristic information of the described sound signal that recording elements records;
First determines subelement, is used for according to obtaining described first characteristic information that subelement obtains, and determines whether to have voice signal in the described sound signal that recording elements records.
10. pick-up unit as claimed in claim 8 is characterized in that, described first judging unit specifically comprises:
Second determines subelement, is used for amplitude information and the signal to noise ratio (S/N ratio) information obtained according to first acquiring unit, determines the volume value of voice signal in the described sound signal;
First judgment sub-unit is used to judge that second determines described volume value that subelement determines whether between first defined threshold and second defined threshold, and wherein said first defined threshold is less than second defined threshold;
The 3rd determines subelement, be used in the judged result of first judgment sub-unit when being, determine that described volume is normal, when the judgement of first judgment sub-unit obtains described volume value less than first defined threshold, determine that described volume is undesired, and described volume is less than normal quantity, and judges when obtaining described volume value greater than second defined threshold in first judgment sub-unit, determine that described volume is undesired, and described volume is greater than normal quantity.
11. pick-up unit as claimed in claim 10 is characterized in that, when the 3rd determined that subelement is determined described volume less than normal quantity, described Tip element specifically comprised:
Second judgment sub-unit, whether the ratio of intensity level that is used for judging the intensity level of air-flow composition of described sound signal and voice signal is less than the 3rd defined threshold;
The first prompting subelement is used for judging when obtaining described ratio less than the 3rd defined threshold in second judgment sub-unit, sends to reduce and the information of the distance between the input equipment of recording, and
When second judgment sub-unit judges that obtaining described ratio is not less than the 3rd defined threshold, send the information that increases the pronunciation volume.
12. pick-up unit as claimed in claim 10 is characterized in that, when the 3rd determined that subelement is determined described volume greater than normal quantity, described Tip element specifically comprised:
The 3rd judgment sub-unit, whether the ratio of intensity level that is used for judging the intensity level of air-flow composition of described sound signal and voice signal is greater than the 4th defined threshold;
The second prompting subelement is used for when the judgement of the 3rd judgment sub-unit obtains described ratio greater than the 4th defined threshold, sends the information of distance between increase and the recording input media, and
When the 3rd judgment sub-unit judges that obtaining described ratio is not more than the 4th defined threshold, send the information that reduces to pronounce volume.
13. an auxiliary oral language examination system comprises auxiliary oral language examination client and monitoring server, it is characterized in that, wherein:
The auxiliary oral language examination client, be used for the recording audio signal, and when in determining described sound signal, having voice signal, obtain amplitude information and signal to noise ratio (S/N ratio) information in the described sound signal, and according to described amplitude information and signal to noise ratio (S/N ratio) information, judge the volume of the voice signal in the described sound signal when undesired, send prompting at the voice signal volume is undesired; And
Be used for obtaining second characteristic information of the sound signal of recording in the stipulated time length, and according to described second characteristic information that obtains, judge sound signal in this stipulated time length when undesired, with the sound signal in this stipulated time length and should stipulated time length in sound signal be that the information of abnormal signal sends to monitoring server;
Monitoring server, be used to show in the stipulated time length that the auxiliary oral language examination client sends sound signal and should stipulated time length in sound signal be the information of abnormal signal.
CN2008102392007A 2008-12-03 2008-12-03 Audio signal detection method and device, and auxiliary oral language examination system Expired - Fee Related CN101419795B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN2008102392007A CN101419795B (en) 2008-12-03 2008-12-03 Audio signal detection method and device, and auxiliary oral language examination system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN2008102392007A CN101419795B (en) 2008-12-03 2008-12-03 Audio signal detection method and device, and auxiliary oral language examination system

Publications (2)

Publication Number Publication Date
CN101419795A CN101419795A (en) 2009-04-29
CN101419795B true CN101419795B (en) 2011-04-06

Family

ID=40630560

Family Applications (1)

Application Number Title Priority Date Filing Date
CN2008102392007A Expired - Fee Related CN101419795B (en) 2008-12-03 2008-12-03 Audio signal detection method and device, and auxiliary oral language examination system

Country Status (1)

Country Link
CN (1) CN101419795B (en)

Families Citing this family (13)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102044246B (en) * 2009-10-15 2012-05-23 华为技术有限公司 Method and device for detecting audio signal
CN102811386B (en) * 2011-06-01 2017-05-24 中兴通讯股份有限公司 Recording device, media server, recording method and system
CN103578470B (en) * 2012-08-09 2019-10-18 科大讯飞股份有限公司 A kind of processing method and system of telephonograph data
CN103929707B (en) * 2014-04-08 2019-03-01 努比亚技术有限公司 A kind of method and mobile terminal detecting microphone audio tunnel condition
CN107205204B (en) * 2016-03-16 2022-09-13 广州启辰电子科技有限公司 Earphone fault detection method and special earphone for examination
CN107548564B (en) * 2016-04-29 2021-02-26 华为技术有限公司 Method, device, terminal and storage medium for determining voice input abnormity
CN107580113B (en) * 2017-08-18 2019-09-24 Oppo广东移动通信有限公司 Reminding method, device, storage medium and terminal
CN109785683A (en) * 2017-11-13 2019-05-21 上海流利说信息技术有限公司 For simulating method, apparatus, electronic equipment and the medium at speaking test scene
CN109246299A (en) * 2018-08-31 2019-01-18 深圳市万普拉斯科技有限公司 Rapid Speech recording method, device, mobile terminal and computer storage medium
CN109686158A (en) * 2019-01-07 2019-04-26 九江学院 A kind of method of accountant management system ability culture simulation and training
CN110473525B (en) * 2019-09-16 2022-04-05 百度在线网络技术(北京)有限公司 Method and device for acquiring voice training sample
CN111210839A (en) * 2019-12-31 2020-05-29 秒针信息技术有限公司 Method and device for detecting recording equipment
CN113593602B (en) * 2021-07-19 2023-12-05 深圳市雷鸟网络传媒有限公司 Audio processing method and device, electronic equipment and storage medium

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP0027066B1 (en) * 1979-09-28 1983-07-06 Thomson-Csf Device for detecting speech signals and transmit-receive switching system comprising such a device
US5459814A (en) * 1993-03-26 1995-10-17 Hughes Aircraft Company Voice activity detector for speech signals in variable background noise
US5991718A (en) * 1998-02-27 1999-11-23 At&T Corp. System and method for noise threshold adaptation for voice activity detection in nonstationary noise environments
CN1753427A (en) * 2004-09-20 2006-03-29 华为技术有限公司 Device and method for automatically regulating mobile terminal loudness
CN1815555A (en) * 2005-02-04 2006-08-09 光宝科技股份有限公司 Electronic radio device and its volume prompting method
CN101197871A (en) * 2006-12-07 2008-06-11 乐金电子(中国)研究开发中心有限公司 Automatic voice measuring and reminding device

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP0027066B1 (en) * 1979-09-28 1983-07-06 Thomson-Csf Device for detecting speech signals and transmit-receive switching system comprising such a device
US5459814A (en) * 1993-03-26 1995-10-17 Hughes Aircraft Company Voice activity detector for speech signals in variable background noise
US5991718A (en) * 1998-02-27 1999-11-23 At&T Corp. System and method for noise threshold adaptation for voice activity detection in nonstationary noise environments
CN1753427A (en) * 2004-09-20 2006-03-29 华为技术有限公司 Device and method for automatically regulating mobile terminal loudness
CN1815555A (en) * 2005-02-04 2006-08-09 光宝科技股份有限公司 Electronic radio device and its volume prompting method
CN101197871A (en) * 2006-12-07 2008-06-11 乐金电子(中国)研究开发中心有限公司 Automatic voice measuring and reminding device

Also Published As

Publication number Publication date
CN101419795A (en) 2009-04-29

Similar Documents

Publication Publication Date Title
CN101419795B (en) Audio signal detection method and device, and auxiliary oral language examination system
Kahng The effect of pause location on perceived fluency
Walton et al. Speaker race identification from acoustic cues in the vocal signal
Camarata The application of naturalistic conversation training to speech production in children with speech disabilities
Rockwell Vocal features of conversational sarcasm: A comparison of methods
CN101751919B (en) Spoken Chinese stress automatic detection method
CN104464757B (en) Speech evaluating method and speech evaluating device
US20080300874A1 (en) Speech skills assessment
CA2676380A1 (en) System and method for detection and analysis of speech
US20030202007A1 (en) System and method of providing evaluation feedback to a speaker while giving a real-time oral presentation
CN105825852A (en) Oral English reading test scoring method
CN101727900A (en) Method and equipment for detecting user pronunciation
CN103366759A (en) Speech data evaluation method and speech data evaluation device
Snow Imitation of intonation contours by children with normal and disordered language development
Lehner et al. Indicators of communication limitation in dysarthria and their relation to auditory-perceptual speech symptoms: Construct validity of the KommPaS web app
Makagon et al. An acoustic analysis of laughter produced by congenitally deaf and normally hearing college students
Ström et al. Intelligent barge-in in conversational systems.
Finkelstein et al. Investigating the influence of virtual peers as dialect models on students’ prosodic inventory
Onslow et al. Speech timing in children after the Lidcombe Program of early stuttering intervention
CN105719662A (en) Dysarthrosis detection method and dysarthrosis detection system
Duyck et al. Improving accuracy in detecting acoustic onsets.
CN109065024A (en) abnormal voice data detection method and device
Perwitasari Slips of the ears: Study on vowel perception in Indonesian learners of English
Leonard et al. Homonymy and the voiced-voiceless distinction in the speech of children with specific language impairment
Jialin et al. Likelihood ratio-based forensic voice comparison with the Cantonese diphthong/ei/F-pattern

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
ASS Succession or assignment of patent right

Owner name: BEIJING TAILI TONGLIAN TECHNOLOGY DEVELOPMENT CO.,

Free format text: FORMER OWNER: LI WEI

Effective date: 20110104

Owner name: BEIJING ZHICHENG ZHUOSHENG TECHNOLOGY DEVELOPMENT

Free format text: FORMER OWNER: BEIJING TAILI TONGLIAN TECHNOLOGY DEVELOPMENT CO., LTD.

Effective date: 20110104

Free format text: FORMER OWNER: XU BO

C41 Transfer of patent application or patent right or utility model
COR Change of bibliographic data

Free format text: CORRECT: ADDRESS; FROM: 100083 602, TOWER B, JINMA BUILDING, NO.38, XUEQING ROAD, HAIDIAN DISTRICT, BEIJING TO: 100083 601-603, TOWER B, JINMA BUILDING, NO.38, XUEQING ROAD, HAIDIAN DISTRICT, BEIJING

TA01 Transfer of patent application right

Effective date of registration: 20110104

Address after: 100083, B building, block 38, Jin Qing Road, 601-603, Beijing, Haidian District

Applicant after: Beijing ZhichengZhuosheng Technology Development Co.,Ltd.

Address before: 100083, B building, block 38, Jin Qing Road, 601-603, Beijing, Haidian District

Applicant before: Beijing Taili Communications Technology Development Co.,Ltd.

Effective date of registration: 20110104

Address after: 100083, B building, block 38, Jin Qing Road, 601-603, Beijing, Haidian District

Applicant after: Beijing Taili Communications Technology Development Co.,Ltd.

Address before: 100083, B building, block 38, Jin Qing Road, 602, Beijing, Haidian District

Applicant before: Li Wei

Co-applicant before: Xu Bo

C14 Grant of patent or utility model
GR01 Patent grant
ASS Succession or assignment of patent right

Owner name: TIANJIN XUNFEI INFORMATION TECHNOLOGY CO., LTD

Free format text: FORMER OWNER: BEIJING ZHICHENG ZHUOSHENG TECHNOLOGY DEVELOPMENT CO., LTD.

Effective date: 20140303

COR Change of bibliographic data

Free format text: CORRECT: ADDRESS; FROM: 100083 HAIDIAN, BEIJING TO: 300308 BINHAI NEW DISTRICT, TIANJIN

TR01 Transfer of patent right
TR01 Transfer of patent right

Effective date of registration: 20140303

Address after: 300308, 7 floor, building 3, Crowne Plaza, 55 Central Avenue, Tianjin Airport Economic Zone, 701

Patentee after: TIANJIN XUNFEI INFORMATION TECHNOLOGY Co.,Ltd.

Address before: 100083, B building, block 38, Jin Qing Road, 601-603, Beijing, Haidian District

Patentee before: Beijing ZhichengZhuosheng Technology Development Co.,Ltd.

CF01 Termination of patent right due to non-payment of annual fee
CF01 Termination of patent right due to non-payment of annual fee

Granted publication date: 20110406

Termination date: 20171203