CN100511425C

CN100511425C - Method and system for improving ADPCM voice coding step length regulation

Info

Publication number: CN100511425C
Application number: CNB2003101165291A
Authority: CN
Inventors: 林彦士
Original assignee: Elan Microelectronics Corp
Current assignee: Elan Microelectronics Corp
Priority date: 2003-11-14
Filing date: 2003-11-14
Publication date: 2009-07-08
Anticipated expiration: 2023-11-14
Also published as: CN1617227A

Abstract

The method and system for improving ADPCM voice coding step length regulation includes partitioning one voice signal to form several voice frames and pre-coding the voice frames to obtain the optimal voice frame step length varying functions and upper step length limits; and then performing formal coding with the optimal voice frame step length varying functions and upper step length limits of the voice frames to improve voice quality and avoid great error.

Description

Improve the method and system that ADPCM voice coding step-length is regulated

Technical field

The present invention relates to a kind of adaptive differential pulse code modulated (Adaptive DifferentialPulse-Code Modulation; ADPCM) Methods for Coding, particularly a kind of is the method and system of improving the step-length adjusting of ADPCM voice coding.

Technical background

Fig. 1 is the simplified system calcspar of known adpcm encoder 10, and it comprises a quantizer 12, prediction device 14 and step size automatic regulator 16.12 pairs one difference signal Δs of quantizer X quantizes and produces a numerical code C[n] with quantize after difference signal Δ X ' [n], this difference signal Δ X is voice signal X[n] with estimate the difference of signal X ' [n], difference signal Δ X ' [n] after this quantification produces a signal S and inputs to prediction device 14 to produce the new signal X ' [n] that estimates with estimating signal X ' [n] after totalizer combines, step size automatic regulator 16 is according to this numerical code C[n] and output one step change function M (C[n]) to quantizer 12.

Fig. 2 is the system block diagrams of ADPCM decoder 20, and it comprises a de-quantizer 22, a prediction device 24 and step size automatic regulator 26.Step size automatic regulator 26 receives this numerical code C[n], and export step change function M (C[n]) according to this, de-quantizer 22 is according to step change function M (C[n]) this numerical code of de-quantization C[n] produce difference signal Δ X, this difference signal Δ X with estimate signal X ' [n] and after totalizer combines, revert to voice signal X[n], prediction device 24 is according to this voice signal X[n] produce this and estimate signal X ' [n].

The quantizer 12 of known adpcm encoder 10 has regulatory function, it adjusts according to the step-length step_size (n) that step change function M (C[n]) will import quantizer 12, to adapt to the variation of present input difference signal AX, yet, in the step-length renewal process of quantizer 12, be to decide next step value, normally present step-length step_size (n) be multiplied by a step change function M (C[n]) according to present coded data, shown in the following formula (1):

Step_size (n+1)=step_size (n) * M (C[n]) ... formula (1)

Wherein, step_size (n+1) is for giving the used step value of next sampling spot.

Step change function M (C[n]) only with present numerical code C[n] relevant, in general, one step change function M (C[n]) and numerical code C[n are all arranged in step size automatic regulator 16 and 26] the table of comparisons, shown in following table one, and be a default value, it is not regulated at different characteristics of signals, therefore, when the amplitude variations of voice signal is excessive, and pairing step change function M (C[n]) can't do best processing to voice signal, thereby cause distortion.

Table one:

Numerical code C[n]	Step change function M (C[n])
Numerical code C[n]	Step change function M (C[n])	0，1，2，3，8，9，10，11	0.9
4，12	1.2	0，1，2，3，8，9，10，11	0.9
4，12	1.2	5，13	1.6
6，14	2.0	5，13	1.6
6，14	2.0	7，15	2.4

With reference to table, numerical code C[n wherein] be one 4 bit data, as numerical code C[n] be 0,1,2,3,8,9,10 or 11 o'clock, step change function M (C[n]) be 0.9, as numerical code C[n] when being 4 or 12, step change function M (C[n]) be 1.2, as numerical code C[n] when being 5 or 13, step change function M (C[n]) be 1.6, as numerical code C[n] when being 6 or 14, step change function M (C[n]) be 2.0, as numerical code C[n] when being 7 or 15, step change function M (C[n]) be 2.4, as shown in Table 1, different numerical code C[n] and corresponding separately one fixing step change function M (C[n]), have nothing to do with signal itself.

In addition, known adpcm encoder 10 is all set a higher limit to step size, in case halt long excessive and cause distortion, and this higher limit is unique, yet voice signal is all inequality in the amplitude and the dynamic change scope of each time point, needs bigger step-length when scope is big, and scope hour only needs smaller step size, and single fixing step-length higher limit can not satisfy all scopes.

Therefore, a kind of can finding out with the difference of scope can reach signal to noise ratio (S/N ratio) (Signal-to-NoiseRatio; SNR) the step-length higher limit and the ADPCM coding method and the system of step change function are to be the institute Ji.

Summary of the invention

Purpose of the present invention is to propose the method and system that a kind of ADPCM of improvement voice coding step-length is regulated.

Purpose of the present invention, other is to provide a kind of ADPCM voice coding step-length control method and system that finds out the most suitable step change function and step-length higher limit with precoding.

Purpose of the present invention is to provide a kind of again and improves voice coding tonequality and prevent excessive ADPCM coding method and the system that causes error of step-length.

According to the present invention, the method that a kind of ADPCM of improvement voice coding step-length is regulated comprises that cutting apart a voice signal forms a plurality of frames, and each frame of precoding to be obtaining having the step change function and the step-length higher limit of maximum signal to noise ratio, and formally encodes with step change function and step-length higher limit that each frame has a maximum signal to noise ratio.

Because voice signal has the characteristic of slow variation, so the utmost point in the short time signal variation characteristic all very close, therefore the present invention is cut into a plurality of frames that continue with voice signal, and be that unit encodes and regulates with the frame, moreover, the present invention utilizes the mode of precoding according to the maximum upper limit that calculates optimal step change function of each frame and step-length, cause tangible error to improve tonequality and to prevent that step-length is excessive, after the maximum upper limit of optimal step change function that obtains each frame and step-length, again each frame is formally encoded, to obtain best tonequality and to prevent that error from occurring.

After precoding finishes, the optimal step change function of each frame and the maximum upper limit of step-length will be stored in the comparison list, according to this table of comparisons, the step change function and the step size higher limit of ADPCM coded system will be adjusted with frame, therefore, coding of the present invention and decoding system can be made best adjusting at different sound properties, to prevent distortion and to improve tonequality.

Description of drawings

For the personage who has the knack of this skill, cooperate the accompanying drawing follow from following being described in detail, the present invention can more clearly be understood, and its above-mentioned and other purpose and advantage will become more obvious, wherein:

Fig. 1 is known adpcm encoder;

Fig. 2 is known ADPCM decoder;

Fig. 3 is the waveform synoptic diagram of voice signal;

Fig. 4 is the process flow diagram that the present invention improves the ADPCM voice coding;

Fig. 5 is the system block diagrams of adpcm encoder of the present invention; And

Fig. 6 is the system block diagrams of ADPCM decoder of the present invention.

The reference numeral explanation

10 adpcm encoders

12 quantizers

14 prediction devices

20 ADPCM decoders

22 de-quantizer

24 prediction devices

100 voice signals

200 read the into speech data of a frame

202 precodings

20202 make I=1

20204 make J=1

20206 is the step-length upper limit with MaxStepsize (J), is that the step change function is done precoding to entire frame with M (I)

20208 calculate the signal to noise ratio (S/N ratio) after the precoding and write down I and the J value

20210 whether J is more than or equal to k

20212 make J=J+1

20214 whether I is more than or equal to n

20216 make I=I+1

20218 finish precoding, and have found out pairing I of maximum S R value and J value

204 is this step change function with M (I), formally encodes with MaxStepsize (J) step-length higher limit

Whether 206 be last frame

300 adpcm encoders

302 dispensers

304 quantizers

306 prediction devices

308 step size automatic regulators

310 SNR counters

400 adpcm encoders

402 de-quantizer

404 step size automatic regulators

406 prediction devices

Embodiment

Fig. 3 is the waveform synoptic diagram of a voice signal 100, because voice signal 100 has the characteristic of slow variation, therefore the utmost point in the short time signal 100 variation characteristics very close, the present invention utilizes this characteristic that voice signal 100 is divided into a plurality of frames, characteristics of signals in each frame is very close, thereby the signal in same frame can use identical step change function to encode.In this embodiment, the length of each frame is L, then is that unit carries out precoding and formal coding again with the frame, and its flow process as shown in Figure 4.Present embodiment is set the higher limit of k kind step-length, ascending be respectively MaxStepsize (1), MaxStepsize (2) ... to MaxStepsize (k), and n kind step change function, be in regular turn M (1), M (2) ... to M (n), choose optimal step-length higher limit and step change function for each frame.After beginning coding, at first step 200 is read the into speech data of a frame, then in step 202, the frame of being read is into carried out precoding to obtain the optimal step change function of this frame M (I) and step-length upper limit value M axStepsize (J), and then step 204 is formally encoded with this step change function M (I) and step-length upper limit value M axStepsize (J), after finishing formal coding, step 206 judges whether this frame is last frame, then finish coding if yes, otherwise then get back to step 200, continue again next frame is carried out precoding and formal coding.

Refer again to Fig. 4, in precoding step 202, in order to determine optimum stepsize higher limit and the step change function of each frame in k kind step-length upper limit value M axStepsize (J) and n step change function M (I), in

step

20202 and 20204, make I=1 and J=1 respectively, then in step 20206, with MaxStepsize (J) is the step-length higher limit, with M (I) is that the step change function is done precoding to entire frame, follow the signal to noise ratio (S/N ratio) after step 20208 is calculated precoding, and record I and J value, come step 20210 to judge that whether J is more than or equal to k again, if otherwise carry out step 20212, make J=J+1, and repeating step 20206 to 20210, otherwise then carry out step 20214, step 20214 is to judge that whether I is more than or equal to n, if otherwise carry out step 20216, make I=I+1, and repeating step 20204 to 20214 again, otherwise, carry out step 20218 and finish precoding, and found out pairing I of maximum S R value and J value, M (I) and MaxStepsize (J) with maximum S R value, the optimum stepsize that is the frame that reads changes function and step-length higher limit.Method not only can be regulated the step change function according to difference by this, more can regulate step change function and step-length higher limit according to different frames, thereby obtains the ADPCM coding of the most suitable voice signal characteristic.

Fig. 5 is the system block diagrams of adpcm encoder 300 of the present invention, and it comprises a dispenser 302, quantizer 304, prediction device 306, step size automatic regulator 308 and SNR counter 310.Dispenser 302 is cut apart voice signal X[n] form a plurality of frames, can utilize counter to write down the length of frame, 304 couples of difference DELTA X of quantizer quantize and produce a numerical code C[n] with quantize after difference signal Δ X ' [n], this difference signal Δ X is voice signal x[n] with estimate the difference of signal X ' [n], difference signal Δ X ' [n] after the quantification with estimate signal X ' [n] and after totalizer combines, produce a signal S and input to prediction device 306 to produce the new signal X ' [n] that estimates, step size automatic regulator 308 is according to numerical code C[n] and output one step change function M (I, C[n]) to quantizer 304.To voice signal X[n] in each frame when doing precoding, step size automatic regulator 308 provides various step change function and step-length higher limit, and utilize SNR counter 310 to calculate the SNR value of corresponding various step change functions and step-length higher limit, and then obtain the optimal step change function of each frame M (I) and step-length upper limit value M axStepsize (J), therefore, the step change function M (I that final step size automatic regulator 308 is determined, C[n]) and numerical code C[n] the table of comparisons also be the function of frame, because each frame all has its best step change function M (I, C[n]) and step-length upper limit value M axStepsize (J), tonequality improved so when coding, can reduce distortion.The step-length renewal process of system 300 is to decide next step value according to present coded data and frame, shown in the following formula (2):

Step_size (n+1)=step_size (n) * M (I, C[n]) ... formula (2)

Wherein, step_size (n) is present step value, and step_size (n+1) is next step value.

The system 300 of Fig. 5 can utilize the process control of software to realize on existing hardware structure.And, thereby the length L of frame and step change function M (I, C[n]) can revise with the step-length higher limit, to cooperate suitable voice signal X[n].

Fig. 6 is the system block diagrams of ADPCM decoder 400 of the present invention, and it comprises a de-quantizer 402, a prediction device 404 and step size automatic regulator 406.Step size automatic regulator 406 receives this numerical code C[n], and export step change function M (I according to this, C[n]), it is the function of speech data and frame, de-quantizer 402 is according to step change function M (I, C[n]) this numerical code of de-quantization C[n] and produce difference signal Δ X, this difference signal Δ X with estimate signal X ' [n] and after totalizer combines, revert to voice signal X[n], prediction device 404 is according to this voice signal X[n] produce this and estimate signal X ' [n].Similarly, step size automatic regulator 406 employed step change function M (I, C[n]) and numerical code C[n] the table of comparisons be along with the voice signal X[n that is imported] and the difference of frame and changing.

In different embodiment, the length L of frame can adopt revocable size, for example, cuts apart according to the scope and the variation of voice signal 100.

More than the narration done for preferred embodiment of the present invention be the purpose of illustrating, accurately be disclosed form and be not intended to limit the present invention, based on above instruction or to make an amendment or change from embodiments of the invention study be possible, embodiment has the knack of this operator and utilizes the present invention to select in practical application with various embodiment and narrate for explaining orally principle of the present invention and allowing, and technological thought attempt of the present invention is decided by following claim and equalization thereof.

Claims

1. one kind is improved the method that ADPCM voice coding step-length is regulated, and comprises the following steps:

Cut apart a voice signal and form a plurality of frames;

Each this frame of precoding in regular turn is to obtain the step change function with maximum signal to noise ratio and a step-length higher limit of each this frame correspondence; Formal this speech signal of coding and

Wherein, the step of this each this frame of precoding comprises the following steps:

Set a plurality of step-length higher limits;

Set a plurality of step change functions;

Calculate the signal to noise ratio (S/N ratio) of each this frame when different step change functions and step-length higher limit; And

Find out this step change function with maximum signal to noise ratio and this step-length higher limit of each this frame correspondence.

2. the method for claim 1, wherein this step of cutting apart a voice signal comprises with a regular length and cuts apart this voice signal.

3. the method for claim 1, wherein this step of cutting apart a voice signal comprises with an on-fixed length and cuts apart this voice signal.

4. ADPCM coded system comprises:

One dispenser is cut apart a voice signal to form a plurality of frames with a length;

One quantizer is estimated the difference of signal and is produced a numerical code to quantize this voice signal and; And

One step size automatic regulator, providing a step change function and step-length higher limit to this quantizer,

Wherein,

When each frame in the voice signal is done precoding, the step size automatic regulator provides various step change function and step-length higher limit, and utilize the SNR counter to calculate the SNR value of corresponding various step change function and step-length higher limit, and then obtain optimal step change function of each frame and step-length higher limit.

5. coded system as claimed in claim 4, wherein, this length is a fixed value.

6. coded system as claimed in claim 4, wherein, this length is an on-fixed value.

7. coded system as claimed in claim 4 more comprises a SNR counter, calculating the SNR value that a plurality of default step change functions and step-length higher limit are caused, and then finds out for each this frame and to have maximum S R value step change function and step-length higher limit.

8. coded system as claimed in claim 4, wherein, each this frame has the step change function and the step-length higher limit an of the best.

9. an ADPCM decoding system, it receives a numerical code, and to produce a voice signal, this decoding system comprises:

One de-quantizer produces a difference signal with this numerical code of de-quantization;

One totalizer is to estimate signal and to form this voice signal in conjunction with this difference signal and; And

One step size automatic regulator is for each frame provides a step change function and step-length higher limit to this de-quantizer.