|
ISSN 1214-9675 Server vznikl za podpory Grantové agentury ČR. 21. ročník |
Témata
Doporučujeme
Kontakt
|
On Robust Control of Adaptive Algorithms for Acoustic Echo CancellersVydáno dne 31. 12. 2008 (8381 přečtení)The main purpose of the paper was to investigate robust adaptive echo cancellation algorithms in hands-free scenarios to achieve an understanding of its performance and also its effects on Double-talk (DT). 1. IntroductionDT is a major issue present in acoustic echo cancellation (AEC). It occurs when both the far-end and the near-end speakers speak simultaneously. Many attempts have been made to alleviate the isruptive effects of DT; these include the use of DT detection schemes, robust daptive AEC, step-size control, state-space representation ([1], [2]). The scheme f extending robust algorithms to an acoustic hands-free environment as proposed in [3] became the motivation for this paper.
Robust adaptive echo cancellation in particular, is an interesting approach
because if appropriately combined with a good double-talk detector (DTD) it
could offer a noteworthy solution in overcoming the problems of DT [2]. This
approach was used for line echo cancellers (LEC) [3]. Thus, by extending the
method to acoustic hands-free scenarios, it may prove very valuable in the
long term goal of eliminating the problems caused by DT. 2. Adaptive Algorithms and NotationsIn derivations and descriptions the following notations are used (see also Fig. 1). Fig. 1 Block diagram of the echo canceller and DTD [3]. The excitation vector is denoted xn=[xn,...,xn-L+1]T, where xn is the far-end speech signal. vn+ wn is the background noise mixed with the near-end speech. The microphone signal is denoted yn. The residual echo is en=yn-hnTxn, where hn=[h0,n,...,hL-1,n]T is the estimated echo path. Here L is the length of the adaptive filter. 2.1 The Proportionate Algorithms
The step-sizes in the PNLMS algorithm are calculated from the last estimate of the filter coefficients so that a larger coefficient receives a larger weight, thus increasing the convergence rate of that coefficient. This has the effect that active coefficients are adjusted faster than non active coefficients (i.e. small or zero coefficients). This algorithm is described by the following equations:
Gn is a diagonal matrix which adjusts the step-sizes, μ is the overall step-size parameter, and δ is a regularization parameter which prevents division by zero and stabilizes the solution when speech is used as input signal. The diagonal elements of Gn+1 are calculated as follows, [4],
Parameters δp and ρ are positive numbers with typical values δp = 0.01, ρ= 5/L. A variant of this algorithm is called the PNLMS++ [5]. There, for odd-numbered time steps the matrix Gn is derived as above, while for even-numbered steps it is chosen to be the identity matrix ( Gn = I), which results in the NLMS algorithm. The alternation between NLMS and PNLMS iterations has several advantages compared to using just the PNLMS technique (in the case of pure PNLMS, as the large weights adapt, the remaining small coefficients adapt at a rate slower than NLMS), e.g., the PNLMS++ algorithm is much less sensitive to the assumption of a sparse impulse response. 2.2 The Robust NLMS and PNLMS++ Algorithms
The NLMS and PNLMS++ algorithm can both be made robust to large disturbances by modification of the criteria on which they are based [3]. The robust NLMS algorithm is given by,
where Ψ(.) is a limiter function,
and the adaptive scale factor, sn, is defined as,
where λ controls the scale factor. β is a normalization constant, it dependents on k0,
where erfc(.) is a complementary error function defined as,
As with the Geigel DTD (which is an essential component in the robust algorithms) [8], it is useful to introduce a hangover time to control scale updating. When the DTD detects DT, adaptation of sn should be inhibited for a specific time, preferably al long as the DTD hangover time, Thold, [10]:
The PNLMS algorithm can be made robust in an exactly analogous manner, yielding the equation
Alternating the iterations with Gn as given in (3) and the identity matrix then yields the robust PNLMS++ algorithm. 2.3 The Robust Proportionate APA (PAPA)
Let yn=[yn,...,yn-p+1]T, be a vector of samples yn and Xn=[xn,...,xn-L+1]T the excitation matrix, where p is the projection order. A residual echo vector en=yn-XnThn, and PAPA is then given by,
where Gn is as defined in equation (2) and R
is a weighted estimate of the inverse correlation matrix of the input
signal. This matrix “whitens” the input data, Xn, and thus the
convergence rate of the adaptive filter is increased. With Gn = I
and δ=0, the equation (14)
reduces to the standard APA [6].
where ☺ denotes elementwise multiplications and |.| in |en| operates on the individual elements. 3. Investigation of Robust ParametersThis section examines the relationship and function of the robust algorithm. In the robust PAPA Ψ(en) is an P-by-1. The scale factor sn uses the first element in the limiter matrix,
From a glance at the equations, the parameters β and λ act as a weighting on the previous scale factor values and the current error value. This relationship between the parameters and the current error value can be demonstrated through the substitution of equation (14) into (15), yielding,
If the error is small, meaning no double talk is active; the minimum function will favour the first term and this results in following equation,
From equation (17), it can be deduced that when the error is small, the
algorithm uses the previous scale factor and the current error to compute
the scale factor which will be used in the next iteration.
In this case, the scale factor relies on the previous scale factor and k0. This implies that when DT occurs, the scale factor will use a smaller k0 value instead of the larger error value. 4. SimulationsThe simulations were provided using MATLAB environment. The robust PAPA was simulated under user defined acoustic scenarios. The data used comprised of a real car channel (length of 2048 taps), female and male synthetic speech. 6 dB was used for DT as the ratio between the far-end speaker and the near-end speaker and a SNR of 20 dB was used for the noise as illustrated in Figure 2. The two main factors that were considered in the investigation of adaptive algorithms were rate of convergence and robustness.The misalignment (it was the main focus of this paper) is given by,
ε=||h-hep||/||hep||, where
hep is the true echo path. The scale estimate in (7),
sn , is never allowed to become smaller than 2. This inhibits bad behaviour in low noise situations. Fig. 2 Echo signal, local speaker and background noise. Fig. 3 Misalignment for k0. Fig. 4 Misalignment for β. Fig. 5 Misalignment for λ. 5. ConclusionThe parameter k0 was the main robust parameter, which affected the level of divergence in the algorithm DT occurred. This parameter was the key to the limiter function, which limits the magnitude in the filter weight adaptation. The other two robust parameters β and λ could be regarded as fine-tuning parameters, which allowed users to adjust the level of robustness integral to the robust adaptive algorithms. The recommended values suggested in ([3], [10]) were found to be relevant to the acoustic environment used in the simulations. An increase in the values of β and λ was found to increase the memory of the algorithm, meaning that the algorithm demonstrated slower convergence and reacted slower to changes in the input signal. Conversely, decreasing the values of β and λ also decreased the memory of the algorithm thus allowing it to track changes more readily, as it was less robust; the algorithm was found to diverge when double-talk occurred. It was concluded that to configure the robust adaptive echo cancellation algorithms, the required degree of robustness should be known such that one can set the correct parameters. As allowing more robustness would decrease the convergence of the algorithms, it was very important to find a balance between the two key factors. This paper has originated thanks to the support from the Ministry of Education, Youth and Sports of Czech Republic within the project MSM6840770014. Literature
[1] C. Breining, "Control of a hands-free telephone set", Elsevier Signal
Processing, vol. 61, pp. 131-143, 1997. Autor: K. Sakhnov, B. Šimák Pracoviště: České vysoké učení technické v Praze, FEL |
Zprávy
UPOZORNĚNÍ
Činnost serveru byla ukončena.
|
Tento web site byl vytvořen prostřednictvím phpRS - redakčního systému napsaného v PHP jazyce.
Na této stránce použité názvy programových produktů, firem apod. mohou být ochrannými známkami
nebo registrovanými ochrannými známkami příslušných vlastníků.