CMA-FSE ERROR SURFACE EXAMPLES
  
  P. Bert Schniter - 
  
  Cornell University Blind Equalization Research Group
The behavior of the Constant Modulus Algorithm (CMA) as used to 
adapt a Fractionally Spaced Equalizer (FSE) is illustrated 
below using simple examples.  Some important conclusions are then
drawn from these examples: (1) in the presence of noise or channel
undermodelling, proper initialization may be necessary to sucessfully 
equalize the channel, and (2) channels containing nearly reflected
roots exhibit high levels of noise gain.   
Background
In digital communications systems,
the path from source to receiver (i.e. the channel)
generally introduces dispersion 
and multipath, making the received pulses difficult or impossible
to detect without a high probability of error.  The use of equalization 
in the receiver attempts to combat these linear distortions imposed 
by the channel.  As a linear equalizer filtering the received signal,
usually a transversal FIR implementation 
is chosen.  When sampling the received signal at the system 
baud rate, we refer to the equalizer as baud spaced.  When
operating at a sample rate higher than baud (a typical choice being
twice the baud rate), we refer to the equalizer as fractionally 
spaced.  The use of FSEs is nearly ubiquitous in recent 
designs due to advantages in performance over baud spaced 
equalizers.  It can be shown [2] that perfect 
equalization is possible with FSEs longer than the channel 
delay spread for channels not exhibiting reflected 
roots. 
In addition to dispersion and multipath, the channel can introduce 
noise.  In general, the equalizer must make a compromise 
between compensating for linear channel distortions and attenuating noise.
In the presence of noise, the ability to perfectly equalize is lost.
In many cases, the receiver does not know the channel characteristics
(possibly because of a newly initiated connection or because of a 
rapidly varying channel).  To remedy this, it may be possible to 
transmit a known training signal
from which the receiver can extract the current channel characteristics.
In some situations, however, the amount of channel capacity 
used by this scheme
cannot be tolerated, and the receiver must instead determine the channel
characteristics blindly.  CMA is the most widely 
established algorithm for accomplishing blind equalization.  
Motivation
Similar to LMS, CMA can be considered a stochastic
gradient algorithm.  This means that the convergence of the equalizer
parameter vector under CMA can be viewed as a (stochastic) traversal of 
the CMA cost surface with average movement in the direction of
steepest descent.  Thus, great insights into the dynamical behavior of 
CMA can be gained from an understanding of the topology of the CMA cost
surface.  
What follows are simple two-dimensional examples that help to illustrate 
the salient characteristics of the CMA cost surface in the case of 
fractionally spaced equalization. 
Restricting the scope to two real-valued equalizer taps makes the surface 
possible to visualize as a 3D object and allows us to identify important 
features which also exist in the general (i.e. higher dimensional) case.
Illustrations of the CMA-FSE Cost Surface
In one regard, the goal of CMA
is to transfer a set of initial equalizer parameters to
the lowest point on the CMA cost surface.  As evident below, there may
be multiple locations on the cost surface attaining the minimum cost.  
As such, they represent equally desirable solutions.
All of the contour plots exhibit the following properties:  at the
origin there exists a maximum, surrounded by a ring of up to 
four minima.  Minima reflected across the origin correspond to
solutions differing only in sign, which is inconsequential when 
using differential coding.  Finally, as we travel further from the 
origin, the surface rises steeply in all directions.  
It should be noted that, though the following surfaces were created from
particular channels, 
they are intended to be prototypical of their 
respective channel classes.  For simplicity, BPSK was chosen as the 
model for the source statistics, though the thrust of the conclusions 
below applies to higher-order constellations (e.g. 64-QAM) as well.
|   | Perfect Equalizability with Well-Behaved ChannelIn the case of perfect equalizability, all four minima correspond to
the same (optimal) CMA cost of zero. Each minima reflects a 
particular combination of system delay and system polarity.  The
asterisk indicates the location of a particular Wiener solution.  
Since there is no noise, it appears coincident with the CMA cost 
minimum of corresponding system delay and polarity. | 
|   | Perfect Equalizability with Nearly-Reflected Channel RootsAs channel roots become more nearly reflected, the surface becomes 
more elongated.
Note the scaling on the vertical axis is 100 times that on the 
horizontal axis!  This characteristic will be
manifested as an extremely slow convergence of the 'f1' equalizer tap 
due to the shallowness of the gradient along that dimension. 
Theoretically, however, perfect equalizability is still
possible in the noiseless case.  In other words, all minima correspond 
to zero cost and are aligned with the respective Wiener solutions. | 
|   | Noisy but Well-Behaved ChannelWhen the well-behaved channel (i.e. the first plot above) is operated 
in a 7dB SNR environment, 
note the appearance of local minima distinct from the global 
minima. For particular initializations, local minima are capable 
of capturing the parameter
trajectory and preventing it from attaining an optimum setting.
Note that all minima correspond to costs greater than zero since perfect
equalization is not possible in the presence of noise. | 
|   | Noisy Channel with Nearly-Reflected RootsWhen the channel with nearly reflected roots (above) is operated in a 7dB 
SNR environment, we see a dramatic change in the cost surface.  
Note the equivalent scaling of horizontal and vertical axes; the cost surface
has become much less elongated than in the noiseless case (see the second 
picture in the series).  
Also note the merging of two pairs of
minima into one pair. For this channel, the optimal solution in the 
presence of noise is very different from the solution which perfectly 
equalizes the channel. | 
|   | Undermodelled but Well-Behaved Channel, No NoiseWhen the delay spread of the channel is longer than the delay
spread of the FSE, we lose the ability to perfectly equalize.  Thus,
none of the minima correspond to zero cost.  More importantly, note the
existence of local minima with greater CMA cost than the global minima.  
This implies that the equalizer initialization will determine the 
optimality of the resulting CMA solution. | 
 Conclusions 
The examples above emphasize a few facts.  First, the FSE-CMA cost surface
is multimodal, by virtue of the existence of multiple minima.  In the 
case where an FSE is able to perfectly equalize the channel, all minima 
correspond to the same optimal CMA cost.  Furthermore, it can
be shown that these multiple minima occur very near to the Wiener solutions
corresponding to particular combinations of system polarity and system
delay [3].  By system delay, we mean the delay through the cascaded channel
and equalizer.  By sytem polarity, we mean {+,-} in the case of real
signals or {+,-,+j,-j} in the case of complex signals.  
Next, channels with nearly-reflected roots will most likely exhibit
very slow FSE parameter convergence rates in the absence of noise.  Adding the 
effects of noise, the FSE may not even come close to equalizing the channel.
Well behaved channels are not as sensitive to noise as those
with nearly reflected roots.  
 
Any time noise is present, though, 
we can expect local minima.  This implies the potential for
convergence to non-optimal CMA solutions and places importance on 
the choice of initialization.  The same holds true for FSEs that 
undermodel the channel, even for well-behaved channels in the absence
of noise.  Initialization can be extremely important if, in these cases,
there exist bad local minima which do not correspond to  open
eye solutions.  By open eye, we mean that the channel has been 
equalized enough to successfully transfer adaptation to a decision-
directed equalization scheme.  If CMA converges to one of these
bad local minima, there is little chance of successfully demodulating the
incoming data.  In such cases, the system must be restarted, possibly with
a different initialization, in the hope that CMA will converge to an 
open-eye solution. 
 Questions for Practitioners 
The examples presented in this demonstration help to motivate the
search for techniques that avoid the problems associated with CMA's
sensitivity to poor choices of initialization.  In practice, it is
typical to use a center-spike initialization, whereby the center 
equalizer tap is made unity and all other taps are made zero.
Though originally suggested in [1], we know of no theoretical work 
that has rigorously defended this technique.
The question remains as to how well the center-spike initialization 
really works.  We have received reports of the center-spike method 
failing on occasion, but have not received any specific examples.  
Can anyone provide us with some?
 References 
[1] D.N. Godard, "Self-Recovering Equalization and Carrier Tracking in
Two-Dimensional Data Communication Systems," IEEE Trans. Commun.,
Vol. COM-28, No.11, Nov. 1980.
[2] J.R. Treichler, I. Fijalkow, and C.R. Johnson, Jr., "Fractionally
Spaced Equalizers - How long should they really be?," Signal 
Processing, pp.65-81, May 1996.
[3] H.H. Zeng, L. Tong, and C.R. Johnson, Jr., "Relationships Between
CMA and Wiener Receivers," Submitted to IEEE Trans. on Inform.
Theory.
 Further Reading 
The examples and discussion above are drawn from a 
technical report written by members of Cornell's 
Blind Equalization Research Group (BERG).  The report includes
a derivation of the equations respresenting CMA-FSE cost with arbitrary
source kurtosis (i.e. for any constellation) and in the presence
of additive white gaussian channel noise.
 Matlab Code 
The 
BERGulator can be used to generate these surfaces and others.