Although they are often compared, it seems like the FFT and FWT are apples and oranges. As I understand it, a more appropriate comparison would be the STFT (FFTs of small chunks over time) and the complex Morlet WT, since they're both time-frequency representations based on complex sinusoids (please correct me if I'm wrong). This is often shown with a diagram like this:

The left shows how the STFT is a bunch of FFTs stacked on top of each other as time passes (this representation is the origin of the spectrogram), while the right shows the dyadic WT, which has better time resolution at high frequencies and better frequency resolution at low frequencies (this representation is called a scalogram).

In this example, N for the STFT is the number of vertical columns (ω axis), and a single O(N log N) FFT operation calculates a single row of N coefficients from N samples.

What I don't understand: How many coefficients does a single O(N) FWT operation compute, and where are they located on the time-frequency chart above? Which tiles get filled in by a single computation?

If we calculate an equal-area block of time-frequency coefficients using both, do we get the same amount of data out? Is the FWT still more efficient than the FFT?

2 Answers
2

Well, since the wavelet transform is not precisely a time-frequency analysis but more a time-scale analysis one can not really display the wavelet coefficients in the time frequency plane. (However, even for the STFT the time-frequency plane is only a crutch for illustration purposes.)

Maybe it helps to think of $N=2^n$. Then you can analyze your signal into $n$ scales with the wavelet transform. See, e.g., this image with $n=3$:

Then the wavelet-transform stores the $c_{31}$ and all the $d_{ij}$'s. If you want to put the coefficients in the time-frequency plane you would have to stretch the tiles as you indicated in your figures. There would be $n$ vertical colums (in $\omega$), the rightmost having $2^{n/2}$ boxes and the number of boxes halving in each column (the last two containing just one box). This adds up to $2^n$.

Considering the efficiency: To be a bit more precise, the discrete wavelet transform (DWT) of a signal of length $2^n$ and a filter with length $k$ takes $O(k2^n)$, while FFT is $O(n2^n)$ with comparable constants. Hence, DWT is faster it the filter is not too long.

Is there really a difference between time-frequency and time-scale if using the Morlet wavelet? I think it's just a sinusoid with a gaussian window, so frequency still makes sense in this case. I'm trying to make the FT and WT as similar as possible to clarify the differences between them.
–
endolithMar 23 '11 at 20:12

I think I understand better now. My PyWavelets example would just be providing the output for Level 1? And to fill in the dyadic table, I'd need to do a multi-level DWT, which is documented here: pybytes.com/pywavelets/regression/multilevel.html It produces 4 outputs with 4, 2, 1, and 1 elements. So how does this connect with the claim "the complexity of the FWT algorithm is O(M). This compares to the FFT which is O(M*log(M))"? It seems there is more than one way to do an FWT.
–
endolithMar 23 '11 at 20:18

A single O(N) FWT computes as many output samples as there are inputs.

I'm not sure about the exact ordering of the coefficients, but for a Haar-transform
of length 8 is calculated in the following 3 steps:

Step 1: 8 -> 4:
4 samples should contain the (2x subsampled) result of a [+1,-1] filter.
These 4 will be stored in the final output
The other 4 samples contain the (2x subsampled) result of a [+1,+1] filter.
These 4 will be filtered again.

Step 2: 4 -> 2:
2 samples of the [+1,-1] filter as stored in the final output.
The other two will be filtered again.

Step 3: 2 -> 1:
The [+1,-1] filter yields 1 output sample. Since this is the last step, the remaining
sample of the [+1,+1] filter is also stored.