Appendix A. Simulation study of wavelet coefficient regression.
To illustrate why strong associations
between variables are observed at some scales and not others, we consider how
wavelet coefficients from sequences sharing similar patterns covary at different
scales. We begin with a few definitions, following the notation of Percival
and Guttorp (1994). Given a sequence
,
we can quantify the pattern exhibited at a particular scale
in terms of its Allen variance
|
|
(A.1) |
where
.
If a sequence has a large Allen variance a particular scale
,
but is small at all other scales, then we say that the sequence is patchy or
patterned at scale
.
The Haar wavelet transform of the sequence
yields wavelet coefficients
|
|
(A.2) |
where
is the scale of the transform and
is the translation along the sequence. It is straightforward to show that
|
|
(A.3) |
Thus, a large wavelet-coefficient variance at a given scale equates to a large Allan variance at that scale, which by definition, indicates the presence of pattern at that scale. Similar arguments extend this result to other wavelet bases as well (Percival and Guttorp 1994).
We now consider the wavelet-covariance
of two sequences patterned at the same scale. Let
be a sequence of length
with a large Allen variance at scale
and small Allen variance at all other scales. Furthermore, let
be a sequence that inherits the pattern in
through the relationship
,
where
represents uncorrelated random errors. We can partition the covariance relationship
between
and
into a hierarchy of scales
via the wavelet transform
.
It follows that
|
|
|
|
|
|
|
|
||
|
|
|
(A.4) |
Hence, the wavelet-covariance of
and
will reach its maximum when
,
i.e., at exactly the scale at which the variance of the wavelet coefficients
of
is maximal and the scale at which
and
exhibit a shared pattern. We can easily extend these results to multivariate
regression. It is the increase in covariance at certain scales that generates
scale-specific models in our analysis.
To validate the approach of wavelet-coefficient regression (WCR), we applied the method to simulated data with known properties. In the following, we present these results.
We generated a test data set consisting
of three independent variables and one dependent variable. The data are shown
in Fig. A1. The three independent variables
,
,
and
were generated as 1-dimensional periodic sequences, each with 1024 samples.
We used a sine function to generate the values, but altered the periodicity
with each sequence (we used periods of 8, 32, and 128 for the three variables).
The independent variable was generated by simply summing the dependent variables
and adding random noise. We used normally distributed errors with a standard
deviation equal to 0.5.
We then wavelet transformed the simulated data using the Daubechies Least-Asymmetric wavelet (Daubechies 1992) of length 8, generating six new datasets representing the decomposition at first six levels. As described in the main text, we then selected a linear model by minimizing the AIC score across stepwise removals and additions of the independent variables.
Table A1
shows the results of the model selection and regression. Parameter estimates
are given at each level of the transform; where an estimate is missing, that
variable was omitted from the model by the model selection routine (in a couple
of cases, the selected model generated a singularity in the estimation procedure
and so the variable responsible for the singularity was also dropped). Clearly,
the model selection approach is identifying the scales at which independent
and dependent variables interact. Level 2 highlights the association with
.
Level 4 includes
,
and Level 6 shows a strong covariance between
and
.
Generally, the best fit occurs when the scale of the transform (defined as
where
is the transform level) is 1/2 the period of the input pattern. Often, there
is also a less strong relationship at 1/4 the period, probably owing to resonance
with the larger scale pattern.
TABLE A1. Regression results for simulated data.
|
|
|
|
|
|||||
|
Level |
Estimate |
P |
Estimate |
P |
Estimate |
P |
Value |
P |
|
0 |
|
|
|
|
|
|
2086 |
|
|
1 |
|
|
20.36 |
|
||||
|
2 |
|
|
|
0.084 |
43.19 |
|
||
|
3 |
|
|
28.32 |
|
||||
|
4 |
|
|
1608 |
|
||||
|
5 |
|
0.001 |
13.51 |
0.001 |
||||
|
6 |
|
|
733.8 |
|
||||
Daubechies, I. 1992. Ten lectures on wavelets. CBMS-NSF Regional Conference Series in Applied Mathematics, Society for Industrial and Applied Mathematics, Philadelphia, Pennsylvania, USA.
Percival, D. B., and P. Guttorp. 1994. Long-memory processes, the allan variance and wavelets. Pages 325334 in E. Foufoula-Georgiou, editor. Wavelets in geophysics. Volume 4 of Wavelet analysis and its applications. Academic Press, San Diego, California, USA.