We recall from section 2 that we may apply the
approximations in equations 3 and 4
provided that the limits of integration are much greater or lesser
than 1 or -1 respectively. To make the problem more tractable, we
presently restate the condition on the limits as: the limits of
integration are greater than or lesser than
or
respectively, where
is a constant we deem sufficiently
large. Now, given a specified frame length
and chirp signal
characterized by
, we may conceptualize these limits as
functions of
. To verify that these limits satisfy the
requirement, we must solve the inequalities
![]() |
(57) |
Concerning our choice of
, we note that choosing the ``risky''
yields fractional error bounds (deduced from
figure 2) of about 14%. Choosing
drops
this bound to 2.3%, and
leads to 1.3%. A decaying
exponential may be roughly fit to the fractional error
characteristic for purposes of choosing
based on the
fractional error tolerated.7
Now, given a desired accuracy, we may view the large limits
requirement either as a function of
or as a function of
. If the former, we may consider that the model is only
valid for certain bins. This is not a complete picture, however,
since we see below that in some cases, no choice of
will
satisfy equation 60. Thus we must also consider the
large limits requirement as a constraint on
values for
which we may consider the model valid. The relationship of each
of
and
to the large limits requirement is now
discussed separately.
Satisfying Requirements for the Large
Limits Approximation by Choice of
Despite small errors when
, equation 60 may
raise the concern that
has been restricted significantly. To
see if this is true, we compare the above bound to the positive
and negative frequency bins representing the maximum frequency
present in the signal,
(equation 15). To do this rigorously, we define
as a
constant between 0 and 1 indicating what fraction of
contains a valid range for our approximation. We can then solve
for
to determine if we have significantly restricted the range
of bins in which our approximation is valid. Expressing the
constrained
in equation 60 as
,
our constraint becomes:
![]() |
(59) |
![]() |
(60) |
We take a moment to consider consequences of this result. First,
We see a tradeoff between accuracy of the approximation (achieved
by requiring a large
value) and the size of the frequency
range in which our approximation is valid (represented by
).
Simply put, the higher quality approximation we wish to make, the
smaller the range in which we may consider the approximation
valid. The range of validity will become crucially important when
we backsolve for
.
Second, we note that certain values of
and
can lead
to
values outside the range (0,1]. A reasonably high
and low
in equation 63 may actually cause
,
suggesting that the approximation is valid outside the range
representing frequencies present in the signal. This case will
not give rise to any special analysis, as we do not find that
consideration of bins outside
enhances our analysis.
Furthermore, doing so strains our inverse midpoint integral
approximation above.
A more significant problem is that for a very small
, we
get
values less than or equal to zero, implying that no
frequency bins are valid for the approximation. Specifically,
solving for
we have that
Though it may not be obvious from inspecting this equation, this
is a nontrivial constraint on
for speech and music
signals, even when
. In such practical cases, the large
limits approximation is often invalid by this metric, and an
alternative model must be used.
To make this more concrete, we offer the example of a musical tone
of frequency 440 Hz, sounded with a 14 Hz frequency vibrato at 6
pulses per second. Listening to such a signal reveals that this
range and speed represent the largest musically realistic
(``Wagnerian opera-singer'') values for such parameters; anything
larger or faster sounds machine-like. A frequency range half this
size is in fact more typical (and to some, will sound more
tasteful). Thus, the
representing the underlying
quasi-linear piecewise frequency functions in our Wagnerian singer
signal is a representative maximum. Given these parameters, we
calculate the maximum
value by considering an FFT
analysis frame that happens to catch the ideal point in time
where, in one half vibrato pulse, the frequency range will change
14 Hz (from 433 Hz to 447 Hz) over a time period of 83.3 ms.
Assuming a sampling rate
Hz for convenience, we thus
have a 14 Hz frequency change over 833 samples, or .0168 Hz per
sample, yielding
. This in turn leads to
. Comparing this to the
minimum value required by equation 65 requires that we
choose
and
values, which we modestly require to be 0.75
and 1.5, respectively. Doing so leads to
. We thus see the razor-thin margin for validity
of this model; requiring greater accuracy in
or the more
``tasteful'' vibrato alluded to above would invalidate the model.
Hence, the need for modified models summarized in
appendix A and to be rigorously discussed in future
writing.
Effect of
on Satisfying the Large
Limits Approximation
We take a moment to further explore the relationship between
,
and the large limits approximation. As can be seen
from inspecting equations 65 and 58, there is
a nonlinear relationship between the limits and
, and
therefore between the validity of the large limits approximation
and
. In figure 6, we plot the limits as
a function of
for each of 9
values. In each plot, we
use a horizontal range of
, which necessarily is
different for each plot since
is a function of
. We recall that the lower and upper limits must be
outside -1 and 1 respectively in order to use the large limits
approximation, and see that these point have been marked by
horizontal dotted lines. The key observation here is that as
becomes small, the range over which the approximation
holds decreases, until no range remains. Hence, our abovementioned
claim that this approximation is not valid for very small
. We do, however, observe that the limits in those cases
fall in a range where other approximations may be used. This
is the modification alluded to just above, and will be covered in
future writing. Such modifications are also summarized in
appendix A.
|
Application of the Large Limits
Approximation
Having restricted
and
as above, our approximations
may be applied to
and
. Defining
and
for notational simplicity, we may now write
We now apply the trigonometric identities
| (68) | |||
| (69) | |||
| (70) |
![]() |
|||
![]() |
(71) | ||
![]() |
|||
![]() |
(72) |
After so much discussion, this is not an entirely satisfying
result; though the above equations are highly accurate, they are
cumbersome and not readily manipulated into an elegant closed form
expression for the phase of
. Fortunately, we realize
that the approximations can be further simplified if we can
eliminate the second and fourth sinusoidal terms in each. We
will show that this can be done in either of two ways:
In the next subsection, we consider the second method. Therein we also show a very elegant fact: two ways of conceptualizing the attenuation of the sinusoids are mathematically equivalent. Those methods are the liftering mentioned in the second bullet above, and consideration of the Hann window as a direct attenuator of Fresnel ringing in the time domain.