Comment on page

Nonlinear autocorrelation

These features capture nonlinear autocorrelation properties of a time series

CO_trev_1_num

This statistic computes computes the average across the time series of the cube of successive time-series differences. It will be close to zero for time series for which the distribution of successive decreases in the time series matches the distribution of successive increases, but will be positive if increases tend to be larger in magnitude and negative if decreases tend to be larger in magnitude.
The statistic is computed as:
(xi+1xi)3t,\langle (x_{i+1} - x_i)^3\rangle_t\,,
for time-series values, x, averaged across all possible time points, t (i.e., from index 1 to N-1, for a time series of length N).
It is based on a statistic used in nonlinear time-series analysis, cf. Surrogate time series, Schreiber and Schmitz, Physica D, 142: 346 (2000).
The original hctsa implementation is the code CO_trev(x_z,1) , for a z-scored time series, x_z.
For example, this Chua map time series has increments that are approximately symmetric (blue histogram), as are the cubic increments (orange histogram), and so this statistic has a value (the mean of the orange distribution) of approximately zero.
This simple flow has a multimodal distribution of successive increments (blue), with an asymmetry towards larger increases compared to smaller decreases. This is accentuated after taking a cube (orange), yielding a positive value of this statistic:
This Lozi map is an example of the opposite behavior: time-series decreases can be sudden and large in magnitude, whereas increases are relatively gradual. This leads to an asymmetry towards large negative values, leading to a negative value for this statistic.

CO_HistogramAMI_even_2_5

(NB: renamed in current hctsa library, as the feature named CO_HistogramAMI_even_5bin_ami2)
This feature is a nonlinear version of the autocorrelation function: using a nonlinear correlation metric (mutual information) instead of a conventional linear correlation metric, evaluated using a histogram with 5 bins and at a time delay τ = 2 (from the hctsa code CO_HistogramAMI(x_z,2,'even',5)).
This feature gives high values to time series like this Chaotic Web map:
, which has clear dependence structure of the time-series value at the current point,
xtx_t
, and the value two time points ahead,
xt+2x_{t+2}
, yielding a high value for this feature of 1.25:
This Tent map also has a clear (nonlinear) dependence of present time-series value with that two-steps into the future (and a moderate value of this feature, 0.473):
as seen in its embedding:
And a moderate value (0.25) for this heart rhythm:
But a low value (0.02) for this earthquake time series:

IN_AutoMutualInfoStats_40_gaussian_fmmi

This feature outputs a measure of the timescale of (potentially nonlinear) autocorrelation in the time series, as the minimum of the automutual information function. This is a conventional way of selecting the timescale for a time-delay embedding.
Automutual information is estimated here using a Gaussian assumption on the data (and is thus a nonlinear transformation of the linear autocorrelation function). The hctsa version maxes out at 40.