# Subsampling Inference for the Autocorrelations of GARCH Processes

Subsampling Inference for the Autocorrelations of GARCH Processes Abstract We provide self-normalization for the sample autocorrelations of power GARCH(p, q) processes whose higher moments might be infinite. To validate the studentization, whose goal is to match the growth rate dependent on the index of regular variation of the process, we substantially extend existing weak-convergence results. Since asymptotic distributions are non-pivotal, we construct subsampling-based confidence intervals for the autocorrelations and cross-correlations, which are shown to have satisfactory empirical coverage rates in a simulation study. The methodology is further applied to daily returns of CAC40 and FTSA100 indices and their squares. In the exploratory analysis of time series, it is common practice to examine the sample autocorrelations (acs) of the observed data (suitably transformed to remove nonstationarity) X1,X2,…,Xn ⁠, to see whether the process differs significantly from white noise. For financial time series, such as log-returns, there is also interest in studying the acs of the squared data. In order to ascertain whether a sample autocorrelation at a particular lag differs significantly from zero, it is necessary to obtain an accurate construction of the parameter’s confidence interval, and this in turn requires some knowledge of the asymptotic behavior of the sample acs of {Xt} (or of {Xt2} ⁠). For linear processes with finite variance, the sample acs (under mild conditions) are asymptotically normal with the asymptotic variance-covariance matrix given by the standard Bartlett’s formula (Brockwell and Davis, 1991), whereas for nonlinear processes with potentially infinite variance the asymptotic behavior is much more complex. In this article we focus on nonlinear processes, namely, the popular class of GARCH(p,q) processes {Xt} (Bollerslev, 1986), which are widely used for modeling financial log-returns. For GARCH processes, the asymptotic normality of the sample acs with the standard n-rate holds if E[Xt4]<∞ ⁠. Since the marginal distributions of GARCH processes are regularly varying with index κ>0 (Davis and Mikosch, 1998; Mikosch and Stărică, 2000; Basrak, Davis, and Mikosch, 2002), this moment requirement is met if κ > 4. However, the asymptotic variance-covariance matrix can no longer be expressed via the standard Bartlett’s formula, and instead is given via the general Bartlett’s formula (Chapter 5 of Francq and Zakoian, 2010; Kokoszka and Politis, 2011). Nevertheless, this matrix can be consistently estimated (Francq and Zakoian, 2010; Kokoszka and Politis, 2011) so long as E[Xt4]<∞ ⁠. This is in tension with the empirical evidence suggesting that the tails of financial log-returns are heavier, having an infinite fourth moment (Mittnik, Paolella, and Rachev, 2002; Carrasco and Chen, 2002; Tully and Lusey, 2007). Statistical inference for the acs of GARCH processes in the absence of a finite fourth moment (2 < κ < 4) is rather challenging. Under this scenario, the convergence rate of the sample acs is slower than the classical n and is determined by the abovementioned index κ, which depends on the model parameters and the distribution of the innovations {Zt} and is difficult to estimate (Wagner and Marsh, 2004; Baek et al., 2009). Closed-form expressions for κ exist only for ARCH(1) and GARCH(1,1). Additionally, if E[Zt4]=∞ ⁠, estimation of model coefficients via the quasi maximum likelihood poses difficulties (nonstandard rates of convergence and non-normal asymptotics) and other methods have been proposed in the literature (Hall and Yao, 2003; Peng and Yao, 2003; Huang, Wang, and Yao, 2008). The limit distributions of sample acs when 2 < κ < 4 involve the infinite-variance stable laws (Basrak, Davis, and Mikosch, 2002) and thus the asymptotic quantiles cannot be determined analytically. When 0 < κ < 2, the population acs of GARCH processes are not well-defined and the sample acs are inconsistent. For similar reasons, the asymptotics and the convergence rates for the sample acs of squares of GARCH processes also show trichotomous behavior, which is subject to κ>8, κ∈(4,8) and κ∈(0,4) ⁠, respectively. Our primary goal in the current article is to construct confidence intervals for the acs of GARCH(p,q) processes, of their squares and of the cross-correlations between the process and its squares. Recall that the acs of a GARCH process are zero, whereas the acs of squares decay with geometric rate. Moreover, as shown herein, the cross-correlations between values and squares for the GARCH process are zero whenever the marginal distributions are symmetric. Therefore, any empirical evidence against these three features will render dubious the hypothesis that the data’s dynamics can be adequately captured through a GARCH model. Thus, our procedures for the construction of confidence intervals can be viewed as several misspecification tests for the GARCH hypothesis. Also in the case that the acs of the absolute values of the process are of interest, we establish convergence results for power GARCH (PGARCH) processes. In prior literature, Kokoszka, Teyssière, and Zhang (2004) compared several resampling methods of constructing confidence intervals for lag-1-autocorrelation of squares in GARCH-type models, recommending residual bootstrap as the best approach. In contrast, we examine all the lags of the acs, and employ a nonparametric approach that combines the concepts of self-normalization and subsampling. This approach is valid irrespective of whether the asymptotic distributions for the acs are Gaussian or not (i.e., without assuming finiteness of the fourth moment), and does not require knowledge of model orders p and q. Our procedure does not involve parameter estimation, which under some scenarios can be particularly troublesome, for example, when the error distribution is heavy-tailed with an infinite fourth moment. Self-normalization (McElroy and Politis, 2007; Jach, McElroy, and Politis, 2012) addresses the issue of parameter-dependent convergence rates and is accomplished by dividing the sample ac of {Xt} (resp. {Xt2} ⁠) by a quantity that correctly matches its asymptotic growth rate, without knowing a priori whether the fourth (resp. eighth) population moment is finite or not. We show that the fourth (resp. eighth) sample moment is suitable for such studentization; we also provide a studentization for the cross-correlations. The identification of suitable studentizations is nontrivial, and is a novel facet of this work. In order to validate this technique, it is necessary to substantially extend some of the weak-convergence results of Mikosch and Stărică (2000), which is a stand-alone contribution of this article. Clearly, self-normalization can only resolve half of the problem—namely, eliminating the need to know the convergence rate to compute the studentized statistic—since the limit distributions will be unconventional and non-pivotal. Subsampling (Politis, Romano, and Wolf, 1999) can be used to empirically estimate the quantiles of the sampling distribution. This scheme operates by computing the same statistics on a small subsample—typically a contiguous stretch—drawn from the original time series data, with the unknown parameter being replaced by its best large-sample estimate. Consistency of the resulting empirical distribution for the sampling (or asymptotic) distribution is typically established through a strong mixing assumption, together with strict stationarity, which in the context of GARCH processes is immediate. The article is organized as follows: Section 1 develops the statistical methodology of self-normalization for sample acs, as well as other unobserved quantities. While these results are not of direct statistical applicability, they are necessary for establishing the subsequent subsampling methodology. Section 2 provides a detailed asymptotic theory, upon which the statistical methodology relies. An application to stock returns is given in Section 3, while Section 4 concludes. Simulations that explore finite-sample performance of the subsampling estimators, as well as proofs of technical results, are provided in the appendices (see Supplementary Data). 1 Self-normalization for Autocovariances and Autocorrelations 1.1 Process The GARCH process satisfies Xt=σt Zt for an iid sequence {Zt} that are only assumed to be symmetric about zero, and σt2=α0+∑j=1pαjXt−j2+∑j=1qβjσt−j2. (1) We refer to {σt} as the volatility process. A discussion of the conditions for stationarity are summarized in Lindner (2009); necessary and sufficient conditions in terms of the process’ Lyapunov exponent are given in Bougerol and Picard (1992), and a sufficient condition—in the case that Z0 has finite variance—in terms of the coefficients αj and βj is given in Bollerslev (1986). In this article, we consider {Zt} that are heavy-tailed, but are interested in stationary GARCH processes—see the discussion in Remark 3.2 of Basrak, Davis, and Mikosch (2002) (henceforth BDM) for stationarity conditions. Theorem 3.1 of BDM discusses the properties of the GARCH process, and in their Corollary 3.5 they show that Ut=(Xt,σt) has heavy-tailed marginal distributions of index κ, for some κ > 0 that depends on distributional properties of Z0 and the GARCH coefficients in a complicated fashion. The bivariate process {Ut} is also strong mixing with geometric rate, and is regularly varying with some rate an. The rate an is related to the tail index, being given by an=c n1/κ for some constant c > 0—see Remark 2.1 of BDM. The PGARCH process {Xt} is defined via Xt=σt Zt together with volatility satisfying |σt|ν=α0+∑j=1pαj|Xt−j|ν+∑j=1qβj|σt−j|ν, (2) where ν > 0 is the exponent of the process (ν = 2 corresponds to the GARCH). For references on PGARCH, see Mittnik, Paolella, and Rachev, 2002; Carrasco and Chen, 2002; and Tully and Lusey, 2007. Interest focuses on the transformed variables Yt=|Xt|ν   and  St=σtν. (3) Define the following sample quantities for k integer: γ^Y,Y(k)=n−1∑t=1nYtYt+kγ^Y,S(k)=n−1∑t=1nYtSt+kγ^S,S(k)=n−1∑t=1nStSt+k. (The latter two quantities are not statistics, because {St} is not observed.) Also γ^S,Y(k) is defined by swapping the order of Y and S. Up to negligible errors (i.e., terms that converge to zero in probability—see below for the precise discussion), γ^Y,Y(−k)≈γ^Y,Y(k) ⁠, γ^Y,S(−k)≈γ^S,Y(k) ⁠, and γ^S,S(−k)≈γ^S,S(k) ⁠. If we remove the  ^  symbol, we refer to expectations (whenever these exist), and the above relations in the lags are exact. Because X0 is regularly varying of index κ, the autocovariances and cross-covariances for {Yt} and {St} exist whenever ν < κ/2. We are interested in weak convergence of the so-called roots defined by γ˜Y,Y(k)=γ^Y,Y(k)−γY,Y(k) ⁠, and secondarily in weak convergence of the analogous quantities for the volatilities S and cross-covariances. Although the latter roots involving the volatilities are theoretical quantities, the distribution of γ˜Y,Y(k) depends upon γ˜Y,S(k) and γ˜S,S(k) ⁠, so it behooves us to analyze these objects together. When κ < 2ν the theoretical quantity γY,Y (k) does not exist, and the root is just defined via γ˜Y,Y(k)=γ^Y,Y(k) ⁠. We extend this definition to γ˜Y,S(k) ⁠, and γ˜S,S(k) in the obvious fashion. The appropriate rate of convergence actually depends on the regular variation rate an. Trivially, regular variation for the νth power implies a rate of anν for Yt, and the product of two variables (in the autocovariances or cross-covariances) indicates a rate of an2ν ⁠. In the case of a GARCH(1,1), previous work indicates that nan−4γ˜Y,Y(k) converges weakly to a nondegenerate random variable, jointly in k, when κ < 8; here we utilize the rate an2ν in lieu of an4 (i.e., replacing the exponent 2 by ν). When κ > 2ν the autocovariances and cross-covariances exist, and centering of the sample quantities becomes possible, but when κ < 2ν no centering is necessary. 1.2 Self-normalization For our applications we investigate sample autocovariances, acs, and cross-covariances and cross-correlations for the process {Xt} with its powers {Yt} ⁠, where Yt=|Xt|ν ⁠. Recall that γX,X(h) = 0 for h≠0, whenever κ > ν. With the notation for γ^(k) of the previous subsection, and with ρ^(k)=γ^(k)/γ^(0) by definition, we here study γ^X,X, ρ^X,X, γ^X,Y, ρ^X,Y, γ^Y,Y, ρ^Y,Y. In the case of the cross-correlation, the normalization is γ^X,X(0)γ^Y,Y(0) ⁠. In each case, there are rates of convergence (sometimes with a mean centering) for each statistic with the results dependent on κ and ν. Since we are interested in developing simple studentizations for the statistics, we prove joint results involving absolute sample moments, abbreviated by μ^Xj=n−1∑t=1n|Xt|j for j real. Here we discuss self-normalized statistics, such that the studentized quantity’s rate of convergence does not depend on unknown quantities. First consider self-normalization for the acs of a GARCH(p,q) process. It follows from Theorem 2 below that cn−1nρ^X,X(k)=OP(1) ⁠, where cn={n1 if κ∈(0,2)an2 if κ∈(2,4)n1/2 if κ∈(4,∞). Excluding κ∈{2,4} ⁠, the rate cn equals a constant times n raised to the power 1∧(2/κ∨1/2) ⁠. A suitable choice for the self-normalization that matches this growth rate cn is σ^X,X=(n−1+(nμ^X4)−1/2)−1, as is demonstrated in Theorem 5 below. Similarly, the growth rate for the acs of powers takes the form of n raised to the power 1∧(2ν/κ∨1/2) ⁠, and hence we can normalize with σ^Y,Y=(n−1+(nμ^Y4)−1/2)−1. The case of cross-correlations is more complex. In the case that ν∈(0,1) ⁠, the growth rate cn takes the form of n raised to the power [1∧(1/2+ν/κ)]∧[(1+ν)/κ∨1/2] ⁠. Therefore, we can use the studentization σ^X,Y=(n−1+n−1/2 (nμ^X2)−ν/2+(nμ^X2Y2)−1/2)−1. On the other hand, if ν≥1 then the growth rate cn takes the form of n raised to the power [1∧(1/2+1/κ)]∧[(1+ν)/κ∨1/2] ⁠. Then we can use the studentization σ^X,Y=(n−1+n−1/2 (nμ^Y2)−1/2ν+(nμ^X2Y2)−1/2)−1. Each self-normalization converges jointly with the respective correlation statistics, and is bounded in probability under the assumptions of Theorem 5 given below. 1.3 Subsampling We now proceed to the statistical portion of the article, namely conducting inference for the process’ (and its powers’) acs. For a PGARCH(p,q) process, the acs and cross-correlations with the powers are zero, while the acs of the powers decay exponentially. Our testing paradigm is as follows: we assume as null hypothesis that the process is PGARCH(p, q) and check whether the subsampling-based confidence intervals capture zero (for the process acs and for the cross-correlations with the powers) or decay exponentially fast (in case of the powers). In both cases, we replace the value of the population parameter in the root by its full-sample estimate, although in the former case we have the choice of substituting it with zero. So if zero is not contained in a 1 – α level confidence interval for the acs or cross-cs at some lag, then we can reject the GARCH hypothesis with Type I error rate α. We begin with the definitions, and then consider consistency of the estimators. To construct confidence intervals for ρX,X(k) (although for a PGARCH process this parameter is always zero) we need to approximate the sampling distribution of TX,X(k)=n (ρ^X,X(k)−ρX,X(k)σ^X,X), that is, LX,X,k(x)=ℙ[TX,X(k)≤x] ⁠. Let L∞,X,X,k denote the cumulative distribution function (cdf) of the corresponding limiting random variable—which by Equation (17) below depends on κ. The use of this asymptotic distribution is impractical, as L∞,X,X,k depends on the unknown parameter and there is no known analytic formula for it. Hence, we propose to approximate LX,X,k (and L∞,X,X,k) nonparametrically via subsampling (Politis, Romano, and Wolf, 1999). According to this procedure we divide the sample into overlapping blocks of size b (⁠ b→∞, b/n→0 ⁠), containing Xt,Xt+1,…,Xt+b−1 for t=1,2,…,n−b+1 ⁠, and calculate the self-normalized statistic upon each block, treating each block as if it were a full sample. Moreover, the parameter ρX,X(k) is replaced by its large-sample estimate ρ^X,X(k) ⁠. This produces n – b + 1 subsampling statistics TX,X,t(k)=b (ρ^X,X,t(k)−ρ^X,X(k)σ^X,X,t), where ρ^X,X,t(k)=γ^X,X,t(k)/γ^X,X,t(0) with γ^X,X,t(k)=1b∑ℓ=tt+b−1−k(Xℓ−X¯t)(Xℓ+k−X¯t) and X¯t=∑ℓ=tt+b−1−kXℓ/b ⁠. Note that we could also replace ρ^X,X(k) by zero, if we wish to utilize the “null hypothesis” that the process is PGARCH, but we elect instead to utilize a large-sample estimate, which makes the confidence interval construction more intellectually consistent with the case of the powers’ acs. The normalization σ^X,X,t is given by σ^X,X,t=(b−1+[∑ℓ=tt+b−1−kXℓ4]−1/2)−1 and the sampling distribution LX,X,k(x) is approximated by L^X,X,k(x)=1n−b+1∑t=1n−b+11{TX,X,t(k)≤x}. If by cX,X,k(1−p)=inf⁡{x: L^X,X,k(x)≥1−p} we denote its lower 1 – p quantile, we obtain for ρX,X(k) a (1−p) equal-tailed subsampling confidence interval CIet;1−p(ρX,X(k)) ⁠, defined as [ρ^X,X(k)−σ^X,Xn cX,X,k(1−p/2),ρ^X,X(k)−σ^X,Xn cX,X,k(p/2)]. (4) Introducing L^X,X,k,|·|(x)=∑t=1n−b+11{|TX,X,t(k)|≤x}/(n−b+1) ⁠, another related distribution, and its quantile cX,X,k,|·|(1−p)=inf⁡{x: L^X,X,k,|·|(x)≥1−p} offers an (1 – p) symmetric subsampling confidence interval for ρX,X(k) ⁠, CIs;1−p(ρX,X(k))=[ρ^X,X(k)∓σ^X,Xn cX,X,k,|·|(1−p)]. (5) For the powers {Yt} ⁠, we have the following, analogous definitions, starting with the statistic and its subsampling version TY,Y(k)=n (ρ^Y,Y(k)−ρY,Y(k)σ^Y,Y), TY,Y,t(k)=b (ρ^Y,Y,t(k)−ρ^Y,Y(k)σ^Y,Y,t), where ρ^Y,Y,t(k)=γ^Y,Y,t(k)/γ^Y,Y,t(0) with γ^Y,Y,t(k)=1b∑ℓ=tt+b−1−k (Yℓ−Y¯t)(Yℓ+h−Y¯t) and Y¯t=∑ℓ=tt+b−1−kYℓ/b ⁠. Unlike the previous case of the regular acs, we do not presume that ρY,Y(k) equals zero, so we must estimate it instead. The normalization σ^Y,Y,t is given by σ^Y,Y,t=(b−1+[∑ℓ=tt+b−1−kYℓ4]−1/2)−1. The sampling distribution LY,Y,k(x)=ℙ[TY,Y(k)≤x] is approximated by L^Y,Y,k(x)=1n−b+1∑t=1n−b+11{TY,Y,t(k)≤x}, and is an estimator of L∞,Y,Y,k(x) ⁠, the cdf of the limit variable given in (18). For ρY,Y(k) a (1−p) equal-tailed subsampling confidence interval CIet;1−p(ρY,Y(k)) is given by [ρ^Y,Y(k)−σ^Y,YncY,Y,k(1−p/2),ρ^Y,Y(k)−σ^Y,YncY,Y,k(p/2)], (6) where cY,Y,k(1−p)=inf⁡{x: L^Y,Y,k(x)≥1−p} ⁠. An (1−p) symmetric subsampling confidence interval for ρY,Y(k) is then CIs;1−p(ρY,Y(k))=[ρ^Y,Y(k)∓σ^Y,Yn cY,Y,k,|·|(1−p)], (7) with cY,Y,k,|·|(1−p)=inf⁡{x: L^Y,Y,k,|·|(x)≥1−p} and L^Y,Y,k,|·|(x)=∑t=1n−b+11{|TY,Y,t(k)|≤x}/(n−b+1) ⁠. For the cross-correlations between the data process and its powers, the normalized difference and its subsampling counterpart are TX,Y(k)=n (ρ^X,Y(k)−ρX,Y(k)σ^X,Y), TX,Y,t(k)=b (ρ^X,Y,t(k)−ρ^X,Y(k)σ^X,Y,t), where ρ^X,Y,t(k)=γ^X,Y,t(k)/γ^X,X,t(0) γ^Y,Y,t(0) ⁠. Again, we could replace ρX,Y(k) by zero under the PGARCH hypothesis, but in keeping with the above treatment we utilize the parameter’s large-sample estimate. The normalization σ^X,Y,t is given (in the case of ν≥1 ⁠) by σ^X,Y,t=(b−1+b−1/2[∑ℓ=tt+b−1−kYℓ2]−1/2ν+[∑ℓ=tt+b−1−k|Xℓ|2+2ν]−1/2)−1. The sampling distribution LX,Y,k(x)=ℙ[TX,Y(k)≤x] is approximated by L^X,Y,k(x)=1n−b+1∑t=1n−b+11{TX,Y,t(k)≤x}, and is an estimator of L∞,X,Y,k(x) ⁠, the cdf of the limit variable given in (19). For ρX,Y(k) a (1−p) equal-tailed subsampling confidence interval CIet;1−p(ρX,Y(k)) is defined as [ρ^X,Y(k)−σ^X,Yn cX,Y,k(1−p/2),ρ^X,Y(k)−σ^X,Yn cX,Y,k(p/2)], (8) where cX,Y,k(1−p)=inf⁡{x: L^X,Y,k(x)≥1−p} ⁠. An (1−p) symmetric subsampling confidence interval for ρX,Y(k) is then CIs;1−p(ρX,Y(k))=[ρ^X,Y(k)∓σ^X,Yn cX,Y,k,|·|(1−p)], (9) with cX,Y,k,|·|(1−p)=inf⁡{x: L^X,Y,k,|·|(x)≥1−p} and L^X,Y,k,|·|(x)=∑t=1n−b+11{|TX,Y,t(k)|≤x}/(n−b+1) ⁠. 2 Asymptotic Theory We expand the theoretical results of Davis and Mikosch (1998) and Mikosch and Stărică (2000) in two substantial directions: we derive asymptotic relations for process and volatility autocovariances and cross-covariances (where the volatility is defined to be {σt} ⁠) for the PGARCH process, and we derive cross-covariance results for a PGARCH process and its power. Our main objective is to obtain nondegenerate weak limits of all studentized roots, so that the subsampling method is applicable. The first subsection reviews some background concepts and notation (also see Appendix A in Supplementary Data), while the second subsection applies these results to certain statistics, providing our main theorems. The third subsection applies these theorems to produce studentized statistics, and the fourth subsection establishes the consistency of subsampling. 2.1 GARCH and PGARCH The GARCH(p,q) process {Xt} has been studied in Mikosch and Stărică (2000) and BDM. In addition to establishing results on autocovariances, BDM provides results on Stochastic Recurrence Equations (SREs), from which they derive regular variation properties of GARCH processes; by adapting their methods of proof, many of the results can be trivially extended to PGARCH processes. We first review BDM’s GARCH results. The vector process of dimension d=q+p−1 defined by W⃗t=[σt+1,σt,⋯,σt−q+2,Xt,Xt−1,⋯,Xt−p+2]′ (10) satisfies a SRE in the squares of its components, as described in Equation (3.1) of BDM. Let ɛx be the point measure concentrated at x ⁠, and let ⇒L denote convergence in distribution of point measures on ℝ¯d(m+1)∖{0} ⁠, where ℝ¯=ℝ∪{±∞} and d=q+p−1 ⁠. BDM shows that W⃗t(m)= vec(W⃗t,⋯,W⃗t+m) yields a point process convergence Nn=∑t=1nɛW⃗t(m)an−1⇒LN∞=∑i,j≥1ɛPiQ⃗ij (11) as n→∞ ⁠, where an is the rate of regular variation, being implicitly defined by n ℙ[|W⃗0(m)|>an]→1. (12) The {Pi}i≥1 are the points of a Poisson process defined on (0,∞) with intensity measure given in terms of κ and the extremal index. The Poisson process ∑i≥1ɛPi is independent of each iid point process ∑i≥1ɛQij (for each j≥1 ⁠). The points {Pi} and {Qij} correspond to the radial and spherical portions of the limiting points W⃗t(m)an−1 ⁠; see Corollary 2.4 of Davis and Mikosch (1998) for more detail. It is apparent that Q⃗ij has m + 1 blocks of d components, which we denote via Q⃗ij= vec{Qij(0),Qij(1),⋯,Qij(m)}, (13) and each Qij(ℓ) itself has d=q+p−1 components, where 0≤ℓ≤m ⁠. To refer to the kth component, with 1≤k≤d ⁠, of Qij(ℓ) ⁠, we write Qij(ℓ,k) ⁠. BDM then makes an application to establishing weak convergence of sample autocovariances of {Xt} ⁠, extending earlier results by Davis and Mikosch (1998) and Mikosch and Stărică (2000) on the GARCH(1,1). These latter articles also discuss results for the autocovariances of {|Xt|} and {Xt2} ⁠, which are objects of interest for financial applications; BDM claim that these results can be extended to the GARCH(p,q) case without proof. For the sample autocovariances of {Xt2} ⁠, (asymptotic) recursive relations can be derived that extend ideas in Davis and Mikosch (1998) and Mikosch and Stărică (2000) for the GARCH(1,1), but these relations seem hard to extend to the autocovariances of {|Xt|}—something rather special happens when p,q≤1 that prevents the method of proof to be easily generalized for p > 1 or q > 1. Essentially, these same results also hold for the PGARCH, which we prove below. Set X⃗t=[St+1,⋯,St−q+2,Yt,⋯,Yt−p+2]′ ⁠. Then {Yt} and {St} ⁠, defined via (3), satisfy the following SRE: X⃗t=At X⃗t−1+Bt, (14) with Bt=[α0,0,⋯,0]′ and At=[α1 |Zt|ν+β1β2⋯βq−1βqα2α3⋯αp10⋯0000⋯001⋯0000⋯0⋮⋮⋱⋮⋮⋮⋮⋱⋮00⋯1000⋯0|Zt|ν0⋯0000⋯000⋯0010⋯0⋮⋮⋱⋮⋮⋮⋮⋱⋮00⋯000⋯10]. The key assumptions needed to establish the existence of a stationary solution, and its regular variation properties, are given below. The Lyapunov exponent of the SRE is given in terms of the L1 matrix norm by γ=inf⁡{n−1 Elog⁡||A1⋯An||,n∈ℕ}, and sufficient conditions (in the GARCH case) for the exponent’s negativity are discussed in Remark 3.2 of BDM. Assumption A α0>0 and the Lyapunov exponent γ<0 ⁠. Assumption B Z0 has a positive density on ℝ such that E|Z0|h<∞ for all h<h0 and E|Z0|h0=∞ for some h0∈(0,∞] ⁠; not all the parameters αj and βk for 1≤j≤p and 1≤k≤q are zero. Assumption C The density of Z0 is positive in an interval containing zero. Assumption D {Ut=(Xt,σt)} is strong mixing with geometric rate. Theorem 1 For the SRE (14), assume Assumptions A and B. Then there exists a unique strictly stationary solution to the SRE, and there exists ρ such that for all x∈ℝd∖{0} the inner product ⟨x,X⃗1⟩ is regularly varying with index ρ. Furthermore, if ρ is not an even integer, then X⃗1 is regularly varying with index ρ. Remark 1 This theorem is proved by exactly following the proof of Theorem 3.1 of BDM, only substituting |Zt|ν for Zt2 and consolidating parts A and B; therefore, its proof is omitted. Because it is unknown whether the PGARCH process is strong mixing—though in part C of Theorem 3.1 of BDM it is stated that for a GARCH process Assumption C implies Assumption D—we omit discussion of this point in Theorem 1. Also see Fryzlewicz and Subba Rao (2011). Corollary 1 For the SRE (14), assume Assumptions A, B, and D. Then a stationary version of the process {Ut=(Xt,σt)} exists, and if the index ρ of Theorem 1 is not an even integer, then the finite-dimensional distributions of {Ut} are regularly varying with index κ=νρ. Remark 2 The proof of Corollary 1 follows almost verbatim from the proof of Corollary 3.5 of BDM, the only change being that κ=νρ and ν need not be 2, as in the GARCH case. Note we assume the strong mixing property, although in the GARCH case it suffices to assume the weaker Assumption C instead. Hence using Theorem 2.8 of Davis and Mikosch (1998), from Corollary 1 we can conclude that—under our working assumptions A, B, and D—with W⃗t given by (10) the convergence (11) holds, along with (12). Note that the points Pi and Q⃗ij pertain to the process {Ut} ⁠, not {Yt} and {St} ⁠. The regular variation is of index κ, such that κ/ν is not an even integer. 2.2 Sampling Behavior for PGARCH Processes Under Assumptions A, B, and D we can state our convergence results for the roots of interest. Each theorem below splits into three main cases: (i) κ is sufficiently low that the mean centering does not exist, and a stable limit is obtained; (ii) κ is a bit larger so that centerings are utilized, and a stable limit for the root is obtained; (iii) κ is large enough that a central limit theorem holds, so that the limit of the root is Gaussian. (The third theorem splits case (ii) into two sub-cases.) In cases (i) and (ii) the limit distributions are stated in terms of the points of (11), which are explained in more detail in Theorem 2.10 of BDM; also see Corollary 2.4 of Davis and Mikosch (1998). In case (iii) we only need to know the joint covariance of the limiting Gaussian variables. In the first Theorem below, case (ii) involves a limit variable that is described as follows (also see Proposition 3.3 of Davis and Mikosch, 1998). The regular variation of the PGARCH process (cf. Corollary 1) yields the vague convergence of n ℙ[an−1W⃗0(m)∈·] to a measure τ on ℝ¯d(m+1)∖{0} ⁠, the Lévy measure of a κ-stable random vector with κ∈(0,2) (and such that κ/ν is not an even integer). Then we have the definition Vk=lim⁡δ→0(∑i,j≥1Pi2Qij(0,q+1)Qij(k,q+1)1{Pi2|Qij(0,q+1)Qij(k,q+1)|≥δ}−∫Bδ,kx(0,q+1)x(k,q+1) ρ(dx)) (15) for Bδ,k={x∈ℝd(m+1):δ<|x(0,q+1)x(k,q+1)|} ⁠, where k≥0 ⁠. The superscript notation is explained in (13). The points Qij(0,q+1) and Qij(k,q+1) correspond to the (0,q+1) and (k,q+1) components of W⃗t(m) ⁠, that is, Xt and Xt+k ⁠. In addition, all the theorems contain the random variables Wr=∑i,j≥1|Pi|r|Qij(0,q+1)|r, for r > 0. This first result extends the work of BDM, which is concerned with the GARCH(p,q) process, in three ways: the process is a PGARCH (which nest the GARCH), we consider convergence jointly with sample moments, and we also derive the result for acs. Theorem 2 Let {Xt} be a PGARCH(p,q) process (2) satisfying Assumptions A, B, and D; also assume that Z0 is symmetric about zero. Then: (i): in the case κ∈(0,2): ({nan−2 γ^X,X(k)}k=0m,nan−4 μ^X4)⇒L({Vk}k=0m,W4)({ρ^X,X(k)}k=1m,nan−4 μ^X4)⇒L({VkV0−1}k=1m,W4),where Vk=∑i,j≥1Pi2Qij(0,q+1)Qij(k,q+1). (ii): in the case κ∈(2,4): ({nan−2 [γ^X,X(k)−γX,X(k)]}k=0m,nan−4μ^X4)⇒L({Vk}k=0m,W4)({nan−2 [ρ^X,X(k)−ρX,X(k)]}k=1m,nan−4μ^X4)⇒L({γX,X−1(0)[Vk−ρX,X(k)V0]}k=1m,W4),where Vk is defined by (15). (iii): in the case κ>4: ({n1/2 [γ^X,X(k)−γX,X(k)]}k=0m,μ^X4)⇒L({Gk}k=0m,EX4)({n1/2 [ρ^X,X(k)−ρX,X(k)]}k=1m,μ^X4)⇒L({γX,X−1(0)[Gk−ρX,X(k)G0]}k=1m,EX4),where the Gk satisfy Cov(Gj,Gk)=∑ℓ Cov(X0Xj,XℓXℓ+k) and are multivariate normal. Remark 3 In cases (ii) and (iii) of Theorem 2, γX,X(k)=0 if k > 0, and in particular ρX,X(k)=0 in the results for ρ^X,X(k) ⁠. Therefore, these results could be stated as nan−2ρ^X,X(k)⇒LγX,X−1(0)Vk and n ρ^X,X(k)⇒LγX,X−1(0)Gk for cases (ii) and (iii), respectively. Note that these limiting distributions are nondegenerate, which is important for our subsampling applications discussed in the next section. The next result is concerned with autocovariances and acs for the powered process {Yt} ⁠. The middle case (ii) involves a somewhat complicated limit distribution, which is described in Proposition 1 of Appendix A (Supplementary Data), which requires the additional Assumption M discussed therein. The result for the autocovariances is stated without proof in BDM (and for the GARCH(1,1) case, a full proof is given in Davis and Mikosch, 1998), and our proof here relies on the recursive relations of the previous subsection. We also describe results jointly with the sample moments, and provide the autocorrelation results. Theorem 3 Let {Xt} be a PGARCH(p,q) process (2) satisfying Assumptions A, B, and D; also assume that Z0 is symmetric about zero. Then: (i): in the case κ∈(0,2ν): ({nan−2ν γ^Y,Y(k)}k=0m,nan−4ν μ^Y4)⇒L({Vk}k=0m,W4ν)({ρ^Y,Y(k)}k=1m,nan−4ν μ^Y4)⇒L({VkV0−1}k=1m,W4ν),where Vk=∑i,j≥1|Pi|2ν|Qij(0,q+1)Qij(k,q+1)|ν. (ii): in the case κ∈(2ν,4ν), assume Assumption M as well: with Uk defined as in Proposition 1. (iii): in the case κ>4ν: ({n1/2 [γ^Y,Y(k)−γY,Y(k)]}k=0m,μ^Y4)⇒L({Gk}k=0m,E|X|4ν)({n1/2 [ρ^Y,Y(k)−ρY,Y(k)]}k=1m,μ^Y4)⇒L({γY,Y−1(0)[Gk−ρY,Y(k)G0]}k=1m,E|X|4ν),where the Gk satisfy Cov(Gj,Gk)=∑ℓ Cov(Y0Yj,YℓYℓ+k) and are multivariate normal. The final result of this section utilizes some of the results of Theorems 2 and 3, because the normalization for the cross-correlations involves both γX,X(0) and γY,Y(0) ⁠. It turns out that symmetry in the cross-covariance variables makes the results resemble those of Theorem 2, though the limits depend on whether ν is less than or greater than one. Also, when κ>1+ν ⁠, the limit involves the random variables Vk=lim⁡δ→0(∑i,j≥1Pi|Pi|νQij(0,q+1)|Qij(k,q+1)|ν1{|Pi|ν+1|Qij(0,q+1)||Qij(k,q+1)|ν≥δ}−∫Bδ,kx(0,q+1)|x(k,q+1)|ν ρ(dx)) (16) for Bδ,k={x∈ℝd(m+1):δ<|x(0,q+1)[x(k,q+1)]ν|} ⁠, where k≥0 ⁠. The points Qij(0,q+1) and [Qij(k,q+1)]ν correspond to the (0,q+1) component and the powered (k,q+1) component, respectively, of W⃗t(m) ⁠, that is, Xt and Yt+k ⁠. These types of statistics have not been mathematically studied in prior literature, to our knowledge. Theorem 4 Let {Xt} be a PGARCH(p,q) process (2) satisfying Assumptions A, B, and D; also assume that Z0 is symmetric about zero. If κ>1+ν, then γX,Y(k) exists and equals zero for all k. Supposing that ν∈(0,1), then: (i) in the case κ∈(0,2ν): ({nan−(1+ν) γ^X,Y(k)}k=0m,nan−2γ^X,X(0),nan−2ν γ^Y,Y(0),nan−(2+2ν) μ^X2Y2)⇒L({Vk}k=0m,W2,W2ν,W2+2ν)({ρ^X,Y(k)}k=1m,nan−(2+2ν) μ^X2Y2)⇒L({Vk/W2 W2ν}k=1m,W2+2ν),where Vk=∑i,j≥1Pi|Pi|νQij(0,q+1)|Qij(k,q+1)|ν. (ii) in the case κ∈(2ν,1+ν)∪(1+ν,2), assuming Assumption M: ({nan−(1+ν) γ^X,Y(k)}k=0m,nan−2γ^X,X(0),γ^Y,Y(0),nan−(2+2ν) μ^X2Y2)⇒L({Vk}k=0m,W2,γY,Y(0),W2+2ν)({n1/2an−νρ^X,Y(k)}k=1m,nan−(2+2ν) μ^X2Y2)⇒L({Vk/W2 γY,Y(0)}k=1m,W2+2ν),with Vk given by case (i) if κ∈(2ν,1+ν) and by (16) if κ∈(1+ν,2). (iii) in the case κ∈(2,2+2ν), assuming Assumption M: ({nan−(1+ν) γ^X,Y(k)}k=0m,γ^X,X(0),γ^Y,Y(0),nan−(2+2ν) μ^X2Y2)⇒L({Vk}k=0m,γX,X(0),γY,Y(0),W2+2ν)({nan−(1+ν)ρ^X,Y(k)}k=1m,nan−(2+2ν) μ^X2Y2)⇒L({Vk/γX,X(0) γY,Y(0)}k=1m,W2+2ν),with Vk given by (16). (iv) in the case κ>2+2ν, assuming Assumption M: ({n1/2 γ^X,Y(k)}k=0m,γ^X,X(0),γ^Y,Y(0),μ^X2Y2)⇒L({Gk}k=0m,γX,X(0),γY,Y(0),γX2,Y2(0))({n1/2ρ^X,Y(k)}k=1m,μ^X2Y2)⇒L({Gk/γX,X(0) γY,Y(0)}k=1m,E|X|2+2ν),where the Gk satisfy Cov(Gj,Gk)=∑ℓ Cov(X0Yj,XℓYℓ+k) and are multivariate normal. Supposing that ν≥1, then: (i) in the case κ∈(0,2): ({nan−(1+ν) γ^X,Y(k)}k=0m,nan−2γ^X,X(0),nan−2ν γ^Y,Y(0),nan−(2+2ν) μ^X2Y2)⇒L({Vk}k=0m,W2,W2ν,W2+2ν)({ρ^X,Y(k)}k=1m,nan−(2+2ν) μ^X2Y2)⇒L({Vk/W2 W2ν}k=1m,W2+2ν),where Vk=∑i,j≥1Pi|Pi|νQij(0,q+1)|Qij(k,q+1)|ν. (ii) in the case κ∈(2,1+ν)∪(1+ν,2ν), assuming Assumption M: ({nan−(1+ν) γ^X,Y(k)}k=0m,γ^X,X(0),nan−2ν γ^Y,Y(0),nan−(2+2ν) μ^X2Y2)⇒L({Vk}k=0m,γX,X(0),W2ν,W2+2ν)({n1/2an−1ρ^X,Y(k)}k=1m,nan−(2+2ν) μ^X2Y2)⇒L({Vk/γX,X(0) W2ν}k=1m,W2+2ν),with Vk given by case (i) if κ∈(2,1+ν) and by (16) if κ∈(1+ν,2ν). (iii) in the case κ∈(2ν,2+2ν), assuming Assumption M: ({nan−(1+ν) γ^X,Y(k)}k=0m,γ^X,X(0),γ^Y,Y(0),nan−(2+2ν) μ^X2Y2)⇒L({Vk}k=0m,γX,X(0),γY,Y(0),W2+2ν)({nan−(1+ν)ρ^X,Y(k)}k=1m,nan−(2+2ν) μ^X2Y2)⇒L({Vk/γX,X(0) γY,Y(0)}k=1m,W2+2ν),with Vk given by (16). (iv) in the case κ>2+2ν, assuming Assumption M: ({n1/2 γ^X,Y(k)}k=0m,γ^X,X(0),γ^Y,Y(0),μ^X2Y2)⇒L({Gk}k=0m,γX,X(0),γY,Y(0),γX2,Y2(0))({n1/2ρ^X,Y(k)}k=1m,μ^X2Y2)⇒L({Gk/γX,X(0) γY,Y(0)}k=1m,E|X|2+2ν),where the Gk satisfy Cov(Gj,Gk)=∑ℓ Cov(X0Yj,XℓYℓ+k) and are multivariate normal. Remark 4 The rates of convergence for the sample cross-correlation, and its root, vary considerably depending upon κ. To summarize (recall that an∼n1/κ up to a constant), when ν∈(0,1) the rate is n0 for κ<2ν ⁠; n1/2−ν/κ for κ∈(2ν,2) ⁠; n1−(1+ν)/κ for κ∈(2,2+2ν) ⁠; n1/2 for κ>2+2ν ⁠. Whereas for ν≥1 ⁠, the rates are n0 for κ<2 ⁠; n1/2−1/κ for κ∈(2,2ν) ⁠; n1−(1+ν)/κ for κ∈(2ν,2+2ν) ⁠; n1/2 for κ>2+2ν ⁠. See Figure 1 for a depiction of this rate in the case of ν∈{0.5,2} ⁠; the odd behavior is due to the unusual normalization involving both γ^X,X(0) and γ^Y,Y(0) ⁠. Figure 1. View largeDownload slide δ(κ) versus κ, where δ(κ) determines the order in probability of the sample cross-correlation, nδ(κ)ρ^X,Y(k)=OP(1) ⁠. Figure 1. View largeDownload slide δ(κ) versus κ, where δ(κ) determines the order in probability of the sample cross-correlation, nδ(κ)ρ^X,Y(k)=OP(1) ⁠. 2.3 Self-normalization for Correlation Statistics We next provide the asymptotic results for self-normalized roots. Theorem 5 Let {Xt} be a PGARCH(p,q) process (2) satisfying Assumptions A, B, and D; also assume that Z0 is symmetric about zero. (i) With quantities defined in Theorem 2 according to the value of κ, n{ρ^X,X(k)σ^X,X}k=1m⇒L{{VkV0}k=1m  if κ∈(0,2){VkγX,X(0)W4}k=1m  if κ∈(2,4){GkγX,X(0)EX4}k=1m  if κ>4. (17) (ii) With quantities defined in Theorem 3 according to the value of κ, and assuming Assumption M when κ>2ν, n{ρ^Y,Y(k)−ρY,Y(k)σ^Y,Y}k=1m⇒L{{VkV0}k=1m  if κ∈(0,2ν){Uk−ρY,Y(k)U0γY,Y(0)W4ν}k=1m  if κ∈(2ν,4ν){Gk−ρY,Y(k)G0γY,Y(0)E[|X|4ν]}k=1m  if κ>4ν. (18) (iii) If ν∈(0,1), with quantities defined in Theorem 4 according to the value of κ n{ρ^X,Y(k)σ^X,Y}k=1m⇒L{{VkW2 W2ν}k=1m  if κ∈(0,2ν){VkγY,Y(0)W21+ν}k=1m  if κ∈(2ν,2){VkγX,X(0) γY,Y(0) W2+2ν}k=1m  if κ∈(2,2+2ν){GkγX,X(0) γY,Y(0) E|X|2+2ν}k=1m  if κ>2+2ν. (19) If ν≥1, with quantities defined in Theorem 4 according to the value of κ n{ρ^X,Y(k)σ^X,Y}k=1m⇒L{{VkW2 W2ν}k=1m  if κ∈(0,2){VkγX,X(0)W2ν1+1/ν}k=1m  if κ∈(2,2ν){VkγX,X(0) γY,Y(0) W2+2ν}k=1m  if κ∈(2ν,2+2ν){GkγX,X(0) γY,Y(0) E|X|2+2ν}k=1m  if κ>2+2ν. In each case, the studentized correlations have a nondegenerate limit distribution, and the rate of convergence does not depend on κ. Although the limit distributions are not pivotal, they can be approximated via subsampling. Remark 5 Although our normalization by the sample moment matches the growth rate of its corresponding root, it is not scale-invariant. Hence, to obtain a scale-invariant statistic, we propose to divide each sample kth moment by the exponential of the log moment μ^log⁡(|X|k)=n−1∑t=1nlog⁡(|Xt|k). This quantity converges to Elog⁡(|X|k) no matter the value of κ, and it also scales such that μ^log⁡(|aX|k)=log⁡(|a|k)+μ^log⁡(|X|k) ⁠. Since μ^log⁡(|X|k) converges in probability to its expectation, we can use its exponential to correct for scale; exp⁡μ^log⁡(|aX|k)=|a|k exp⁡μ^log⁡(|X|k) ⁠. For notational transparency the estimators of Subsection 1.3 are expressed in terms of the original normalizations, but the simulations in Appendix C (Supplementary Data) contain results with the scale-invariant normalizations introduced in Remark 5. 2.4 Consistency of the Subsampling Estimators To establish the consistency of the various subsampling estimators introduced in Section 1, we employ the mixing properties of the corresponding processes and Theorem 11.3.1 of Politis, Romano, and Wolf (1999) combined with the convergence results derived in Section 2. By consistency, we mean that the subsampling distribution estimators converge in probability to the respective cdf of the limiting distributions. Corollary 2 Let {Xt} be a PGARCH(p,q) process (2) satisfying Assumptions A, B, and D; also assume that Z0 is symmetric about zero. Assume that b/n+1/b→0 as n→∞. (i) With quantities defined in Theorem 2 according to the value of κ L^X,X,k(x)→PL∞,X,X,k(x) for each x that is a continuity point of the limit distribution of (17). If this distribution is continuous, the convergence is also uniform. Moreover, the asymptotic coverage of the intervals (4) and (5) is the nominal level 1−p. (ii) With quantities defined in Theorem 3 according to the value of κ, and assuming Assumption M when κ>2ν, L^Y,Y,k(x)→PL∞,Y,Y,k(x) for each x that is a continuity point of the limit distribution of (18). If this distribution is continuous, the convergence is also uniform. Moreover, the asymptotic coverage of the intervals (6) and (7) is the nominal level 1−p. (iii) With quantities defined in Theorem 4 according to the value of κ L^X,Y,k(x)→PL∞,X,Y,k(x) for each x that is a continuity point of the limit distribution of (19). If this distribution is continuous, the convergence is also uniform. Moreover, the asymptotic coverage of the intervals (8) and (9) is the nominal level 1−p. 3 Application to Returns on Stock Indices We consider two datasets analyzed in Section 5.5 of Francq and Zakoian (2010). One corresponds to the daily log returns of CAC 40 stock index from 02/03/1990 to 29/12/2006 (4244 observations) and other to those of FTSE 100 index from 04/04/1984 to 03/04/2007 (5811 observations). According to Francq and Zakoian (2010), a GARCH(1,1) model is a reasonable guess for the first series, but higher order GARCH or ARCH is more likely to fit the second series. In Figure 2, we show Hill plots of the returns. For both series, the assumption about the finiteness of the second moment does not seem to be too unreasonable; however, the assumption regarding the finiteness of the fourth moment might be questionable. Figure 2. View largeDownload slide Hill plot of CAC 40 and FTSE 100 returns. Figure 2. View largeDownload slide Hill plot of CAC 40 and FTSE 100 returns. The 95% confidence intervals for the acf of the returns are plotted in Figure 3. PLUGIN intervals were calculated by fitting a GARCH(1,1) model with normal innovations (R routine garchFit of package fGarch) to the returns and plugging in the estimated α1 and β1 values into the exact normal asymptotic confidence bounds (under normal errors) formula. The estimated values for CAC 40 returns were α^1=0.076907 and β^1=0.907047 (⁠ α^0=0.000003 ⁠) while those for FTSE 100 returns were α^1=0.089941 and β^1=0.891497 (⁠ α^0=0.000002 ⁠). The remaining confidence intervals are those employed in the simulation study of Appendix C (Supplementary Data). Figure 3. View largeDownload slide 95% confidence intervals for the acs of returns of CAC 40 and FTSE 100 indices based on several methods. Figure 3. View largeDownload slide 95% confidence intervals for the acs of returns of CAC 40 and FTSE 100 indices based on several methods. For CAC 40 returns, the confidence intervals for ρX,X(k) based on the subsampling method are much wider than the other two types and oscillate around ±0.4. For both series, and all interval types, the acs seem to be not significantly different from zero. The 95% confidence intervals for ρX2,X2(k) are depicted in Figure 4 (these intervals have to be treated with caution if we are unsure about the finiteness of the fourth moment, because in such case ρX2,X2(k) is undefined). As pointed out in Appendix C (Supplementary Data), the symmetric confidence intervals are too wide for the acs of the squares, hence we focus our attention on the equal-tailed intervals. For CAC 40 and FTSE 100 squared returns, the width of the confidence intervals for the acs is about 0.25. Bounds for CAC 40 indicate slower decay of ρX2,X2(k) compared to bounds for FTSE 100. Figure 4. View largeDownload slide 95% subsampling confidence intervals for the acs of squared returns of CAC 40 and FTSE 100 indices. Figure 4. View largeDownload slide 95% subsampling confidence intervals for the acs of squared returns of CAC 40 and FTSE 100 indices. The 95% confidence intervals for ρX,X2(k) are shown in Figure 5. The two subsampling methods yield similar bounds, which are slightly asymmetric. Both types of intervals indicate that the process’ cross-correlations are non-significant. Figure 5. View largeDownload slide 95% subsampling confidence intervals for the cross-correlations of the returns and squared returns of CAC 40 and FTSE 100 indices. Figure 5. View largeDownload slide 95% subsampling confidence intervals for the cross-correlations of the returns and squared returns of CAC 40 and FTSE 100 indices. 4 Summary In this article, we study the asymptotic properties of sample autocovariances and acs of PGARCH processes. Autocorrelations of a time series and its powers have been used in the literature to determine a nonlinear process’ serial structure, but the inference for PGARCH is complicated because the rate of convergence of sample autocovariance and autocorrelation estimators depends upon the tail index. This tail index is typically unknown, unless it has been previously estimated. However, through an appropriate studentization of autocovariance/autocorrelation roots (i.e., the estimator minus its estimand) it is possible to avoid the necessity of knowing the tail index; this strategy was adopted for heavy-tailed moving average time series in Davis and Resnick (1985) and McElroy and Politis (2002). The latter paper approximated the limiting quantiles of the studentized root by the subsampling methodology. Here we study the related inference problem for PGARCH processes: we derive the limiting distributions, provide effective studentizations, and examine subsampling methods for estimating the limiting quantiles. To obtain nondegenerate weak limits of all studentized roots, we derive the recursive relationships for the PGARCH and thus substantially extend existing theoretical results. A simulation study, which can be found in Appendix C (Supplementary Data), indicates that the subsampling confidence intervals for the acs of GARCH processes with a finite fourth moment are generally wider than the asymptotic confidence intervals (approximate or exact). The subsampling approach can still be employed when no asymptotic formula is available (providing intervals for the acs when the fourth moment is infinite, and for the acs of squares and ccs of the process with its squares). Nevertheless, the subsampling-based empirical coverage probabilities tend to be higher than the nominal level. Equal-tailed subsampling confidence intervals are preferable over the symmetric ones. Footnotes * The authors are grateful for helpful comments from the Associate Editor and anonymous referees. This work was partially supported by the Spanish Ministry of Economy and Competitiveness [grant number ECO2012-38442 to AJ]. This report is released to inform interested parties of research and to encourage discussion. The views expressed on statistical issues are those of the authors and not necessarily those of the U.S. Census Bureau. References Baek C. , Pipiras V. , Wendt H. and Abry P. . 2009 . Second Order Properties of Distribution Tails and Estimation of Tail Exponents in Random Difference Equations . Extremes 12 : 361 – 400 . Google Scholar Crossref Search ADS WorldCat Basrak B. , Davis R. and Mikosch T. . 2002 . Regular Variation of GARCH Processes . Stochastic Processes and their Applications 99 : 95 – 115 . Google Scholar Crossref Search ADS WorldCat Bollerslev T. 1986 . Generalized Autoregressive Conditional Heteroskedasticity . Journal of Econometrics 31 : 307 – 327 . Google Scholar Crossref Search ADS WorldCat Bougerol P. and Picard N. . 1992 . Stationarity of GARCH Processes and of Some Nonnegative Time Series . Journal of Econometrics 52 : 115 – 127 . Google Scholar Crossref Search ADS WorldCat Brockwell P. J. and Davis R. A. . 1991 . Time Series: Theory and Methods . New York : Springer . Google Preview WorldCat Carrasco M. and Chen X. . 2002 . Mixing and Moment Properties of Various GARCH and Stochastic Volatility Models . Econometric Theory 18 : 17 – 39 . Google Scholar Crossref Search ADS WorldCat Davis R. and Mikosch T. . 1998 . The Sample Autocorrelations of Heavy-Tailed Processes with Applications to ARCH . The Annals of Statistics 26 : 2049 – 2080 . Google Scholar Crossref Search ADS WorldCat Davis R. A. and Resnick S. I. . 1985 . Limit Theory for Moving Averages of Random Variables with Regularly Varying Tail Probabilities . The Annals of Probability 13 ( 1 ): 179 – 195 . Google Scholar Crossref Search ADS WorldCat Francq C. and Zakoian J.-M. . 2010 . GARCH Models. Structure, Statistical Inference and Finanial Applications . UK : Wiley . Google Preview WorldCat Fryzlewicz P. and Subba Rao S. . 2011 . Mixing Properties of ARCH and Time-Varying ARCH Processes . Bernoulli 1 : 320 – 346 . Google Scholar Crossref Search ADS WorldCat Hall P. and Yao Q. . 2003 . Inference in ARCH and GARCH Models . Econometrica 71 : 285 – 317 . Google Scholar Crossref Search ADS WorldCat Huang D. , Wang H. and Yao Q. . 2008 . Estimating GARCH Models: When to Use What? Econometrics Journal 11 : 27 – 38 . Google Scholar Crossref Search ADS WorldCat Jach A. , McElroy T. and Politis D. . 2012 . Subsampling Inference for the Mean of Heavy-Tailed Long-Memory Time Series . Journal of Time Series Analysis 33 : 96 – 111 . Google Scholar Crossref Search ADS WorldCat Kokoszka P. and Politis D. . 2011 . Nonlinearity of ARCH and Stochastic Volatility Models and Bartlett’s Formula . Probability and Mathematical Statistics 31 : 47 – 59 . WorldCat Kokoszka P. , Teyssière G. and Zhang A. . 2004 . “Confidence Intervals for the Autocorrelations of the Squares of GARCH Sequences” . In Computational Science - ICCS 2004, Volume 3039 of Lecture Notes in Computer Science . Springer , pp. 837 – 844 . Google Preview WorldCat Lindner A. 2009 . Stationarity, Mixing, Distributional Properties and Moments of GARCH(p, q)-Processes . In Andersen J.-P. K. T. , Davis R. and Mikosch T. (eds.), Handbook of Financial Time Series . Berlin, Germany : Springer , pp. 481 – 496 . Google Preview WorldCat McElroy T. and Politis D. N. . 2002 . Robust Inference for the Mean in the Presence of Serial Correlation and Heavy-Tailed Distributions . Econometric Theory 18 : 1019 – 1039 . Google Scholar Crossref Search ADS WorldCat McElroy T. and Politis D. N. . 2007 . Self-Normalization for Heavy-Tailed Time Series with Long Memory . Statistica Sinica 17 ( 1 ): 199 – 220 . WorldCat Mikosch T. and Stărică C. . 2000 . Limit Theory for the Sample Autocorrelation and Extremes of a GARCH(1, 1) Process . The Annals of Statistics 28 : 1427 – 1451 . Google Scholar Crossref Search ADS WorldCat Mittnik S. , Paolella M. and Rachev S. . 2002 . Stationarity of Stable Power-GARCH Processes . Journal of Econometrics 106 : 97 – 107 . Google Scholar Crossref Search ADS WorldCat Peng L. and Yao Q. . 2003 . Least Absolute Deviations Estimation for ARCH and GARCH Models . Biometrika 90 : 967 – 975 . Google Scholar Crossref Search ADS WorldCat Politis D. N. , Romano J. P. and Wolf M. . 1999 . Subsampling . New York : Springer . Google Preview WorldCat Tully E. and Lusey B. . 2007 . A Power GARCH Examination of the Gold Market . Research in International Business and Finance 21 : 316 – 325 . Google Scholar Crossref Search ADS WorldCat Wagner N. and Marsh T. . 2004 . Measuring Tail Thickness under GARCH and an Application to Extreme Exchange Rate Changes . Journal of Empirical Finance 12 : 165 – 185 . Google Scholar Crossref Search ADS WorldCat © The Author, 2017. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com This article is published and distributed under the terms of the Oxford University Press, Standard Journals Publication Model (https://academic.oup.com/journals/pages/open_access/funder_policies/chorus/standard_publication_model) http://www.deepdyve.com/assets/images/DeepDyve-Logo-lg.png Journal of Financial Econometrics Oxford University Press

# Subsampling Inference for the Autocorrelations of GARCH Processes

, Volume 17 (3) – Jun 1, 2019
21 pages

Loading next page...

/lp/ou_press/subsampling-inference-for-the-autocorrelations-of-garch-processes-RHYtnwsIb5
Publisher
Oxford University Press
Copyright
© The Author, 2017. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com
ISSN
1479-8409
eISSN
1479-8417
DOI
10.1093/jjfinec/nbx037
Publisher site
See Article on Publisher Site

### Abstract

Abstract We provide self-normalization for the sample autocorrelations of power GARCH(p, q) processes whose higher moments might be infinite. To validate the studentization, whose goal is to match the growth rate dependent on the index of regular variation of the process, we substantially extend existing weak-convergence results. Since asymptotic distributions are non-pivotal, we construct subsampling-based confidence intervals for the autocorrelations and cross-correlations, which are shown to have satisfactory empirical coverage rates in a simulation study. The methodology is further applied to daily returns of CAC40 and FTSA100 indices and their squares. In the exploratory analysis of time series, it is common practice to examine the sample autocorrelations (acs) of the observed data (suitably transformed to remove nonstationarity) X1,X2,…,Xn ⁠, to see whether the process differs significantly from white noise. For financial time series, such as log-returns, there is also interest in studying the acs of the squared data. In order to ascertain whether a sample autocorrelation at a particular lag differs significantly from zero, it is necessary to obtain an accurate construction of the parameter’s confidence interval, and this in turn requires some knowledge of the asymptotic behavior of the sample acs of {Xt} (or of {Xt2} ⁠). For linear processes with finite variance, the sample acs (under mild conditions) are asymptotically normal with the asymptotic variance-covariance matrix given by the standard Bartlett’s formula (Brockwell and Davis, 1991), whereas for nonlinear processes with potentially infinite variance the asymptotic behavior is much more complex. In this article we focus on nonlinear processes, namely, the popular class of GARCH(p,q) processes {Xt} (Bollerslev, 1986), which are widely used for modeling financial log-returns. For GARCH processes, the asymptotic normality of the sample acs with the standard n-rate holds if E[Xt4]<∞ ⁠. Since the marginal distributions of GARCH processes are regularly varying with index κ>0 (Davis and Mikosch, 1998; Mikosch and Stărică, 2000; Basrak, Davis, and Mikosch, 2002), this moment requirement is met if κ > 4. However, the asymptotic variance-covariance matrix can no longer be expressed via the standard Bartlett’s formula, and instead is given via the general Bartlett’s formula (Chapter 5 of Francq and Zakoian, 2010; Kokoszka and Politis, 2011). Nevertheless, this matrix can be consistently estimated (Francq and Zakoian, 2010; Kokoszka and Politis, 2011) so long as E[Xt4]<∞ ⁠. This is in tension with the empirical evidence suggesting that the tails of financial log-returns are heavier, having an infinite fourth moment (Mittnik, Paolella, and Rachev, 2002; Carrasco and Chen, 2002; Tully and Lusey, 2007). Statistical inference for the acs of GARCH processes in the absence of a finite fourth moment (2 < κ < 4) is rather challenging. Under this scenario, the convergence rate of the sample acs is slower than the classical n and is determined by the abovementioned index κ, which depends on the model parameters and the distribution of the innovations {Zt} and is difficult to estimate (Wagner and Marsh, 2004; Baek et al., 2009). Closed-form expressions for κ exist only for ARCH(1) and GARCH(1,1). Additionally, if E[Zt4]=∞ ⁠, estimation of model coefficients via the quasi maximum likelihood poses difficulties (nonstandard rates of convergence and non-normal asymptotics) and other methods have been proposed in the literature (Hall and Yao, 2003; Peng and Yao, 2003; Huang, Wang, and Yao, 2008). The limit distributions of sample acs when 2 < κ < 4 involve the infinite-variance stable laws (Basrak, Davis, and Mikosch, 2002) and thus the asymptotic quantiles cannot be determined analytically. When 0 < κ < 2, the population acs of GARCH processes are not well-defined and the sample acs are inconsistent. For similar reasons, the asymptotics and the convergence rates for the sample acs of squares of GARCH processes also show trichotomous behavior, which is subject to κ>8, κ∈(4,8) and κ∈(0,4) ⁠, respectively. Our primary goal in the current article is to construct confidence intervals for the acs of GARCH(p,q) processes, of their squares and of the cross-correlations between the process and its squares. Recall that the acs of a GARCH process are zero, whereas the acs of squares decay with geometric rate. Moreover, as shown herein, the cross-correlations between values and squares for the GARCH process are zero whenever the marginal distributions are symmetric. Therefore, any empirical evidence against these three features will render dubious the hypothesis that the data’s dynamics can be adequately captured through a GARCH model. Thus, our procedures for the construction of confidence intervals can be viewed as several misspecification tests for the GARCH hypothesis. Also in the case that the acs of the absolute values of the process are of interest, we establish convergence results for power GARCH (PGARCH) processes. In prior literature, Kokoszka, Teyssière, and Zhang (2004) compared several resampling methods of constructing confidence intervals for lag-1-autocorrelation of squares in GARCH-type models, recommending residual bootstrap as the best approach. In contrast, we examine all the lags of the acs, and employ a nonparametric approach that combines the concepts of self-normalization and subsampling. This approach is valid irrespective of whether the asymptotic distributions for the acs are Gaussian or not (i.e., without assuming finiteness of the fourth moment), and does not require knowledge of model orders p and q. Our procedure does not involve parameter estimation, which under some scenarios can be particularly troublesome, for example, when the error distribution is heavy-tailed with an infinite fourth moment. Self-normalization (McElroy and Politis, 2007; Jach, McElroy, and Politis, 2012) addresses the issue of parameter-dependent convergence rates and is accomplished by dividing the sample ac of {Xt} (resp. {Xt2} ⁠) by a quantity that correctly matches its asymptotic growth rate, without knowing a priori whether the fourth (resp. eighth) population moment is finite or not. We show that the fourth (resp. eighth) sample moment is suitable for such studentization; we also provide a studentization for the cross-correlations. The identification of suitable studentizations is nontrivial, and is a novel facet of this work. In order to validate this technique, it is necessary to substantially extend some of the weak-convergence results of Mikosch and Stărică (2000), which is a stand-alone contribution of this article. Clearly, self-normalization can only resolve half of the problem—namely, eliminating the need to know the convergence rate to compute the studentized statistic—since the limit distributions will be unconventional and non-pivotal. Subsampling (Politis, Romano, and Wolf, 1999) can be used to empirically estimate the quantiles of the sampling distribution. This scheme operates by computing the same statistics on a small subsample—typically a contiguous stretch—drawn from the original time series data, with the unknown parameter being replaced by its best large-sample estimate. Consistency of the resulting empirical distribution for the sampling (or asymptotic) distribution is typically established through a strong mixing assumption, together with strict stationarity, which in the context of GARCH processes is immediate. The article is organized as follows: Section 1 develops the statistical methodology of self-normalization for sample acs, as well as other unobserved quantities. While these results are not of direct statistical applicability, they are necessary for establishing the subsequent subsampling methodology. Section 2 provides a detailed asymptotic theory, upon which the statistical methodology relies. An application to stock returns is given in Section 3, while Section 4 concludes. Simulations that explore finite-sample performance of the subsampling estimators, as well as proofs of technical results, are provided in the appendices (see Supplementary Data). 1 Self-normalization for Autocovariances and Autocorrelations 1.1 Process The GARCH process satisfies Xt=σt Zt for an iid sequence {Zt} that are only assumed to be symmetric about zero, and σt2=α0+∑j=1pαjXt−j2+∑j=1qβjσt−j2. (1) We refer to {σt} as the volatility process. A discussion of the conditions for stationarity are summarized in Lindner (2009); necessary and sufficient conditions in terms of the process’ Lyapunov exponent are given in Bougerol and Picard (1992), and a sufficient condition—in the case that Z0 has finite variance—in terms of the coefficients αj and βj is given in Bollerslev (1986). In this article, we consider {Zt} that are heavy-tailed, but are interested in stationary GARCH processes—see the discussion in Remark 3.2 of Basrak, Davis, and Mikosch (2002) (henceforth BDM) for stationarity conditions. Theorem 3.1 of BDM discusses the properties of the GARCH process, and in their Corollary 3.5 they show that Ut=(Xt,σt) has heavy-tailed marginal distributions of index κ, for some κ > 0 that depends on distributional properties of Z0 and the GARCH coefficients in a complicated fashion. The bivariate process {Ut} is also strong mixing with geometric rate, and is regularly varying with some rate an. The rate an is related to the tail index, being given by an=c n1/κ for some constant c > 0—see Remark 2.1 of BDM. The PGARCH process {Xt} is defined via Xt=σt Zt together with volatility satisfying |σt|ν=α0+∑j=1pαj|Xt−j|ν+∑j=1qβj|σt−j|ν, (2) where ν > 0 is the exponent of the process (ν = 2 corresponds to the GARCH). For references on PGARCH, see Mittnik, Paolella, and Rachev, 2002; Carrasco and Chen, 2002; and Tully and Lusey, 2007. Interest focuses on the transformed variables Yt=|Xt|ν   and  St=σtν. (3) Define the following sample quantities for k integer: γ^Y,Y(k)=n−1∑t=1nYtYt+kγ^Y,S(k)=n−1∑t=1nYtSt+kγ^S,S(k)=n−1∑t=1nStSt+k. (The latter two quantities are not statistics, because {St} is not observed.) Also γ^S,Y(k) is defined by swapping the order of Y and S. Up to negligible errors (i.e., terms that converge to zero in probability—see below for the precise discussion), γ^Y,Y(−k)≈γ^Y,Y(k) ⁠, γ^Y,S(−k)≈γ^S,Y(k) ⁠, and γ^S,S(−k)≈γ^S,S(k) ⁠. If we remove the  ^  symbol, we refer to expectations (whenever these exist), and the above relations in the lags are exact. Because X0 is regularly varying of index κ, the autocovariances and cross-covariances for {Yt} and {St} exist whenever ν < κ/2. We are interested in weak convergence of the so-called roots defined by γ˜Y,Y(k)=γ^Y,Y(k)−γY,Y(k) ⁠, and secondarily in weak convergence of the analogous quantities for the volatilities S and cross-covariances. Although the latter roots involving the volatilities are theoretical quantities, the distribution of γ˜Y,Y(k) depends upon γ˜Y,S(k) and γ˜S,S(k) ⁠, so it behooves us to analyze these objects together. When κ < 2ν the theoretical quantity γY,Y (k) does not exist, and the root is just defined via γ˜Y,Y(k)=γ^Y,Y(k) ⁠. We extend this definition to γ˜Y,S(k) ⁠, and γ˜S,S(k) in the obvious fashion. The appropriate rate of convergence actually depends on the regular variation rate an. Trivially, regular variation for the νth power implies a rate of anν for Yt, and the product of two variables (in the autocovariances or cross-covariances) indicates a rate of an2ν ⁠. In the case of a GARCH(1,1), previous work indicates that nan−4γ˜Y,Y(k) converges weakly to a nondegenerate random variable, jointly in k, when κ < 8; here we utilize the rate an2ν in lieu of an4 (i.e., replacing the exponent 2 by ν). When κ > 2ν the autocovariances and cross-covariances exist, and centering of the sample quantities becomes possible, but when κ < 2ν no centering is necessary. 1.2 Self-normalization For our applications we investigate sample autocovariances, acs, and cross-covariances and cross-correlations for the process {Xt} with its powers {Yt} ⁠, where Yt=|Xt|ν ⁠. Recall that γX,X(h) = 0 for h≠0, whenever κ > ν. With the notation for γ^(k) of the previous subsection, and with ρ^(k)=γ^(k)/γ^(0) by definition, we here study γ^X,X, ρ^X,X, γ^X,Y, ρ^X,Y, γ^Y,Y, ρ^Y,Y. In the case of the cross-correlation, the normalization is γ^X,X(0)γ^Y,Y(0) ⁠. In each case, there are rates of convergence (sometimes with a mean centering) for each statistic with the results dependent on κ and ν. Since we are interested in developing simple studentizations for the statistics, we prove joint results involving absolute sample moments, abbreviated by μ^Xj=n−1∑t=1n|Xt|j for j real. Here we discuss self-normalized statistics, such that the studentized quantity’s rate of convergence does not depend on unknown quantities. First consider self-normalization for the acs of a GARCH(p,q) process. It follows from Theorem 2 below that cn−1nρ^X,X(k)=OP(1) ⁠, where cn={n1 if κ∈(0,2)an2 if κ∈(2,4)n1/2 if κ∈(4,∞). Excluding κ∈{2,4} ⁠, the rate cn equals a constant times n raised to the power 1∧(2/κ∨1/2) ⁠. A suitable choice for the self-normalization that matches this growth rate cn is σ^X,X=(n−1+(nμ^X4)−1/2)−1, as is demonstrated in Theorem 5 below. Similarly, the growth rate for the acs of powers takes the form of n raised to the power 1∧(2ν/κ∨1/2) ⁠, and hence we can normalize with σ^Y,Y=(n−1+(nμ^Y4)−1/2)−1. The case of cross-correlations is more complex. In the case that ν∈(0,1) ⁠, the growth rate cn takes the form of n raised to the power [1∧(1/2+ν/κ)]∧[(1+ν)/κ∨1/2] ⁠. Therefore, we can use the studentization σ^X,Y=(n−1+n−1/2 (nμ^X2)−ν/2+(nμ^X2Y2)−1/2)−1. On the other hand, if ν≥1 then the growth rate cn takes the form of n raised to the power [1∧(1/2+1/κ)]∧[(1+ν)/κ∨1/2] ⁠. Then we can use the studentization σ^X,Y=(n−1+n−1/2 (nμ^Y2)−1/2ν+(nμ^X2Y2)−1/2)−1. Each self-normalization converges jointly with the respective correlation statistics, and is bounded in probability under the assumptions of Theorem 5 given below. 1.3 Subsampling We now proceed to the statistical portion of the article, namely conducting inference for the process’ (and its powers’) acs. For a PGARCH(p,q) process, the acs and cross-correlations with the powers are zero, while the acs of the powers decay exponentially. Our testing paradigm is as follows: we assume as null hypothesis that the process is PGARCH(p, q) and check whether the subsampling-based confidence intervals capture zero (for the process acs and for the cross-correlations with the powers) or decay exponentially fast (in case of the powers). In both cases, we replace the value of the population parameter in the root by its full-sample estimate, although in the former case we have the choice of substituting it with zero. So if zero is not contained in a 1 – α level confidence interval for the acs or cross-cs at some lag, then we can reject the GARCH hypothesis with Type I error rate α. We begin with the definitions, and then consider consistency of the estimators. To construct confidence intervals for ρX,X(k) (although for a PGARCH process this parameter is always zero) we need to approximate the sampling distribution of TX,X(k)=n (ρ^X,X(k)−ρX,X(k)σ^X,X), that is, LX,X,k(x)=ℙ[TX,X(k)≤x] ⁠. Let L∞,X,X,k denote the cumulative distribution function (cdf) of the corresponding limiting random variable—which by Equation (17) below depends on κ. The use of this asymptotic distribution is impractical, as L∞,X,X,k depends on the unknown parameter and there is no known analytic formula for it. Hence, we propose to approximate LX,X,k (and L∞,X,X,k) nonparametrically via subsampling (Politis, Romano, and Wolf, 1999). According to this procedure we divide the sample into overlapping blocks of size b (⁠ b→∞, b/n→0 ⁠), containing Xt,Xt+1,…,Xt+b−1 for t=1,2,…,n−b+1 ⁠, and calculate the self-normalized statistic upon each block, treating each block as if it were a full sample. Moreover, the parameter ρX,X(k) is replaced by its large-sample estimate ρ^X,X(k) ⁠. This produces n – b + 1 subsampling statistics TX,X,t(k)=b (ρ^X,X,t(k)−ρ^X,X(k)σ^X,X,t), where ρ^X,X,t(k)=γ^X,X,t(k)/γ^X,X,t(0) with γ^X,X,t(k)=1b∑ℓ=tt+b−1−k(Xℓ−X¯t)(Xℓ+k−X¯t) and X¯t=∑ℓ=tt+b−1−kXℓ/b ⁠. Note that we could also replace ρ^X,X(k) by zero, if we wish to utilize the “null hypothesis” that the process is PGARCH, but we elect instead to utilize a large-sample estimate, which makes the confidence interval construction more intellectually consistent with the case of the powers’ acs. The normalization σ^X,X,t is given by σ^X,X,t=(b−1+[∑ℓ=tt+b−1−kXℓ4]−1/2)−1 and the sampling distribution LX,X,k(x) is approximated by L^X,X,k(x)=1n−b+1∑t=1n−b+11{TX,X,t(k)≤x}. If by cX,X,k(1−p)=inf⁡{x: L^X,X,k(x)≥1−p} we denote its lower 1 – p quantile, we obtain for ρX,X(k) a (1−p) equal-tailed subsampling confidence interval CIet;1−p(ρX,X(k)) ⁠, defined as [ρ^X,X(k)−σ^X,Xn cX,X,k(1−p/2),ρ^X,X(k)−σ^X,Xn cX,X,k(p/2)]. (4) Introducing L^X,X,k,|·|(x)=∑t=1n−b+11{|TX,X,t(k)|≤x}/(n−b+1) ⁠, another related distribution, and its quantile cX,X,k,|·|(1−p)=inf⁡{x: L^X,X,k,|·|(x)≥1−p} offers an (1 – p) symmetric subsampling confidence interval for ρX,X(k) ⁠, CIs;1−p(ρX,X(k))=[ρ^X,X(k)∓σ^X,Xn cX,X,k,|·|(1−p)]. (5) For the powers {Yt} ⁠, we have the following, analogous definitions, starting with the statistic and its subsampling version TY,Y(k)=n (ρ^Y,Y(k)−ρY,Y(k)σ^Y,Y), TY,Y,t(k)=b (ρ^Y,Y,t(k)−ρ^Y,Y(k)σ^Y,Y,t), where ρ^Y,Y,t(k)=γ^Y,Y,t(k)/γ^Y,Y,t(0) with γ^Y,Y,t(k)=1b∑ℓ=tt+b−1−k (Yℓ−Y¯t)(Yℓ+h−Y¯t) and Y¯t=∑ℓ=tt+b−1−kYℓ/b ⁠. Unlike the previous case of the regular acs, we do not presume that ρY,Y(k) equals zero, so we must estimate it instead. The normalization σ^Y,Y,t is given by σ^Y,Y,t=(b−1+[∑ℓ=tt+b−1−kYℓ4]−1/2)−1. The sampling distribution LY,Y,k(x)=ℙ[TY,Y(k)≤x] is approximated by L^Y,Y,k(x)=1n−b+1∑t=1n−b+11{TY,Y,t(k)≤x}, and is an estimator of L∞,Y,Y,k(x) ⁠, the cdf of the limit variable given in (18). For ρY,Y(k) a (1−p) equal-tailed subsampling confidence interval CIet;1−p(ρY,Y(k)) is given by [ρ^Y,Y(k)−σ^Y,YncY,Y,k(1−p/2),ρ^Y,Y(k)−σ^Y,YncY,Y,k(p/2)], (6) where cY,Y,k(1−p)=inf⁡{x: L^Y,Y,k(x)≥1−p} ⁠. An (1−p) symmetric subsampling confidence interval for ρY,Y(k) is then CIs;1−p(ρY,Y(k))=[ρ^Y,Y(k)∓σ^Y,Yn cY,Y,k,|·|(1−p)], (7) with cY,Y,k,|·|(1−p)=inf⁡{x: L^Y,Y,k,|·|(x)≥1−p} and L^Y,Y,k,|·|(x)=∑t=1n−b+11{|TY,Y,t(k)|≤x}/(n−b+1) ⁠. For the cross-correlations between the data process and its powers, the normalized difference and its subsampling counterpart are TX,Y(k)=n (ρ^X,Y(k)−ρX,Y(k)σ^X,Y), TX,Y,t(k)=b (ρ^X,Y,t(k)−ρ^X,Y(k)σ^X,Y,t), where ρ^X,Y,t(k)=γ^X,Y,t(k)/γ^X,X,t(0) γ^Y,Y,t(0) ⁠. Again, we could replace ρX,Y(k) by zero under the PGARCH hypothesis, but in keeping with the above treatment we utilize the parameter’s large-sample estimate. The normalization σ^X,Y,t is given (in the case of ν≥1 ⁠) by σ^X,Y,t=(b−1+b−1/2[∑ℓ=tt+b−1−kYℓ2]−1/2ν+[∑ℓ=tt+b−1−k|Xℓ|2+2ν]−1/2)−1. The sampling distribution LX,Y,k(x)=ℙ[TX,Y(k)≤x] is approximated by L^X,Y,k(x)=1n−b+1∑t=1n−b+11{TX,Y,t(k)≤x}, and is an estimator of L∞,X,Y,k(x) ⁠, the cdf of the limit variable given in (19). For ρX,Y(k) a (1−p) equal-tailed subsampling confidence interval CIet;1−p(ρX,Y(k)) is defined as [ρ^X,Y(k)−σ^X,Yn cX,Y,k(1−p/2),ρ^X,Y(k)−σ^X,Yn cX,Y,k(p/2)], (8) where cX,Y,k(1−p)=inf⁡{x: L^X,Y,k(x)≥1−p} ⁠. An (1−p) symmetric subsampling confidence interval for ρX,Y(k) is then CIs;1−p(ρX,Y(k))=[ρ^X,Y(k)∓σ^X,Yn cX,Y,k,|·|(1−p)], (9) with cX,Y,k,|·|(1−p)=inf⁡{x: L^X,Y,k,|·|(x)≥1−p} and L^X,Y,k,|·|(x)=∑t=1n−b+11{|TX,Y,t(k)|≤x}/(n−b+1) ⁠. 2 Asymptotic Theory We expand the theoretical results of Davis and Mikosch (1998) and Mikosch and Stărică (2000) in two substantial directions: we derive asymptotic relations for process and volatility autocovariances and cross-covariances (where the volatility is defined to be {σt} ⁠) for the PGARCH process, and we derive cross-covariance results for a PGARCH process and its power. Our main objective is to obtain nondegenerate weak limits of all studentized roots, so that the subsampling method is applicable. The first subsection reviews some background concepts and notation (also see Appendix A in Supplementary Data), while the second subsection applies these results to certain statistics, providing our main theorems. The third subsection applies these theorems to produce studentized statistics, and the fourth subsection establishes the consistency of subsampling. 2.1 GARCH and PGARCH The GARCH(p,q) process {Xt} has been studied in Mikosch and Stărică (2000) and BDM. In addition to establishing results on autocovariances, BDM provides results on Stochastic Recurrence Equations (SREs), from which they derive regular variation properties of GARCH processes; by adapting their methods of proof, many of the results can be trivially extended to PGARCH processes. We first review BDM’s GARCH results. The vector process of dimension d=q+p−1 defined by W⃗t=[σt+1,σt,⋯,σt−q+2,Xt,Xt−1,⋯,Xt−p+2]′ (10) satisfies a SRE in the squares of its components, as described in Equation (3.1) of BDM. Let ɛx be the point measure concentrated at x ⁠, and let ⇒L denote convergence in distribution of point measures on ℝ¯d(m+1)∖{0} ⁠, where ℝ¯=ℝ∪{±∞} and d=q+p−1 ⁠. BDM shows that W⃗t(m)= vec(W⃗t,⋯,W⃗t+m) yields a point process convergence Nn=∑t=1nɛW⃗t(m)an−1⇒LN∞=∑i,j≥1ɛPiQ⃗ij (11) as n→∞ ⁠, where an is the rate of regular variation, being implicitly defined by n ℙ[|W⃗0(m)|>an]→1. (12) The {Pi}i≥1 are the points of a Poisson process defined on (0,∞) with intensity measure given in terms of κ and the extremal index. The Poisson process ∑i≥1ɛPi is independent of each iid point process ∑i≥1ɛQij (for each j≥1 ⁠). The points {Pi} and {Qij} correspond to the radial and spherical portions of the limiting points W⃗t(m)an−1 ⁠; see Corollary 2.4 of Davis and Mikosch (1998) for more detail. It is apparent that Q⃗ij has m + 1 blocks of d components, which we denote via Q⃗ij= vec{Qij(0),Qij(1),⋯,Qij(m)}, (13) and each Qij(ℓ) itself has d=q+p−1 components, where 0≤ℓ≤m ⁠. To refer to the kth component, with 1≤k≤d ⁠, of Qij(ℓ) ⁠, we write Qij(ℓ,k) ⁠. BDM then makes an application to establishing weak convergence of sample autocovariances of {Xt} ⁠, extending earlier results by Davis and Mikosch (1998) and Mikosch and Stărică (2000) on the GARCH(1,1). These latter articles also discuss results for the autocovariances of {|Xt|} and {Xt2} ⁠, which are objects of interest for financial applications; BDM claim that these results can be extended to the GARCH(p,q) case without proof. For the sample autocovariances of {Xt2} ⁠, (asymptotic) recursive relations can be derived that extend ideas in Davis and Mikosch (1998) and Mikosch and Stărică (2000) for the GARCH(1,1), but these relations seem hard to extend to the autocovariances of {|Xt|}—something rather special happens when p,q≤1 that prevents the method of proof to be easily generalized for p > 1 or q > 1. Essentially, these same results also hold for the PGARCH, which we prove below. Set X⃗t=[St+1,⋯,St−q+2,Yt,⋯,Yt−p+2]′ ⁠. Then {Yt} and {St} ⁠, defined via (3), satisfy the following SRE: X⃗t=At X⃗t−1+Bt, (14) with Bt=[α0,0,⋯,0]′ and At=[α1 |Zt|ν+β1β2⋯βq−1βqα2α3⋯αp10⋯0000⋯001⋯0000⋯0⋮⋮⋱⋮⋮⋮⋮⋱⋮00⋯1000⋯0|Zt|ν0⋯0000⋯000⋯0010⋯0⋮⋮⋱⋮⋮⋮⋮⋱⋮00⋯000⋯10]. The key assumptions needed to establish the existence of a stationary solution, and its regular variation properties, are given below. The Lyapunov exponent of the SRE is given in terms of the L1 matrix norm by γ=inf⁡{n−1 Elog⁡||A1⋯An||,n∈ℕ}, and sufficient conditions (in the GARCH case) for the exponent’s negativity are discussed in Remark 3.2 of BDM. Assumption A α0>0 and the Lyapunov exponent γ<0 ⁠. Assumption B Z0 has a positive density on ℝ such that E|Z0|h<∞ for all h<h0 and E|Z0|h0=∞ for some h0∈(0,∞] ⁠; not all the parameters αj and βk for 1≤j≤p and 1≤k≤q are zero. Assumption C The density of Z0 is positive in an interval containing zero. Assumption D {Ut=(Xt,σt)} is strong mixing with geometric rate. Theorem 1 For the SRE (14), assume Assumptions A and B. Then there exists a unique strictly stationary solution to the SRE, and there exists ρ such that for all x∈ℝd∖{0} the inner product ⟨x,X⃗1⟩ is regularly varying with index ρ. Furthermore, if ρ is not an even integer, then X⃗1 is regularly varying with index ρ. Remark 1 This theorem is proved by exactly following the proof of Theorem 3.1 of BDM, only substituting |Zt|ν for Zt2 and consolidating parts A and B; therefore, its proof is omitted. Because it is unknown whether the PGARCH process is strong mixing—though in part C of Theorem 3.1 of BDM it is stated that for a GARCH process Assumption C implies Assumption D—we omit discussion of this point in Theorem 1. Also see Fryzlewicz and Subba Rao (2011). Corollary 1 For the SRE (14), assume Assumptions A, B, and D. Then a stationary version of the process {Ut=(Xt,σt)} exists, and if the index ρ of Theorem 1 is not an even integer, then the finite-dimensional distributions of {Ut} are regularly varying with index κ=νρ. Remark 2 The proof of Corollary 1 follows almost verbatim from the proof of Corollary 3.5 of BDM, the only change being that κ=νρ and ν need not be 2, as in the GARCH case. Note we assume the strong mixing property, although in the GARCH case it suffices to assume the weaker Assumption C instead. Hence using Theorem 2.8 of Davis and Mikosch (1998), from Corollary 1 we can conclude that—under our working assumptions A, B, and D—with W⃗t given by (10) the convergence (11) holds, along with (12). Note that the points Pi and Q⃗ij pertain to the process {Ut} ⁠, not {Yt} and {St} ⁠. The regular variation is of index κ, such that κ/ν is not an even integer. 2.2 Sampling Behavior for PGARCH Processes Under Assumptions A, B, and D we can state our convergence results for the roots of interest. Each theorem below splits into three main cases: (i) κ is sufficiently low that the mean centering does not exist, and a stable limit is obtained; (ii) κ is a bit larger so that centerings are utilized, and a stable limit for the root is obtained; (iii) κ is large enough that a central limit theorem holds, so that the limit of the root is Gaussian. (The third theorem splits case (ii) into two sub-cases.) In cases (i) and (ii) the limit distributions are stated in terms of the points of (11), which are explained in more detail in Theorem 2.10 of BDM; also see Corollary 2.4 of Davis and Mikosch (1998). In case (iii) we only need to know the joint covariance of the limiting Gaussian variables. In the first Theorem below, case (ii) involves a limit variable that is described as follows (also see Proposition 3.3 of Davis and Mikosch, 1998). The regular variation of the PGARCH process (cf. Corollary 1) yields the vague convergence of n ℙ[an−1W⃗0(m)∈·] to a measure τ on ℝ¯d(m+1)∖{0} ⁠, the Lévy measure of a κ-stable random vector with κ∈(0,2) (and such that κ/ν is not an even integer). Then we have the definition Vk=lim⁡δ→0(∑i,j≥1Pi2Qij(0,q+1)Qij(k,q+1)1{Pi2|Qij(0,q+1)Qij(k,q+1)|≥δ}−∫Bδ,kx(0,q+1)x(k,q+1) ρ(dx)) (15) for Bδ,k={x∈ℝd(m+1):δ<|x(0,q+1)x(k,q+1)|} ⁠, where k≥0 ⁠. The superscript notation is explained in (13). The points Qij(0,q+1) and Qij(k,q+1) correspond to the (0,q+1) and (k,q+1) components of W⃗t(m) ⁠, that is, Xt and Xt+k ⁠. In addition, all the theorems contain the random variables Wr=∑i,j≥1|Pi|r|Qij(0,q+1)|r, for r > 0. This first result extends the work of BDM, which is concerned with the GARCH(p,q) process, in three ways: the process is a PGARCH (which nest the GARCH), we consider convergence jointly with sample moments, and we also derive the result for acs. Theorem 2 Let {Xt} be a PGARCH(p,q) process (2) satisfying Assumptions A, B, and D; also assume that Z0 is symmetric about zero. Then: (i): in the case κ∈(0,2): ({nan−2 γ^X,X(k)}k=0m,nan−4 μ^X4)⇒L({Vk}k=0m,W4)({ρ^X,X(k)}k=1m,nan−4 μ^X4)⇒L({VkV0−1}k=1m,W4),where Vk=∑i,j≥1Pi2Qij(0,q+1)Qij(k,q+1). (ii): in the case κ∈(2,4): ({nan−2 [γ^X,X(k)−γX,X(k)]}k=0m,nan−4μ^X4)⇒L({Vk}k=0m,W4)({nan−2 [ρ^X,X(k)−ρX,X(k)]}k=1m,nan−4μ^X4)⇒L({γX,X−1(0)[Vk−ρX,X(k)V0]}k=1m,W4),where Vk is defined by (15). (iii): in the case κ>4: ({n1/2 [γ^X,X(k)−γX,X(k)]}k=0m,μ^X4)⇒L({Gk}k=0m,EX4)({n1/2 [ρ^X,X(k)−ρX,X(k)]}k=1m,μ^X4)⇒L({γX,X−1(0)[Gk−ρX,X(k)G0]}k=1m,EX4),where the Gk satisfy Cov(Gj,Gk)=∑ℓ Cov(X0Xj,XℓXℓ+k) and are multivariate normal. Remark 3 In cases (ii) and (iii) of Theorem 2, γX,X(k)=0 if k > 0, and in particular ρX,X(k)=0 in the results for ρ^X,X(k) ⁠. Therefore, these results could be stated as nan−2ρ^X,X(k)⇒LγX,X−1(0)Vk and n ρ^X,X(k)⇒LγX,X−1(0)Gk for cases (ii) and (iii), respectively. Note that these limiting distributions are nondegenerate, which is important for our subsampling applications discussed in the next section. The next result is concerned with autocovariances and acs for the powered process {Yt} ⁠. The middle case (ii) involves a somewhat complicated limit distribution, which is described in Proposition 1 of Appendix A (Supplementary Data), which requires the additional Assumption M discussed therein. The result for the autocovariances is stated without proof in BDM (and for the GARCH(1,1) case, a full proof is given in Davis and Mikosch, 1998), and our proof here relies on the recursive relations of the previous subsection. We also describe results jointly with the sample moments, and provide the autocorrelation results. Theorem 3 Let {Xt} be a PGARCH(p,q) process (2) satisfying Assumptions A, B, and D; also assume that Z0 is symmetric about zero. Then: (i): in the case κ∈(0,2ν): ({nan−2ν γ^Y,Y(k)}k=0m,nan−4ν μ^Y4)⇒L({Vk}k=0m,W4ν)({ρ^Y,Y(k)}k=1m,nan−4ν μ^Y4)⇒L({VkV0−1}k=1m,W4ν),where Vk=∑i,j≥1|Pi|2ν|Qij(0,q+1)Qij(k,q+1)|ν. (ii): in the case κ∈(2ν,4ν), assume Assumption M as well: with Uk defined as in Proposition 1. (iii): in the case κ>4ν: ({n1/2 [γ^Y,Y(k)−γY,Y(k)]}k=0m,μ^Y4)⇒L({Gk}k=0m,E|X|4ν)({n1/2 [ρ^Y,Y(k)−ρY,Y(k)]}k=1m,μ^Y4)⇒L({γY,Y−1(0)[Gk−ρY,Y(k)G0]}k=1m,E|X|4ν),where the Gk satisfy Cov(Gj,Gk)=∑ℓ Cov(Y0Yj,YℓYℓ+k) and are multivariate normal. The final result of this section utilizes some of the results of Theorems 2 and 3, because the normalization for the cross-correlations involves both γX,X(0) and γY,Y(0) ⁠. It turns out that symmetry in the cross-covariance variables makes the results resemble those of Theorem 2, though the limits depend on whether ν is less than or greater than one. Also, when κ>1+ν ⁠, the limit involves the random variables Vk=lim⁡δ→0(∑i,j≥1Pi|Pi|νQij(0,q+1)|Qij(k,q+1)|ν1{|Pi|ν+1|Qij(0,q+1)||Qij(k,q+1)|ν≥δ}−∫Bδ,kx(0,q+1)|x(k,q+1)|ν ρ(dx)) (16) for Bδ,k={x∈ℝd(m+1):δ<|x(0,q+1)[x(k,q+1)]ν|} ⁠, where k≥0 ⁠. The points Qij(0,q+1) and [Qij(k,q+1)]ν correspond to the (0,q+1) component and the powered (k,q+1) component, respectively, of W⃗t(m) ⁠, that is, Xt and Yt+k ⁠. These types of statistics have not been mathematically studied in prior literature, to our knowledge. Theorem 4 Let {Xt} be a PGARCH(p,q) process (2) satisfying Assumptions A, B, and D; also assume that Z0 is symmetric about zero. If κ>1+ν, then γX,Y(k) exists and equals zero for all k. Supposing that ν∈(0,1), then: (i) in the case κ∈(0,2ν): ({nan−(1+ν) γ^X,Y(k)}k=0m,nan−2γ^X,X(0),nan−2ν γ^Y,Y(0),nan−(2+2ν) μ^X2Y2)⇒L({Vk}k=0m,W2,W2ν,W2+2ν)({ρ^X,Y(k)}k=1m,nan−(2+2ν) μ^X2Y2)⇒L({Vk/W2 W2ν}k=1m,W2+2ν),where Vk=∑i,j≥1Pi|Pi|νQij(0,q+1)|Qij(k,q+1)|ν. (ii) in the case κ∈(2ν,1+ν)∪(1+ν,2), assuming Assumption M: ({nan−(1+ν) γ^X,Y(k)}k=0m,nan−2γ^X,X(0),γ^Y,Y(0),nan−(2+2ν) μ^X2Y2)⇒L({Vk}k=0m,W2,γY,Y(0),W2+2ν)({n1/2an−νρ^X,Y(k)}k=1m,nan−(2+2ν) μ^X2Y2)⇒L({Vk/W2 γY,Y(0)}k=1m,W2+2ν),with Vk given by case (i) if κ∈(2ν,1+ν) and by (16) if κ∈(1+ν,2). (iii) in the case κ∈(2,2+2ν), assuming Assumption M: ({nan−(1+ν) γ^X,Y(k)}k=0m,γ^X,X(0),γ^Y,Y(0),nan−(2+2ν) μ^X2Y2)⇒L({Vk}k=0m,γX,X(0),γY,Y(0),W2+2ν)({nan−(1+ν)ρ^X,Y(k)}k=1m,nan−(2+2ν) μ^X2Y2)⇒L({Vk/γX,X(0) γY,Y(0)}k=1m,W2+2ν),with Vk given by (16). (iv) in the case κ>2+2ν, assuming Assumption M: ({n1/2 γ^X,Y(k)}k=0m,γ^X,X(0),γ^Y,Y(0),μ^X2Y2)⇒L({Gk}k=0m,γX,X(0),γY,Y(0),γX2,Y2(0))({n1/2ρ^X,Y(k)}k=1m,μ^X2Y2)⇒L({Gk/γX,X(0) γY,Y(0)}k=1m,E|X|2+2ν),where the Gk satisfy Cov(Gj,Gk)=∑ℓ Cov(X0Yj,XℓYℓ+k) and are multivariate normal. Supposing that ν≥1, then: (i) in the case κ∈(0,2): ({nan−(1+ν) γ^X,Y(k)}k=0m,nan−2γ^X,X(0),nan−2ν γ^Y,Y(0),nan−(2+2ν) μ^X2Y2)⇒L({Vk}k=0m,W2,W2ν,W2+2ν)({ρ^X,Y(k)}k=1m,nan−(2+2ν) μ^X2Y2)⇒L({Vk/W2 W2ν}k=1m,W2+2ν),where Vk=∑i,j≥1Pi|Pi|νQij(0,q+1)|Qij(k,q+1)|ν. (ii) in the case κ∈(2,1+ν)∪(1+ν,2ν), assuming Assumption M: ({nan−(1+ν) γ^X,Y(k)}k=0m,γ^X,X(0),nan−2ν γ^Y,Y(0),nan−(2+2ν) μ^X2Y2)⇒L({Vk}k=0m,γX,X(0),W2ν,W2+2ν)({n1/2an−1ρ^X,Y(k)}k=1m,nan−(2+2ν) μ^X2Y2)⇒L({Vk/γX,X(0) W2ν}k=1m,W2+2ν),with Vk given by case (i) if κ∈(2,1+ν) and by (16) if κ∈(1+ν,2ν). (iii) in the case κ∈(2ν,2+2ν), assuming Assumption M: ({nan−(1+ν) γ^X,Y(k)}k=0m,γ^X,X(0),γ^Y,Y(0),nan−(2+2ν) μ^X2Y2)⇒L({Vk}k=0m,γX,X(0),γY,Y(0),W2+2ν)({nan−(1+ν)ρ^X,Y(k)}k=1m,nan−(2+2ν) μ^X2Y2)⇒L({Vk/γX,X(0) γY,Y(0)}k=1m,W2+2ν),with Vk given by (16). (iv) in the case κ>2+2ν, assuming Assumption M: ({n1/2 γ^X,Y(k)}k=0m,γ^X,X(0),γ^Y,Y(0),μ^X2Y2)⇒L({Gk}k=0m,γX,X(0),γY,Y(0),γX2,Y2(0))({n1/2ρ^X,Y(k)}k=1m,μ^X2Y2)⇒L({Gk/γX,X(0) γY,Y(0)}k=1m,E|X|2+2ν),where the Gk satisfy Cov(Gj,Gk)=∑ℓ Cov(X0Yj,XℓYℓ+k) and are multivariate normal. Remark 4 The rates of convergence for the sample cross-correlation, and its root, vary considerably depending upon κ. To summarize (recall that an∼n1/κ up to a constant), when ν∈(0,1) the rate is n0 for κ<2ν ⁠; n1/2−ν/κ for κ∈(2ν,2) ⁠; n1−(1+ν)/κ for κ∈(2,2+2ν) ⁠; n1/2 for κ>2+2ν ⁠. Whereas for ν≥1 ⁠, the rates are n0 for κ<2 ⁠; n1/2−1/κ for κ∈(2,2ν) ⁠; n1−(1+ν)/κ for κ∈(2ν,2+2ν) ⁠; n1/2 for κ>2+2ν ⁠. See Figure 1 for a depiction of this rate in the case of ν∈{0.5,2} ⁠; the odd behavior is due to the unusual normalization involving both γ^X,X(0) and γ^Y,Y(0) ⁠. Figure 1. View largeDownload slide δ(κ) versus κ, where δ(κ) determines the order in probability of the sample cross-correlation, nδ(κ)ρ^X,Y(k)=OP(1) ⁠. Figure 1. View largeDownload slide δ(κ) versus κ, where δ(κ) determines the order in probability of the sample cross-correlation, nδ(κ)ρ^X,Y(k)=OP(1) ⁠. 2.3 Self-normalization for Correlation Statistics We next provide the asymptotic results for self-normalized roots. Theorem 5 Let {Xt} be a PGARCH(p,q) process (2) satisfying Assumptions A, B, and D; also assume that Z0 is symmetric about zero. (i) With quantities defined in Theorem 2 according to the value of κ, n{ρ^X,X(k)σ^X,X}k=1m⇒L{{VkV0}k=1m  if κ∈(0,2){VkγX,X(0)W4}k=1m  if κ∈(2,4){GkγX,X(0)EX4}k=1m  if κ>4. (17) (ii) With quantities defined in Theorem 3 according to the value of κ, and assuming Assumption M when κ>2ν, n{ρ^Y,Y(k)−ρY,Y(k)σ^Y,Y}k=1m⇒L{{VkV0}k=1m  if κ∈(0,2ν){Uk−ρY,Y(k)U0γY,Y(0)W4ν}k=1m  if κ∈(2ν,4ν){Gk−ρY,Y(k)G0γY,Y(0)E[|X|4ν]}k=1m  if κ>4ν. (18) (iii) If ν∈(0,1), with quantities defined in Theorem 4 according to the value of κ n{ρ^X,Y(k)σ^X,Y}k=1m⇒L{{VkW2 W2ν}k=1m  if κ∈(0,2ν){VkγY,Y(0)W21+ν}k=1m  if κ∈(2ν,2){VkγX,X(0) γY,Y(0) W2+2ν}k=1m  if κ∈(2,2+2ν){GkγX,X(0) γY,Y(0) E|X|2+2ν}k=1m  if κ>2+2ν. (19) If ν≥1, with quantities defined in Theorem 4 according to the value of κ n{ρ^X,Y(k)σ^X,Y}k=1m⇒L{{VkW2 W2ν}k=1m  if κ∈(0,2){VkγX,X(0)W2ν1+1/ν}k=1m  if κ∈(2,2ν){VkγX,X(0) γY,Y(0) W2+2ν}k=1m  if κ∈(2ν,2+2ν){GkγX,X(0) γY,Y(0) E|X|2+2ν}k=1m  if κ>2+2ν. In each case, the studentized correlations have a nondegenerate limit distribution, and the rate of convergence does not depend on κ. Although the limit distributions are not pivotal, they can be approximated via subsampling. Remark 5 Although our normalization by the sample moment matches the growth rate of its corresponding root, it is not scale-invariant. Hence, to obtain a scale-invariant statistic, we propose to divide each sample kth moment by the exponential of the log moment μ^log⁡(|X|k)=n−1∑t=1nlog⁡(|Xt|k). This quantity converges to Elog⁡(|X|k) no matter the value of κ, and it also scales such that μ^log⁡(|aX|k)=log⁡(|a|k)+μ^log⁡(|X|k) ⁠. Since μ^log⁡(|X|k) converges in probability to its expectation, we can use its exponential to correct for scale; exp⁡μ^log⁡(|aX|k)=|a|k exp⁡μ^log⁡(|X|k) ⁠. For notational transparency the estimators of Subsection 1.3 are expressed in terms of the original normalizations, but the simulations in Appendix C (Supplementary Data) contain results with the scale-invariant normalizations introduced in Remark 5. 2.4 Consistency of the Subsampling Estimators To establish the consistency of the various subsampling estimators introduced in Section 1, we employ the mixing properties of the corresponding processes and Theorem 11.3.1 of Politis, Romano, and Wolf (1999) combined with the convergence results derived in Section 2. By consistency, we mean that the subsampling distribution estimators converge in probability to the respective cdf of the limiting distributions. Corollary 2 Let {Xt} be a PGARCH(p,q) process (2) satisfying Assumptions A, B, and D; also assume that Z0 is symmetric about zero. Assume that b/n+1/b→0 as n→∞. (i) With quantities defined in Theorem 2 according to the value of κ L^X,X,k(x)→PL∞,X,X,k(x) for each x that is a continuity point of the limit distribution of (17). If this distribution is continuous, the convergence is also uniform. Moreover, the asymptotic coverage of the intervals (4) and (5) is the nominal level 1−p. (ii) With quantities defined in Theorem 3 according to the value of κ, and assuming Assumption M when κ>2ν, L^Y,Y,k(x)→PL∞,Y,Y,k(x) for each x that is a continuity point of the limit distribution of (18). If this distribution is continuous, the convergence is also uniform. Moreover, the asymptotic coverage of the intervals (6) and (7) is the nominal level 1−p. (iii) With quantities defined in Theorem 4 according to the value of κ L^X,Y,k(x)→PL∞,X,Y,k(x) for each x that is a continuity point of the limit distribution of (19). If this distribution is continuous, the convergence is also uniform. Moreover, the asymptotic coverage of the intervals (8) and (9) is the nominal level 1−p. 3 Application to Returns on Stock Indices We consider two datasets analyzed in Section 5.5 of Francq and Zakoian (2010). One corresponds to the daily log returns of CAC 40 stock index from 02/03/1990 to 29/12/2006 (4244 observations) and other to those of FTSE 100 index from 04/04/1984 to 03/04/2007 (5811 observations). According to Francq and Zakoian (2010), a GARCH(1,1) model is a reasonable guess for the first series, but higher order GARCH or ARCH is more likely to fit the second series. In Figure 2, we show Hill plots of the returns. For both series, the assumption about the finiteness of the second moment does not seem to be too unreasonable; however, the assumption regarding the finiteness of the fourth moment might be questionable. Figure 2. View largeDownload slide Hill plot of CAC 40 and FTSE 100 returns. Figure 2. View largeDownload slide Hill plot of CAC 40 and FTSE 100 returns. The 95% confidence intervals for the acf of the returns are plotted in Figure 3. PLUGIN intervals were calculated by fitting a GARCH(1,1) model with normal innovations (R routine garchFit of package fGarch) to the returns and plugging in the estimated α1 and β1 values into the exact normal asymptotic confidence bounds (under normal errors) formula. The estimated values for CAC 40 returns were α^1=0.076907 and β^1=0.907047 (⁠ α^0=0.000003 ⁠) while those for FTSE 100 returns were α^1=0.089941 and β^1=0.891497 (⁠ α^0=0.000002 ⁠). The remaining confidence intervals are those employed in the simulation study of Appendix C (Supplementary Data). Figure 3. View largeDownload slide 95% confidence intervals for the acs of returns of CAC 40 and FTSE 100 indices based on several methods. Figure 3. View largeDownload slide 95% confidence intervals for the acs of returns of CAC 40 and FTSE 100 indices based on several methods. For CAC 40 returns, the confidence intervals for ρX,X(k) based on the subsampling method are much wider than the other two types and oscillate around ±0.4. For both series, and all interval types, the acs seem to be not significantly different from zero. The 95% confidence intervals for ρX2,X2(k) are depicted in Figure 4 (these intervals have to be treated with caution if we are unsure about the finiteness of the fourth moment, because in such case ρX2,X2(k) is undefined). As pointed out in Appendix C (Supplementary Data), the symmetric confidence intervals are too wide for the acs of the squares, hence we focus our attention on the equal-tailed intervals. For CAC 40 and FTSE 100 squared returns, the width of the confidence intervals for the acs is about 0.25. Bounds for CAC 40 indicate slower decay of ρX2,X2(k) compared to bounds for FTSE 100. Figure 4. View largeDownload slide 95% subsampling confidence intervals for the acs of squared returns of CAC 40 and FTSE 100 indices. Figure 4. View largeDownload slide 95% subsampling confidence intervals for the acs of squared returns of CAC 40 and FTSE 100 indices. The 95% confidence intervals for ρX,X2(k) are shown in Figure 5. The two subsampling methods yield similar bounds, which are slightly asymmetric. Both types of intervals indicate that the process’ cross-correlations are non-significant. Figure 5. View largeDownload slide 95% subsampling confidence intervals for the cross-correlations of the returns and squared returns of CAC 40 and FTSE 100 indices. Figure 5. View largeDownload slide 95% subsampling confidence intervals for the cross-correlations of the returns and squared returns of CAC 40 and FTSE 100 indices. 4 Summary In this article, we study the asymptotic properties of sample autocovariances and acs of PGARCH processes. Autocorrelations of a time series and its powers have been used in the literature to determine a nonlinear process’ serial structure, but the inference for PGARCH is complicated because the rate of convergence of sample autocovariance and autocorrelation estimators depends upon the tail index. This tail index is typically unknown, unless it has been previously estimated. However, through an appropriate studentization of autocovariance/autocorrelation roots (i.e., the estimator minus its estimand) it is possible to avoid the necessity of knowing the tail index; this strategy was adopted for heavy-tailed moving average time series in Davis and Resnick (1985) and McElroy and Politis (2002). The latter paper approximated the limiting quantiles of the studentized root by the subsampling methodology. Here we study the related inference problem for PGARCH processes: we derive the limiting distributions, provide effective studentizations, and examine subsampling methods for estimating the limiting quantiles. To obtain nondegenerate weak limits of all studentized roots, we derive the recursive relationships for the PGARCH and thus substantially extend existing theoretical results. A simulation study, which can be found in Appendix C (Supplementary Data), indicates that the subsampling confidence intervals for the acs of GARCH processes with a finite fourth moment are generally wider than the asymptotic confidence intervals (approximate or exact). The subsampling approach can still be employed when no asymptotic formula is available (providing intervals for the acs when the fourth moment is infinite, and for the acs of squares and ccs of the process with its squares). Nevertheless, the subsampling-based empirical coverage probabilities tend to be higher than the nominal level. Equal-tailed subsampling confidence intervals are preferable over the symmetric ones. Footnotes * The authors are grateful for helpful comments from the Associate Editor and anonymous referees. This work was partially supported by the Spanish Ministry of Economy and Competitiveness [grant number ECO2012-38442 to AJ]. This report is released to inform interested parties of research and to encourage discussion. The views expressed on statistical issues are those of the authors and not necessarily those of the U.S. Census Bureau. References Baek C. , Pipiras V. , Wendt H. and Abry P. . 2009 . Second Order Properties of Distribution Tails and Estimation of Tail Exponents in Random Difference Equations . Extremes 12 : 361 – 400 . Google Scholar Crossref Search ADS WorldCat Basrak B. , Davis R. and Mikosch T. . 2002 . Regular Variation of GARCH Processes . Stochastic Processes and their Applications 99 : 95 – 115 . Google Scholar Crossref Search ADS WorldCat Bollerslev T. 1986 . Generalized Autoregressive Conditional Heteroskedasticity . Journal of Econometrics 31 : 307 – 327 . Google Scholar Crossref Search ADS WorldCat Bougerol P. and Picard N. . 1992 . Stationarity of GARCH Processes and of Some Nonnegative Time Series . Journal of Econometrics 52 : 115 – 127 . Google Scholar Crossref Search ADS WorldCat Brockwell P. J. and Davis R. A. . 1991 . Time Series: Theory and Methods . New York : Springer . Google Preview WorldCat Carrasco M. and Chen X. . 2002 . Mixing and Moment Properties of Various GARCH and Stochastic Volatility Models . Econometric Theory 18 : 17 – 39 . Google Scholar Crossref Search ADS WorldCat Davis R. and Mikosch T. . 1998 . The Sample Autocorrelations of Heavy-Tailed Processes with Applications to ARCH . The Annals of Statistics 26 : 2049 – 2080 . Google Scholar Crossref Search ADS WorldCat Davis R. A. and Resnick S. I. . 1985 . Limit Theory for Moving Averages of Random Variables with Regularly Varying Tail Probabilities . The Annals of Probability 13 ( 1 ): 179 – 195 . Google Scholar Crossref Search ADS WorldCat Francq C. and Zakoian J.-M. . 2010 . GARCH Models. Structure, Statistical Inference and Finanial Applications . UK : Wiley . Google Preview WorldCat Fryzlewicz P. and Subba Rao S. . 2011 . Mixing Properties of ARCH and Time-Varying ARCH Processes . Bernoulli 1 : 320 – 346 . Google Scholar Crossref Search ADS WorldCat Hall P. and Yao Q. . 2003 . Inference in ARCH and GARCH Models . Econometrica 71 : 285 – 317 . Google Scholar Crossref Search ADS WorldCat Huang D. , Wang H. and Yao Q. . 2008 . Estimating GARCH Models: When to Use What? Econometrics Journal 11 : 27 – 38 . Google Scholar Crossref Search ADS WorldCat Jach A. , McElroy T. and Politis D. . 2012 . Subsampling Inference for the Mean of Heavy-Tailed Long-Memory Time Series . Journal of Time Series Analysis 33 : 96 – 111 . Google Scholar Crossref Search ADS WorldCat Kokoszka P. and Politis D. . 2011 . Nonlinearity of ARCH and Stochastic Volatility Models and Bartlett’s Formula . Probability and Mathematical Statistics 31 : 47 – 59 . WorldCat Kokoszka P. , Teyssière G. and Zhang A. . 2004 . “Confidence Intervals for the Autocorrelations of the Squares of GARCH Sequences” . In Computational Science - ICCS 2004, Volume 3039 of Lecture Notes in Computer Science . Springer , pp. 837 – 844 . Google Preview WorldCat Lindner A. 2009 . Stationarity, Mixing, Distributional Properties and Moments of GARCH(p, q)-Processes . In Andersen J.-P. K. T. , Davis R. and Mikosch T. (eds.), Handbook of Financial Time Series . Berlin, Germany : Springer , pp. 481 – 496 . Google Preview WorldCat McElroy T. and Politis D. N. . 2002 . Robust Inference for the Mean in the Presence of Serial Correlation and Heavy-Tailed Distributions . Econometric Theory 18 : 1019 – 1039 . Google Scholar Crossref Search ADS WorldCat McElroy T. and Politis D. N. . 2007 . Self-Normalization for Heavy-Tailed Time Series with Long Memory . Statistica Sinica 17 ( 1 ): 199 – 220 . WorldCat Mikosch T. and Stărică C. . 2000 . Limit Theory for the Sample Autocorrelation and Extremes of a GARCH(1, 1) Process . The Annals of Statistics 28 : 1427 – 1451 . Google Scholar Crossref Search ADS WorldCat Mittnik S. , Paolella M. and Rachev S. . 2002 . Stationarity of Stable Power-GARCH Processes . Journal of Econometrics 106 : 97 – 107 . Google Scholar Crossref Search ADS WorldCat Peng L. and Yao Q. . 2003 . Least Absolute Deviations Estimation for ARCH and GARCH Models . Biometrika 90 : 967 – 975 . Google Scholar Crossref Search ADS WorldCat Politis D. N. , Romano J. P. and Wolf M. . 1999 . Subsampling . New York : Springer . Google Preview WorldCat Tully E. and Lusey B. . 2007 . A Power GARCH Examination of the Gold Market . Research in International Business and Finance 21 : 316 – 325 . Google Scholar Crossref Search ADS WorldCat Wagner N. and Marsh T. . 2004 . Measuring Tail Thickness under GARCH and an Application to Extreme Exchange Rate Changes . Journal of Empirical Finance 12 : 165 – 185 . Google Scholar Crossref Search ADS WorldCat © The Author, 2017. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com This article is published and distributed under the terms of the Oxford University Press, Standard Journals Publication Model (https://academic.oup.com/journals/pages/open_access/funder_policies/chorus/standard_publication_model)

### Journal

Journal of Financial EconometricsOxford University Press

Published: Jun 1, 2019