Multiplicative Seasonal ARIMA Models - The moving average model of order q, or MA(q) model, is

ARIMA Models

Deﬁnition 3.3 The moving average model of order q, or MA(q) model, is deﬁned to be

3.9 Multiplicative Seasonal ARIMA Models

Example 3.39 Model Choice for the U.S. GNP Series

Returning to the analysis of the U.S. GNP data presented in Exam-ples 3.35 and 3.36, recall that two models, an AR(1) and an MA(2), ﬁt the GNP growth rate well. To choose the ﬁnal model, we compare the AIC, the AICc, and the SIC for both models.

Below are the R commands for the comparison. The eﬀective sample size in this example is 222. We note that R returns AIC⁸ as part of the ARIMA ﬁt.

> # AIC

> gnpgr.ma$aic

[1] -1431.929 # MA(2)

> gnpgr.ar$aic

[1] -1431.221 # AR(1)

> # AICc - see Section 2.2

> log(gnpgr.ma$sigma2)+(222+2)/(222-2-2) [1] -8.297199 # MA(2)

> log(gnpgr.ar$sigma2)+(222+1)/(222-1-2) [1] -8.294156 # AR(1)

> # SIC or BIC - see Section 2.2

> log(gnpgr.ma$sigma2)+(2*log(222)/222) [1] -9.276049 # MA(2)

> log(gnpgr.ar$sigma2)+(1*log(222)/222) [1] -9.288084 # AR(1)

The AIC and AICc both prefer the MA(2) ﬁt, whereas the SIC (or BIC) prefers the simpler AR(1) model. It is often the case that the SIC will select a model of smaller order than the AIC or AICc. It would not be unreasonable in this case to retain the AR(1) because pure autoregressive models are easier to work with.

3.9: Seasonal ARIMA 155 of the strong connections of all activity to the calendar year. Data taken quarterly will exhibit the yearly repetitive period ats= 4 quarters. Natural phenomena such as temperature also have strong components corresponding to seasons. Hence, the natural variability of many physical, biological, and economic processes tends to match with seasonal ﬂuctuations. Because of this, it is appropriate to introduce autoregressive and moving average polynomials that identify with the seasonal lags. The resulting pure seasonal autoregressive moving average model, say, ARMA(P, Q)_s, then takes the form

Φ_P(B^s)x_t= Θ_Q(B^s)w_t, (3.139) with the following deﬁnition.

Deﬁnition 3.12 The operators

Φ_P(B^s) = 1−Φ₁B^s−Φ₂B²^s− · · · −Φ_PB^{P s} (3.140) and

ΘQ(B^s) = 1 + Θ₁B^s+ Θ₂B²^s+· · ·+ ΘQB^Qs (3.141) are the seasonal autoregressive operatorand the seasonal moving av-erage operatorof ordersP andQ, respectively, with seasonal periods.

Analogous to the properties of nonseasonal ARMA models, the pure sea-sonal ARMA(P, Q)_s is causal only when the roots of Φ_P(z^s) lie outside the unit circle, and it is invertible only when the roots of Θ_Q(z^s) lie outside the unit circle.

Example 3.40 A Seasonal ARMA Series

A ﬁrst-order seasonal autoregressive moving average series that might run over months could be written as

(1−ΦB¹²)x_t= (1 + ΘB¹²)w_t or

x_t= Φx_t₋₁₂+w_t+ Θw_t₋₁₂.

This model exhibits the series x_t in terms of past lags at the multi-ple of the yearly seasonal period s = 12 months. It is clear from the above form that estimation and forecasting for such a process involves only straightforward modiﬁcations of the unit lag case already treated.

In particular, the causal condition requires |Φ| < 1, and the invertible condition requires|Θ|<1.

For the ﬁrst-order seasonal (s = 12) MA model, x_t = w_t+ Θw_t₋₁₂, it is easy to verify that

γ(0) = (1 + Θ²)σ²

Table 3.2Behavior of the ACF and PACF for Causal and Invertible Pure Seasonal ARMA Models

AR(P)_s MA(Q)_s ARMA(P, Q)_s

ACF* Tails off at lagsks, Cuts off after Tails off at k= 1,2, . . . , lagQs lagsks PACF* Cuts off after Tails off at lagsks Tails off at

lag P s k= 1,2, . . . , lagsks

*The values at nonseasonal lags h=ks, fork= 1,2, . . ., are zero.

γ(±12) = Θσ²

γ(h) = 0, otherwise.

Thus, the only nonzero correlation, aside from lag zero, is ρ(±12) = Θ/(1 + Θ²).

For the ﬁrst-order seasonal (s= 12) AR model, using the techniques of the nonseasonal AR(1), we have

γ(0) = σ²/(1−Φ²)

γ(±12k) = σ²Φ^k/(1−Φ²) k= 1,2, . . . γ(h) = 0, otherwise.

In this case, the only non-zero correlations are

ρ(±12k) = Φ^k, k= 0,1,2, . . . .

These results can be veriﬁed using the general result thatγ(h) = Φγ(h−12), forh ≥1. For example, when h= 1, γ(1) = Φγ(11), but when h= 11, we haveγ(11) = Φγ(1), which implies thatγ(1) =γ(11) = 0. In addition to these results, the PACF have the analogous extensions from nonseasonal to seasonal models.

As an initial diagnostic criterion, we can use the properties for the pure seasonal autoregressive and moving average series listed in Table 3.2. These properties may be considered as generalizations of the properties for nonsea-sonal models that were presented in Table 3.1.

In general, we can combine the seasonal and nonseasonal operators into a multiplicative seasonal autoregressive moving average model, denoted by ARMA(p, q)×(P, Q)_s, and write

ΦP(B^s)φ(B)xt= ΘQ(B^s)θ(B)wt (3.142)

3.9: Seasonal ARIMA 157 as the overall model. Although the diagnostic properties in Table 3.2 are not strictly true for the overall mixed model, the behavior of the ACF and PACF tends to show rough patterns of the indicated form. In fact, for mixed models, we tend to see a mixture of the facts listed in Tables 3.1 and 3.2.

In ﬁtting such models, focusing on the seasonal autoregressive and moving average components ﬁrst generally leads to more satisfactory results.

Example 3.41 A Mixed Seasonal Model Consider an ARMA(0,1)×(1,0)₁₂model

x_t= Φx_t₋₁₂+w_t+θw_t₋₁,

where |Φ| < 1 and |θ| < 1. Then, because x_t₋₁₂, w_t, and w_t₋₁ are uncorrelated, andxtis stationary,γ(0) = Φ²γ(0) +σ²_w+θ²σ_w²,or

γ(0) = 1 +θ² 1−Φ² σ_w².

In addition, multiplying the model byxt−h,h >0, and taking expecta-tions, we haveγ(1) = Φγ(11) +θσ_w², and γ(h) = Φγ(h−12), forh≥2.

Thus, the ACF for this model is

ρ(12h) = Φ^h h= 1,2, . . . ρ(12h−1) = ρ(12h+ 1) = θ

1 +θ²Φ^h h= 0,1,2, . . . , ρ(h) = 0, otherwise.

The ACF and PACF for this model, with Φ =.8 andθ=−.5, are shown in Figure 3.21. These type of correlation relationships, although idealized here, are typically seen with seasonal data.

To reproduce Figure 3.21 in R, use the following commands:

> phi = c(rep(0,11),.8)

> acf = ARMAacf(ar=phi, ma=-.5, 50)

> pacf = ARMAacf(ar=phi, ma=-.5, 50, pacf=T)

> par(mfrow=c(1,2))

> plot(acf, type="h", xlab="lag")

> abline(h=0)

> plot(pacf, type="h", xlab="lag")

> abline(h=0)

Seasonal nonstationarity can occur, for example, when the process is nearly periodic in the season. For example, with average monthly temperatures over the years, each January would be approximately the same, each February would be approximately the same, and so on. In this case, we might think of average monthly temperaturex_tas being modeled as

xt=St+wt,

Figure 3.21 ACF and PACF of the mixed seasonal ARMA model x_t = .8x_t₋₁₂+w_t−.5w_t₋₁.

whereS_tis a seasonal component that varies slowly from one year to the next, according to a random walk,

St=St−12+vt.

In this model,w_tandv_tare uncorrelated white noise processes. The tendency of data to follow this type of model will be exhibited in a sample ACF that is large and decays very slowly at lagsh= 12k, fork= 1,2, . . . .If we subtract the eﬀect of successive years from each other, we ﬁnd that

(1−B¹²)x_t=x_t−x_t₋₁₂=v_t+w_t−w_t₋₁₂.

This model is a stationary MA(1)₁₂, and its ACF will have a peak only at lag 12. In general, seasonal diﬀerencing can be indicated when the ACF decays slowly at multiples of some season s, but is negligible between the periods.

Then, a seasonal diﬀerence of order D is deﬁned as

∇^Dsx_t= (1−B^s)^Dx_t, (3.143) whereD= 1,2, . . .takes integer values. Typically,D= 1 is suﬃcient to obtain seasonal stationarity.

Incorporating these ideas into a general model leads to the following deﬁ-nition.

3.9: Seasonal ARIMA 159 Deﬁnition 3.13 The multiplicative seasonal autoregressive integrated moving average model, or SARIMA model, of Box and Jenkins (1970) is given by

Φ_P(B^s)φ(B)∇^Ds∇^dx_t=α+ Θ_Q(B^s)θ(B)w_t, (3.144) where w_t is the usual Gaussian white noise process. The general model is denoted as ARIMA(p, d, q)(p, d, q)(p, d, q)×××(P, D, Q)(P, D, Q)(P, D, Q)_s_s_s. The ordinary autoregressive and moving average components are represented by polynomialsφ(B)andθ(B)of orderspandq, respectively [see (3.5) and (3.17)], and the seasonal autoregres-sive and moving average components byΦ_P(B^s)andΘ_Q(B^s)[see (3.140) and (3.141)] of ordersP and Qand ordinary and seasonal diﬀerence components by∇^d = (1−B)^d and∇^Ds = (1−B^s)^D.

Example 3.42 A SARIMA Model

Consider the following model, which often provides a reasonable repre-sentation for seasonal, nonstationary, economic time series. We exhibit the equations for the model, denoted by ARIMA(0,1,1)×(0,1,1)₁₂ in the notation given above, where the seasonal ﬂuctuations occur every 12 months. Then, the model (3.144) becomes

(1−B¹²)(1−B)x_t= (1 + ΘB¹²)(1 +θB)w_t. (3.145) Expanding both sides of (3.145) leads to the representation

(1−B−B¹²+B¹³)x_t= (1 +θB+ ΘB¹²+ ΘθB¹³)w_t, or in diﬀerence equation form

xt=xt−1+xt−12−xt−13+wt+θwt−1+ Θwt−12+ Θθwt−13.

Selecting the appropriate model for a given set of data from all of those represented by the general form (3.144) is a daunting task, and we usually think first in terms of finding difference operators that produce a roughly stationary series and then in terms of finding a set of simple autoregressive moving average or multiplicative seasonal ARMA to fit the resulting residual series. Differencing operations are applied first, and then the residuals are constructed from a series of reduced length. Next, the ACF and the PACF of these residuals are evaluated. Peaks that appear in these functions can often be eliminated by fitting an autoregressive or moving average component in accordance with the general properties of Tables 3.1 and 3.2. In considering whether the model is satisfactory, the diagnostic techniques discussed in§3.8 still apply.

1950 1955 1960 1965 1970 1975 40

60 80 100 120 140 160

Production

1950 1955 1960 1965 1970 1975

0 200 400 600 800 1000

Unemployment

Figure 3.22Values of the Monthly Federal Reserve Board Production Index and Unemployment (1948-1978,n= 372 months).

Example 3.43 Analysis of the Federal Reserve Board Production Index.

A problem of great interest in economics involves ﬁrst identifying a model within the Box–Jenkins class for a given time series and then producing forecasts based on the model. For example, we might consider applying this methodology to the Federal Reserve Board Production Index shown in Figure 3.22. The ACFs and PACFs for this series are shown in Fig-ure 3.23, and we note the slow decay in the ACF and the peak at lag h= 1 in the PACF, indicating nonstationary behavior.

Following the recommended procedure, a first difference was taken, and the ACF and PACF of the first difference

∇xt=xt−xt−1

are shown in Figure 3.24. Noting the peaks at 12,24,36,and 48 with rel-atively slow decay suggested a seasonal difference and Figure 3.25 shows the seasonal difference of the differenced production, say,

∇12∇x_t= (1−B¹²)(1−B)x_t.

Characteristics of the ACF and PACF of this series tend to show a strong peak ath= 12 in the autocorrelation function, with smaller peaks ap-pearing at h= 24,36, combined with peaks at h= 12,24,36,48, in the

3.9: Seasonal ARIMA 161

Figure 3.23ACF and PACF of the production series.

partial autocorrelation function. Using Table 3.2, this suggests either a seasonal moving average of order Q = 1, a seasonal autoregression of possible orderP = 2, or due to the fact that both the ACF and PACF may be tailing oﬀ at the seasonal lags, perhaps both components,P = 2 andQ= 1, are needed.

Inspecting the ACF and the PACF at the within season lags, h = 1, . . . ,11, it appears that both the ACF and PACF are tailing off. Based on Table 3.1, this result indicates that we should consider fitting a model with bothp > 0 and q >0 for the nonseasonal components. Hence, at first we will considerp= 1 andq= 1.

Fitting the three models suggested by these observations and computing the AIC for each, we obtain:

(i) ARIMA(1,1,1)×(0,1,1)₁₂, AIC = 1162.30 (ii) ARIMA(1,1,1)×(2,1,0)₁₂, AIC = 1169.04 (iii) ARIMA(1,1,1)×(2,1,1)₁₂, AIC = 1148.43

On the basis of the AICs, we prefer the ARIMA(1,1,1)×(2,1,1)₁₂model.

Figure 3.26 shows the diagnostics for this model, leading to the conclusion that the model is adequate. We note, however, the presence of a few outliers.

Figure 3.24ACF and PACF of diﬀerenced production, (1−B)x_t.

Figure 3.25 ACF and PACF of first differenced and then seasonally differ-enced production, (1−B)(1−B¹²)x_t.

3.9: Seasonal ARIMA 163

Standardized Residuals

Time

0 100 200 300

−4−2024

0 5 10 15 20 25

−0.20.20.61.0

Lag

ACF

ACF of Residuals

0 10 20 30 40

0.00.40.8

p values for Ljung−Box statistic

lag

p value

Figure 3.26 Diagnostics for the ARIMA(1,1,1)×(2,1,1)₁₂ ﬁt on the Pro-duction data.

The ﬁtted ARIMA(1,1,1)×(2,1,1)₁₂is

(1 +.22₍_.₀₈₎B¹²+.28₍_.₀₆₎B²⁴)(1−.58₍_.₁₁₎B)∇12∇xt

= (1−.50₍_.₀₇₎B¹²)(1−.27₍_.₁₃₎B)w_t

with σ_w² = 1.35. Forecasts based on the ﬁtted model for the next 12 months are shown in Figure 3.27.

Finally, we present the R code necessary to reproduce most of the analy-ses performed in Example 3.43.

> prod=scan("/mydata/prod.dat")

> par(mfrow=c(2,1)) # (P)ACF of data

> acf(prod, 48)

> pacf(prod, 48)

> par(mfrow=c(2,1)) # (P)ACF of d1 data

> acf(diff(prod), 48)

> pacf(diff(prod), 48)

340 350 360 370 380

100120140160180

month

Production

Figure 3.27Forecasts and limits for production index. The vertical dotted line separates the data from the predictions.

> par(mfrow=c(2,1)) # (P)ACF of d1-d12 data

> acf(diff(diff(prod),12), 48)

> pacf(diff(diff(prod),12), 48)

> ### fit model (iii)

> prod.fit3 = arima(prod, order=c(1,1,1), + seasonal=list(order=c(2,1,1), period=12))

> prod.fit3 # to view the results

> tsdiag(prod.fit3, gof.lag=48) # diagnostics

> ### forecasts for the final model

> prod.pr = predict(prod.fit3, n.ahead=12)

> U = prod.pr$pred + 2*prod.pr$se

> L = prod.pr$pred - 2*prod.pr$se

> month=337:372

> plot(month, prod[month], type="o", xlim=c(337,384), + ylim=c(100,180), ylab="Production")

> lines(prod.pr$pred, col="red", type="o")

> lines(U, col="blue", lty="dashed")

> lines(L, col="blue", lty="dashed")

> abline(v=372.5,lty="dotted")

Problems 165

Problems

Section 3.2

3.1 For an MA(1),xt=wt+θwt−1, show that|ρx(1)| ≤1/2 for any number θ. For which values ofθ doesρx(1) attain its maximum and minimum?

3.2 Let w_t be white noise with varianceσ_w² and let|φ| < 1 be a constant.

Consider the process

x₁ = w₁

x_t = φx_t₋₁+w_t t= 2,3, . . . .

(a) Find the mean and the variance of{x_t, t= 1,2, . . .}. Isx_t station-ary?

(b) Show

corr(xt, xt−h) =φ^h

,var(xt−h) var(x_t)

-₁/2

forh≥0.

var(xt)≈ σ²_w 1−φ² and

corr(x_t, x_t₋_h)≈φ^h, h≥0, so in a sense,x_tis “asymptotically stationary.”

(d) Comment on how you could use these results to simulaten obser-vations of a stationary Gaussian AR(1) model from simulated iid N(0,1) values.

(e) Now supposex₁=w₁/

1−φ². Is this process stationary?

3.3 Identify the following models as ARMA(p, q) models (watch out for pa-rameter redundancy), and determine whether they are causal and/or invertible:

(a) x_t=.80x_t₋₁−.15x_t₋₂+w_t−.30w_t₋₁. (b) x_t=x_t₋₁−.50x_t₋₂+w_t−w_t₋₁.

3.4 Verify the causal conditions for an AR(2) model given in (3.27). That is, show that an AR(2) is causal if and only if (3.27) holds.

Section 3.3

3.5 For the AR(2) model given byx_t=−.9x_t₋₂+w_t, ﬁnd the roots of the autoregressive polynomial, and then sketch the ACF,ρ(h).

3.6 For the AR(2) autoregressive series shown below, determine a set of dif-ference equations that can be used to ﬁndψ_j, j= 0,1, . . .in the represen-tation (3.24) and the autocorrelation functionρ(h), h = 0,1, . . .. Solve for the constants in the ACF using the known initial conditions, and plot the ﬁrst eight values.

(a) x_t+ 1.6x_t₋₁+.64x_t₋₂=w_t. (b) x_t−.40x_t₋₁−.45x_t₋₂=w_t. (c) x_t−1.2x_t₋₁+.85x_t₋₂=w_t.

Section 3.4

3.7 Verify the calculations for the autocorrelation function of an ARMA(1,1) process given in Example 3.11. Compare the form with that of the ACF for the ARMA(1,0) and the ARMA(0,1) series. Plot the ACFs of the three series on the same graph forφ =.6, θ=.9, and comment on the diagnostic capabilities of the ACF in this case.

3.8 Generaten= 100 observations from each of the three models discussed in Problem 3.7. Compute the sample ACF for each model and compare it to the theoretical values. Compute the sample PACF for each of the generated series and compare the sample ACFs and PACFs with the general results given in Table 3.1.

Section 3.5

3.9 LetMtrepresent the cardiovascular mortality series discussed in Chap-ter 2, Example 2.2.

(a) Fit an AR(2) toM_t using linear regression as in Example 3.16.

(b) Assuming the ﬁtted model in (a) is the true model, ﬁnd the fore-casts over a four-week horizon, xⁿ_n₊_m, for m = 1,2,3,4, and the corresponding 95% prediction intervals.

3.10 Consider the MA(1) series

x_t=w_t+θw_t₋₁, wherewtis white noise with varianceσ_w².

Problems 167 (a) Derive the minimum mean square error one-step forecast based on the inﬁnite past, and determine the mean square error of this fore-cast.

(b) Letx+ⁿ_n₊₁be the truncated one-step-ahead forecast as given in (3.82).

Show that

(x_n₊₁−x+ⁿ_n₊₁)² =σ²(1 +θ²⁺²ⁿ).

Compare the result with (a), and indicate how well the ﬁnite ap-proximation works in this case.

3.11 In the context of equation (3.56), show that, ifγ(0)>0 andγ(h)→ 0 ash→ ∞, then Γ_n is positive deﬁnite.

3.12 Supposex_t is stationary with zero mean and recall the deﬁnition of the PACF given by (3.49) and (3.50). That is, let

_t=x_t−

h−1 i=1

a_ix_t₋_i

and

δ_t₋_h=x_t₋_h−

h−1 j=1

b_jx_t₋_j

be the two residuals where{a₁, . . . , a_h₋₁} and{b₁, . . . , b_h₋₁}are chosen so that they minimize the mean-squared errors

E[²_t] and E[δ²_t₋_h].

The PACF at lag hwas deﬁned as the cross-correlation betweent and δt−h; that is,

φ_hh= E(_tδ_t₋_h)

E(²_t)E(δ²_t₋_h) .

LetRh be the h×hmatrix with elements ρ(i−j), i, j = 1, . . . , h, and letρρρ_h = (ρ(1), ρ(2), . . . , ρ(h)) be the vector of lagged autocorrelations, ρ(h) = corr(xt+h, xt). Let+ρρρ_h= (ρ(h), ρ(h−1), . . . , ρ(1))be the reversed vector. In addition, letx^h_t denote the BLP ofx_t given{x_t₋₁, . . . , x_t₋_h}:

x^h_t =α_h₁x_t₋₁+· · ·+α_hhx_t₋_h, as described in Property P3.3. Prove

φ_hh=ρ(h)−+ρρρ_h₋₁R⁻¹_h₋₁ρρρ_h

1−+ρρρ_h₋₁R⁻¹_h₋₁+ρρρ_h₋₁ =α_hh. In particular, this result proves Property P3.4.

Hint: Divide the prediction equations [see (3.56)] byγ(0) and write the matrix equation in the partitioned form as

R_h₋₁ +ρρρ_h₋₁ +

ρρρ_h₋₁ ρ(0) αα α₁ α_hh

= ρρρ_h₋₁

ρ(h)

where the h×1 vector of coeﬃcients ααα= (αh1, . . . , αhh) is partitioned asααα= (ααα₁, α_hh).

3.13 Suppose we wish to ﬁnd a prediction functiong(x) that minimizes M SE=E[(y−g(x))²],

wherexandyare jointly distributed random variables with density func-tionf(x, y).

(a) Show that MSE is minimized by the choice g(x) =E(y**x).

Hint:

M SE= ,

(y−g(x))²f(y|x)dy

-f(x)dx.

(b) Apply the above result to the model y=x²+z,

where x and z are independent zero-mean normal variables with variance one. Show thatM SE = 1.

g(x) =a+bx

and determineaand bto minimizeM SE. Show thata= 1 and b= E(xy)

E(x²) = 0

andM SE= 3. What do you interpret this to mean?

3.14 For an AR(1) model, determine the general form of the m-step-ahead forecastx^t_t₊_m and show

E[(xt+m−x^t_t₊_m)²] =σ²_w1−φ²^m 1−φ² .

3.15 Consider the ARMA(1,1) model discussed in Example 3.6, equation (3.26);

that is, xt =.9xt−1+.5wt−1+wt. Show that truncated prediction as deﬁned in (3.81) is equivalent to truncated prediction using the recursive formula (3.82).

3.16 Verify statement (3.78), that for a ﬁxed sample size, the ARMA predic-tion errors are correlated.

Problems 169 Section 3.6

3.17 LetM_trepresent the cardiovascular mortality series discussed in Chap-ter 2, Example 2.2. Fit an AR(2) model to the data using linear regres-sion and using Yule–Walker.

(a) Compare the parameter estimates obtained by the two methods.

(b) Compare the estimated standard errors of the coeﬃcients obtained by linear regression with their corresponding asymptotic approxi-mations, as given in Property P3.9.

3.18 Supposex₁, . . . , x_n are observations from an AR(1) process withµ= 0.

(a) Show the backcasts can be written as xⁿ_t =φ¹⁻^tx₁, fort≤1.

(b) In turn, show, for t ≤ 1, the backcasted errors are w_t(φ) = xⁿ_t − φxⁿ_t₋₁=φ¹⁻^t(1−φ²)x₁.

t=−∞w²_t(φ) = (1−φ²)x²₁.

(d) Use the result of (c) to verify the unconditional sum of squares, S(φ), can be written in the innovations form asn

t=−∞w²_t(φ).

(e) Findn x^t_t⁻¹ and r_t^t⁻¹, and show that S(φ) can also be written as

t=1(x_t−x^t_t⁻¹)² r_t^t⁻¹.

3.19 Generaten= 500 observations from the ARMA model given by x_t=.9x_t₋₁+w_t−.9w_t₋₁,

withw_t∼iid N(0,1). Plot the simulated data, compute the sample ACF and PACF of the simulated data, and ﬁt an ARMA(1,1) model to the data. What happened and how do you explain the results?

3.20 Generate 10 realizations of lengthn= 200 of a series from an ARMA(1,1) model with φ₁ = .90, θ₁ =.2 and σ² = .25. Fit the model by nonlin-ear least squares or maximum likelihood in each case and compare the estimators to the true values.

3.21 Generaten= 50 observations from a Gaussian AR(1) model withφ=.99 and σ_w = 1. Using an estimation technique of your choice, compare the approximate asymptotic distribution of your estimate (the one you would use for inference) with the results of a bootstrap experiment (use B= 200).

3.22 Using Example 3.30 as your guide, ﬁnd the Gauss–Newton procedure for estimating the autoregressive parameter, φ, from the AR(1) model, x_t=φx_t₋₁+w_t, given datax₁, . . . , x_n. Does this procedure produce the unconditional or the conditional estimator? Hint: Write the model as wt(φ) =xt−φxt−1; your solution should work out to be a non-recursive procedure.

3.23 Consider the stationary series generated by x_t=α+φx_t₋₁+w_t+θw_t₋₁,

where E(x_t) =µ, |θ| < 1,|φ| <1 and the w_t are iid random variables with zero mean and varianceσ²_w.

(a) Determine the mean as a function of αfor the above model. Find the autocovariance and ACF of the processx_t, and show that the process is weakly stationary. Is the process strictly stationary?

(b) Prove the limiting distribution asn→ ∞of the sample mean,

¯ x=n⁻¹

n t=1

x_t,

is normal, and ﬁnd its limiting mean and variance in terms ofα,φ, θ, andσ²_w. (Note: This part uses results from Appendix A.) 3.24 A problem of interest in the analysis of geophysical time series involves

a simple model for observed data containing a signal and a reflected version of the signal with unknown amplification factoraand unknown time delay δ. For example, the depth of an earthquake is proportional to the time delayδfor the P wave and its reflected form pP on a seismic record. Assume the signal is white and Gaussian with variance σ_s², and consider the generating model

x_t=s_t+as_t₋_δ.

(a) Prove the processxt is stationary. If|a|<1, show that s_t=

∞ j=0

(−a)^jx_t₋_δj

is a mean square convergent representation for the signal st, for t= 1,±1,±2, . . ..

(b) If the time delayδis assumed to be known, suggest an approximate computational method for estimating the parametersaandσ_s²using maximum likelihood and the Gauss–Newton method.

(c) If the time delay δ is an unknown integer, specify how we could estimate the parameters including δ. Generate a n = 500 point series with a =.9, σ²_w = 1 and δ = 5. Estimate the integer time delayδby searching overδ= 3,4, . . . ,7.

Problems 171 3.25 Forecasting with estimated parameters: Let x₁, x₂, . . . , xn be a sample of size n from a causal AR(1) process, x_t = φx_t₋₁+w_t. Let φbe the Yule–Walker estimator ofφ.

(a) Show φ−φ = Op(n⁻¹^/²). See Appendix A for the deﬁnition of O_p(·).

(b) Let xⁿ_n₊₁ be the one-step-ahead forecast of x_n₊₁ given the data x₁, . . . , x_n, based on the known parameter,φ, and letxⁿ_n₊₁ be the one-step-ahead forecast when the parameter is replaced byφ. Show xⁿ_n₊₁−xⁿ_n₊₁=O_p(n⁻¹^/²).

Section 3.7 3.26 Suppose

y_t=β₀+β₁t+· · ·+β_qt^q+x_t, β_q= 0,

where xt is stationary. First, show that∇^kxt is stationary for anyk= 1,2, . . . , and then show that ∇^ky_t is not stationary for k < q, but is stationary fork≥q.

3.27 Verify that the IMA(1,1) model given in (3.131) can be inverted and written as (3.132).

3.28 For the logarithm of the glacial varve data, say,x_t, presented in Example 3.31, use the ﬁrst 100 observations and calculate the EWMA,+x^t_t₊₁, given in (3.134) for t = 1, . . . ,100, using λ = .25, .50, and .75, and plot the EWMAs and the data superimposed on each other. Comment on the results.

Section 3.8

3.29 In Example 3.36, we presented the diagnostics for the MA(2) ﬁt to the GNP growth rate series. Using that example as a guide, complete the diagnostics for the AR(1) ﬁt.

3.30 Using the gas price series described in Problem 2.9, ﬁt an ARIMA(p, d, q) model to the data, performing all necessary diagnostics. Comment.

3.31 The second column in the data file globtemp2.dat are annual global temperature deviations from 1880 to 2004. The data are an update to the Hansen-Lebedeff global temperature data and the url of the data source is in the file. Fit an ARIMA(p, d, q) model to the data, performing all of the necessary diagnostics. After deciding on an appropriate model, forecast (with limits) the next 10 years. Comment. In R, useread.table to load the data file.

3.32 One of the series collected along with particulates, temperature, and mortality described in Example 2.2 is the sulfur dioxide series. Fit an ARIMA(p, d, q) model to the data, performing all of the necessary diag-nostics. After deciding on an appropriate model, forecast the data into the future four time periods ahead (about one month) and calculate 95%

prediction intervals for each of the four forecasts. Comment.

Section 3.9

3.33 Consider the ARIMA model

x_t=w_t+ Θw_t₋₂.

(a) Identify the model using the notation ARIMA(p, d, q)×(P, D, Q)_s. (b) Show that the series is invertible for|Θ|<1, and ﬁnd the coeﬃcients

in the representation

w_t= ∞ k=0

π_kx_t₋_k.

(c) Develop equations for the m-step ahead forecast, x+n+m, and its variance based on the inﬁnite past,xn, xn−1, . . ..

3.34 Sketch the ACF of the seasonal ARIMA(0,1)×(1,0)₁₂model with Φ =.8 andθ=.5.

3.35 Fit a seasonal ARIMA model of your choice to the unemployment data displayed in Figure 3.22. Use the estimated model to forecast the next 12 months.

3.36 Fit a seasonal ARIMA model of your choice to the U.S. Live Birth Series (birth.dat). Use the estimated model to forecast the next 12 months.

3.37 Fit an appropriate seasonal ARIMA model to the log-transformed John-son and JohnJohn-son earnings series of Example 1.1. Use the estimated model to forecast the next 4 quarters.

The following problems require the supplemental material given in Appendix B 3.38 Supposext=p

j=1φjxt−j+wt, whereφp= 0 andwtis white noise such thatwtis uncorrelated with{xk;k < t}. Use the Projection Theorem to show that, forn > p, the BLP ofx_n₊₁on sp{x_k, k≤n}is

x_n₊₁=

p j=1

φ_jx_n₊₁₋_j.

Problems 173 3.39 Use the Projection Theorem to derive the Innovations Algorithm, Prop-erty P3.6, equations (3.68)-(3.70). Then, use Theorem B.2 to derive the m-step-ahead forecast results given in (3.71) and (3.72).

3.40 Consider the series x_t = w_t−w_t₋₁, where w_t is a white noise process with mean zero and variance σ²_w. Suppose we consider the problem of predictingx_n₊₁, based on only x₁, . . . , x_n. Use the Projection Theorem to answer the questions below.

(a) Show the best linear predictor is xⁿ_n₊₁=− 1

n+ 1 n k=1

k x_k.

(b) Prove the mean square error is

E(x_n₊₁−xⁿ_n₊₁)²= n+ 2 n+ 1σ_w². 3.41 Use Theorem B.2 and B.3 to verify (3.105).

3.42 Prove Theorem B.2.

3.43 Prove Property P3.2.

Chapter 4 Spectral Analysis and

No documento Springer Texts in Statistics (páginas 167-187)