Search in book...
Toggle Font Controls
Create new playlist

Name your new playlist

Playlist description (optional)
Sign In

Email address

Password

Forgot Password?

or

Continue with Facebook

Continue with Google
Sign Up

Full Name

Email address

Confirm Email Address

Password

or

Continue with Facebook

Continue with Google

9
Multivariate spectral analysis of time series

Similar to the univariate time series analysis, where one can study a univariate time series through its autocovariance/autocorrelation functions and lag relationships or through its spectrum properties, we can study a multivariate time series through a time domain approach or a frequency domain approach. In the time domain approach, we use the covariance/correlation matrices, and in frequency domain approach we will use the spectrum matrices. In this chapter, after a brief review of the univariate frequency domain method, we will introduce the spectral analysis for both stationary and nonstationary vector time series. With no loss of generality, we will assume a zero‐mean time series in the following discussion.

9.1 Introduction

Recall that for a univariate stationary time series process, Z_t, its spectral representation is given by

(9.1)

where dU(ω) is a complex‐valued orthogonal stochastic process for each ω such that

(9.2)

(9.3)

and

(9.4)

where U^*(ω) is the complex conjugate of U(ω). Let γ_k be the autocovariance function of Z_t. Its spectral representation is given by

(9.5)

where F(ω) is the spectral distribution function.

When the autocovariance function is absolutely summable, the spectrum or the spectral density exists and is equal to

(9.6)

In this case, we have

(9.7)

with

(9.8)

Given a time series Z₁, Z₂, …, Z_n, its Fourier transform at the Fourier frequencies ω_j = 2πj/n, − [(n − 1)/2] ≤ j ≤ [n/2], is

(9.9)

It can be shown that the sample spectrum is given by

(9.10)

Looking at the following expression of the previous double sums for Z_tZ_r

and letting k = t − r, we see that

(9.11)

where

is the sample autocovariance function. If Z_t is a Gaussian white noise process with mean 0 and variance σ², then for j = 1, 2, …, (n − 1)/2, are distributed independently and identically as the following Chi‐square distribution, that is

(9.12)

where (σ²/2π) is actually the spectrum of Z_t.

Although is an unbiased estimate, since the sample spectrum is only defined at the Fourier frequencies and its variance is independent of the sample size n, it is a rather poor estimate of the spectrum. To solve this problem, we introduce a suitable kernel or spectral window to smooth the sample spectrum, that is,

(9.13)

where ω_j = 2πj/n, j = 0, ± 1, …, ± [n/2], are Fourier frequencies; M_n is a function of n such that M_n → ∞ but M_n/n → 0 as n → ∞ ; and W_n(ω_j) is the kernel or spectral window that is a weighting function with the following properties

(9.14)

From Eq. (9.7), we see that the spectrum is the Fourier transform of the autocovariance function. So, we can also apply a weighting function to the sample autocovariances, that is,

(9.15)

where M is the truncation point that is a function of n such that M/n → 0 as n → ∞; W(k) is the lag window chosen to be an absolutely summable sequence

(9.16)

which is derived from a bounded even continuous function W(x) satisfying

(9.17)

Some commonly used spectral and lag windows include the Daniell (1946) Rectangular, Bartlett (1950), Blackman–Tukey (1959), and Parzen (1961, 1963) windows. For more details, we refer readers to Priestley (1981), and Wei (2006, 2008).

When Z_t is a Gaussian process with the spectrum f(ω), under a properly chosen lag window, we have

(9.18)

with

and W(x) is the continuous weighting function used in the associated lag window. This smoothed spectrum has a distribution related to the Chi‐square distribution. is an asymptotically unbiased estimate of f(ω) so that

and

The estimation procedure for series observed at high sampling frequency that does not require equally spaced observations, we refer readers to a recent study by Chang, Hall, and Tang (2017).

9.2 Spectral representations of multivariate time series processes

These univariate time series results can be readily generalized to the m‐dimensional vector process. Let Z_t = [Z_1,t, Z_{2, t}, …, Z_m,t]′ be a zero‐mean jointly stationary m‐dimensional vector process with the covariance matrix function, Γ(k) = [γ_i,j(k)], the spectral representation of Z_t is given by

(9.19)

where dU(ω) = [dU₁(ω), dU₂(ω), …, dU_m(ω)]′ is a m‐dimensional complex‐valued process with dU_i(ω), for i = 1, 2, …, m, being both orthogonal as well as cross‐orthogonal such that

(9.20)

and

(9.21)

The spectral representation of the covariance matrix function is given by

(9.22)

where

(9.23)

and F(ω) is the spectral distribution matrix function of Z_t. The diagonal elements F_i,i(ω) are the spectral distribution functions of the Z_i,t and the off‐diagonal elements F_i,j(ω) are the cross‐spectral distribution functions between the Z_i,t and the Z_j,t.

If the covariance matrix function is absolutely summable in the sense that each of the m × m sequence γ_i,j(k) is absolutely summable, then the spectrum matrix or the spectral density matrix function exists and is given by

(9.24)

Thus, we can write

(9.25)

and

(9.26)

where

(9.27)

When k = 0 in Eq. (9.22), we have

(9.28)

So, the area under the multivariate spectrum is the variance–covariance matrix of Z_t. The element f_i,i(ω) in Eq. (9.26) is the spectrum or the spectral density of Z_i,t, and the element f_i,j(ω) in Eq. (9.26) is the cross‐spectrum or the cross‐spectral density of Z_i,t and Z_j,t. It is easily seen that the spectral density matrix function f(ω) is positive semidefinite, that is, c′f(ω)c ≥ 0 for any nonzero m‐dimensional complex vector c. Also, the matrix f(ω) is Hermitian, that is,

(9.29)

Hence, for all i and j.

Since f_i,j(ω) is in general complex, we can write it as

(9.30)

where c_i,j(ω) and − iq_i,j(ω) are the real and imaginary parts of f_i,j(ω), that is,

(9.31)

and

(9.32)

The function c_i,j(ω) is known as the co‐spectrum, and q_i,j(ω) is known as the quadrature spectrum, between Z_i,t and Z_j,t. We can also write f_i,j(ω) in the polar form,

(9.33)

where

(9.34)

and

(9.35)

The function α_i,j(ω) is called the cross‐amplitude spectrum, and the function ϕ_i,j(ω) is called the phase spectrum.

To properly understand these spectra, we note that for any ω, the function dU_i(ω) is a complex‐valued random variable, and we can write it as

(9.36)

where α_i(ω) and ϕ_i(ω) are the amplitude spectrum and the phase spectrum of the Z_i,t series. From Eqs. (9.23), (9.24), and (9.33), since

(9.37)

where for simplicity we assume that the amplitude and phase spectra are independent. Thus, α_i,j(ω) can be thought as the average value of the product of amplitudes of the ω − frequency components of Z_i,t and Z_j,t, and the phase spectrum ϕ_i,j(ω) represents the average phase shift, ϕ_i(ω) − ϕ_j(ω), between the ω − frequency components of Z_i,t and Z_j,t. In terms of a causal relationship, Z_j,t = αZ_{i,t − τ} + e_t, or Z_i,t = αZ_{j,t − τ} + e_t, where there is no feedback relationship between them, the phase spectrum is a measure of the extent to which each frequency component of one series leads the other. The ω − frequency component of Z_i,t leads the ω − frequencycomponent of Z_j,t if the phase ϕ_i,j(ω) is negative. The ω − frequency component of Z_i,t lags the ω − frequency component of Z_j,t if the phase ϕ_i,j(ω) is positive. For a given ϕ_i,j(ω), the shift in time units is ϕ_i,j(ω)/ω, and the actual time delay of the ω − frequency component of Z_j,t is equal to

(9.38)

Other useful functions in the multivariate spectral analysis are the gain function defined as

(9.39)

and the squared coherency function defined as

(9.40)

From Eqs. (9.23) and (9.24), it is easy to see that

(9.41)

which is actually square of the correlation coefficient between the ω − frequency components of Z_i,t and Z_j,t. A value of close to 1 implies that the ω − frequency components of the two series are strongly linearly related, and a value of close to 0 implies that they are very weakly linearly related. It should be noted that just like the correlation coefficient between two random variables, the square coherency is invariant under linear transformations.

9.3 The estimation of the spectral density matrix

9.3.1 The smoothed spectrum matrix

Given a zero‐mean m‐dimensional time series, Z₁, Z₂, …, and Z_n, its Fourier transform at the Fourier frequencies ω_p = 2πp/n, − [(n − 1)/2] ≤ p ≤ [n/2], is

(9.44)

Then, the m × m sample spectrum matrix, which is also known as periodogram matrix, is simply the extension of Eqs. (9.9)–(9.11). Thus,

(9.45)

where

and

When Z_t is a multivariate Gaussian process with mean vector 0 and variance–covariance matrix Σ, has a distribution related to the sample variance–covariance matrix that is known as Wishart distribution with n degrees of freedom, which is the multivariate analog of the Chi‐square distribution. We refer readers to Goodman (1963), Hannan (1970), and Brillinger (2002) for further discussion of the properties of the periodogram and Wishart distribution.

Similar to the univariate extension of the sample spectral density discussed in Section 9.1, the sample spectrum matrix or periodogram matrix is also a poor estimate. So, we replace it by the following smoothed spectrum matrix

(9.46)

where

(9.47)

W_i(ω) is a smoothing function, also known as kernel or spectral window, and M_i is the bandwidth of the spectral window, and

(9.48)

where W_i,j(ω) is a spectral window, and M_i,j is the corresponding bandwidth. Similar to the extension of the univariate smoothed spectrum, the smoothed spectrum matrix can also be approximated by the Wishart distribution.

Once f_i,i(ω) and f_i,j(ω)are estimated, we can estimate the co‐spectrum, c_i,j(ω), the quadrature spectrum, q_i,j(ω), the cross‐amplitude spectrum, α_i,j(ω), phase spectrum, ϕ_i,j(ω), the gain function, G_i,j(ω), and the squared coherency function,

Note that the spectrum matrix is the Fourier transform of the covariance function, Γ(k) = [γ_i,j(k)], and the sample spectrum matrix is

(9.49)

Instead of spectrum smoothing, we can also apply the smoothing function to the sample covariance matrices, that is,

(9.50)

where

(9.51)

is the sample autocovariance for the Z_i,t series, W_i(k) is a suitable lag window, and M_i is the truncation point, and

(9.52)

is the sample cross‐covariance function between Z_i,t and Z_j,t, W_i,j(k) is a suitable lag window, and M_i,j is the corresponding truncation point. In this case, the estimations of the co‐spectrum, the quadrature spectrum, the cross‐amplitude spectrum, the phase spectrum, the gain function, and the squared coherency function are given by

(9.53)

(9.54)

(9.55)

(9.56)

(9.57)

and

(9.58)

For commonly used spectral or lag windows and their properties, we refer readers to Priestley (1981), Brillinger (2002), and Wei (2006).

Some important remarks are in order here.

The smoothed lag windows and the associated truncation points used in estimating f_i,i(ω) and f_i,j(ω) are not necessarily the same for all i and j. Even the same lag window is used, the associated truncation points can be different. However, the use of the same lag window and truncation point for all estimates will in general make the study of sampling properties of the estimates simpler.
In estimating the cross‐spectrum, the lag window places heavier weight on the sample cross‐covariances around the zero lag. If one series leads another and the cross‐covariance does not peak at zero lag, then the estimated cross‐spectrum will be biased. This bias is especially severe for the estimated square coherency. Because the square coherency is invariant under the linear transformation, to reduce the bias, one can properly align two series by shifting one of the series by a time lag τ so that the peak in the cross‐spectrum function of the aligned series occurs at the zero lag. In practice, the time lag τ can be estimated by the lag corresponding to the maximum cross‐correlation.
The estimates of the cross‐amplitude, phase, and gain functions are not reliable when the square coherency is small.
It should be noted that when we write the spectral density as , it implies that the range of the frequencies is from −π to π. Given a time series of length n, due to symmetry, the Fourier frequencies used in the sample spectrum will be ω_j = 2πj/n, with j = 0, 1, …, [n/2]. However, if one writes the spectral density as it implies that the range of the frequencies will be −1/2 to 1/2. In this case, the frequencies used in the sample spectrum will be ω_j = j/n, with j = 0, 1, …, [n/2]. In the following discussions, we may use either one depending on what is used in software and the referenced papers.

9.3.2 Multitaper smoothing

Developed by Thompson (1982) for univariate processes and extended by Walden (2000) for multivariate processes, multitaper smoothing is another useful way to estimate power spectrum density that balances the bias and variance of nonparametric spectral estimation. The multitaper reduces estimation bias by averaging modified periodograms obtained using a family of mutually orthogonal tapers from the same sample data. Let h_j(t) for t = 1, …, n and j = 1, …, n, be n orthonormal tapers such that

(9.59)

From Eq. (9.45), we note that

(9.60)

where is the discrete Fourier transform of Z_t. The multitaper power spectral estimator at frequency ω is

(9.61)

where K is chosen through the method shown below and is the tapered Fourier transform such that

(9.62)

The multitaper spectral estimator is asymptotically unbiased and is consistent when the number of tapers K increase with the number of observations n.

One of the most commonly used multitapers is the Discrete Prolate Spheroidal Sequences (DPSS) or tapers suggested by Thompson (1982), also known as Slepian sequences or tapers because the term DPSS was given in Slepian (1978). The Slepian tapers are sequences of functions, which are the Fourier transform of the DPSS, that were designed to maximize the spectral concentration defined as

(9.63)

for a frequency W, ∣W ∣ < π/2 is the bandwidth defining local and normally in the order of 1/n, where Consequently, the first of the sequences, {h₁(t), t = 1, …, n}, is chosen such that its corresponding spectral window or taper maximizing the concentration ratio in Eq. (9.63) over the interval (−W, W), which is equal to λ₁(n,W). This is done by maximizing the power

subject to the constraint that the total power is normalized such that

which leads to the following eigenvalue equation,

(9.64)

for t′ = 1, …, n. So, λ₁(n,W) is the largest eigenvalue and {h₁(t), t = 1, …, n} is the first taper. The second taper, {h₂(t), t = 1, …, n} is selected to maximize the corresponding concentration ratio but subject to being orthogonal to the first. Going further, the third taper maximizes the concentration ratio, but subject to being orthogonal to the first two tapers. In general, the jth taper corresponds to the jth largest eigenvalue, λ_j(n,W).

By maximizing concentration ratio, we are essentially minimizing the sidelobe energy outside a frequency band (−W,W). The eigenvalues should be close to unity for well‐behaved tapers. Let M be the matrix formed by those orthogonal tapers. Then M is an n × n positive–definite matrix with K dominant eigenvalues, λ₁(n,W) > λ₂(n,W) > ⋯ > λ_K(n,W) > ⋯ > λ_n(n,W), which are close to 1.

Another widely used set of tapers is called the Sine tapers (Riedel and Sidorenko, 1995), which are in the form of

(9.65)

for t = 1, …, n. They were used by Dai and Guo (2004) to obtain a preliminary estimate of the spectral matrix in the smoothing spline framework discussed in the next section.

9.3.3 Smoothing spline

The main disadvantage of the kernel smoothing method and the multitaper smoothing method is that they cannot guarantee that the final estimate is positive semidefinite while allowing flexible smoothing for each element of the spectral matrix. Thus, the same bandwidth is often applied to smoothing all the spectral components. However, in many applications, different components of the spectral matrix may need different smoothnesses, and require different smoothing parameters to get optimal estimates. To overcome this difficulty, Dai and Guo (2004) proposed a Cholesky decomposition based smoothing spline method for the spectrum estimation. The method models each Cholesky component separately by using different smoothing parameters. The method first obtains positive–definite and asymptotically unbiased initial spectral estimator through sine multitapers as shown in Eq. (9.65). Then, it further smooths the Cholesky components of the spectral matrix via the smoothing spline and penalized sum of squares, which allows different degrees of smoothness for different Cholesky elements.

Suppose the spectral matrix has Cholesky decomposition such that , where Γ is m × m lower triangular matrix. To obtain unique decomposition, the diagonal elements of Γ are constrained to be positive. The diagonal elements γ_j,j, j = 1, …, m, the real part of γ_j,k, ℜ(γ_j,k), and imaginary part of γ_j,k, ℑ(γ_j,k), j > k are smoothed by spline with different smoothing parameters. Suppose γ ∈ {γ_j,j, ℜ(γ_j,k), ℑ(γ_j,k), j > k, for j,k = 1, …, m}, we have

(9.66)

where a(ω_ℓ) = E{γ(ω_ℓ)}, and the e(ω_ℓ), ℓ = 1, …, n,are independent errors with zero means and the variances depending on the frequency point ω_ℓ. a(⋅) is periodic and is fitted by periodic smoothing spline (Wahba, 1990) of the form,

(9.67)

The estimator of power spectrum component is obtained by minimizing the following

(9.68)

where because of issues related asymptotic distributions, ω_ℓ = 0, 0.5, and 1 are not used in the estimation, τ = n − 1 if n is odd, and τ = n − 2 if n is even; is the estimator of γ(ω_ℓ); and λ is a smoothing parameter. The final form of the estimator is given by

(9.69)

where [⋅] is the greatest integer function. The smoothing parameter λ can be chosen by the generalized cross‐validation or the generalized maximum likelihood. For more details, we refer readers to Wahba (1990) and Dai and Guo (2004). Qin and Wang (2008) suggested, through simulations, that the generalized maximum likelihood is stable and performs better than the generalized cross‐validation.

9.3.4 Bayesian method

Recall that the discrete Fourier transform , ℓ = 0, …, (n − 1)/2, are approximately independent complex multivariate normal random variables. The large‐sample distribution of leads to the Whittle likelihood (Whittle, 1953, 1954),

(9.70)

where ω_k = ℓ/n and L = [(n − 1)/2]. Based on the Whittle likelihood, Rosen and Stoffer (2007) proposed a Bayesian method to the estimate spectrum of the second‐order stationary multivariate time series. They model the spectrum f(ω) by the modified complex Cholesky factorization, such that,

(9.71)

where Γ_ℓ is a complex‐valued lower triangular matrix with one on its diagonal,

and D_ℓ is a diagonal matrix with positive real values, that is, . Let Θ = (θ₁, …, θ_L), and D = {D₁, …, D_L},the modified Cholesky representation facilitates development of Bayesian sampler by noticing that the Whittle likelihood can be rewritten as

(9.72)

where θ_ℓ is an m(m − 1)/2–dimensional vector and R_ℓ is a m × m(m − 1)/2 design matrix such that

and Y_{j, ℓ} is the jth entry of . Let and Each component of the Cholesky decomposition is fitted by the Demmler–Reinsch basis functions for linear smoothing splines of Eubank and Hsing (2008) as follows

(9.73)

for each frequency ω_ℓ. Let X be the design matrix of the basis functions and be the associated parameters. We then have

(9.74)

for j = 1, …, m, k = 1, …, j − 1. The priors on c_j, c_{j,k, (re)}, and c_{j,k, (im)} are chosen to be bivariate normal distributions and the priors on d_j, d_{j,k, (re)}, and d_{j,k, (im)} are chosen to be L–dimensional normal distributions , , and , respectively. The hyperparameters , and are smoothing parameters, which control the amount of smoothness. As the smoothing parameters tend to zero, the spline becomes a linear fit; as the smoothing parameters tend to infinity, the spline will be an interpolating spline. Gibbs sampling with the Metropolis–Hastings algorithm is used to draw parameters from posterior distribution.

9.3.5 Penalized Whittle likelihood

Penalized methods have been one of the most popular research topics for the past decade. In the area of spectrum estimation, Pawitan and O'Sullivan (1994) developed penalized likelihood method for the nonparametric estimation of the power spectrum of a univariate time series. Pawitan (1996) developed a penalized Whittle likelihood estimator for a bivariate time series. However, their approach cannot be easily generalized to higher dimension. More recently, Krafty and Collinge (2013) proposed a penalized Whittle likelihood method to estimate the power spectrum of a vector‐valued time series. The method allows for varying levels of smoothness among spectral components while accounting for the positive definiteness of spectral matrices and the Hermitian and periodic structures of power spectra as functions of frequency. In this section, we briefly discuss the method by Krafty and Collinge (2013).

Recall that are discrete Fourier transform of the time series, and define the negative log Whittle likelihood as

(9.75)

where ω_ℓ = ℓ/n and L = [(n − 1)/2]. Based on L, the method penalizes the roughness of the estimated spectrum. The spectral matrix f(ω) is modeled by the Cholesky decomposition such that f(ω) = [ΓΓ^*]⁻¹, where Γ is a m × m lower triangular matrix with real‐valued diagonal elements. Further, denote Γ_i,j,R(ω;f) and Γ_i,j,I(ω;f) as the real and imaginary parts of the (i,j) element of Γ. The method proposes a measure of roughness of a power spectrum through the integrated squared kth derivatives of the m(m + 1)/2 real and m(m − 1)/2 imaginary components of the Cholesky decomposition. Suppose the smoothing parameters, λ = {ρ_i,j, θ_i,j : i ≤ j = 1, …, m},of Γ_i,j,R and Γ_i,j,I that control the roughness penalty are ρ_i,j > 0 and θ_i,j > 0, the roughness measure for a spectrum is

(9.76)

The interpretation is that the penalty function J shrinks the estimates of power spectra toward real‐valued matrix functions that are constant across frequency. Consequently, we consider minimizing the penalized Whittle negative loglikelihood

The estimation procedure is based on an iterative algorithm adopted from Wood (2011) and the confidence intervals are obtained via bootstrap.

9.3.6 VARMA spectral estimation

As shown in Chapter 2, the underlying vector time series process is often described by a vector autoregressive moving average (VARMA) model. Specifically, the m‐dimensional VARMA process of order p and q, VARMA(p, q) is given by

(9.77)

(9.78)

where a_t is a sequence of m‐dimensional vector white noise process, VWN(0, Σ), and

(9.79)

We assume that all zeros of ∣Φ_p(B)∣ and ∣Θ_q(B)∣ lie outside of the unit circle, so that we can also represent it as

(9.80)

where and the sequence Ψ_j is square summable. The spectral density matrix of VARMA(p, q) model is given by

(9.81)

where and Ψ^*(e^−iω) is its conjugate transpose.

Given a vector time series of n observations, Z = (Z₁, Z₂, …, Z_n), we will first build its VARMA(p,q) model including the estimation of its parameter matrices. Let and be the corresponding estimates of the parameter matrices. We have

(9.82)

where and a_t is a sequence of m‐dimensional vector white noise, Then, the spectral density matrix estimation of the underlying process is given by

(9.83)

where

(9.84)

In practice, given a vector time series Z_t, t = 1, 2, …, n, one can often approximate the underlying process with a VAR(p) model,

(9.85)

So, the commonly used parametric estimation of spectral matrix is

(9.86)

where the order p can be determined through some kind of model selection criteria such as Akaike’s information criterion (AIC) or the Bayesian information criterion (BIC).

9.4 Empirical examples of stationary vector time series

Example 9.3

In this example, we consider the monthly U.S. sales from June 2, 2009 to November 11, 2016 in five categories: AUT (automobile), BUM (building materials), GEM (general merchandise), GRO (grocery), and COM (consumer materials) introduced as WW2b in Chapter 2, where COM is the sum of CLO (clothing), BWL (beer wine and liquor), and FUR (furniture). The only difference is that we take the difference of these five series and work on the monthly changes of these sales. There are 90 points for each original sales, so in the following analysis, we will have 89 data points for each series, as shown in Figure 9.1. It is clear that the time series contain some periodic patterns.

Figure 9.1 Five U.S. monthly changes of sales from June 2, 2009 to November 11, 2016 in AUT (automobile), BUM (building materials), GEM (general merchandise), GRO (grocery), and COM (consumer materials).

9.4.1 Sample spectrum

Let us first examine the sample spectrum of each series, which as shown in Wei (2006) and Priestley (1981) can be used to test hidden periodic components. They are displayed in Figure 9.2 and they are noisy.

5 Graphs of raw periodogram vs. frequency for AUT, BUM, GEM, GRO, and COM, depicting sample spectrum of the five US monthly sales changes. — **Figure 9.2** The sample spectrum of the five U.S. monthly sales changes.

So, we will compute a smoothed kernel sample spectrum matrix

(9.88)

where

(9.89)

where W(ω) is a spectral window, and M is the corresponding bandwidth. Let us simply consider Daniell's window, that is,

(9.90)

where

(9.91)

The smoothed five individual sample spectra with M = 5 are shown on the first row in Figure 9.3. We then increase the truncation point M to 7. The corresponding five smoothed sample spectra are shown on the second row in Figure 9.3, and they are much smoother.

2 Sets of 5 graphs for AUT, BUM, GEM, GRO, and COM (left–right) depicting five individual smoothed sample spectrum using Daniell window with M = 5 (top row) and M = 7 (bottom row). — **Figure 9.3** The five individual smoothed sample spectrum using Daniell window with M = 5: first row; Daniell window with M = 7: second row.

It is clear that, except AUT, almost all other estimated spectra contain a large peak at two cycles per year, which corresponds to a half‐year periodic pattern. In addition, the estimated spectra of AUT and BUM have a peak at one cycle per year that is related to a yearly pattern. Finally, the spectra of GEM and COM also suggest a peak at four cycles per year, which is related to seasonal effects. In general, the power of AUT and GRO concentrates at higher frequencies; the power of BUM concentrates at lower frequencies; and the power of GEM and COM seems evenly spread out from medium to high frequencies.

From Eqs. (9.31) and (9.32), we have

(9.92)

and

(9.93)

Once the smoothed are obtained, we can compute the estimated co‐spectra, and the estimated quadrature spectra, as follows.

(9.94)

and

(9.95)

for i = 1, …, 5 and j = 1, …, 5. For an illustration, we use Daniell window with M = 7, and Figure 9.4 shows the estimated co‐spectra, and quadrature spectra, for j = 1, …, 4.

8 Graphs displaying estimated co-spectra for ĉ5,1 (ω), ĉ5,2 (ω), ĉ5,3 (ω) , and ĉ5,4 (ω) (a–d) and estimated quadrature spectra qˆ5,1 (ω), qˆ5,2 (ω), qˆ5,3 (ω), and qˆ5,4 (ω) (e–h). — **Figure 9.4** Estimated co‐spectra (a–d): , , , and ; estimated quadrature spectra (e–h): and .

The estimated squared coherences can be obtained by

(9.96)

They are shown in Figure 9.5. An unexpected observation is that the squared coherence between COM and GEM is large and is constant across all frequencies. This indicates a strong relationship between sales changes of consumer materials and general merchandise.

10 Graphs, each with a curve, illustrating the estimated squared coherences with the Daniell window and bandwidth of 7. — **Figure 9.5** The estimated squared coherences with the Daniell window and bandwidth of 7.

9.4.2 Bayesian method

Example 9.3(b)

Next, we apply the Bayesian spectral estimation method of Rosen and Stoffer (2007) to this dataset. Since the computer program is able to analyze at the maximum a three‐dimensional time series, we select AUT, BUM, and GEM to construct a trivariate time series in this analysis. The estimated spectra are shown in Figure 9.6. The results are largely consistent with the results of the kernel smoothing method. However, it appears that the spectra obtained from the Bayesian method are smoother compared to that of the kernel smoothing method. In addition, it seems that the GEM series only have one peak around three to four cycles per year, and its other harmonic peaks at higher frequencies disappear according to the Bayesian spectral analysis.

Figure 9.6 The three estimated spectra using the Bayesian method by Rosen and Stoffer (2007).

9.4.3 Penalized Whittle likelihood method

9.4.4 Example of VAR spectrum estimation

According to Eq. (9.86), the estimated spectral matrix is

(9.98)

where

Figure 9.8 shows the estimated spectrum from the VAR(4) model. Comparing with the kernel smoothed estimations given in Figure 9.3, we see that there are some similarities but also some differences. Similar to the nonparametric estimation, only the spectra of GEM and COM have peak at four cycles per year, which is related to seasonal effects. BUM concentrates at lower frequencies, GRO concentrates at higher frequencies, and the powers of GEM and COM are more evenly spread out in frequencies. From Figure 9.3, we see that the estimated spectra contain a large peak at two cycles per year except AUT. However, in terms of the VAR estimation shown in Figure 9.8, the estimated spectra contain a large peak at two cycles per year including AUT. Also, all estimated spectra have peak at one cycle per year, which is related to a yearly pattern, and unlike the kernel estimation, only AUT and BUM have peak at one cycle per year. Overall, the estimates of kernel window smoothing are much smoother than that of the VAR model. One possible reason is that the sample size of this particular example is small, which leads to unstable VAR parameter estimation.

5 Graphs for AUT, BUM, GEM, GRO, and COM (left–right) depicting the estimated power spectra by using the VAR(4) model. — **Figure 9.8** The estimated power spectra by using the VAR(4) model.

Figure 9.9 shows the estimated co‐spectra, and quadrature spectra, for j = 1, …, 4. Comparing with the kernel smoothed estimations given in Figure 9.4, again we see that there are some similarities and differences. However, the VAR estimation of the squared coherences as shown in Figure 9.10 is quite similar to the kernel estimation from Figure 9.5. Both estimation methods show that the estimated squared coherence between COM and GEM is large and constant across all frequencies, indicating a strong relationship between sales changes of consumer materials and general merchandise.

8 Graphs displaying estimated co-spectra for ĉ5,1 (ω), ĉ5,2 (ω), ĉ5,3 (ω) , and ĉ5,4 (ω) (a–d) and estimated quadrature spectra qˆ5,1 (ω), qˆ5,2 (ω), qˆ5,3 (ω), and qˆ5,4 (ω) (e–h) from the VAR(4) model. — **Figure 9.9** Estimated co‐spectra (a–d): , , , and estimated quadrature spectra (e–h): , , , and from the VAR(4) model.

10 Graphs, each with a curve, illustrating the estimated squared coherences with the VAR(4) model. — **Figure 9.10** The estimated squared coherences by the VAR(4) model.

9.5 Spectrum analysis of a nonstationary vector time series

9.5.1 Introduction

For a nonstationary time series, we normally use some transformation like variance stabilization and/or differencing to reduce it to stationary before performing its spectral matrix estimation. However, there are many kinds of nonstationary time series that cannot be reduced to stationary by these transformations. Let us first consider a univariate case. There are many univariate nonstationary processes Z_t, which cannot be represented by given in Eq. (9.1), because the function ϕ(ω) = e^iωt as a sine and cosine waves is stationary. Priestley (1965, 1966, and 1967) has pointed out that in this case, instead of using ϕ(ω) = e^iωt, we need to consider an oscillatory function, which is a generalized Fourier transform,

(9.99)

so that

(9.100)

where A(t,ω) is a time‐varying modulating or transfer function with absolute maximum at zero frequency. In other words, Z_t is an oscillatory process with an evolutionary spectrum, which has the same type of physical interpretation as the spectrum of a stationary process. The main difference is that while the spectrum of a stationary process describes the power distribution across frequencies over all time, the evolutionary spectrum describes power distribution over frequency at instantaneous time. However, within this framework, letting the length of series n → ∞ does not increase our knowledge about local behavior of spectrum since the pattern of the forthcoming series is different. As a result, the formulation does not allow the development of rigorous asymptotic theory of statistical inference. To overcome this problem, Dahlhaus (1996, 2000) introduced locally stationary time series that allows theoretical asymptotic analysis of the evolutionary spectrum for a univariate case, and further extended it to the multivariate case.

9.5.2 Spectrum representations of a nonstationary multivariate process

Let {Z_t : t = 1, …, n} be a m‐dimensional time series. The idea of locally stationary process is to rescale the transfer function to unit time scale, such that

(9.101)

More rigorously, we have following definition.

Based on Definition 9.1, the time‐varying power spectrum of the process is given by

(9.103)

Developed from locally stationary time series, methods for estimating the time‐varying spectrum of a multivariate time series can be roughly grouped into three categories. The first category consists of estimators in which second‐order frequency domain structures evolve continuously over time. This includes the slowly evolving multivariate locally stationary process, which can be analyzed parametrically by fitting time series models with time‐varying parameters (Dahlhaus, 2000), and nonparametrically via the bivariate smoothing of spectral components as functions of frequency and time (Guo and Dai, 2006). The second category of methods consist of estimators that are piecewise stationary. Approaches within this second category typically divide a time series into approximately stationary segments, then obtain the estimates of local spectrum within segments. These methods include both parametric approaches, such as fitting piecewise vector autoregressive models (Davis et al., 2006), and nonparametric approaches, such as using the multivariate smooth localized complex exponential (SLEX) library (Ombao von Sachs and Guo, 2005). The final category are methods that can automatically approximate both abrupt and slowly varying changes by averaging over piecewise stationary models, and they include the smoothing stochastic approximation Monte Carlo (SSAMC) based method of Zhang (2016) and the reversible jump Monte Carlo and Hamiltonian Monte Carlo (HMC) based method by Li and Krafty (2018).

9.5.2.1 Time‐varying autoregressive model

One of the methods used in Dahlhaus (2000) is the time‐varying VARMA(p,q) model. For a vector autoregressive model VAR(p), it is defined by

(9.104)

where ε_t are m‐dimensional independent random variable with mean zero and unit variance I. In addition, we assume some smoothness conditions on Σ(⋅) and Φ_j(⋅). In some neighborhood of a fixed time point u₀ = t₀/n, the process Z_t can be approximated by the stationary process Z_t(u₀) given by

(9.105)

Z_t has a unique time‐varying power spectrum, which is locally the same as the power spectrum of Z_t(u), that is,

(9.106)

where u = t/n, and

Similarly, the locally covariance matrix is

(9.107)

Based on this statement, the time‐varying spectrum can be obtained by estimating the time‐varying parameters of the VAR model. For more properties of the estimation based on time‐varying VARMA(p,q), VAR(p), and VMA(q) models, we refer readers to Dahlhaus (2000).

9.5.2.2 Smoothing spline ANOVA model

Based on the locally stationary process, the time‐varying spectrum can also be estimated nonparametrically via the smoothing spline Analysis of Variance (ANOVA) model by Guo and Dai (2006). However, their definition of locally stationary process is slightly different from Dahlhaus (2000). In Section 9.5.2, we mentioned that Dahlhaus (2000) assumes a series of transfer functions A⁰(t/n,ω) that converge to a large‐sample transfer function A(u,ω) in order to allow for the fitting of parametric models. Since Guo and Dai (2006) considered a nonparametric estimation, they used A(u,ω) directly.

Based on this definition, the smoothing ANOVA model takes a two‐stage estimation procedure. At the first stage, the locally stationary process is approximated by piecewise stationary time series with small blocks to obtain initial spectrum estimates and the Cholesky decomposition. The initial spectrum estimates are obtained by the multitaper method to reduce variance. At the second stage, each element of the Cholesky decomposition is treated as a bivariate smooth function of time and frequency and is modeled by the smoothing spline ANOVA model by Gu and Wahba (1993). The final estimated time‐varying spectrum is reconstructed from the smoothed elements of the Cholesky decomposition. Thus, the method provides a way to ensure the final estimate of the multivariate spectrum is positive–definite while allowing enough flexibility in the smoothness of its elements.

We shall briefly discuss the smoothing spline ANOVA step. Suppose the spectral matrix has the Cholesky decomposition such that where L(u,ω) is a m × m lower triangular matrix. The method smooths the diagonal elements γ_j,j(u,ω), j = 1, …, m, the real part of γ_j,k(u,ω), ℜ{γ_j,k(u,ω)}, and the imaginary part of γ_j,k(u,ω), ℑ{γ_j,k(u,ω)}, for j > k separately with their own smoothing parameters. Let γ(u,ω) ∈ {γ_j,j(u,ω), ℜ(γ_j,k)(u, ω), ℑ(γ_j,k)(u,ω), j > k, for j, k = 1, …, m}. We have

where a(u,ω) is the corresponding Cholesky decomposition element of the spectrum, such that a(u,ω) = E{γ(u,ω)}, the ε(u,ω) are independent errors with zero‐mean and the variance depending on the time‐frequency point (u,ω).

The smoothing spline ANOVA model is defined by the corresponding reproducing kernels (RKs) for the time and frequency domains. In the frequency domain, the reproducing kernel Hilbert space (RKHS) W₁ for ω can be decomposed as W₁ = 1 ⊕ H₁. The reproducing kernel for H₁ is R₁(ω₁, ω₂) = − K₄(∣ω₁ − ω₂∣)/24, where K_k(⋅) is the kth order Bernoulli polynomial. In the time domain, the RKHS W₂ can be decomposed as W₂ = 1 ⊕ (u − 0.5) ⊕ H₂. The reproducing kernel for H₂ is R₂(u₁, u₂) = K₂(u₁)K₂(u₂)/4 − K₄(∣u₁ − u₂∣)/24. The full tensor product RKHS for (u,ω) is then

(9.108)

(9.109)

(9.110)

where the RK for H₃ = H₁ ⊗ (u − 0.5) is R₃{(u₁, ω₁), (u₂, ω₂)} = R₁(ω₁, ω₂)(u₁ − 0.5)(u₂ − 0.5), the RK for H₄ = H₁ ⊗ H₂ is R₄{(u₁, ω₁), (u₂, ω₂)} = R₁(ω₁, ω₂)R₂(u₁, u₂). Thus, a(u,ω) has the ANOVA‐like decomposition

(9.111)

where d₁ + d₂(u − 0.5) is the linear trend, a₁(ω) is the smooth main effect for frequency, a₂(u) is the smooth main effect across time, a₃(u,ω) is the smooth in frequency and linear in time, and a₄(u,ω) is the interaction that is smooth at both time and frequency. The least square estimation can be used to obtain the estimates For the details, see Guo and Dai (2006).

9.5.2.3 Piecewise vector autoregressive model

Davis, Lee, and Rodriguez‐Yam (2006) considered modeling nonstationary time series using piecewise VAR processes. The number and locations of the piecewise VAR segments, and the orders of the corresponding VAR process, are assumed to be unknown. Based on the minimum description length (MDL) principle, the method penalizes complexity of the model, and thus provides the criteria to define the best fitting model. We refer readers to Rissanen (1989), Hansen and Yu (2001) for more details about the MDL principle.

Let Z_t be a m‐dimensional time series with k segments, and assume that there are partitions or changepoints δ = (δ₀, …, δ_k)′ with δ₀ = 0 and δ_k = n. The time series within the jth segment is modeled by VAR(p_j) process, such that

(9.112)

where are m × m dimensional coefficient matrices of the VAR process. We define the entire class of piecewise VAR models as M and a model from this class as F ∈ M. The principle of MDL is to find the best fitting model from M as the one that produces the shortest code length. The code length of an object is the amount of memory space required to store the data Z_t. The MDL has two components, a fitted model and the portion that is unexplained by . The later component can be defined as the residuals . Let be the code length of fitted model and be the code length of the residuals of the jth segment. Then, the total code length of the data can be decomposed to

(9.113)

The MDL principle suggests that a best fitting model by minimizing L( Z_t).

Let n_j be the sample size in the jth segment and δ be the information set about the partition of segments. Since the piecewise VAR model can be characterized by following parameters: k, δ , p = (p₁, …, p_k), and , we can further decompose by

(9.114)

(9.115)

The equality is due to the fact that the complete knowledge of δ implies complete knowledge of n₁, …, n_k. In order to store an integer I whose value is not bounded, approximately log₂I bits are needed. Thus, we have L(k) = log₂k, L(p_j) = log₂p_j, and . On the other hand, if integer I is upper‐bounded say I_U, then log₂I_U bits are needed. Since n_j is bounded by n, we have L(n_j) = log₂n and L(n₁, …, n_k) = klog₂n (Hansen and Yu, 2001). It is known that a maximum likelihood estimate of a real parameter computed from n observations can be effectively encoded with (1/2)log₂n bits. Because the total number of parameters in is (p_j + 1)m², we have

(9.116)

Combining the results, we have

(9.117)

Rissanen (1989) showed that the code length of is approximated by the negative of the loglikelihood of the fitted model . Thus, we have

(9.118)

where L denotes the Whittle likelihood. Consequently, from Eqs. (9.114) and (9.115), the MDL objective function is defined as

(9.119)

The best fitting model is the one that minimizes the MDL. To overcome difficulties raised in optimization, the method proposes using a genetic algorithm.

9.5.2.4 Bayesian methods

There are two recently proposed Bayesian methods to estimate multivariate time‐varying spectrum, including Zhang (2016) and Li and Krafty (2018). In this section, we focus on the method of Li and Krafty (2018). The method is also based on locally stationary time series to estimate the time‐varying spectrum

(9.120)

The method assumes that for every u ∈ (0, 1), each component of f(u, ⋅) possesses a square‐integrable first derivative as a function of frequency; for every ω, each component of f(⋅, ω) is continuous as a function of scaled time at all but a possible finite number of points. The assumption on transfer function and time‐varying spectrum are slightly different from Definitions 9.1 and 9.2 in two ways. First, Definition 9.1 assumes a series of transfer functions A⁰(u,ω) that converge to a large‐sample transfer function A(u,ω) in order to allow for the fitting of parametric models. Since the method considers nonparametric estimation, in a manner similar to the smoothing ANOVA method, A(u,ω) is used. Second, Definitions 9.1 and 9.2 require the time‐varying spectrum to be continuous in both time and frequency. The method is more flexible and allows for components of spectrum to evolve not only continuously, but also abruptly in time.

The analysis of a locally stationary time series { Z_t : t = 1, …, n} begins by using the piecewise stationary approximation. Consider a partition of the time series into k segments defined by partition points δ = (δ₀, …, δ_k) with δ₀ = 0 and δ_k = n such that Z_t is approximately stationary within the segments {t : δ_q − 1 < t ≤ δ_q} for q = 1, …, k. Then

(9.121)

where A_q(ω) = A(u_q, ω)I(δ_q − 1 < t ≤ δ_q), I(⋅) is the indicator function, and u_q = (δ_q + δ_q − 1)/2n is the scaled midpoint of the qth segment. Within the qth segment, the time series is approximately second‐order stationary with local power spectrum f(u_q, ω) = A_q(ω)A_q(ω)^*.

Conditional on an approximately stationary partition δ, known approaches and properties for stationary time series can be applied. In particular, the large‐sample distribution of the discrete Fourier transform of a stationary time series that provides the Whittle likelihood allows for the formulation of a product of Whittle likelihoods. Let us define the local discrete Fourier transform at frequency ℓ within segment q as

(9.122)

where n_q = δ_q − δ_q − 1 is the number of observations in segment q, ω_q,ℓ = ℓ/n_q are the Fourier frequencies, and L_q = [(n_q − 1)/2].Given a partition of k segments, δ, the y_{q, ℓ} are approximately independent zero‐mean complex multivariate Gaussian random variables with covariance matrices, which equals spectral matrices f(u_q, ω_q,ℓ). This leads to a loglikelihood that can be approximated by a sum of log Whittle likelihoods

(9.123)

where Y is collection of Fourier transformations.

The spectral matrix is positive–definite and, to nonparametrically allow for flexible smoothing among the different components while preserving positive definiteness, the procedure modified the Cholesky components of local spectra via linear penalized splines. The modified Cholesky decomposition represents a time‐varying spectral matrix as

(9.124)

for a complex‐valued m × m lower triangular matrix Θ(u,ω) with ones on the diagonal and a positive diagonal matrix Ψ(u,ω). For a piecewise stationary approximation with partition δ into k segments, we define the local modified Cholesky decomposition as f⁻¹(u_q, ω) = Θ(u_q, ω)Ψ(u_q, ω)⁻¹Θ(u_q, ω)^*, for q = 1, …, k, and let θ_j,k,q(ω) and ψ_j,j,q(ω) be the (j,k) and (j,j) elements of Θ(u_q, ω) and Ψ(u_q, ω), respectively. Then, for each segment, there are m² components to estimate: ℜ{θ_j,k,q(ω)} for (j > k) = 1, …, m,ℑ{θ_j,k,q(ω)} for (j > k) = 1, …, m, and ψ_j,j,q(ω) for j = 1, …, m. The Cholskey components are modeled by periodic even and odd linear splines by considering

(9.125)

(9.126)

and

(9.127)

The Fourier frequencies for each segment form an equally spaced grid, so that Demmler–Reinsch bases for periodic even and odd smoothing splines for local periodograms are given by {cos(2πsω) : s = 0, 1, …, (L_q − 1)} and {sin(2πsω) : s = 1, …, L_q}, respectively. For the details, we refer readers to Schwarz and Krivobokova (2016).

The method relies on reversible jump Markov chain and HMC methods to sample from posterior distributions. The estimates of time‐varying spectrum components are obtained by averaging over the distribution of partitions so that both abrupt and slowly varying changes can be recovered.

9.6 Empirical spectrum example of nonstationary vector time series

We first applied the method of Li and Krafty (2018) to the time series. Figures 9.12 and 9.13 displays estimated spectra and pairwise squared coherence respectively. There are several obvious changes in the spectra and coherences, including (i) the period 1997–1998, which corresponds to the Asia financial crisis in 1997, which also affected the U.S. economy; (ii) the year 2003, which is the period of the U.S. invasion of Iraq; and (iii) the period 2007–2009, which corresponds to the global financial crisis that began in 2007. All the coherences had a big jump during this period instead of drop as in 1997–1998, indicating that the financial crisis in 2007–2009 had broad impacts on both the DJIA and the NASDAQ.

3D graphs depicting the estimated spectrum of Dow Jones Industrial Average (DJIA) (left), NASDAQ (middle), and S&P 500 (right). — **Figure 9.12** Estimated spectrum of Dow Jones Industrial Average (DJIA) (left), NASDAQ (middle), and S&P 500 (right).

3D graphs depicting the estimated squared pairwise coherence between DJIA and NASDAQ (left), DJIA and S&P 500 (middle), and NASDAQ and S&P 500 (right). — **Figure 9.13** Estimated squared pairwise coherence between DJIA and NASDAQ (left), DJIA and S&P 500 (middle), and NASDAQ and S&P 500 (right).

When a finite number of stationary segments are identified with a high probability, the results can be displayed as estimated local spectra at certain time point. For example, we obtain the local spectra of DJIA and NASDAQ and their coherence at the week of October 18, 2007 in the Figure 9.14. It appears that the spectra and coherence are all flat, and they clearly indicate a well‐known phenomenon that the log returns of stock time series are white noise.

3 Graphs, each displaying a flat curve representing the local spectrum of DJIA (top left), NASDAQ (top right), and their coherences (bottom) at October 18, 2007. — **Figure 9.14** Local spectrum of DJIA (top left), NASDAQ (top right), and their coherences (bottom) at October 18, 2007.

We then applied the piecewise vector autoregressive method of Davis, Lee, and Rodriguez‐Yam (2006) to the same data set. The procedure produces changepoints at the following dates: July 13, 1998, July 15, 2002, September 8, 2003, and June 25, 2007. Figure 9.15 presents the DJIA time series along with the changepoints found by the approach of Davis, et al. (2006). The changepoints found by the piecewise autoregressive model are somewhat similar to that found by the Bayesian method. However, one of the main differences between the assumptions of the Bayesian method and the piecewise vector autoregressive method is that the number and location of partitions (changepoints) is random for the Bayesian method, while they are fixed for the piecewise autoregressive method.

Graph displaying a fluctuating curve with 4 vertical dotted lines, illustrating the Weekly log returns for the Dow Jones from April 1990 to December 2011. — **Figure 9.15** Weekly log returns for the Dow Jones from April 1990 to December 2011 along with the changepoints found by the piecewise vector autoregressive model of Davis, Lee, and Rodriguez‐Yam (2006).

Before concluding this chapter on multivariate spectral analysis and its applications, I would like to mention some more recent references including Grant and Quinn (2017), Ray et al. (2017), and Wilson (2017), among others, and point out that multivariate spectral analysis can also be used to analyze space–time data sets. However, it should be stated that, so far, the developed procedures can only apply to a stationary or properly transformed stationary space–time series. For references on spectral analysis with space–time series, we refer readers to Bandyopadhyay, Jentsch, and Rao (2017), Rao and Terdick (2017), among others.

Software code

The software program for this chapter is very long, because there are no standard software programs available in the market for many spectral procedures introduced in this book. I sincerely appreciate Mr. Zeda Li who helped me use R and/or MATLAB to create numerous programs for the implementation of the procedures introduced in the chapter.

Nonparametric multivariate kernel spectrum estimation – Example 9.3(a) using R

library(astsa)

# data preparation

ms = as.data.frame(read.csv("C:/Bookdata/WW2b.csv")[,-1])
rownames(ms) = seq(as.Date("2009/6/1"), by="month", length=90)
z.orig = ts(ms,start=c(2009,6),frequency=12)
z = diff(z.orig,1)
plot(z)
z = ts(apply(z,2,scale), freq=1)  # scaling strips ts attributes
n = nrow(z)

# get raw periodogram using fft

h = mvfft(z)/sqrt(n) # Fourier Transformation
nf = floor(n/2)+1
h = h[1:nf,]  # Take half of the frequencies
I = matrix(h*Conj(h),c(nf,5))/(2*pi)
nn = dim(I)[1]-1
par(mfrow=c(1,5))
plot((0:nn)/88*12,Re(I[,1]),type='l',lty=1,ylab="Raw Periodogram",xlab="frequency",main=colnames(ms)[1])
plot((0:nn)/88*12,Re(I[,2]),type='l',lty=1,ylab="Raw Periodogram",xlab="frequency",main=colnames(ms)[2])
plot((0:nn)/88*12,Re(I[,3]),type='l',lty=1,ylab="Raw Periodogram",xlab="frequency",main=colnames(ms)[3])
plot((0:nn)/88*12,Re(I[,4]),type='l',lty=1,ylab="Raw Periodogram",xlab="frequency",main=colnames(ms)[4])
plot((0:nn)/88*12,Re(I[,5]),type='l',lty=1,ylab="Raw Periodogram",xlab="frequency",main=colnames(ms)[5])

# window span 5

L = c(5,5)  # degree of smoothing
sales.spec = mvspec(z, spans=L, kernel="daniel", detrend=FALSE, taper=0, plot=FALSE)
f = sales.spec$fxx/(2*pi) # estimated spectral matrix
freq = sales.spec$freq
par(mfrow=c(1,5))
plot(sales.spec$freq*12,Re(f[1,1,]),type='l',lty=1,ylab="Spectrum",xlab="Cycles/year",main=colnames(ms)[1])
plot(sales.spec$freq*12,Re(f[2,2,]),type='l',lty=1,ylab="Spectrum",xlab="Cycles/year",main=colnames(ms)[2])
plot(sales.spec$freq*12,Re(f[3,3,]),type='l',lty=1,ylab="Spectrum",xlab="Cycles/year",main=colnames(ms)[3])
plot(sales.spec$freq*12,Re(f[4,4,]),type='l',lty=1,ylab="Spectrum",xlab="Cycles/year",main=colnames(ms)[4])
plot(sales.spec$freq*12,Re(f[5,5,]),type='l',lty=1,ylab="Spectrum",xlab="Cycles/year",main=colnames(ms)[5])

# quadrature spectrum
c51 = Re(f[5,1,])
c52 = Re(f[5,2,])
c53 = Re(f[5,3,])
c54 = Re(f[5,4,])
c41 = Re(f[4,1,])
c42 = Re(f[4,2,])
c43 = Re(f[4,3,])
c31 = Re(f[3,1,])
c32 = Re(f[3,2,])
c21 = Re(f[2,1,])

# plot part of quadrature spectrum
par(mfrow=c(1,4))
plot(freq*12, c51, type='l',lty=1,ylab="Co-spectrum",xlab="Cycles/year",main="(a)")
plot(freq*12, c52, type='l',lty=1,ylab="Co-spectrum",xlab="Cycles/year",main="(b)")
plot(freq*12, c53, type='l',lty=1,ylab="Co-spectrum",xlab="Cycles/year",main="(c)")
plot(freq*12, c54, type='l',lty=1,ylab="Co-spectrum",xlab="Cycles/year",main="(d)")

# co spectrum
q51 = Im(f[5,1,])
q52 = Im(f[5,2,])
q53 = Im(f[5,3,])
q54 = Im(f[5,4,])
q41 = Im(f[4,1,])
q42 = Im(f[4,2,])
q43 = Im(f[4,3,])
q31 = Im(f[3,1,])
q32 = Im(f[3,2,])
q21 = Im(f[2,1,])

# plot part of co spectrum
par(mfrow=c(1,4))
plot(freq*12,q51,type='l',lty=1,ylab="Quadrature",xlab="Cycles/year",main="(a)")
plot(freq*12,q52,type='l',lty=1,ylab="Quadrature",xlab="Cycles/year",main="(b)")
plot(freq*12,q53,type='l',lty=1,ylab="Quadrature",xlab="Cycles/year",main="(c)")
plot(freq*12,q54,type='l',lty=1,ylab="Quadrature",xlab="Cycles/year",main="(d)")

# phase spectrum
phi = sales.spec$phase # estimated phase spectrum

phi[,1] # between 2-1
phi[,2] # between 3-1
phi[,3] # between 3-2
phi[,4] # between 4-1
phi[,5] # between 4-2
phi[,6] # between 4-3
phi[,7] # between 5-1
phi[,8] # between 5-2
phi[,9] # between 5-3
phi[,10] # between 5-4

# plot part of phase spectrum
par(mfrow=c(1,4))
plot(freq*12,phi[,7],type='l',lty=1,ylab="Quadrature",xlab="Cycles/year",main="5,1")
plot(freq*12,phi[,8],type='l',lty=1,ylab="Quadrature",xlab="Cycles/year",main="5,2")
plot(freq*12,phi[,9],type='l',lty=1,ylab="Quadrature",xlab="Cycles/year",main="5,3")
plot(freq*12,phi[,10],type='l',lty=1,ylab="Quadrature",xlab="Cycles/year",main="5,4")

# coherency
K = sales.spec$coh
plot.spec.coherency(sales.spec, ci=NA, main="Squared Coherencies")


# window span 7

L = c(7,7) # degree of smoothing
sales.spec = mvspec(z, spans=L, kernel="daniel", detrend=FALSE, taper=0, plot=FALSE)
f = sales.spec$fxx/(2*pi) # estimated spectral matrix
freq = sales.spec$freq
par(mfrow=c(1,5))
plot(sales.spec$freq*12,Re(f[1,1,]),type='l',lty=1,ylab="Spectrum",xlab="Cycles/year",main=colnames(ms)[1])
plot(sales.spec$freq*12,Re(f[2,2,]),type='l',lty=1,ylab="Spectrum",xlab="Cycles/year",main=colnames(ms)[2])
plot(sales.spec$freq*12,Re(f[3,3,]),type='l',lty=1,ylab="Spectrum",xlab="Cycles/year",main=colnames(ms)[3])
plot(sales.spec$freq*12,Re(f[4,4,]),type='l',lty=1,ylab="Spectrum",xlab="Cycles/year",main=colnames(ms)[4])
plot(sales.spec$freq*12,Re(f[5,5,]),type='l',lty=1,ylab="Spectrum",xlab="Cycles/year",main=colnames(ms)[5])

# quadrature spectrum
c51 = Re(f[5,1,])
c52 = Re(f[5,2,])
c53 = Re(f[5,3,])
c54 = Re(f[5,4,])
c41 = Re(f[4,1,])
c42 = Re(f[4,2,])
c43 = Re(f[4,3,])
c31 = Re(f[3,1,])
c32 = Re(f[3,2,])
c21 = Re(f[2,1,])

# plot part of quadrature spectrum
par(mfrow=c(1,4))
plot(freq*12, c51, type='l',lty=1,ylab="Co-spectrum",xlab="Cycles/year",main="(a)")
plot(freq*12, c52, type='l',lty=1,ylab="Co-spectrum",xlab="Cycles/year",main="(b)")
plot(freq*12, c53, type='l',lty=1,ylab="Co-spectrum",xlab="Cycles/year",main="(c)")
plot(freq*12, c54, type='l',lty=1,ylab="Co-spectrum",xlab="Cycles/year",main="(d)")

# co spectrum
q51 = Im(f[5,1,])
q52 = Im(f[5,2,])
q53 = Im(f[5,3,])
q54 = Im(f[5,4,])
q41 = Im(f[4,1,])
q42 = Im(f[4,2,])
q43 = Im(f[4,3,])
q31 = Im(f[3,1,])
q32 = Im(f[3,2,])
q21 = Im(f[2,1,])

# plot part of co spectrum
par(mfrow=c(1,4))
plot(freq*12,q51,type='l',lty=1,ylab="Quadrature",xlab="Cycles/year",main="(a)")
plot(freq*12,q52,type='l',lty=1,ylab="Quadrature",xlab="Cycles/year",main="(b)")
plot(freq*12,q53,type='l',lty=1,ylab="Quadrature",xlab="Cycles/year",main="(c)")
plot(freq*12,q54,type='l',lty=1,ylab="Quadrature",xlab="Cycles/year",main="(d)")

# phase spectrum
phi = sales.spec$phase # estimated phase spectrum

phi[,1] # between 2-1
phi[,2] # between 3-1
phi[,3] # between 3-2
phi[,4] # between 4-1
phi[,5] # between 4-2
phi[,6] # between 4-3
phi[,7] # between 5-1
phi[,8] # between 5-2
phi[,9] # between 5-3
phi[,10] # between 5-4

# plot part of phase spectrum
par(mfrow=c(1,4))
plot(freq*12,phi[,7],type='l',lty=1,ylab="Quadrature",xlab="Cycles/year",main="5,1")
plot(freq*12,phi[,8],type='l',lty=1,ylab="Quadrature",xlab="Cycles/year",main="5,2")
plot(freq*12,phi[,9],type='l',lty=1,ylab="Quadrature",xlab="Cycles/year",main="5,3")
plot(freq*12,phi[,10],type='l',lty=1,ylab="Quadrature",xlab="Cycles/year",main="5,4")

# coherency
K = sales.spec$coh
plot.spec.coherency(sales.spec, ci=NA, main="Squared Coherencies")

Bayesian spectral estimation – Example 9.3(b) using MATLAB

For the Bayesian spectral estimation, the first part of the code contains required functions, and the second part of the code is to read in a data set and implement the procedure. I would like to thank Professor Ori Rosen from University of Texas, El Paso for sharing the Bayesian spectral estimation code.

function [f,g,h]=gamma_deriv(param,xx,ngamma,nf,nbasis,tau,sigalpha,coef)
 f=-sum(xx*param+exp(-xx*param).*coef)-0.5*(param(1:2)'*param(1:2))/...
  sigalpha-0.5*(param(3:ngamma)'*param(3:ngamma))/tau;
 g=zeros(ngamma,1);
 g(1:2)=-param(1:2)/sigalpha;
 g(3:ngamma)=-param(3:ngamma)/tau;
 h=zeros(ngamma,ngamma);
 h(1:2,1:2)=-1/sigalpha*eye(2);
 h(3:ngamma,3:ngamma)=-1/tau*eye(nbasis);
 for k=1:nf
  g=g-xx(k,:)'+coef(k)*exp(-xx(k,:)*param)*xx(k,:)';
  h=h-coef(k)*exp(-xx(k,:)*param)*xx(k,:)'*xx(k,:);
 end
 f=-f;
 g=-g;
 h=-h;
%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
% read in data and implement the analysis
%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
sales = csvread('C:/Bookdata/WW2b.csv',1,1);
sales = diff(sales);
ts = sales(:,1:3);
dim=size(ts);
nobs=dim(1);
n=nobs;
ts=ts(1:n,:); %use the first n observations only
yy=fft(ts)/sqrt(n); %find the dft
y=yy(2:round(n/2)-1,:);%drop first and last Fourier frequencies
nf=length(y); %number of Fourier freqs
z=ones(nf,2);
z(:,2)=(1:round(n/2)-2)/n;
tt=(1:round(n/2)-2)/n;
%%for spectral envelope calculation
var_ts=cov(ts);
[evec,evl]=eig(var_ts);
invroot=evec*diag(diag(evl).^-0.5)*evec';
%%
nbasis=10;
ngamma=nbasis+2;
sigma=10; %variance of prior on theta
sigalpha=100;
taua=nbasis/2;
iter=2000;
nwarmup=1000;
epsilon=zeros(3,iter);
delta_sq=zeros(3,nf,iter);
gamma=zeros(ngamma,9,iter+1);
tau=100*ones(9,iter+1); %corresponding to the gammas
theta=zeros(nf,3,iter+1);
cubic=0;
xx=ones(nf,ngamma);
xx(:,2)=tt';
if(cubic==0)
  for j=3:ngamma
   xx(:,j)=sqrt(2)*cos((j-1)*pi*tt);
  end
else
omega=ones(nf,nf);
for i=1:nf
  for j=1:nf
   if(tt(i) <= tt(j))
     omega(i,j)=tt(i)^2*(tt(j)-tt(i)/3)/2;
     omega(j,i)=omega(i,j);
   end
  end
end
[Q,D]=eig(omega);
xx=Q*D^.5;
xx=[ones(nf,1) reshape(tt,nf,1) xx(:,nf-nbasis+1:nf)];
end
spectmat=zeros(3,3,nf,iter-nwarmup);
%spectenv=zeros(nf,iter-nwarmup);
for p=1:iter
  p
  for j=1:3
   delta_sq(j,:,p)=exp(xx*gamma(:,j,p))';
  end
  %Drawing the gammas to smooth the thetas
  %Drawing first gamma4 and gamma7 corresponding to real(theta_1)
  %and imag(theta_1)
  A4=zeros(ngamma,ngamma);
  A4(1:2,1:2)=0.5/sigalpha*eye(2);
  A4(3:ngamma,3:ngamma)=0.5/tau(4,p)*eye(nbasis);
  A7=zeros(ngamma,ngamma);
  A7(1:2,1:2)=0.5/sigalpha*eye(2);
  A7(3:ngamma,3:ngamma)=0.5/tau(7,p)*eye(nbasis);
  B4=zeros(ngamma,ngamma);
  v4=zeros(1,ngamma);
  v7=zeros(1,ngamma);
  for k=1:nf
   ck=conj(y(k,2))*y(k,1)+conj(y(k,1))*y(k,2);
   dk=abs(y(k,1))^2;
   gk=sqrt(-1)*(conj(y(k,2))*y(k,1)-conj(y(k,1))*y(k,2));
   B4=B4+1/delta_sq(2,k,p)*dk*xx(k,:)'*xx(k,:);
   v4=v4+1/delta_sq(2,k,p)*ck*xx(k,:);
   v7=v7+1/delta_sq(2,k,p)*gk*xx(k,:);
  end
  var_gamma4=0.5*inv(B4+A4);
  mu_gamma4=var_gamma4*v4';
  B7=B4;
  var_gamma7=0.5*inv(B7+A7);
  mu_gamma7=var_gamma7*v7';
  var_gamma4=0.5*(var_gamma4+var_gamma4');
  var_gamma7=0.5*(var_gamma7+var_gamma7');
  gamma(:,4,p+1)=mvnrnd(mu_gamma4,var_gamma4)';
  gamma(:,7,p+1)=mvnrnd(mu_gamma7,var_gamma7)';
  theta(:,1,p+1)=xx*gamma(:,4,p+1)+sqrt(-1)*xx*gamma(:,7,p+1);
  %Drawing next gamma5 and gamma8 corresponding to real(theta_2)
  %and imag(theta_2)
  A5=zeros(ngamma,ngamma);
  A5(1:2,1:2)=0.5/sigalpha*eye(2);
  A5(3:ngamma,3:ngamma)=0.5/tau(5,p)*eye(nbasis);
  A8=zeros(ngamma,ngamma);
  A8(1:2,1:2)=0.5/sigalpha*eye(2);
  A8(3:ngamma,3:ngamma)=0.5/tau(8,p)*eye(nbasis);
  v5=zeros(1,ngamma);
  v8=zeros(1,ngamma);
  B5=zeros(ngamma,ngamma);
  B8=zeros(ngamma,ngamma);
  for k=1:nf
   ck=real(theta(k,3,p))*(conj(y(k,1))*y(k,2)+conj(y(k,2))*y(k,1))-...
    (conj(y(k,3))*y(k,1)+y(k,3)*conj(y(k,1)));
   dk=abs(y(k,1))^2;
   k=imag(theta(k,3,p))*(conj(y(k,1))*y(k,2)+conj(y(k,2))*y(k,1))-...
    sqrt(-1)*(conj(y(k,3))*y(k,1)-y(k,3)*conj(y(k,1)));
   B5=B5+1/delta_sq(3,k,p)*dk*xx(k,:)'*xx(k,:);
   v5=v5-1/delta_sq(3,k,p)*ck*xx(k,:);
   v8=v8-1/delta_sq(3,k,p)*gk*xx(k,:);
  end
  B8=B5;
  var_gamma5=0.5*inv(B5+A5);
  mu_gamma5=var_gamma5*v5';
  var_gamma8=0.5*inv(B8+A8);
  mu_gamma8=var_gamma8*v8';
  var_gamma5=0.5*(var_gamma5+var_gamma5');
  var_gamma8=0.5*(var_gamma8+var_gamma8');
  gamma(:,5,p+1)=mvnrnd(mu_gamma5,var_gamma5)';
  gamma(:,8,p+1)=mvnrnd(mu_gamma8,var_gamma8)';
  theta(:,2,p+1)=xx*gamma(:,5,p+1)+sqrt(-1)*xx*gamma(:,8,p+1);
  %Drawing gamma6 and gamma9 corresponding to real(theta_3)
  %and imag(theta_3)
  A6=zeros(ngamma,ngamma);
  A6(1:2,1:2)=0.5/sigalpha*eye(2);
  A6(3:ngamma,3:ngamma)=0.5/tau(6,p)*eye(nbasis);
  A9=zeros(ngamma,ngamma);
  A9(1:2,1:2)=0.5/sigalpha*eye(2);
  A9(3:ngamma,3:ngamma)=0.5/tau(9,p)*eye(nbasis);
  v6=zeros(1,ngamma);
  v9=zeros(1,ngamma);
  B6=zeros(ngamma,ngamma);
  B9=zeros(ngamma,ngamma);
  for k=1:nf
   ck=real(theta(k,2,p))*(conj(y(k,1))*y(k,2)+conj(y(k,2))*y(k,1))-...
    (conj(y(k,3))*y(k,2)+y(k,3)*conj(y(k,2)));
   dk=abs(y(k,2))^2;
   gk=imag(theta(k,2,p))*(conj(y(k,1))*y(k,2)+conj(y(k,2))*y(k,1))-...
    sqrt(-1)*(conj(y(k,3))*y(k,2)-y(k,3)*conj(y(k,2)));
   B6=B6+1/delta_sq(3,k,p)*dk*xx(k,:)'*xx(k,:);
   v6=v6-1/delta_sq(3,k,p)*ck*xx(k,:);
   v9=v9-1/delta_sq(3,k,p)*gk*xx(k,:);
  end
  B9=B6;
  var_gamma6=0.5*inv(B6+A6);
  mu_gamma6=var_gamma6*v6';
  var_gamma9=0.5*inv(B9+A9);
  mu_gamma9=var_gamma9*v9';
  var_gamma6=0.5*(var_gamma6+var_gamma6');
  var_gamma9=0.5*(var_gamma9+var_gamma9');
  gamma(:,6,p+1)=mvnrnd(mu_gamma6,var_gamma6)';
  gamma(:,9,p+1)=mvnrnd(mu_gamma9,var_gamma9)';
  theta(:,3,p+1)=xx*gamma(:,6,p+1)+sqrt(-1)*xx*gamma(:,9,p+1);
  %Drawing the gammas corresponding to the deltas
  for j=1:3
   if(j==1)
    coef=abs(y(:,1)).^2;
   elseif(j==2)
    coef=abs(y(:,2)-theta(:,1,p+1).*y(:,1)).^2;
   else
    coef=abs(y(:,3)-theta(:,2,p+1).*y(:,1)-theta(:,3,p+1).*y(:,2)).^2;
  end
   param=gamma(:,j,p);
   if verLessThan('matlab','9.2')
    options = optimset('Display','off','GradObj','on','Hessian','on','MaxIter',10000,...
    'MaxFunEvals',10000,'TolFun',0.001,'TolX',0.00001);
   else
    options = optimoptions(@fminunc,'Display','off','GradObj','on','Hessian','on','MaxIter',10000,...
    'MaxFunEvals',10000,'TolFun',0.001,'TolX',0.00001,'Algorithm','trust-region');
   end
   [gamma_mean,fval,exitflag,output,grad,hessian]=...
   fminunc(@gamma_deriv,param,options,xx,ngamma,nf,nbasis,tau(j,p),...
    sigalpha,coef);
   if(p<nwarmup)
    gamma_var=4*inv(hessian);
   else
    gamma_var=1.6*inv(hessian);
   end
   gamma_var=0.5*(gamma_var+gamma_var');
   gamma(:,j,p+1)=mvnrnd(gamma_mean,gamma_var)';
   log_prop_gamma_new=-0.5*(gamma(:,j,p+1)-gamma_mean)'*inv(gamma_var)*...
      (gamma(:,j,p+1)-gamma_mean);
   log_prop_gamma_old=-0.5*(gamma(:,j,p)-gamma_mean)'*inv(gamma_var)*...
      (gamma(:,j,p)-gamma_mean);
   log_target_new=-sum(xx*gamma(:,j,p+1)+exp(-xx*gamma(:,j,p+1)).*coef)-...
      0.5*(gamma(1:2,j,p+1)'*gamma(1:2,j,p+1))/sigalpha-...
      0.5*(gamma(3:ngamma,j,p+1)'*gamma(3:ngamma,j,p+1))/tau(j,p);
   log_target_old=-sum(xx*gamma(:,j,p)+exp(-xx*gamma(:,j,p)).*coef)-...
      0.5*(gamma(1:2,j,p)'*gamma(1:2,j,p))/sigalpha-...
      0.5*(gamma(3:ngamma,j,p)'*gamma(3:ngamma,j,p))/tau(j,p);
   met_rat=log_target_new-log_target_old+log_prop_gamma_old-...
     log_prop_gamma_new;
   u=rand;
   epsilon(j,p)=min(1,exp(met_rat));
   if(u>epsilon(j,p)&p>1)
    gamma(:,j,p+1)=gamma(:,j,p);
   end
  end
  %Drawing the taus
  for j=1:9
   taub=2/(gamma(3:ngamma,j,p+1)'*gamma(3:ngamma,j,p+1));
   tau(j,p+1)=1/gamrnd(taua,taub);
  end
  if(p>nwarmup)
   for k=1:nf
    T=eye(3);
    T(2,1)=-theta(k,1,p);
    T(3,1)=-theta(k,2,p);
    T(3,2)=-theta(k,3,p);
    spectmat(:,:,k,p-nwarmup)=inv(T)*diag(delta_sq(:,k,p))*inv(T');
    %spectenv(k,p-nwarmup)=max(eig(2*invroot*...
    %    real(spectmat(:,:,k,p-nwarmup))*invroot./n));
   end
  end
end
mean_spectmat=mean(spectmat,4);
s11=reshape(mean_spectmat(1,1,:),nf,1);
s22=reshape(mean_spectmat(2,2,:),nf,1);
s33=reshape(mean_spectmat(3,3,:),nf,1);
s21=reshape(mean_spectmat(2,1,:),nf,1);
s31=reshape(mean_spectmat(3,1,:),nf,1);
s32=reshape(mean_spectmat(3,2,:),nf,1);
coh21=abs(s21).^2./(s11.*s22);
coh31=abs(s31).^2./(s11.*s33);
coh32=abs(s32).^2./(s22.*s33);
%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
freq=(0:(nf-1))./(2*nf);
figure
subplot(1,3,1)
plot(freq*12,real(s11));xlabel('cycles/year');title('AUT')
subplot(1,3,2)
plot(freq*12,real(s22));xlabel('cycles/year');title('BUM')
subplot(1,3,3)
plot(freq*12,real(s33));xlabel('cycles/year');title('GEM')

Penalized Whittle likelihood method – Example 9.3(c) using MATLAB

The first part of the code is for various required functions, which can be placed in a different order. The second part of the code is to read in a data set and implement the procedure. I would like to thank Dr. Robert Krafty from University of Pittsburgh for sharing the penalized Whittle likelihood estimation codes.


function indexer=CholIndexer(P)
%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
% Used for the general program to index the conversion of a NxN Cholesky
% matrix into a vector of size P^2.

Nreal = (P*(P+1))/2;
Nimag = (P*(P-1))/2;

indexer = zeros(P^2,5);
indexer(:,1) = 1:(P^2);
indexer(1:Nreal,2) =1;

bigguy = [kron(ones(P,1), (1:P)'), kron((1:P)',ones(P,1))];
indexer(1:Nreal, 3:4) = bigguy(bigguy(:,1)>=bigguy(:,2), :);
indexer((Nreal+1):end, 3:4) = bigguy(bigguy(:,1)>bigguy(:,2), :);

indexer(indexer(:,3)==indexer(:,4), 5)=1;
function spec =inChol2spec(inChol,indexer);
%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
% Takes the inverse Cholesky components and returns the components of the
% spectral matrix

[P2,K] = size(inChol);
P=sqrt(P2);
Nreal = P*(P+1)/2;
spec = zeros(P2,K);

for k = 1:K
 % inv cholesky at freq k
 cholMat = zeros(P,P);
 for p = 1:Nreal
  cholMat(indexer(p,3), indexer(p,4)) = inChol(p,k);
 end;
 for p=(Nreal+1):P2
   cholMat(indexer(p,3), indexer(p,4)) = cholMat(indexer(p,3), indexer(p,4)) + 1i*inChol(p,k);
 end;
 % spec matrix freq k
 F_k = inv(cholMat*cholMat');
 for p=1:(Nreal)
   spec(p,k) = real( F_k( indexer(p,3), indexer(p,4)));
 end;
 for p=(Nreal+1):P2
   spec(p,k) = imag( F_k( indexer(p,3), indexer(p,4)));
 end;
end;

function [hatF, gcv]=NWspec(omegas,perds,logh)
%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
% Computes the Gaussian Nadaraya-Watson kernel regression estimate of the
% components spectral matrix % by smoothing the components of the raw
% periodograms.

[N,K] = size(perds);
h=exp(logh);

hatF=zeros(N,K);
gcv = 0;
I_A = 0;

for k=1:K
 z=kerf((omegas(k)-omegas)/h);
 hatF(:,k) = perds*z'/sum(z);
 I_A = I_A + 1 - z(k)/sum(z);
end

gcv = log(sum(sum( (perds - hatF).^2))) - log(I_A*I_A);

function kerval=kerf(z)
kerval=exp(-z.*z/2)/sqrt(2*pi);

function inChol =spec2inChol(spec,indexer);
%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
% Takes spectral components and gives the inverse Cholesky components

[P2,K] = size(spec);
P=sqrt(P2);
Nreal = P*(P+1)/2;
inChol = zeros(P2,K);

for k = 1:K
 % spectral matrix at freq k
 F_k = zeros(P,P);
 for p = 1:Nreal
   F_k(indexer(p,3), indexer(p,4)) = spec(p,k);
   F_k(indexer(p,4), indexer(p,3)) = spec(p,k);
 end;
 for p=(Nreal+1):P2
   F_k(indexer(p,3), indexer(p,4)) = F_k(indexer(p,3), indexer(p,4)) + 1i*spec(p,k);
   F_k(indexer(p,4), indexer(p,3)) = F_k(indexer(p,4), indexer(p,3)) - 1i*spec(p,k);
 end;
 % inv cholesky at freq k
 cholMat = chol(inv(F_k))';
 for p = 1:Nreal
   inChol(p,k) = real(cholMat(indexer(p,3), indexer(p,4)));
 end;
 for p = (Nreal+1):P2
   inChol(p,k) = imag(cholMat(indexer(p,3), indexer(p,4)));
 end;
end;

function [tildeX, perds, omegas] =time2freq(X);
%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
% Takes a time series X and returns its DFT and the periodogram

[P,T] = size(X);
P2 = P^2;
K = floor((T-1)/2);
omegas = (1:K)/T;

indexer=CholIndexer(size(X,1));

tildeX = zeros(P,T);
for p=1:P
   tildeX(p,:) = fft(X(p,:))/sqrt(T);
end;
tildeX = tildeX(:,2:(K+1));

perds = zeros(P2,K);
Nreal = P*(P+1)/2;
for k = 1:K
 P_mat = tildeX(:,k)*tildeX(:,k)';
 for p=1:Nreal
   perds(p,k) = real(P_mat(indexer(p,3), indexer(p,4)));
 end;
 for p = (Nreal+1):P2
   perds(p,k) = imag(P_mat(indexer(p,3), indexer(p,4)));
 end;
end;

function [workvals, eW] = WorkingFish(startvals,tildeX,indexer);
%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
% Gets the working variables and weight function for an iteration using
% the inverse Cholesky parameterization.

[P,K] = size(tildeX);
P2 = P*P;
Nreal = P*(P+1)/2;
spec = inChol2spec(startvals,indexer);

W = zeros(P2*K, P2*K);
eW = zeros(P2*K, P2*K);
workvals = zeros(P2,K);
for k = 1:K
 %
 % get the G and F matrices
 G = zeros(P,P);
 F = zeros(P,P);
 for p=1:Nreal;
   G(indexer(p,3), indexer(p,4))=startvals(p,k);
   F(indexer(p,3), indexer(p,4))=spec(p,k);
   F(indexer(p,4), indexer(p,3))=spec(p,k);
 end;
 for p=(Nreal+1):P2;
   G(indexer(p,3), indexer(p,4))= G(indexer(p,3), indexer(p,4)) + 1i*startvals(p,k);
   F(indexer(p,3), indexer(p,4))= F(indexer(p,3), indexer(p,4)) + 1i*spec(p,k);
   F(indexer(p,4), indexer(p,3))= F(indexer(p,4), indexer(p,3)) - 1i*spec(p,k);
 end;
 %
 % get the score
 U = zeros(P2,1);
 for p=1:P2
   dG = zeros(P,P);
   dG(indexer(p,3), indexer(p,4)) = indexer(p,2) + (1-indexer(p,2))*1i;
   %
   U(p) = tildeX(:,k)'*(dG*G' + G*dG')*tildeX(:,k) ...
     + indexer(p,5)*(-2/startvals(p,k));
 end;
 %
 % get the Hessian
 Hsmall = zeros(P2,P2);
 Fishmat = zeros(P2,P2);
 for p=1:P2
   dGp = zeros(P,P);
   dGp(indexer(p,3), indexer(p,4)) = indexer(p,2) + (1-indexer(p,2))*1i;
   for q=1:P2
    dGq = zeros(P,P);
    dGq(indexer(q,3), indexer(q,4)) = indexer(q,2) + (1-indexer(q,2))*1i;
    %
    Hsmall(p,q) = tildeX(:,k)'*(dGp*dGq' + dGq*dGp')*tildeX(:,k) ...
     + indexer(p,5)*(p==q)*(2/startvals(p,k)^2);
    Fishmat(p,q) = trace((dGp*dGq' + dGq*dGp')*F) ...
     + indexer(p,5)*(p==q)*(2/startvals(p,k)^2);
   end;
 end;
 W( (1+P2*(k-1)):(P2*k), (1+P2*(k-1)):(P2*k) ) = real(Hsmall);
 eW( (1+P2*(k-1)):(P2*k), (1+P2*(k-1)):(P2*k) ) = real(Fishmat);
 %
 % get the working variables
 workvals(:,k) = real(startvals(:,k) - real(Fishmat)U);
end;

function [hatF, GML] = penWhit(tildeX,omegavals,logLambda0);
%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
% Fits the penalized multivariate Whittle likelihood

tolval = .0001;
MaxIt = 2;
[N,K] = size(tildeX);
P=N*N;
indexer=CholIndexer(N);
Preal=N*(N+1)/2;
Pimag=N*(N-1)/2;

perds = zeros(P,K);
for k = 1:K
 P_mat = tildeX(:,k)*tildeX(:,k)';
 for p=1:Preal
   perds(p,k) = real(P_mat(indexer(p,3), indexer(p,4)));
 end;
 for p = (Preal+1):P
   perds(p,k) = imag(P_mat(indexer(p,3), indexer(p,4)));
 end;
end;
%
%  smooth and get initial values
[hatF_NW, G] = NWspec(omegavals', perds,2*log(20/K));
current = spec2inChol(hatF_NW,indexer);

nb = min( floor(K/4),25 );
Q = zeros(P*K, P*nb+Preal);
S = zeros(P*nb+Preal, P*nb+Preal);
Qc = [ones(K,1), sqrt(2)*cos(2*pi*omegavals*(1:nb))];
for p=1:Preal
 ep=zeros(P,1);
 ep(p)=1;
 locp = ((p-1)*(nb+1)+1):p*(nb+1);
 Q(:, locp ) = kron(Qc,ep);
 S(locp,locp) = diag(exp(logLambda0(p))*(2*pi*(0:nb)).^4);
end;
Qs = sqrt(2)*sin(2*pi*omegavals*(1:nb));
for p=(Preal+1):P
 ep=zeros(P,1);
 ep(p)=1;
 pc=p-Preal;
 locp = ((pc-1)*nb + Preal*(nb+1)+1):(pc*nb+Preal*(nb+1));
 Q(:, locp ) = kron(Qs,ep);
 S(locp,locp) = diag(exp(logLambda0(p))*(2*pi*(1:nb)).^4);
end;
R=blkdiag( kron(eye(Preal,Preal),Qc), kron(eye(Pimag,Pimag),Qs));

old = current;
tol=1;
ct=0;
while (tol>tolval) && (ct<=MaxIt-1)
 [workvars, H] = WorkingFish(current,tildeX,indexer);
 tildeBeta = reshape(workvars, K*P ,1);
 bigest = (Q'*H*Q + S)Q'*H*tildeBeta;
 current = reshape(R*bigest, K,P)';
 %
 tol = sum(sum((current-old).^2))/sum(sum(old.^2));
 old=current;
 ct = ct+1;
end;

hatF =inChol2spec(current,indexer);

Ell = 0;
for k = 1:K;
 % get the inverse cholesky matrix
 Cholmat = zeros(N,N);
 for p = 1:Preal
  Cholmat(indexer(p,3), indexer(p,4)) = current(p,k);
 end;
 for p=(Preal+1):P
  Cholmat(indexer(p,3), indexer(p,4)) = Cholmat(indexer(p,3), indexer(p,4)) + 1i*current(p,k);
 end;
 %
 % add to the -2loglike
 Ell = Ell - log(det(Cholmat*Cholmat')) + tildeX(:,k)'*Cholmat*Cholmat'*tildeX(:,k);
end;

GML = real(Ell) ...
     -nb*sum(logLambda0)...
     + bigest'*S*bigest ...
     + sum(log(eig(Q'*H*Q + S)));

function [logLamHat,G]= optGML(tildeX, perds, omegas, logLamStart, options);
%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
% Used for selecting smoothing parameters by optimizing the GML

indexer=CholIndexer(size(tildeX,1));

[Fstart, Gval] = NWspec(omegas, perds,2*log(20/length(omegas)));
startvals = spec2inChol(Fstart,indexer);

[logLamHat,G] = fminunc(@(llam) GLMfish(llam,tildeX,omegas',startvals,indexer),logLamStart, options);

function GML = GLMfish(logLambda0,tildeX,omegavals, startvals,indexer);

tolval = .0001;
MaxIt = 2;

[N,K] = size(tildeX);
P=N*N;
Preal=N*(N+1)/2;
Pimag=N*(N-1)/2;
current = startvals;

nb = min( floor(K/4),25 );
Q = zeros(P*K, P*nb+Preal);
S = zeros(P*nb+Preal, P*nb+Preal);
Qc = [ones(K,1), sqrt(2)*cos(2*pi*omegavals*(1:nb))];
for p=1:Preal
 ep=zeros(P,1);
 ep(p)=1;
 locp = ((p-1)*(nb+1)+1):p*(nb+1);
 Q(:, locp ) = kron(Qc,ep);
 S(locp,locp) = diag(exp(logLambda0(p))*(2*pi*(0:nb)).^4);
end;
Qs = sqrt(2)*sin(2*pi*omegavals*(1:nb));
for p=(Preal+1):P
 ep=zeros(P,1);
 ep(p)=1;
 pc=p-Preal;
 locp = ((pc-1)*nb + Preal*(nb+1)+1):(pc*nb+Preal*(nb+1));
 Q(:, locp ) = kron(Qs,ep);
 S(locp,locp) = diag(exp(logLambda0(p))*(2*pi*(1:nb)).^4);
end;
R=blkdiag( kron(eye(Preal,Preal),Qc), kron(eye(Pimag,Pimag),Qs));
old = current;
tol=1;
ct=0;
while (tol>tolval) &&#38; (ct<=MaxIt-1)
 [workvars, H] = WorkingFish(current,tildeX,indexer);
 tildeBeta = reshape(workvars, K*P ,1);
 bigest = (Q'*H*Q + S)Q'*H*tildeBeta;
 current = reshape(R*bigest, K,P)';
 %
 tol = sum(sum((current-old).^2))/sum(sum(old.^2));
 old=current;
 ct = ct+1;
end;
hatBeta = current;

hatF =inChol2spec(current,indexer);

Ell = 0;
for k = 1:K;
 % get the inverse cholesky matrix
 Cholmat = zeros(N,N);
 for p = 1:Preal
  Cholmat(indexer(p,3), indexer(p,4)) = current(p,k);
 end;
 for p=(Preal+1):P
  Cholmat(indexer(p,3), indexer(p,4)) = Cholmat(indexer(p,3), indexer(p,4)) + 1i*current(p,k);
 end;
 %
 % add to the -2loglike
 Ell = Ell - log(det(Cholmat*Cholmat')) + tildeX(:,k)'*Cholmat*Cholmat'*tildeX(:,k);
end;

GML = real(Ell) ...
     -nb*sum(logLambda0) ...
     + bigest'*S*bigest ...
     + sum(log(eig(Q'*H*Q + S)));

function bootcell = penWhitBoot(hatF,omegavals, logLambda0, nBoot, display);
%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
% Get bootstrap estimates from the penalized multivariate Whittle
% likelihood

tolval = .0001;
MaxIt = 2;
%
[P,K]=size(hatF);
N=sqrt(P);
Preal = N*(N+1)/2;
Pimag = P - Preal;
bootcell=cell(P,1);
for j = 1:P
 bootcell{j} =zeros(nBoot,K);
end;

indexer=CholIndexer(N);

Fsqrt = cell(K,1);
for k = 1:K
 F = zeros(N,N);
 for p = 1:Preal
   F(indexer(p,3), indexer(p,4)) = hatF(p,k);
   F(indexer(p,4), indexer(p,3)) = hatF(p,k);
 end;
 for p = (Preal+1):P
   F(indexer(p,3), indexer(p,4)) = F(indexer(p,3), indexer(p,4)) + 1i*hatF(p,k);
   F(indexer(p,4), indexer(p,3)) = F(indexer(p,4), indexer(p,3)) - 1i*hatF(p,k);
 end;
 Fsqrt{k} = chol(F)';
end;

nb = min( floor(K/4),25 );
Q = zeros(P*K, P*nb+Preal);
S = zeros(P*nb+Preal, P*nb+Preal);
Qc = [ones(K,1), sqrt(2)*cos(2*pi*omegavals*(1:nb))];
for p=1:Preal
 ep=zeros(P,1);
 ep(p)=1;
 locp = ((p-1)*(nb+1)+1):p*(nb+1);
 Q(:, locp ) = kron(Qc,ep);
 S(locp,locp) = diag(exp(logLambda0(p))*(2*pi*(0:nb)).^4);
end;
Qs = sqrt(2)*sin(2*pi*omegavals*(1:nb));
for p=(Preal+1):P
 ep=zeros(P,1);
 ep(p)=1;
 pc=p-Preal;
 locp = ((pc-1)*nb + Preal*(nb+1)+1):(pc*nb+Preal*(nb+1));
 Q(:, locp ) = kron(Qs,ep);
 S(locp,locp) = diag(exp(logLambda0(p))*(2*pi*(1:nb)).^4);
end;
R=blkdiag( kron(eye(Preal,Preal),Qc), kron(eye(Pimag,Pimag),Qs));

tic
for b=1:nBoot
 %%%%%% display options
 if display==1
   b
 end;
 if display==2
   [b, toc]
   tic
 end;

 %%%%%% pull the DFT residuals
 tildeXBoot=zeros(N,K);
 resids = normrnd(0,sqrt(1/2),N,K)+1i*normrnd(0,sqrt(1/2),N,K);

 % get the fourier transform and periodograms
 perds = zeros(P,K);
 for k = 1:K
   tildeXBoot(:,k) = Fsqrt{k}*resids(:,k);
   P_mat = tildeXBoot(:,k)*tildeXBoot(:,k)';
   for p=1:Preal
    perds(p,k) = real(P_mat(indexer(p,3), indexer(p,4)));
   end;
   for p = (Preal+1):P
    perds(p,k) = imag(P_mat(indexer(p,3), indexer(p,4)));
   end;
 end;

 % run iterative least squares
 % use NWKern to get the starting values
 [hatF_NW, Gval] = NWspec(omegavals', perds,2*log(20/K));
   current = spec2inChol(hatF_NW,indexer);

   old = current;
   tol=1;
   ct=0;
   while (tol>tolval) && (ct<=MaxIt-1)
    [workvars, H] = WorkingFish(current,tildeXBoot,indexer);
    tildeBeta = reshape(workvars, K*P ,1);
    bigest = (Q'*H*Q + S)Q'*H*tildeBeta;
    current = reshape(R*bigest, K,P)';
    %
    tol = sum(sum((current-old).^2))/sum(sum(old.^2));
    old=current;
    ct = ct+1;
    end;
   %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
   % get hatF and save
   hatFboot =inChol2spec(current,indexer);
   for p = 1:P
    bootcell{p}(b,:) = hatFboot(p,:);
   end;
   %%%%%%%%%%%%%%%%%%%%%%%%%%%
end;

%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
% The code reproduces the analysis of the simulated bivariate AR(2) from
% the supplement to "Penalized Multivariate Whittle Likelihood for Power
% Spectrum Estimation" by Krafty and Collinge
%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%

sales = csvread('C:/Bookdata/WW2b.csv',1,1);
sales = diff(sales);
X = sales';

[tildeX, perds, omegas] =time2freq(X);

logLamStart = -2*ones(25,1); %starting values
options = optimset('LargeScale','off','Display','iter', 'TolX', .01); %options for the optimization
[logLamHat,G]= optGML(tildeX, perds, omegas, logLamStart, options);

[hatF, GLM] = penWhit(tildeX,omegas',logLamHat);

figure
subplot(1,5,1); plot((0:43)/88*12,hatF(1,:))
subplot(1,5,2); plot((0:43)/88*12,hatF(6,:))
subplot(1,5,3); plot((0:43)/88*12,hatF(10,:))
subplot(1,5,4); plot((0:43)/88*12,hatF(13,:))
subplot(1,5,5); plot((0:43)/88*12,hatF(15,:))

VAR estimation of multivariate spectrum – Example 9.3(d) using R

library(MTS)
library(astsa)
#=================
#VAR analysis
#=================

# Two necessary functions

# Obtain Spectrum from autoregressive model
ARspect <- function(phi,sigma,freq){
 dimen <- dim(phi)[2]
 len <- dim(phi)[1]

 bigphi <- array(0,dim=c(dimen,dimen,length(freq)))
 spect <- array(0,dim=c(dimen,dimen,length(freq)))
 for(k in 1:length(freq)){
   bigphi[,,k] <- diag(dimen)
   for(j in 1:(len/dimen)){
    if(j==1){
     bigmat <- phi[1:dimen,]*exp(-2*pi*(1i)*freq[k])
    }else{
      bigmat <- phi[((dimen*j-(dimen-1)):(dimen*j)),]*exp(-2*j*pi*(1i)*freq[k])
    }
    bigphi[,,k] = bigphi[,,k] - bigmat
   }
   spect[,,k] = solve(bigphi[,,k])%*%sigma%*%solve(Conj(t(bigphi[,,k])))
 }
 return(spect/(2*pi))
}

# Plot the coherence
plot.coherency<-function (x, ci = 0.95, xlab = "frequency", ylab = "squared coherency",
       ylim = c(0, 1), type = "l", main = NULL, ci.col = "blue",
       ci.lty = 3, ...)
{
 nser <- NCOL(x$spec)
 gg <- 2/x$df
 se <- sqrt(gg/2)
 z <- -qnorm((1 - ci)/2)
 if (is.null(main))
   main <- paste(paste("Series:", x$series), "Squared Coherency",
       sep = " -- ")
 if (nser == 2) {
   plot(x$freq, x$coh, type = type, xlab = xlab, ylab = ylab,
     ylim = ylim, ...)
   coh <- pmin(0.99999, sqrt(x$coh))
   lines(x$freq, (tanh(atanh(coh) + z * se))^2, lty = ci.lty,
     col = ci.col)
   lines(x$freq, (pmax(0, tanh(atanh(coh) - z * se)))^2,
     lty = ci.lty, col = ci.col)
   title(main)
 }
 else {
   dev.hold()
   on.exit(dev.flush())
   opar <- par(mfrow = c(nser - 1, nser - 1), mar = c(1.5, 1.5, 0.5, 0.5), oma = c(4, 4, 6, 4))
   on.exit(par(opar), add = TRUE)
   plot.new()
   for (j in 2:nser) for (i in 1L:(j - 1)) {
     par(mfg = c(j - 1, i, nser - 1, nser - 1))
     ind <- i + (j - 1) * (j - 2)/2
     plot(x$freq, x$coh[, ind], type = type, ylim = ylim,
      axes = FALSE, xlab = "", ylab = "", ...)
     box()
     if (i == 1) {
       axis(2, xpd = NA)
       title(ylab = x$snames[j], xpd = NA)
     }
     if (j == nser) {
       axis(1, xpd = NA)
       title(xlab = x$snames[i], xpd = NA)
     }
     mtext(main, 3, 3, TRUE, 0.5, cex = par("cex.main"),
       font = par("font.main"))
   }
 }
 invisible()
}

# data preparation

ms <- as.data.frame(read.csv("C:/Bookdata/WW2b.csv")[,-1])
rownames(ms) <- seq(as.Date("2009/6/1"), by="month", length=90)
z.orig <- ts(ms,start=c(2009,6),frequency=12)
z <- diff(z.orig,1)
plot(z)
z = ts(apply(z,2,scale), freq=1)  # scaling strips its attributes
n = nrow(z)

# VAR(4) model fitting and spectrum estimation

model <- VARMA(z,p=4,include.mean=FALSE)
nfreq = floor(n/2); freq = (0:nfreq)/(2*nfreq);
f <- ARspect(model$coef,model$Sigma,freq)

> par(mfrow=c(1,5))
plot(freq*12,Re(f[1,1,]),type='l',lty=1,ylab="Spectrum",xlab="Cycles/year",main=colnames(ms)[1])
plot(freq*12,Re(f[2,2,]),type='l',lty=1,ylab="Spectrum",xlab="Cycles/year",main=colnames(ms)[2])
plot(freq*12,Re(f[3,3,]),type='l',lty=1,ylab="Spectrum",xlab="Cycles/year",main=colnames(ms)[3])
plot(freq*12,Re(f[4,4,]),type='l',lty=1,ylab="Spectrum",xlab="Cycles/year",main=colnames(ms)[4])
plot(freq*12,Re(f[5,5,]),type='l',lty=1,ylab="Spectrum",xlab="Cycles/year",main=colnames(ms)[5])

# quadrature spectrum
c51 = Re(f[5,1,])
c52 = Re(f[5,2,])
c53 = Re(f[5,3,])
c54 = Re(f[5,4,])
c41 = Re(f[4,1,])
c42 = Re(f[4,2,])
c43 = Re(f[4,3,])
c31 = Re(f[3,1,])
c32 = Re(f[3,2,])
c21 = Re(f[2,1,])

# plot part of quadrature spectrum
par(mfrow=c(1,4))
plot(freq*12, c51, type='l',lty=1,ylab="Co-spectrum",xlab="Cycles/year",main="(a)")
plot(freq*12, c52, type='l',lty=1,ylab="Co-spectrum",xlab="Cycles/year",main="(b)")
plot(freq*12, c53, type='l',lty=1,ylab="Co-spectrum",xlab="Cycles/year",main="(c)")
plot(freq*12, c54, type='l',lty=1,ylab="Co-spectrum",xlab="Cycles/year",main="(d)")

# co spectrum
q51 = Im(f[5,1,])
q52 = Im(f[5,2,])
q53 = Im(f[5,3,])
q54 = Im(f[5,4,])
q41 = Im(f[4,1,])
q42 = Im(f[4,2,])
q43 = Im(f[4,3,])
q31 = Im(f[3,1,])
q32 = Im(f[3,2,])
q21 = Im(f[2,1,])

# plot part of co spectrum
par(mfrow=c(1,4))
plot(freq*12,q51,type='l',lty=1,ylab="Quadrature",xlab="Cycles/year",main="(a)")
plot(freq*12,q52,type='l',lty=1,ylab="Quadrature",xlab="Cycles/year",main="(b)")
plot(freq*12,q53,type='l',lty=1,ylab="Quadrature",xlab="Cycles/year",main="(c)")
plot(freq*12,q54,type='l',lty=1,ylab="Quadrature",xlab="Cycles/year",main="(d)")

# phase spectrum
phi = matrix(0,length(freq),10)
phi[,1] = atan(-Im(f[2,1,])/Re(f[2,1,])) # between 2-1
phi[,2] = atan(-Im(f[3,1,])/Re(f[3,1,])) # between 3-1
phi[,3] = atan(-Im(f[3,2,])/Re(f[3,2,])) # between 3-2
phi[,4] = atan(-Im(f[4,1,])/Re(f[4,1,])) # between 4-1
phi[,5] = atan(-Im(f[4,2,])/Re(f[4,2,])) # between 4-2
phi[,6] = atan(-Im(f[4,3,])/Re(f[4,3,])) # between 4-3
phi[,7] = atan(-Im(f[5,1,])/Re(f[5,1,])) # between 5-1
phi[,8] = atan(-Im(f[5,2,])/Re(f[5,2,])) # between 5-2
phi[,9] = atan(-Im(f[5,3,])/Re(f[5,3,])) # between 5-3
phi[,10] = atan(-Im(f[5,4,])/Re(f[5,4,])) # between 5-4

# plot part of phase spectrum
par(mfrow=c(1,4))
plot(freq*12, phi[,7] ,type='l',lty=1,ylab="Quadrature",xlab="Cycles/year",main="5,1")
plot(freq*12, phi[,8] ,type='l',lty=1,ylab="Quadrature",xlab="Cycles/year",main="5,2")
plot(freq*12, phi[,9] ,type='l',lty=1,ylab="Quadrature",xlab="Cycles/year",main="5,3")
plot(freq*12, phi[,10] ,type='l',lty=1,ylab="Quadrature",xlab="Cycles/year",main="5,4")

# coherency
coh <- matrix(0,length(freq),10)
coh[,1] <- Re(Mod(f[2,1,])^2/(f[1,1,] * f[2,2,]))
coh[,2] <- Re(Mod(f[3,1,])^2/(f[1,1,] * f[3,3,]))
coh[,3] <- Re(Mod(f[3,2,])^2/(f[2,2,] * f[3,3,]))
coh[,4] <- Re(Mod(f[4,1,])^2/(f[1,1,] * f[4,4,]))
coh[,5] <- Re(Mod(f[4,2,])^2/(f[2,2,] * f[4,4,]))
coh[,6] <- Re(Mod(f[4,3,])^2/(f[3,3,] * f[4,4,]))
coh[,7] <- Re(Mod(f[5,1,])^2/(f[1,1,] * f[5,5,]))
coh[,8] <- Re(Mod(f[5,2,])^2/(f[2,2,] * f[5,5,]))
coh[,9] <- Re(Mod(f[5,3,])^2/(f[3,3,] * f[5,5,]))
coh[,10] <- Re(Mod(f[5,4,])^2/(f[4,4,] * f[5,5,]))
spec <- list()
spec$spec <- f
spec$freq <- freq
spec$coh <- coh
spec$snames <- colnames(z)
plot.coherency(spec, ci=NA, main="Squared Coherencies")
plot.coherency(spec, ci=NA, main="Squared Coherencies")

Code for multivariate nonstationary spectral procedures using MATLAB

Again, the first part of the following code is for various required functions, which can be placed in a different order. The last part is to read in a data set and implement the procedures.

Empirical Example 9.4 of nonstationary vector time series – analysis of financial data

function [f,gr,h] = Beta_derive1(x, yobs_tmp, chol_index, Phi_temp, tau_temp,...
            Beta_temp, sigmasqalpha, nbasis)
global dimen
%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
% Function used for optimization process for coefficients selected to % be changed
%
% Input:
%  1) x - initial values for coefficient of basis functions need to %  be optimized
%  2) yobs_tmp - time series data within the segment
%  3) chol_index - index matrix
%  4) Phi_temp - which component changed
%  5) tau_temp - smoothing parameters
%  6) Beta_temp - current coefficients
%  7) sigmasqalpha - smoothing parameters for the constant in real %  components
%  8) nbasis - number of basis functions used
% Main Outputs:
%  1) f - log posterior probability based on input parameters
%  2) gr - gradients for optimization process
%  3) h - Hessian matrix for optimization process
%
% Required programs: lin_basis_func
%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
%initilize Beta_1 and Beta_2: Beta_2 is for imaginary components
nBeta = nbasis + 1;
Beta_1 = zeros(nBeta,(dimen + dimen*(dimen-1)/2));
Beta_2 = zeros(nBeta,dimen*(dimen-1)/2);

select = chol_index(Phi_temp,:).*(1:dimen^2);
select = select(select∼=0);

x = reshape(x,nBeta,length(select));
Beta_temp(:,select) = x;
Beta_1(:,:) = Beta_temp(:,1:(dimen + dimen*(dimen-1)/2));
Beta_2(:,:) = Beta_temp(1:nBeta,(dimen + dimen*(dimen-1)/2 + 1): end);

dim = size(yobs_tmp); n = dim(1);
nfreq = floor(n/2); tt = (0:nfreq)/(2*nfreq);
yy = fft(yobs_tmp)/sqrt(n); y = yy(1:(nfreq+1),:); nf = length(y);
[xx_r, xx_i]=lin_basis_func(tt);

%theta's
theta = zeros(dimen*(dimen-1)/2,nf);
for i=1:dimen*(dimen-1)/2
 theta_real = xx_r * Beta_1(:,i+dimen);
 theta_imag = xx_i * Beta_2(:,i);
 theta(i,:) = theta_real + sqrt(-1)*theta_imag;
end
%delta's
delta_sq = zeros(dimen,nf);
for i=1:dimen
 delta_sq(i,:) = exp(xx_r * Beta_1(:,i));
end

if dimen==2 %Bivariate Time Series
 if (mod(n,2)==1)%odd n
   f = -sum(log(delta_sq(1,2:end))' + log(delta_sq(2,2:end))' + abs(y(2:end,1)).^2.*exp(-xx_r(2:end,:)*Beta_1(:,1)) + ...
    abs(y(2:end,2) - theta(2:end).'.*y(2:end,1)).^2.*exp(-xx_r(2:end,:)*Beta_1(:,2))) - ...
    0.5*(log(delta_sq(1,1))' + log(delta_sq(2,1))' + abs(y(1,1)).^2.*exp(-xx_r(1,:)*Beta_1(:,1)) + ...
    abs(y(1,2) - theta(1).'.*y(1,1))^2.*exp(-xx_r(1,:)*Beta_1(:,2)));
 else
   f = -sum(log(delta_sq(1,2:nfreq))' + log(delta_sq(2,2:nfreq))' + abs(y(2:nfreq,1)).^2.*exp(-xx_r(2:nfreq,:)*Beta_1(:,1)) + ...
   abs(y(2:nfreq,2) - theta(2:nfreq).'.*y(2:nfreq,1)).^2.*exp(-xx_r(2:nfreq,:)*Beta_1(:,2))) - ...
   0.5*(log(delta_sq(1,1)) + log(delta_sq(2,1)) + abs(y(1,1)).^2.*exp(-xx_r(1,:)*Beta_1(:,1)) + ...
   abs(y(1,2) - theta(1).'.*y(1,1)).^2.*exp(-xx_r(1,:)*Beta_1(:,2))) - ...
   0.5*(log(delta_sq(1,end)) + log(delta_sq(2,end)) + abs(y(end,1)).^2.*exp(-xx_r(end,:)*Beta_1(:,1)) + ...
   abs(y(end,2) - theta(end).'.*y(end,1))^2.*exp(-xx_r(end,:)*Beta_1(:,2)));
 end
 f = f - (0.5.*(Beta_1(1,1)* Beta_1(1,1)')/sigmasqalpha + 0.5.*(Beta_1(2:nBeta,1)'*Beta_1(2:nBeta,1))/tau_temp(1))*chol_index(Phi_temp,1) -...
   (0.5.*(Beta_1(1,2)* Beta_1(1,2)')/sigmasqalpha + 0.5.*(Beta_1(2:nBeta,2)'*Beta_1(2:nBeta,2))/tau_temp(2))*chol_index(Phi_temp,2) -...
   (0.5.*(Beta_1(1,3)* Beta_1(1,3)')/sigmasqalpha + 0.5.*(Beta_1(2:nBeta,3)'*Beta_1(2:nBeta,3))/tau_temp(3))*chol_index(Phi_temp,3) -...
   (0.5.*(Beta_2(1:nBeta,1)'*Beta_2(1:nBeta,1))/tau_temp(4))*chol_index(Phi_temp,4);

 gr1 = zeros(nBeta,1); gr2 = zeros(nBeta,1); gr3 = zeros(nBeta,1); gr4 = zeros(nBeta,1);
 gr1(1) = Beta_1(1,1)/sigmasqalpha; gr1(2:nBeta,1) = Beta_1(2:nBeta,1)/tau_temp(1);
 gr2(1) = Beta_1(1,2)/sigmasqalpha; gr2(2:nBeta,1) = Beta_1(2:nBeta,2)/tau_temp(2);
 gr3(1) = Beta_1(1,3)/sigmasqalpha; gr3(2:nBeta,1) = Beta_1(2:nBeta,3)/tau_temp(3);
 gr4(1:nBeta,1) = Beta_2(1:nBeta,1)/tau_temp(4);
 h11(1,1)=1/sigmasqalpha; h11(2:nBeta,2:nBeta)=1/tau_temp(1)*eye(nbasis);
 h22(1,1)=1/sigmasqalpha; h22(2:nBeta,2:nBeta)= 1/tau_temp(2)*eye(nbasis);
 h33(1,1)=1/sigmasqalpha; h33(2:nBeta,2:nBeta)= 1/tau_temp(3)*eye(nbasis);
 h44(1:nBeta,1:nBeta)=1/tau_temp(4)*eye(nBeta);
 h42 = zeros(nBeta,nBeta); h32 = zeros(nBeta,nBeta);

 if (mod(n,2)==1)
   %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
   %gradient
   %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
   rk = -y(2:end,1).*conj(y(2:end,2)) - y(2:end,2).*conj(y(2:end,1));
   ik = sqrt(-1)*(-y(2:end,1).*conj(y(2:end,2)) + y(2:end,2).*conj(y(2:end,1)));
  ck = 2*abs(y(2:end,1)).^2;

   gr1 = gr1 + xx_r(2:end,:)'*(1-abs(y(2:end,1)).^2.*exp(-xx_r(2:end,:)*Beta_1(:,1))) + ...
      0.5*(xx_r(1,:)'*(1-abs(y(1,1)).^2.*exp(-xx_r(1,:)*Beta_1(:,1))));
   gr2 = gr2 + xx_r(2:end,:)'*(1 - abs(y(2:end,2)-theta(2:end).'.*y(2:end,1)).^2.*exp(-xx_r(2:end,:)*Beta_1(:,2))) + ...
      0.5*(xx_r(1,:)'*(1 - abs(y(1,2)-theta(1).'.*y(1,1)).^2.*exp(-xx_r(1,:)*Beta_1(:,2))));
   temp_mat_31 = bsxfun(@times, xx_r(2:end,:),rk);
   temp_mat_32 = bsxfun(@times, ck, bsxfun(@times, xx_r(2:end,:),xx_r(2:end,:)*Beta_1(:,3)));
   gr3 = gr3 + sum( bsxfun(@times, (temp_mat_31+temp_mat_32), exp(-xx_r(2:end,:)*Beta_1(:,2)) ))' +...
      0.5*(exp(-xx_r(1,:)*Beta_1(:,2))*(-y(1,1).*conj(y(1,2)) - y(1,2).*conj(y(1,1)))*xx_r(1,:)' +...
      exp(-xx_r(1,:)*Beta_1(:,2))*2*abs(y(1,1)).^2*(xx_r(1,:)*Beta_1(:,3))*xx_r(1,:)');
   temp_mat_41 = bsxfun(@times, ik, xx_i(2:end,:));
   temp_mat_42 = bsxfun(@times, ck, bsxfun(@times,xx_i(2:end,:),xx_i(2:end,:)*Beta_2(:,1)));
   gr4 = gr4 + sum( bsxfun(@times, (temp_mat_41 + temp_mat_42), exp(-xx_r(2:end,:)*Beta_1(:,2))))' + ...
       0.5*(exp(-xx_r(1,:)*Beta_1(:,2))*(sqrt(-1)*(-y(1,1).*conj(y(1,2)) + y(1,2).*conj(y(1,1))))*xx_i(1,:)' +...
      exp(-xx_r(1,:)*Beta_1(:,2))*2*abs(y(1,1)).^2*(xx_i(1,:)*Beta_2(:,1))*xx_i(1,:)');
   %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
   %Hessian
   %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%

   bigmat_h11 = kron(bsxfun(@times, abs(y(2:end,1)).^2.*exp(-xx_r(2:end,:)*Beta_1(:,1)), xx_r(2:end,:)),ones(nBeta,1)');
   coefmat_h11 = repmat(xx_r(2:end,:), 1,nBeta);
   h11 = h11 + reshape(sum(bsxfun(@times, bigmat_h11, coefmat_h11),1),nBeta,nBeta) +...
     0.5*(abs(y(1,1)).^2.*exp(-xx_r(1,:)*Beta_1(:,1))*xx_r(1,:)'*xx_r(1,:));

   bigmat_h22 = kron(bsxfun(@times, abs(y(2:end,2)-theta(2:end).'.*y(2:end,1)).^2.*exp(-xx_r(2:end,:)*Beta_1(:,2)),...
                   xx_r(2:end,:)), ones(nBeta,1)');
   coefmat_h22 = repmat(xx_r(2:end,:), 1,nBeta);
   h22 = h22 + reshape(sum(bsxfun(@times, bigmat_h22, coefmat_h22),1),nBeta,nBeta) +...
     0.5*(abs(y(1,2)-theta(1).'.*y(1,1)).^2.*exp(-xx_r(1,:)*Beta_1(:,2))*xx_r(1,:)'*xx_r(1,:));

   bigmat_h33 = kron(bsxfun(@times, exp(-xx_r(2:end,:)*Beta_1(:,2)).*ck, xx_r(2:end,:)),ones(nBeta,1)');
   coefmat_h33 = repmat(xx_r(2:end,:), 1,nBeta);
   h33 = h33 + reshape(sum(bsxfun(@times, bigmat_h33, coefmat_h33),1),nBeta,nBeta) +...
      0.5*(exp(-xx_r(1,:)*Beta_1(:,2))*2*abs(y(1,1)).^2*xx_r(1,:)'*xx_r(1,:));


   bigmat_h44 = kron(bsxfun(@times, exp(-xx_r(2:end,:)*Beta_1(:,2)).*ck, xx_i(2:end,:)),ones(nBeta,1)');
   coefmat_h44 = repmat(xx_i(2:end,:), 1,nBeta);
   h44 = h44 + reshape(sum(bsxfun(@times, bigmat_h44, coefmat_h44),1),nBeta,nBeta) +...
     0.5*(exp(-xx_r(1,:)*Beta_1(:,2))*2*abs(y(1,1)).^2*xx_i(1,:)'*xx_i(1,:));

   bigmat_h42_1 = kron(bsxfun(@times, exp(-xx_r(2:end,:)*Beta_1(:,2)).*ck.*(xx_i(2:end,:)*Beta_2(:,1)),...
     xx_i(2:end,:)), ones(nBeta,1)');
   coefmat_h42_1 = repmat(xx_r(2:end,:), 1,nBeta);
   bigmat_h42_2 = kron(bsxfun(@times,exp(-xx_r(2:end,:)*Beta_1(:,2)).*ik, xx_i(2:end,:)), ones(nBeta,1)');
   coefmat_h42_2 = repmat(xx_r(2:end,:), 1,nBeta);
   h42 = h42 + reshape(sum(bsxfun(@times, bigmat_h42_1, coefmat_h42_1) +...
   bsxfun(@times, bigmat_h42_2, coefmat_h42_2),1),nBeta,nBeta)' +...
     0.5*(exp(-xx_r(1,:)*Beta_1(:,2))*(2*abs(y(1,1)).^2*(xx_i(1,:)*Beta_2(:,1))*xx_i(1,:)+...
   sqrt(-1)*(-y(1,1).*conj(y(1,2)) + y(1,2).*conj(y(1,1)))*xx_i(1,:))'*xx_r(1,:));
   bigmat_h32_1 = kron(bsxfun(@times, exp(-xx_r(2:end,:)*Beta_1(:,2)).*ck.*(xx_r(2:end,:)*Beta_1(:,3)),...
     xx_r(2:end,:)), ones(nBeta,1)');
   coefmat_h32_1 = repmat(xx_r(2:end,:), 1,nBeta);
   bigmat_h32_2 = kron(bsxfun(@times,exp(-xx_r(2:end,:)*Beta_1(:,2)).*rk, xx_r(2:end,:)), ones(nBeta,1)');
   coefmat_h32_2 = repmat(xx_r(2:end,:), 1,nBeta);
   h32 = h32 + reshape(sum(bsxfun(@times, bigmat_h32_1, coefmat_h32_1) +...
   bsxfun(@times, bigmat_h32_2, coefmat_h32_2),1),nBeta,nBeta)' +...
     0.5*(exp(-xx_r(1,:)*Beta_1(:,2))*(2*abs(y(1,1)).^2*(xx_r(1,:)*Beta_1(:,3))*xx_r(1,:)+...
     (-y(1,1).*conj(y(1,2)) - y(1,2).*conj(y(1,1)))*xx_r(1,:))'*xx_r(1,:));
   h24=h42'; h23=h32';
 else
   %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
   %gradient
   %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
   rk = -y(2:nfreq,1).*conj(y(2:nfreq,2)) - y(2:nfreq,2).*conj(y(2:nfreq,1));
   ik = sqrt(-1)*(-y(2:nfreq,1).*conj(y(2:nfreq,2)) + y(2:nfreq,2).*conj(y(2:nfreq,1)));
   ck = 2*abs(y(2:nfreq,1)).^2;

   gr1 = gr1 + xx_r(2:nfreq,:)'*(1-abs(y(2:nfreq,1)).^2.*exp(-xx_r(2:nfreq,:)*Beta_1(:,1))) + ...
      0.5*(xx_r(1,:)'*(1-abs(y(1,1)).^2.*exp(-xx_r(1,:)*Beta_1(:,1)))) +...
           0.5*(xx_r(end,:)'*(1-abs(y(end,1)).^2.*exp(-xx_r(end,:)*Beta_1(:,1))));
   gr2 = gr2 + xx_r(2:nfreq,:)'*(1 - abs(y(2:nfreq,2)-theta(2:nfreq).'.*y(2:nfreq,1)).^2.*exp(-xx_r(2:nfreq,:)*Beta_1(:,2))) + ...
      0.5*(xx_r(1,:)'*(1 - abs(y(1,2)-theta(1).'.*y(1,1)).^2.*exp(-xx_r(1,:)*Beta_1(:,2)))) + ...
       0.5*(xx_r(end,:)'*(1 - abs(y(end,2)-theta(end).'.*y(end,1)).^2.*exp(-xx_r(end,:)*Beta_1(:,2))));
   temp_mat_31 = bsxfun(@times,rk, xx_r(2:nfreq,:));
   temp_mat_32 = bsxfun(@times,ck,bsxfun(@times, xx_r(2:nfreq,:),xx_r(2:nfreq,:)*Beta_1(:,3)));
   gr3 = gr3 + sum(bsxfun(@times, (temp_mat_31 + temp_mat_32), exp(-xx_r(2:nfreq,:)*Beta_1(:,2)) ))' +...
      0.5*(exp(-xx_r(1,:)*Beta_1(:,2))*(-y(1,1).*conj(y(1,2)) - y(1,2).*conj(y(1,1)))*xx_r(1,:)' +...
      exp(-xx_r(1,:)*Beta_1(:,2))*2*abs(y(1,1)).^2*(xx_r(1,:)*Beta_1(:,3))*xx_r(1,:)') +...
      0.5*(exp(-xx_r(end,:)*Beta_1(:,2))*(-y(end,1).*conj(y(end,2)) - y(end,2).*conj(y(end,1)))*xx_r(end,:)' +...
      exp(-xx_r(end,:)*Beta_1(:,2))*2*abs(y(end,1)).^2*(xx_r(end,:)*Beta_1(:,3))*xx_r(end,:)');
   temp_mat_41 = bsxfun(@times, ik, xx_i(2:nfreq,:));
   temp_mat_42 = bsxfun(@times, ck, bsxfun(@times,xx_i(2:nfreq,:),xx_i(2:nfreq,:)*Beta_2(:,1)));
   gr4 = gr4 + sum( bsxfun(@times, (temp_mat_41 + temp_mat_42), exp(-xx_r(2:nfreq,:)*Beta_1(:,2))))' + ...
      0.5*(exp(-xx_r(1,:)*Beta_1(:,2))*(sqrt(-1)*(-y(1,1).*conj(y(1,2)) + y(1,2).*conj(y(1,1))))*xx_i(1,:)' +...
      exp(-xx_r(1,:)*Beta_1(:,2))*2*abs(y(1,1)).^2*(xx_i(1,:)*Beta_2(:,1))*xx_i(1,:)') +...
       0.5*(exp(-xx_r(end,:)*Beta_1(:,2))*(sqrt(-1)*(-y(end,1).*conj(y(end,2)) + y(end,2).*conj(y(end,1))))*xx_i(end,:)' +...
      exp(-xx_r(end,:)*Beta_1(:,2))*2*abs(y(end,1)).^2*(xx_i(end,:)*Beta_2(:,1))*xx_i(end,:)');
   %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
   %Hessian
   %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
   bigmat_h11 = kron(bsxfun(@times, abs(y(2:nfreq,1)).^2.*exp(-xx_r(2:nfreq,:)*Beta_1(:,1)), xx_r(2:nfreq,:)),ones(nBeta,1)');
   coefmat_h11 = repmat(xx_r(2:nfreq,:), 1,nBeta);
   h11 = h11 + reshape(sum(bsxfun(@times, bigmat_h11, coefmat_h11),1),nBeta,nBeta) +...
      0.5*(abs(y(1,1)).^2.*exp(-xx_r(1,:)*Beta_1(:,1))*xx_r(1,:)'*xx_r(1,:))+...
      0.5*(abs(y(end,1)).^2.*exp(-xx_r(end,:)*Beta_1(:,1))*xx_r(end,:)'*xx_r(end,:));

   bigmat_h22 = kron(bsxfun(@times, abs(y(2:nfreq,2)-theta(2:nfreq).'.*y(2:nfreq,1)).^2.*exp(-xx_r(2:nfreq,:)*Beta_1(:,2)),...
          xx_r(2:nfreq,:)), ones(nBeta,1)');
   coefmat_h22 = repmat(xx_r(2:nfreq,:), 1,nBeta);
   h22 = h22 + reshape(sum(bsxfun(@times, bigmat_h22, coefmat_h22),1),nBeta,nBeta) +...
      0.5*(abs(y(1,2)-theta(1).'.*y(1,1)).^2.*exp(-xx_r(1,:)*Beta_1(:,2))*xx_r(1,:)'*xx_r(1,:))+...
      0.5*(abs(y(end,2)-theta(end).'.*y(end,1)).^2.*exp(-xx_r(end,:)*Beta_1(:,2))*xx_r(end,:)'*xx_r(end,:));

   bigmat_h33 = kron(bsxfun(@times, exp(-xx_r(2:nfreq,:)*Beta_1(:,2)).*ck, xx_r(2:nfreq,:)),ones(nBeta,1)');
   coefmat_h33 = repmat(xx_r(2:nfreq,:), 1,nBeta);
   h33 = h33 + reshape(sum(bsxfun(@times, bigmat_h33, coefmat_h33),1),nBeta,nBeta) +...
      0.5*(exp(-xx_r(1,:)*Beta_1(:,2))*2*abs(y(1,1)).^2*xx_r(1,:)'*xx_r(1,:))+...
      0.5*(exp(-xx_r(end,:)*Beta_1(:,2))*2*abs(y(end,1)).^2*xx_r(end,:)'*xx_r(end,:));

   bigmat_h44 = kron(bsxfun(@times, exp(-xx_r(2:nfreq,:)*Beta_1(:,2)).*ck, xx_i(2:nfreq,:)),ones(nBeta,1)');
   coefmat_h44 = repmat(xx_i(2:nfreq,:), 1,nBeta);
   h44 = h44 + reshape(sum(bsxfun(@times, bigmat_h44, coefmat_h44),1),nBeta,nBeta) +...
      0.5*(exp(-xx_r(1,:)*Beta_1(:,2))*2*abs(y(1,1)).^2*xx_i(1,:)'*xx_i(1,:))+...
      0.5*(exp(-xx_r(end,:)*Beta_1(:,2))*2*abs(y(end,1)).^2*xx_i(end,:)'*xx_i(end,:));

   bigmat_h42_1 = kron(bsxfun(@times, exp(-xx_r(2:nfreq,:)*Beta_1(:,2)).*ck.*(xx_i(2:nfreq,:)*Beta_2(:,1)),...
      xx_i(2:nfreq,:)), ones(nBeta,1)');
   coefmat_h42_1 = repmat(xx_r(2:nfreq,:), 1,nBeta);
   bigmat_h42_2 = kron(bsxfun(@times,exp(-xx_r(2:nfreq,:)*Beta_1(:,2)).*ik, xx_i(2:nfreq,:)), ones(nBeta,1)');
   coefmat_h42_2 = repmat(xx_r(2:nfreq,:), 1,nBeta);
   h42 = h42 + reshape(sum(bsxfun(@times, bigmat_h42_1, coefmat_h42_1) +...
      bsxfun(@times, bigmat_h42_2, coefmat_h42_2),1),nBeta,nBeta)' +...
       0.5*(exp(-xx_r(1,:)*Beta_1(:,2))*(2*abs(y(1,1)).^2*(xx_i(1,:)*Beta_2(:,1))*xx_i(1,:)+...
           sqrt(-1)*(-y(1,1).*conj(y(1,2)) + y(1,2).*conj(y(1,1)))*xx_i(1,:))'*xx_r(1,:)) +...
      0.5*(exp(-xx_r(end,:)*Beta_1(:,2))*(2*abs(y(end,1)).^2*(xx_i(end,:)*Beta_2(:,1))*xx_i(end,:)+...
       sqrt(-1)*(-y(end,1).*conj(y(end,2)) + y(end,2).*conj(y(end,1)))*xx_i(end,:))'*xx_r(end,:));

   bigmat_h32_1 = kron(bsxfun(@times, exp(-xx_r(2:nfreq,:)*Beta_1(:,2)).*ck.*(xx_r(2:nfreq,:)*Beta_1(:,3)),...
      xx_r(2:nfreq,:)), ones(nBeta,1)');
   coefmat_h32_1 = repmat(xx_r(2:nfreq,:), 1,nBeta);
   bigmat_h32_2 = kron(bsxfun(@times,exp(-xx_r(2:nfreq,:)*Beta_1(:,2)).*rk, xx_r(2:nfreq,:)), ones(nBeta,1)');
   coefmat_h32_2 = repmat(xx_r(2:nfreq,:), 1,nBeta);
   h32 = h32 + reshape(sum(bsxfun(@times, bigmat_h32_1, coefmat_h32_1) +...
      bsxfun(@times, bigmat_h32_2, coefmat_h32_2),1),nBeta,nBeta)' +...
       0.5*(exp(-xx_r(1,:)*Beta_1(:,2))*(2*abs(y(1,1)).^2*(xx_r(1,:)*Beta_1(:,3))*xx_r(1,:)+...
   (-y(1,1).*conj(y(1,2)) - y(1,2).*conj(y(1,1)))*xx_r(1,:))'*xx_r(1,:)) +...
      0.5*(exp(-xx_r(end,:)*Beta_1(:,2))*(2*abs(y(end,1)).^2*(xx_r(end,:)*Beta_1(:,3))*xx_r(end,:)+...
   (-y(end,1).*conj(y(end,2)) - y(end,2).*conj(y(end,1)))*xx_r(end,:))'*xx_r(end,:));
   h24=h42'; h23=h32';
 end
 h1 = [h11,zeros(nBeta,2*nBeta+nBeta)];
 h2 = [zeros(nBeta,nBeta),h22,-h23,-h24];
 h3 = [zeros(nBeta,nBeta),-h32,h33,zeros(nBeta,nBeta)];
 h4 = [zeros(nBeta,nBeta),-h42,zeros(nBeta,nBeta),h44];
 gr = [gr1;gr2;gr3;gr4]; h = [h1;h2;h3;h4]; f = -f;
 gr_index = (1:(4*nBeta)).*[kron(chol_index(Phi_temp,1:3),ones(nBeta,1)'), kron(chol_index(Phi_temp,4),ones(nBeta,1)')];
 gr_index = gr_index(find(gr_index~=0));
 gr = gr(gr_index); h = h(gr_index,gr_index);

elseif dimen==3

 if (mod(n,2)==1) %odd n
   f = -sum(log(delta_sq(1,2:end))' + log(delta_sq(2,2:end))' + log(delta_sq(3,2:end))' + ...
      conj(y(2:end,1)).*y(2:end,1).*exp(-xx_r(2:end,:)*Beta_1(:,1)) + ...
       conj(y(2:end,2) - theta(1,2:end).'.*y(2:end,1)).*(y(2:end,2) - theta(1,2:end).'.*y(2:end,1)).*exp(-xx_r(2:end,:)*Beta_1(:,2))+...
      conj(y(2:end,3) -(theta(2,2:end).'.*y(2:end,1)+ theta(3,2:end).'.*y(2:end,2))).*(y(2:end,3) -(theta(2,2:end).'.*y(2:end,1)+ theta(3,2:end).'.*y(2:end,2))).*...
      exp(-xx_r(2:end,:)*Beta_1(:,3))) - ...
       0.5*(log(delta_sq(1,1))' + log(delta_sq(2,1))' + log(delta_sq(3,1))' + ...
      conj(y(1,1)).*y(1,1).*exp(-xx_r(1,:)*Beta_1(:,1)) + ...
      conj(y(1,2) - theta(1,1).'.*y(1,1)).*(y(1,2) - theta(1,1).'.*y(1,1)).*exp(-xx_r(1,:)*Beta_1(:,2))+...
      conj(y(1,3) -(theta(2,1).'.*y(1,1)+ theta(3,1).'.*y(1,2))).*(y(1,3) -(theta(2,1).'.*y(1,1)+ theta(3,1).'.*y(1,2))).*...
      exp(-xx_r(1,:)*Beta_1(:,3)));
 else
   f = -sum(log(delta_sq(1,2:nfreq))' + log(delta_sq(2,2:nfreq))' + log(delta_sq(3,2:nfreq))' + ...
      conj(y(2:nfreq,1)).*y(2:nfreq,1).*exp(-xx_r(2:nfreq,:)*Beta_1(:,1)) + ...
     conj(y(2:nfreq,2) - theta(1,2:nfreq).'.*y(2:nfreq,1)).*(y(2:nfreq,2) - theta(1,2:nfreq).'.*y(2:nfreq,1)).*exp(-xx_r(2:nfreq,:)*Beta_1(:,2))+...
     conj(y(2:nfreq,3) -(theta(2,2:nfreq).'.*y(2:nfreq,1)+ theta(3,2:nfreq).'.*y(2:nfreq,2))).*(y(2:nfreq,3) -(theta(2,2:nfreq).'.*y(2:nfreq,1)+ theta(3,2:nfreq).'.*y(2:nfreq,2))).*...
     exp(-xx_r(2:nfreq,:)*Beta_1(:,3))) - ...
      0.5*(log(delta_sq(1,1))' + log(delta_sq(2,1))' + log(delta_sq(3,1))' + ..
     conj(y(1,1)).*y(1,1).*exp(-xx_r(1,:)*Beta_1(:,1)) + ...
     conj(y(1,2) - theta(1,1).'.*y(1,1)).*(y(1,2) - theta(1,1).'.*y(1,1)).*exp(-xx_r(1,:)*Beta_1(:,2))+...
     conj(y(1,3) -(theta(2,1).'.*y(1,1)+ theta(3,1).'.*y(1,2))).*(y(1,3) -(theta(2,1).'.*y(1,1)+ theta(3,1).'.*y(1,2))).*...
     exp(-xx_r(1,:)*Beta_1(:,3)))-...
     0.5*(log(delta_sq(1,1))' + log(delta_sq(2,1))' + log(delta_sq(3,1))' + ...
     conj(y(end,1)).*y(end,1).*exp(-xx_r(end,:)*Beta_1(:,1)) + ...
     conj(y(end,2) - theta(1,end).'.*y(end,1)).*(y(end,2) - theta(1,end).'.*y(end,1)).*exp(-xx_r(end,:)*Beta_1(:,2))+...
     conj(y(end,3) -(theta(2,end).'.*y(end,1)+ theta(3,end).'.*y(end,2))).*(y(end,3) -(theta(2,end).'.*y(end,1)+ theta(3,end).'.*y(end,2))).*...
     exp(-xx_r(end,:)*Beta_1(:,3)));
 end
 f = f - (0.5.*(Beta_1(1,1)* Beta_1(1,1)')/sigmasqalpha + 0.5.*(Beta_1(2:nBeta,1)'*Beta_1(2:nBeta,1))/tau_temp(1))*chol_index(Phi_temp,1) -...
     (0.5.*(Beta_1(1,2)* Beta_1(1,2)')/sigmasqalpha + 0.5.*(Beta_1(2:nBeta,2)'*Beta_1(2:nBeta,2))/tau_temp(2))*chol_index(Phi_temp,2) -...
     (0.5.*(Beta_1(1,3)* Beta_1(1,3)')/sigmasqalpha + 0.5.*(Beta_1(2:nBeta,3)'*Beta_1(2:nBeta,3))/tau_temp(3))*chol_index(Phi_temp,3) -...
     (0.5.*(Beta_1(1,4)* Beta_1(1,4)')/sigmasqalpha + 0.5.*(Beta_1(2:nBeta,4)'*Beta_1(2:nBeta,4))/tau_temp(4))*chol_index(Phi_temp,4) -...
     (0.5.*(Beta_1(1,5)* Beta_1(1,5)')/sigmasqalpha + 0.5.*(Beta_1(2:nBeta,5)'*Beta_1(2:nBeta,5))/tau_temp(5))*chol_index(Phi_temp,5) -...
     (0.5.*(Beta_1(1,6)* Beta_1(1,6)')/sigmasqalpha + 0.5.*(Beta_1(2:nBeta,6)'*Beta_1(2:nBeta,6))/tau_temp(6))*chol_index(Phi_temp,6) -...
     (0.5.*(Beta_2(1:nBeta,1)'*Beta_2(1:nBeta,1))/tau_temp(7))*chol_index(Phi_temp,7)-...
     (0.5.*(Beta_2(1:nBeta,2)'*Beta_2(1:nBeta,2))/tau_temp(8))*chol_index(Phi_temp,8)-...
     (0.5.*(Beta_2(1:nBeta,3)'*Beta_2(1:nBeta,3))/tau_temp(9))*chol_index(Phi_temp,9);

 gr1 = zeros(nBeta,1); gr2 = zeros(nBeta,1); gr3 = zeros(nBeta,1); gr4 = zeros(nBeta,1);
 gr5 = zeros(nBeta,1); gr6 = zeros(nBeta,1); gr7 = zeros(nBeta,1); gr8 = zeros(nBeta,1);
 gr9 = zeros(nBeta,1);

 gr1(1) = Beta_1(1,1)/sigmasqalpha; gr1(2:nBeta) = Beta_1(2:nBeta,1)/tau_temp(1);
 gr2(1) = Beta_1(1,2)/sigmasqalpha; gr2(2:nBeta) = Beta_1(2:nBeta,2)/tau_temp(2);
 gr3(1) = Beta_1(1,3)/sigmasqalpha; gr3(2:nBeta) = Beta_1(2:nBeta,3)/tau_temp(3);
 gr4(1) = Beta_1(1,4)/sigmasqalpha; gr4(2:nBeta) = Beta_1(2:nBeta,4)/tau_temp(4);
 gr5(1) = Beta_1(1,5)/sigmasqalpha; gr5(2:nBeta) = Beta_1(2:nBeta,5)/tau_temp(5);
 gr6(1) = Beta_1(1,6)/sigmasqalpha; gr6(2:nBeta) = Beta_1(2:nBeta,6)/tau_temp(6);
 gr7(1:nBeta) = Beta_2(1:nBeta,1)/tau_temp(7);
 gr8(1:nBeta) = Beta_2(1:nBeta,2)/tau_temp(8);
 gr9(1:nBeta) = Beta_2(1:nBeta,3)/tau_temp(9);
 h11(1,1)=1/sigmasqalpha; h11(2:nBeta,2:nBeta)=1/tau_temp(1)*eye(nbasis);
 h22(1,1)=1/sigmasqalpha; h22(2:nBeta,2:nBeta)=1/tau_temp(2)*eye(nbasis);
 h33(1,1)=1/sigmasqalpha; h33(2:nBeta,2:nBeta)=1/tau_temp(3)*eye(nbasis);
 h44(1,1)=1/sigmasqalpha; h44(2:nBeta,2:nBeta)=1/tau_temp(4)*eye(nbasis);
 h55(1,1)=1/sigmasqalpha; h55(2:nBeta,2:nBeta)=1/tau_temp(5)*eye(nbasis);
 h66(1,1)=1/sigmasqalpha; h66(2:nBeta,2:nBeta)=1/tau_temp(6)*eye(nbasis);
 h77(1:nBeta,1:nBeta)=1/tau_temp(7)*eye(nBeta);
 h88(1:nBeta,1:nBeta)=1/tau_temp(8)*eye(nBeta);
 h99(1:nBeta,1:nBeta)=1/tau_temp(9)*eye(nBeta);

 h42 = zeros(nBeta,nBeta);
 h53 = zeros(nBeta,nBeta);
 h56 = zeros(nBeta,nBeta);
 h59 = zeros(nBeta,nBeta);
 h63 = zeros(nBeta,nBeta);
 h68 = zeros(nBeta,nBeta);
 h72 = zeros(nBeta,nBeta);
 h83 = zeros(nBeta,nBeta);
 h93 = zeros(nBeta,nBeta);
 h98 = zeros(nBeta,nBeta);
 if (mod(n,2)==1)
   %%%%%%%%%%%%%%%%%%%%%%%%
   %gradient
   %%%%%%%%%%%%%%%%%%%%%%%%
   rk4 = -y(2:end,1).*conj(y(2:end,2)) - y(2:end,2).*conj(y(2:end,1));
   ck4 = 2*abs(y(2:end,1)).^2;
   rk5 = -y(2:end,1).*conj(y(2:end,3)) - y(2:end,3).*conj(y(2:end,1));
   ck5 = 2*abs(y(2:end,1)).^2;
   b = theta(3,:);
   dk5 = y(2:end,2).*conj(y(2:end,1)).*(b(2:end).') + conj(y(2:end,2)).*y(2:end,1).*conj(b(2:end).');
   rk6 = -y(2:end,2).*conj(y(2:end,3)) - y(2:end,3).*conj(y(2:end,2));
   ck6 = 2*abs(y(2:end,2)).^2;
   a = theta(2,:);
   dk6 = y(2:end,1).*conj(y(2:end,2)).*(a(2:end).') + conj(y(2:end,1)).*y(2:end,2).*conj(a(2:end).');
   ik7 = sqrt(-1)*(-y(2:end,1).*conj(y(2:end,2)) + y(2:end,2).*conj(y(2:end,1)));
   ck7 = 2*abs(y(2:end,1)).^2;
   ik8 = sqrt(-1)*(-y(2:end,1).*conj(y(2:end,3)) + y(2:end,3).*conj(y(2:end,1)));
   ck8 = 2*abs(y(2:end,1)).^2;
   dk8 = sqrt(-1)*(-y(2:end,2).*conj(y(2:end,1)).*(b(2:end).') + conj(y(2:end,2).*conj(y(2:end,1)).*(b(2:end).'))) ;
   ik9 = sqrt(-1)*(-y(2:end,2).*conj(y(2:end,3)) + y(2:end,3).*conj(y(2:end,2)));
   ck9 = 2*abs(y(2:end,2)).^2;
   dk9 = sqrt(-1)*(-y(2:end,1).*conj(y(2:end,2)).*(a(2:end).') + conj(y(2:end,1).*conj(y(2:end,2)).*(a(2:end).'))) ;

   gr1 = gr1 + xx_r(2:end,:)'*(1-abs(y(2:end,1)).^2.*exp(-xx_r(2:end,:)*Beta_1(:,1))) + ...
      0.5*(xx_r(1,:)'*(1-abs(y(1,1)).^2.*exp(-xx_r(1,:)*Beta_1(:,1))));
   gr2 = gr2 + xx_r(2:end,:)'*(1 - abs(y(2:end,2)-theta(1,2:end).'.*y(2:end,1)).^2.*exp(-xx_r(2:end,:)*Beta_1(:,2))) + ...
      0.5*(xx_r(1,:)'*(1 - abs(y(1,2)-theta(1,1).*y(1,1)).^2.*exp(-xx_r(1,:)*Beta_1(:,2))));
   gr3 = gr3 + xx_r(2:end,:)'*(1 - abs(y(2:end,3) - theta(2,2:end).'.*y(2:end,1) - theta(3,2:end).'.*y(2:end,2)).^2.*exp(-xx_r(2:end,:)*Beta_1(:,3)))+...
      0.5*(xx_r(1,:)'*(1 - abs(y(1,3) - theta(2,1).*y(1,1) - theta(3,1).*y(1,2))^2.*exp(-xx_r(1,:)*Beta_1(:,3))));   
   temp_mat_41 = bsxfun(@times, xx_r(2:end,:),rk4);
   temp_mat_42 = bsxfun(@times, ck4, bsxfun(@times, xx_r(2:end,:), xx_r(2:end,:)*Beta_1(:,4)));
   gr4 = gr4 + sum( bsxfun(@times, (temp_mat_41+temp_mat_42), exp(-xx_r(2:end,:)*Beta_1(:,2))))' +...
       0.5*(exp(-xx_r(1,:)*Beta_1(:,2))*(-y(1,1).*conj(y(1,2)) - y(1,2).*conj(y(1,1)))*xx_r(1,:)' +...
      exp(-xx_r(1,:)*Beta_1(:,2))*2*abs(y(1,1)).^2*(xx_r(1,:)*Beta_1(:,4))*xx_r(1,:)');
   temp_mat_51 = bsxfun(@times, xx_r(2:end,:),rk5);
   temp_mat_52 = bsxfun(@times, ck5, bsxfun(@times, xx_r(2:end,:), xx_r(2:end,:)*Beta_1(:,5)));
   temp_mat_53 = bsxfun(@times, xx_r(2:end,:),dk5);
   gr5 = gr5 + sum( bsxfun(@times, (temp_mat_51 + temp_mat_52 + temp_mat_53), exp(-xx_r(2:end,:)*Beta_1(:,3)) ))'+...
      0.5* (exp(-xx_r(1,:)*Beta_1(:,3))*(-y(1,1).*conj(y(1,3)) - y(1,3).*conj(y(1,1)))*xx_r(1,:)' +...
       exp(-xx_r(1,:)*Beta_1(:,3))*2*abs(y(1,1)).^2*(xx_r(1,:)*Beta_1(:,5))*xx_r(1,:)'+...
      exp(-xx_r(1,:)*Beta_1(:,3))*(y(1,2).*conj(y(1,1)).*(b(1).') + conj(y(1,2).*conj(y(1,1)).*(b(1).')))*xx_r(1,:)');
   temp_mat_61 = bsxfun(@times, xx_r(2:end,:),rk6);
   temp_mat_62 = bsxfun(@times, ck6, bsxfun(@times, xx_r(2:end,:), xx_r(2:end,:)*Beta_1(:,6)));
   temp_mat_63 = bsxfun(@times, xx_r(2:end,:),dk6);
   gr6 = gr6 + sum(bsxfun(@times, (temp_mat_61 + temp_mat_62 + temp_mat_63), exp(-xx_r(2:end,:)*Beta_1(:,3))))'+...
      0.5* (exp(-xx_r(1,:)*Beta_1(:,3))*(-y(1,2).*conj(y(1,3)) - y(1,3).*conj(y(1,2)))*xx_r(1,:)' +...
       exp(-xx_r(1,:)*Beta_1(:,3))*2*abs(y(1,2)).^2*(xx_r(1,:)*Beta_1(:,6))*xx_r(1,:)'+...
      exp(-xx_r(1,:)*Beta_1(:,3))*(y(1,1).*conj(y(1,2)).*(a(1).') + conj(y(1,1).*conj(y(1,2)).*(a(1).')))*xx_r(1,:)');
   temp_mat_71 = bsxfun(@times, ik7, xx_i(2:end,:));
   temp_mat_72 = bsxfun(@times, ck7, bsxfun(@times,xx_i(2:end,:),xx_i(2:end,:)*Beta_2(:,1)));
   gr7 = gr7 + sum( bsxfun(@times, (temp_mat_71 + temp_mat_72), exp(-xx_r(2:end,:)*Beta_1(:,2))))' + ...
      0.5*(exp(-xx_r(1,:)*Beta_1(:,2))*(imag(-y(1,1).*conj(y(1,2)) + y(1,2).*conj(y(1,1))))*xx_i(1,:)' +...
      exp(-xx_r(1,:)*Beta_1(:,2))*2*abs(y(1,1)).^2*(xx_i(1,:)*Beta_2(:,1))*xx_i(1,:)');
   temp_mat_81 = bsxfun(@times, ik8, xx_i(2:end,:));
   temp_mat_82 = bsxfun(@times, ck8, bsxfun(@times,xx_i(2:end,:),xx_i(2:end,:)*Beta_2(:,2))); 
   temp_mat_83 = bsxfun(@times, xx_i(2:end,:),dk8);
   gr8 = gr8 + sum( bsxfun(@times, (temp_mat_81 + temp_mat_82 + temp_mat_83), exp(-xx_r(2:end,:)*Beta_1(:,3))))' + ...
      0.5*(exp(-xx_r(1,:)*Beta_1(:,3))*(imag(-y(1,1).*conj(y(1,3)) + y(1,3).*conj(y(1,1))))*xx_i(1,:)' +...
   exp(-xx_r(1,:)*Beta_1(:,3))*2*abs(y(1,1)).^2*(xx_i(1,:)*Beta_2(:,2))*xx_i(1,:)'+...
   sqrt(-1)*exp(-xx_r(1,:)*Beta_1(:,3))*(-y(1,1).*conj(y(1,2)).*(b(1).') + conj(y(1,1).*conj(y(1,2)).*(b(1).')))*xx_i(1,:)');
   temp_mat_91 = bsxfun(@times, ik9, xx_i(2:end,:));
   temp_mat_92 = bsxfun(@times, ck9, bsxfun(@times,xx_i(2:end,:),xx_i(2:end,:)*Beta_2(:,3))); 
   temp_mat_93 = bsxfun(@times, xx_i(2:end,:),dk9);
   gr9 = gr9 + sum( bsxfun(@times, (temp_mat_91 + temp_mat_92 + temp_mat_93), exp(-xx_r(2:end,:)*Beta_1(:,3))))' + ...
      0.5 * (exp(-xx_r(1,:)*Beta_1(:,3))*(imag(-y(1,2).*conj(y(1,3)) + y(1,3).*conj(y(1,2))))*xx_i(1,:)' +...
       exp(-xx_r(1,:)*Beta_1(:,3))*2*abs(y(1,2)).^2*(xx_i(1,:)*Beta_2(:,3))*xx_i(1,:)'+...
      sqrt(-1)*exp(-xx_r(1,:)*Beta_1(:,3))*(-y(1,1).*conj(y(1,2)).*(a(1).') + conj(y(1,1).*conj(y(1,2)).*(a(1).')))*xx_i(1,:)'); 
   %%%%%%%%%%%%%%%%%%%
   %Hessian
   %%%%%%%%%%%%%%%%%%%
   bigmat_h11 = kron(bsxfun(@times, abs(y(2:end,1)).^2.*exp(-xx_r(2:end,:)*Beta_1(:,1)), xx_r(2:end,:)),ones(nBeta,1)');
   coefmat_h11 = repmat(xx_r(2:end,:), 1,nBeta);
   h11 = h11 + reshape(sum(bsxfun(@times, bigmat_h11, coefmat_h11),1),nBeta,nBeta) +...
      0.5*(abs(y(1,1)).^2.*exp(-xx_r(1,:)*Beta_1(:,1))*xx_r(1,:)'*xx_r(1,:));

   bigmat_h22 = kron(bsxfun(@times, abs(y(2:end,2)-theta(1,2:end).'.*y(2:end,1)).^2.*exp(-xx_r(2:end,:)*Beta_1(:,2)),xx_r(2:end,:)), ones(nBeta,1)');
   coefmat_h22 = repmat(xx_r(2:end,:), 1,nBeta);
   h22 = h22 + reshape(sum(bsxfun(@times, bigmat_h22, coefmat_h22),1),nBeta,nBeta) +...
      0.5*(abs(y(1,2)-theta(1,1).*y(1,1)).^2.*exp(-xx_r(1,:)*Beta_1(:,2))*xx_r(1,:)'*xx_r(1,:));
 
   bigmat_h33 = kron(bsxfun(@times, abs(y(2:end,3) - theta(2,2:end).'.*y(2:end,1) - theta(3,2:end).'.*y(2:end,2)).^2.*...
   exp(-xx_r(2:end,:)*Beta_1(:,3)),xx_r(2:end,:)), ones(nBeta,1)');
   coefmat_h33 = repmat(xx_r(2:end,:), 1,nBeta);
   h33 = h33 + reshape(sum(bsxfun(@times, bigmat_h33, coefmat_h33),1),nBeta,nBeta) +...
      0.5*(abs(y(1,3) - theta(2,1).'.*y(1,1) - theta(3,1).'.*y(1,2)).^2.*...
      exp(-xx_r(1,:)*Beta_1(:,3))*xx_r(1,:)'*xx_r(1,:));
 
   bigmat_h44 = kron(bsxfun(@times, exp(-xx_r(2:end,:)*Beta_1(:,2)).*ck4, xx_r(2:end,:)),ones(nBeta,1)');
   coefmat_h44 = repmat(xx_r(2:end,:), 1,nBeta); 
   h44 = h44 + reshape(sum(bsxfun(@times, bigmat_h44, coefmat_h44),1),nBeta,nBeta) +...
      0.5*(exp(-xx_r(1,:)*Beta_1(:,2))*2*abs(y(1,1)).^2*xx_r(1,:)'*xx_r(1,:));

   bigmat_h55 = kron(bsxfun(@times, exp(-xx_r(2:end,:)*Beta_1(:,3)).*ck5, xx_r(2:end,:)),ones(nBeta,1)');
   coefmat_h55 = repmat(xx_r(2:end,:), 1,nBeta); 
   h55 = h55 + reshape(sum(bsxfun(@times, bigmat_h55, coefmat_h55),1),nBeta,nBeta) +...
      0.5*(exp(-xx_r(1,:)*Beta_1(:,3))*2*abs(y(1,1)).^2*xx_r(1,:)'*xx_r(1,:));
 
   bigmat_h66 = kron(bsxfun(@times, exp(-xx_r(2:end,:)*Beta_1(:,3)).*ck6, xx_r(2:end,:)),ones(nBeta,1)');
   coefmat_h66 = repmat(xx_r(2:end,:), 1,nBeta); 
   h66 = h66 + reshape(sum(bsxfun(@times, bigmat_h66, coefmat_h66),1),nBeta,nBeta) +...
      0.5*(exp(-xx_r(1,:)*Beta_1(:,3))*2*abs(y(1,2)).^2*xx_r(1,:)'*xx_r(1,:));

   bigmat_h77 = kron(bsxfun(@times, exp(-xx_r(2:end,:)*Beta_1(:,2)).*ck7, xx_i(2:end,:)),ones(nBeta,1)');
   coefmat_h77 = repmat(xx_i(2:end,:), 1,nBeta); 
   h77 = h77 + reshape(sum(bsxfun(@times, bigmat_h77, coefmat_h77),1),nBeta,nBeta) +...
      0.5*(exp(-xx_r(1,:)*Beta_1(:,2))*2*abs(y(1,1)).^2*xx_i(1,:)'*xx_i(1,:));

   bigmat_h88 = kron(bsxfun(@times, exp(-xx_r(2:end,:)*Beta_1(:,3)).*ck8, xx_i(2:end,:)),ones(nBeta,1)');
   coefmat_h88 = repmat(xx_i(2:end,:), 1,nBeta); 
   h88 = h88 + reshape(sum(bsxfun(@times, bigmat_h88, coefmat_h88),1),nBeta,nBeta) +...
      0.5*(exp(-xx_r(1,:)*Beta_1(:,3))*2*abs(y(1,1)).^2*xx_i(1,:)'*xx_i(1,:));
 
   bigmat_h99 = kron(bsxfun(@times, exp(-xx_r(2:end,:)*Beta_1(:,3)).*ck9, xx_i(2:end,:)),ones(nBeta,1)');
   coefmat_h99 = repmat(xx_i(2:end,:), 1,nBeta); 
   h99 = h99 + reshape(sum(bsxfun(@times, bigmat_h99, coefmat_h99),1),nBeta,nBeta) +...
      0.5*(exp(-xx_r(1,:)*Beta_1(:,3))*2*abs(y(1,2)).^2*xx_i(1,:)'*xx_i(1,:));
   bigmat_h42_1 = kron(bsxfun(@times, exp(-xx_r(2:end,:)*Beta_1(:,2)).*ck4.*(xx_r(2:end,:)*Beta_1(:,4)),...
        xx_r(2:end,:)), ones(nBeta,1)');
   coefmat_h42_1 = repmat(xx_r(2:end,:), 1,nBeta);
   bigmat_h42_2 = kron(bsxfun(@times,exp(-xx_r(2:end,:)*Beta_1(:,2)).*rk4, xx_r(2:end,:)), ones(nBeta,1)');
   coefmat_h42_2 = repmat(xx_r(2:end,:), 1,nBeta);
   h42 = h42 + reshape(sum(bsxfun(@times, bigmat_h42_1, coefmat_h42_1) +...
   bsxfun(@times, bigmat_h42_2, coefmat_h42_2),1),nBeta,nBeta)' +...
      0.5*(exp(-xx_r(1,:)*Beta_1(:,2))*(2*abs(y(1,1)).^2*(xx_r(1,:)*Beta_1(:,4))*xx_r(1,:)+...
   (-y(1,1).*conj(y(1,2)) - y(1,2).*conj(y(1,1)))*xx_r(1,:))'*xx_r(1,:)); 

   bigmat_h53_1 = kron(bsxfun(@times, exp(-xx_r(2:end,:)*Beta_1(:,3)).*ck5.*(xx_r(2:end,:)*Beta_1(:,5)),...
        xx_r(2:end,:)), ones(nBeta,1)'); 
   coefmat_h53_1 = repmat(xx_r(2:end,:), 1,nBeta);
   bigmat_h53_2 = kron(bsxfun(@times,exp(-xx_r(2:end,:)*Beta_1(:,3)).*rk5, xx_r(2:end,:)), ones(nBeta,1)');
   coefmat_h53_2 = repmat(xx_r(2:end,:), 1,nBeta);
   bigmat_h53_3 = kron(bsxfun(@times,exp(-xx_r(2:end,:)*Beta_1(:,3)).*dk5, xx_r(2:end,:)), ones(nBeta,1)');
   coefmat_h53_3 = repmat(xx_r(2:end,:), 1,nBeta); 
   h53 = h53 + reshape(sum(bsxfun(@times, bigmat_h53_1, coefmat_h53_1) +...
      bsxfun(@times, bigmat_h53_2, coefmat_h53_2)+...
      bsxfun(@times, bigmat_h53_3, coefmat_h53_3),1),nBeta,nBeta)' +...
       0.5*(exp(-xx_r(1,:)*Beta_1(:,3))*(2*abs(y(1,1)).^2*(xx_r(1,:)*Beta_1(:,5))*xx_r(1,:)+...
      (-y(1,1).*conj(y(1,3)) - y(1,3).*conj(y(1,1)))*xx_r(1,:)+...
      (y(1,2).*conj(y(1,1)).*(b(1).') + conj(y(1,2).*conj(y(1,1)).*(b(1).')))*xx_r(1,:))'*xx_r(1,:));
     
   bigmat_h56_1 = kron(bsxfun(@times, exp(-xx_r(2:end,:)*Beta_1(:,3)).*conj(y(2:end,1)).*y(2:end,2),...
      xx_r(2:end,:)), ones(nBeta,1)'); 
   coefmat_h56_1 = repmat(xx_r(2:end,:), 1,nBeta);
   bigmat_h56_2 = kron(bsxfun(@times, exp(-xx_r(2:end,:)*Beta_1(:,3)).*conj(y(2:end,2)).*y(2:end,1),...
      xx_r(2:end,:)), ones(nBeta,1)'); 
   coefmat_h56_2 = repmat(xx_r(2:end,:), 1,nBeta);
   h56 = real(h56 + reshape(sum(bsxfun(@times, bigmat_h56_1, coefmat_h56_1) +...
      bsxfun(@times, bigmat_h56_2, coefmat_h56_2),1),nBeta,nBeta)' +...
      0.5*(exp(-xx_r(1,:)*Beta_1(:,3))*((conj(y(1,1)).*y(1,2))*xx_r(1,:)'*xx_r(1,:)+...
      (conj(y(1,2)).*y(1,1))*xx_r(1,:)'*xx_r(1,:)))); 
     
   bigmat_h59_1 = kron(bsxfun(@times, exp(-xx_r(2:end,:)*Beta_1(:,3)).*conj(y(2:end,1)).*y(2:end,2),...
      xx_r(2:end,:)), ones(nBeta,1)'); 
   coefmat_h59_1 = repmat(xx_i(2:end,:), 1,nBeta);
   bigmat_h59_2 = kron(bsxfun(@times, exp(-xx_r(2:end,:)*Beta_1(:,3)).*conj(y(2:end,2)).*y(2:end,1),...
      xx_r(2:end,:)), ones(nBeta,1)'); 
   coefmat_h59_2 = repmat(xx_i(2:end,:), 1,nBeta);
   h59 = imag(h59 + reshape(sum(bsxfun(@times, bigmat_h59_1, coefmat_h59_1) -...
      bsxfun(@times, bigmat_h59_2, coefmat_h59_2),1),nBeta,nBeta)' +...
      0.5*(exp(-xx_r(1,:)*Beta_1(:,3))*((conj(y(1,1)).*y(1,2))*xx_r(1,:)'*xx_i(1,:) -...
      (conj(y(1,2)).*y(1,1))*xx_r(1,:)'*xx_i(1,:))));       

   bigmat_h63_1 = kron(bsxfun(@times, exp(-xx_r(2:end,:)*Beta_1(:,3)).*ck6.*(xx_r(2:end,:)*Beta_1(:,6)),...
      xx_r(2:end,:)), ones(nBeta,1)'); 
   coefmat_h63_1 = repmat(xx_r(2:end,:), 1,nBeta);
   bigmat_h63_2 = kron(bsxfun(@times,exp(-xx_r(2:end,:)*Beta_1(:,3)).*rk6, xx_r(2:end,:)), ones(nBeta,1)');
   coefmat_h63_2 = repmat(xx_r(2:end,:), 1,nBeta);
   bigmat_h63_3 = kron(bsxfun(@times,exp(-xx_r(2:end,:)*Beta_1(:,3)).*dk6 , xx_r(2:end,:)), ones(nBeta,1)');
   coefmat_h63_3 = repmat(xx_r(2:end,:), 1,nBeta); 
   h63 = h63 + reshape(sum(bsxfun(@times, bigmat_h63_1, coefmat_h63_1) +...
      bsxfun(@times, bigmat_h63_2, coefmat_h63_2)+...
      bsxfun(@times, bigmat_h63_3, coefmat_h63_3),1),nBeta,nBeta)' +...
       0.5*(exp(-xx_r(1,:)*Beta_1(:,3))*(2*abs(y(1,2)).^2*(xx_r(1,:)*Beta_1(:,6))*xx_r(1,:)+...
      (-y(1,2).*conj(y(1,3)) - y(1,3).*conj(y(1,2)))*xx_r(1,:)+...
      (y(1,2).*conj(y(1,1)).*(a(1).') + conj(y(1,2).*conj(y(1,1)).*(a(1).')))*xx_r(1,:))'*xx_r(1,:));
   bigmat_h68_1 = kron(bsxfun(@times, exp(-xx_r(2:end,:)*Beta_1(:,3)).*conj(y(2:end,2)).*y(2:end,1),...
      xx_r(2:end,:)), ones(nBeta,1)'); 
   coefmat_h68_1 = repmat(xx_i(2:end,:), 1,nBeta);
   bigmat_h68_2 = kron(bsxfun(@times, exp(-xx_r(2:end,:)*Beta_1(:,3)).*conj(y(2:end,1)).*y(2:end,2),...
      xx_r(2:end,:)), ones(nBeta,1)');
   coefmat_h68_2 = repmat(xx_i(2:end,:), 1,nBeta);
    h68 = imag(h68 + reshape(sum(bsxfun(@times, bigmat_h68_1, coefmat_h68_1) -...
   bsxfun(@times, bigmat_h68_2, coefmat_h68_2),1),nBeta,nBeta)' +...
      0.5*(exp(-xx_r(1,:)*Beta_1(:,3))*((conj(y(1,2)).*y(1,1))*xx_r(1,:)'*xx_i(1,:)-...
      (conj(y(1,1)).*y(1,2))*xx_r(1,:)'*xx_i(1,:))));
     
   bigmat_h72_1 = kron(bsxfun(@times, exp(-xx_r(2:end,:)*Beta_1(:,2)).*ck7.*(xx_i(2:end,:)*Beta_2(:,1)),...
      xx_i(2:end,:)), ones(nBeta,1)');    
   coefmat_h72_1 = repmat(xx_r(2:end,:), 1,nBeta);
   bigmat_h72_2 = kron(bsxfun(@times,exp(-xx_r(2:end,:)*Beta_1(:,2)).*ik7, xx_i(2:end,:)), ones(nBeta,1)');
   coefmat_h72_2 = repmat(xx_r(2:end,:), 1,nBeta);
   h72 = h72 + reshape(sum(bsxfun(@times, bigmat_h72_1, coefmat_h72_1) +...
      bsxfun(@times, bigmat_h72_2, coefmat_h72_2),1),nBeta,nBeta)' +...
      0.5*(exp(-xx_r(1,:)*Beta_1(:,2))*(2*abs(y(1,1)).^2*(xx_i(1,:)*Beta_2(:,1))*xx_i(1,:)+...
      imag(-y(1,1).*conj(y(1,2)) + y(1,2).*conj(y(1,1)))*xx_i(1,:))'*xx_r(1,:)); 

   bigmat_h83_1 = kron(bsxfun(@times, exp(-xx_r(2:end,:)*Beta_1(:,3)).*ck8.*(xx_i(2:end,:)*Beta_2(:,2)),...
      xx_i(2:end,:)), ones(nBeta,1)'); 
   coefmat_h83_1 = repmat(xx_r(2:end,:), 1,nBeta);
   bigmat_h83_2 = kron(bsxfun(@times,exp(-xx_r(2:end,:)*Beta_1(:,3)).*ik8, xx_i(2:end,:)), ones(nBeta,1)');
   coefmat_h83_2 = repmat(xx_r(2:end,:), 1,nBeta);
   bigmat_h83_3 = kron(bsxfun(@times,exp(-xx_r(2:end,:)*Beta_1(:,3)).*(dk8), xx_i(2:end,:)), ones(nBeta,1)');
   coefmat_h83_3 = repmat(xx_r(2:end,:), 1,nBeta); 
   h83 = h83 + reshape(sum(bsxfun(@times, bigmat_h83_1, coefmat_h83_1) +...
      bsxfun(@times, bigmat_h83_2, coefmat_h83_2)+...
      bsxfun(@times, bigmat_h83_3, coefmat_h83_3),1),nBeta,nBeta)' +...
      0.5*(exp(-xx_r(1,:)*Beta_1(:,3))*(2*abs(y(1,1)).^2*(xx_i(1,:)*Beta_2(:,2))*xx_i(1,:)+...
      (-y(1,1).*conj(y(1,3)) + y(1,3).*conj(y(1,1)))*xx_i(1,:)+...
      imag(-y(1,2).*conj(y(1,1)).*(b(1).') + conj(y(1,2).*conj(y(1,1)).*(b(1).')))*xx_i(1,:))'*xx_r(1,:));  
     
   bigmat_h93_1 = kron(bsxfun(@times, exp(-xx_r(2:end,:)*Beta_1(:,3)).*ck9.*(xx_i(2:end,:)*Beta_2(:,3)),...
      xx_i(2:end,:)), ones(nBeta,1)'); 
   coefmat_h93_1 = repmat(xx_r(2:end,:), 1,nBeta);
   bigmat_h93_2 = kron(bsxfun(@times,exp(-xx_r(2:end,:)*Beta_1(:,3)).*ik9, xx_i(2:end,:)), ones(nBeta,1)');
   coefmat_h93_2 = repmat(xx_r(2:end,:), 1,nBeta);
   bigmat_h93_3 = kron(bsxfun(@times,exp(-xx_r(2:end,:)*Beta_1(:,3)).*(dk9), xx_i(2:end,:)), ones(nBeta,1)');
   coefmat_h93_3 = repmat(xx_r(2:end,:), 1,nBeta); 
   h93 = h93 + reshape(sum(bsxfun(@times, bigmat_h93_1, coefmat_h93_1) +...
      bsxfun(@times, bigmat_h93_2, coefmat_h93_2)+...
      bsxfun(@times, bigmat_h93_3, coefmat_h93_3),1),nBeta,nBeta)' +...
      0.5*(exp(-xx_r(1,:)*Beta_1(:,3))*(2*abs(y(1,2)).^2*(xx_i(1,:)*Beta_2(:,3))*xx_i(1,:)+...
      (-y(1,2).*conj(y(1,3)) + y(1,3).*conj(y(1,2)))*xx_i(1,:)+...
      imag(-y(1,1).*conj(y(1,2)).*(a(1).') + conj(y(1,1).*conj(y(1,2)).*(a(1).')))*xx_i(1,:))'*xx_r(1,:)); 

   bigmat_h98_1 = kron(bsxfun(@times, exp(-xx_r(2:end,:)*Beta_1(:,3)).*conj(y(2:end,1)).*y(2:end,2),...
      xx_i(2:end,:)), ones(nBeta,1)'); 
   coefmat_h98_1 = repmat(xx_i(2:end,:), 1,nBeta);
   bigmat_h98_2 = kron(bsxfun(@times, exp(-xx_r(2:end,:)*Beta_1(:,3)).*conj(y(2:end,2)).*y(2:end,1),...
      xx_i(2:end,:)), ones(nBeta,1)'); 
   coefmat_h98_2 = repmat(xx_i(2:end,:), 1,nBeta);
   h98 = real(h98 + reshape(sum(bsxfun(@times, bigmat_h98_1, coefmat_h98_1) +...
      bsxfun(@times, bigmat_h98_2, coefmat_h98_2),1),nBeta,nBeta)' +...
      0.5*(exp(-xx_r(1,:)*Beta_1(:,3))*((conj(y(1,1)).*y(1,2))*xx_i(1,:)'*xx_i(1,:)+...
      (conj(y(1,2)).*y(1,1))*xx_i(1,:)'*xx_i(1,:))));

   h24=h42'; h35=h53'; h65=h56'; h95=h59'; h36=h63';
   h86=h68'; h27=h72'; h38=h83'; h39=h93'; h89=h98';
 else
   %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
   %gradient
   %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
   rk4 = -y(2:nfreq,1).*conj(y(2:nfreq,2)) - y(2:nfreq,2).*conj(y(2:nfreq,1));
   ck4 = 2*abs(y(2:nfreq,1)).^2;
   rk5 = -y(2:nfreq,1).*conj(y(2:nfreq,3)) - y(2:nfreq,3).*conj(y(2:nfreq,1));
   ck5 = 2*abs(y(2:nfreq,1)).^2;
   b = theta(3,:);
   dk5 = y(2:nfreq,2).*conj(y(2:nfreq,1)).*(b(2:nfreq).') + conj(y(2:nfreq,2)).*y(2:nfreq,1).*conj(b(2:nfreq).');
   rk6 = -y(2:nfreq,2).*conj(y(2:nfreq,3)) - y(2:nfreq,3).*conj(y(2:nfreq,2));
   ck6 = 2*abs(y(2:nfreq,2)).^2;
   a = theta(2,:);
   dk6 = y(2:nfreq,1).*conj(y(2:nfreq,2)).*(a(2:nfreq).') + conj(y(2:nfreq,1).*conj(y(2:nfreq,2)).*(a(2:nfreq).'));
   ik7 = sqrt(-1)*(-y(2:nfreq,1).*conj(y(2:nfreq,2)) + y(2:nfreq,2).*conj(y(2:nfreq,1)));
   ck7 = 2*abs(y(2:nfreq,1)).^2;
   ik8 = sqrt(-1)*(-y(2:nfreq,1).*conj(y(2:nfreq,3)) + y(2:nfreq,3).*conj(y(2:nfreq,1)));
   ck8 = 2*abs(y(2:nfreq,1)).^2;
   dk8 = sqrt(-1)*( -y(2:nfreq,2).*conj(y(2:nfreq,1)).*(b(2:nfreq).') + conj(y(2:nfreq,2).*conj(y(2:nfreq,1)).*(b(2:nfreq).')));
   ik9 = sqrt(-1)*(-y(2:nfreq,2).*conj(y(2:nfreq,3)) + y(2:nfreq,3).*conj(y(2:nfreq,2)));
   ck9 = 2*abs(y(2:nfreq,2)).^2;
   dk9 = sqrt(-1)*( -y(2:nfreq,1).*conj(y(2:nfreq,2)).*(a(2:nfreq).') + conj(y(2:nfreq,1).*conj(y(2:nfreq,2)).*(a(2:nfreq).')));

   gr1 = gr1 + xx_r(2:nfreq,:)'*(1-abs(y(2:nfreq,1)).^2.*exp(-xx_r(2:nfreq,:)*Beta_1(:,1))) + ...
      0.5*(xx_r(1,:)'*(1-abs(y(1,1)).^2.*exp(-xx_r(1,:)*Beta_1(:,1))))+...
      0.5*(xx_r(end,:)'*(1-abs(y(end,1)).^2.*exp(-xx_r(end,:)*Beta_1(:,1))));
   gr2 = gr2 + xx_r(2:nfreq,:)'*(1 - abs(y(2:nfreq,2)-theta(1,2:nfreq).'.*y(2:nfreq,1)).^2.*exp(-xx_r(2:nfreq,:)*Beta_1(:,2))) + ...
       0.5*(xx_r(1,:)'*(1 - abs(y(1,2)-theta(1,1).*y(1,1)).^2.*exp(-xx_r(1,:)*Beta_1(:,2)))) +...
      0.5*(xx_r(end,:)'*(1 - abs(y(end,2)-theta(1,end).*y(end,1)).^2.*exp(-xx_r(end,:)*Beta_1(:,2))));
   gr3 = gr3 + xx_r(2:nfreq,:)'*(1 - abs(y(2:nfreq,3) - theta(2,2:nfreq).'.*y(2:nfreq,1) - theta(3,2:nfreq).'.*y(2:nfreq,2)).^2.*exp(-xx_r(2:nfreq,:)*Beta_1(:,3)))+...
      0.5*(xx_r(1,:)'*(1 - abs(y(1,3) - theta(2,1).*y(1,1) - theta(3,1).*y(1,2))^2.*exp(-xx_r(1,:)*Beta_1(:,3))))+...
      0.5*(xx_r(end,:)'*(1 - abs(y(end,3) - theta(2,end).*y(end,1) - theta(3,end).*y(end,2))^2.*exp(-xx_r(end,:)*Beta_1(:,3))));   
   temp_mat_41 = bsxfun(@times, xx_r(2:nfreq,:),rk4);
   temp_mat_42 = bsxfun(@times, ck4, bsxfun(@times, xx_r(2:nfreq,:),xx_r(2:nfreq,:)*Beta_1(:,4)));
   gr4 = gr4 + sum( bsxfun(@times, (temp_mat_41+temp_mat_42), exp(-xx_r(2:nfreq,:)*Beta_1(:,2)) ))' +...
      0.5*(exp(-xx_r(1,:)*Beta_1(:,2))*(-y(1,1).*conj(y(1,2)) - y(1,2).*conj(y(1,1)))*xx_r(1,:)' +...
   exp(-xx_r(1,:)*Beta_1(:,2))*2*abs(y(1,1)).^2*(xx_r(1,:)*Beta_1(:,4))*xx_r(1,:)')+...
      0.5*(exp(-xx_r(end,:)*Beta_1(:,2))*(-y(end,1).*conj(y(end,2)) - y(end,2).*conj(y(end,1)))*xx_r(end,:)' +...
      exp(-xx_r(end,:)*Beta_1(:,2))*2*abs(y(end,1)).^2*(xx_r(end,:)*Beta_1(:,4))*xx_r(end,:)');
   temp_mat_51 = bsxfun(@times, xx_r(2:nfreq,:),rk5);
   temp_mat_52 = bsxfun(@times, ck5, bsxfun(@times, xx_r(2:nfreq,:),xx_r(2:nfreq,:)*Beta_1(:,5)));
   temp_mat_53 = bsxfun(@times, xx_r(2:nfreq,:),dk5);
   gr5 = gr5 + sum( bsxfun(@times, (temp_mat_51 + temp_mat_52 + temp_mat_53), exp(-xx_r(2:nfreq,:)*Beta_1(:,3)) ))'+...
       0.5*(exp(-xx_r(1,:)*Beta_1(:,3))*(-y(1,1).*conj(y(1,3)) - y(1,3).*conj(y(1,1)))*xx_r(1,:)' +...
      exp(-xx_r(1,:)*Beta_1(:,3))*2*abs(y(1,1)).^2*(xx_r(1,:)*Beta_1(:,5))*xx_r(1,:)'+...
      exp(-xx_r(1,:)*Beta_1(:,3))*(y(1,2).*conj(y(1,1)).*(b(1).')+conj(y(1,2).*conj(y(1,1)).*(b(1).')))*xx_r(1,:)')+...
      0.5*(exp(-xx_r(end,:)*Beta_1(:,3))*(-y(end,1).*conj(y(end,3)) - y(end,3).*conj(y(end,1)))*xx_r(end,:)' +...
      exp(-xx_r(end,:)*Beta_1(:,3))*2*abs(y(end,1)).^2*(xx_r(end,:)*Beta_1(:,5))*xx_r(end,:)'+...
      exp(-xx_r(end,:)*Beta_1(:,3))*(y(end,2).*conj(y(end,1)).*(b(end).') + conj(y(end,2).*conj(y(end,1)).*(b(end).')))*xx_r(end,:)');
   temp_mat_61 = bsxfun(@times, xx_r(2:nfreq,:),rk6);
   temp_mat_62 = bsxfun(@times, ck6, bsxfun(@times, xx_r(2:nfreq,:),xx_r(2:nfreq,:)*Beta_1(:,6)));
   temp_mat_63 = bsxfun(@times, xx_r(2:nfreq,:),dk6);
   gr6 = gr6 + sum( bsxfun(@times, (temp_mat_61 + temp_mat_62 + temp_mat_63), exp(-xx_r(2:nfreq,:)*Beta_1(:,3)) ))'+...
      0.5*(exp(-xx_r(1,:)*Beta_1(:,3))*(-y(1,2).*conj(y(1,3)) - y(1,3).*conj(y(1,2)))*xx_r(1,:)' +...
      exp(-xx_r(1,:)*Beta_1(:,3))*2*abs(y(1,2)).^2*(xx_r(1,:)*Beta_1(:,6))*xx_r(1,:)'+...
      exp(-xx_r(1,:)*Beta_1(:,3))*(y(1,1).*conj(y(1,2)).*(a(1).') + conj(y(1,1).*conj(y(1,2)).*(a(1).')))*xx_r(1,:)')+...
      0.5*(exp(-xx_r(end,:)*Beta_1(:,3))*(-y(end,2).*conj(y(end,3)) - y(end,3).*conj(y(end,2)))*xx_r(end,:)' +...
      exp(-xx_r(end,:)*Beta_1(:,3))*2*abs(y(end,2)).^2*(xx_r(end,:)*Beta_1(:,6))*xx_r(end,:)'+...
       exp(-xx_r(end,:)*Beta_1(:,3))*(y(end,1).*conj(y(end,2)).*(a(end).') + conj(y(end,1).*conj(y(end,2)).*(a(end).')))*xx_r(end,:)');
   temp_mat_71 = bsxfun(@times, ik7, xx_i(2:nfreq,:));
   temp_mat_72 = bsxfun(@times, ck7, bsxfun(@times,xx_i(2:nfreq,:),xx_i(2:nfreq,:)*Beta_2(:,1)));
   gr7 = gr7 + sum( bsxfun(@times, (temp_mat_71 + temp_mat_72), exp(-xx_r(2:nfreq,:)*Beta_1(:,2))))' + ...
       0.5*(exp(-xx_r(1,:)*Beta_1(:,2))*(imag(-y(1,1).*conj(y(1,2)) + y(1,2).*conj(y(1,1))))*xx_i(1,:)' +...
      exp(-xx_r(1,:)*Beta_1(:,2))*2*abs(y(1,1)).^2*(xx_i(1,:)*Beta_2(:,1))*xx_i(1,:)')+...
       0.5*(exp(-xx_r(end,:)*Beta_1(:,2))*(imag(-y(end,1).*conj(y(end,2)) + y(end,2).*conj(y(end,1))))*xx_i(end,:)' +...
      exp(-xx_r(end,:)*Beta_1(:,2))*2*abs(y(end,1)).^2*(xx_i(end,:)*Beta_2(:,1))*xx_i(end,:)');
   temp_mat_81 = bsxfun(@times, ik8, xx_i(2:nfreq,:));
   temp_mat_82 = bsxfun(@times, ck8, bsxfun(@times,xx_i(2:nfreq,:),xx_i(2:nfreq,:)*Beta_2(:,2))); 
   temp_mat_83 = bsxfun(@times, xx_i(2:nfreq,:),dk8);
   gr8 = gr8 + sum( bsxfun(@times, (temp_mat_81 + temp_mat_82 + temp_mat_83), exp(-xx_r(2:nfreq,:)*Beta_1(:,3))))' + ...
      0.5 * (exp(-xx_r(1,:)*Beta_1(:,3))*(imag(-y(1,1).*conj(y(1,3)) + y(1,3).*conj(y(1,1))))*xx_i(1,:)' +...
      exp(-xx_r(1,:)*Beta_1(:,3))*2*abs(y(1,1)).^2*(xx_i(1,:)*Beta_2(:,2))*xx_i(1,:)'+...
      sqrt(-1)*exp(-xx_r(1,:)*Beta_1(:,3))*(-y(1,1).*conj(y(1,2)).*(b(1).') + conj(y(1,1).*conj(y(1,2)).*(b(1).')))*xx_i(1,:)')+...
      0.5 * (exp(-xx_r(end,:)*Beta_1(:,3))*(imag(-y(end,1).*conj(y(end,3)) + y(end,3).*conj(y(end,1))))*xx_i(end,:)' +...
      exp(-xx_r(end,:)*Beta_1(:,3))*2*abs(y(end,1)).^2*(xx_i(end,:)*Beta_2(:,2))*xx_i(end,:)'+...
      sqrt(-1)*exp(-xx_r(end,:)*Beta_1(:,3))*(-y(end,1).*conj(y(end,2)).*(b(end).') + conj(y(end,1).*conj(y(end,2)).*(b(end).')))*xx_i(end,:)');
   temp_mat_91 = bsxfun(@times, ik9, xx_i(2:nfreq,:));
   temp_mat_92 = bsxfun(@times, ck9, bsxfun(@times,xx_i(2:nfreq,:),xx_i(2:nfreq,:)*Beta_2(:,3))); 
   temp_mat_93 = bsxfun(@times, xx_i(2:nfreq,:),dk9);
   gr9 = gr9 + sum( bsxfun(@times, (temp_mat_91 + temp_mat_92 + temp_mat_93), exp(-xx_r(2:nfreq,:)*Beta_1(:,3))))' + ...
      0.5 * (exp(-xx_r(1,:)*Beta_1(:,3))*(imag(-y(1,2).*conj(y(1,3)) + y(1,3).*conj(y(1,2))))*xx_i(1,:)' +...
      exp(-xx_r(1,:)*Beta_1(:,3))*2*abs(y(1,2)).^2*(xx_i(1,:)*Beta_2(:,3))*xx_i(1,:)'+...
      sqrt(-1)*exp(-xx_r(1,:)*Beta_1(:,3))*(-y(1,1).*conj(y(1,2)).*(a(1).') + conj(y(1,1).*conj(y(1,2)).*(a(1).')))*xx_i(1,:)')+...
      0.5 * (exp(-xx_r(end,:)*Beta_1(:,3))*(imag(-y(end,2).*conj(y(end,3)) + y(end,3).*conj(y(end,2))))*xx_i(end,:)' +...
      exp(-xx_r(end,:)*Beta_1(:,3))*2*abs(y(end,2)).^2*(xx_i(end,:)*Beta_2(:,3))*xx_i(end,:)'+...
      sqrt(-1)*exp(-xx_r(end,:)*Beta_1(:,3))*(-y(end,1).*conj(y(end,2)).*(a(end).') + conj(y(end,1).*conj(y(end,2)).*(a(end).')))*xx_i(end,:)');
   %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
   %Hessian
   %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
   bigmat_h11 = kron(bsxfun(@times, abs(y(2:nfreq,1)).^2.*exp(-xx_r(2:nfreq,:)*Beta_1(:,1)), xx_r(2:nfreq,:)),ones(nBeta,1)');
   coefmat_h11 = repmat(xx_r(2:nfreq,:), 1,nBeta);
   h11 = h11 + reshape(sum(bsxfun(@times, bigmat_h11, coefmat_h11),1),nBeta,nBeta) +...
      0.5*(abs(y(1,1)).^2.*exp(-xx_r(1,:)*Beta_1(:,1))*xx_r(1,:)'*xx_r(1,:))+...
      0.5*(abs(y(end,1)).^2.*exp(-xx_r(end,:)*Beta_1(:,1))*xx_r(end,:)'*xx_r(end,:));

   bigmat_h22 = kron(bsxfun(@times, abs(y(2:nfreq,2)-theta(1,2:nfreq).'.*y(2:nfreq,1)).^2.*exp(-xx_r(2:nfreq,:)*Beta_1(:,2)),xx_r(2:nfreq,:)), ones(nBeta,1)');
   coefmat_h22 = repmat(xx_r(2:nfreq,:), 1,nBeta);
   h22 = h22 + reshape(sum(bsxfun(@times, bigmat_h22, coefmat_h22),1),nBeta,nBeta) +...
      0.5*(abs(y(1,2)-theta(1,1).*y(1,1)).^2.*exp(-xx_r(1,:)*Beta_1(:,2))*xx_r(1,:)'*xx_r(1,:))+...
      0.5*(abs(y(end,2)-theta(1,end).*y(end,1)).^2.*exp(-xx_r(end,:)*Beta_1(:,2))*xx_r(end,:)'*xx_r(end,:));
 
   bigmat_h33 = kron(bsxfun(@times, abs(y(2:nfreq,3) - theta(2,2:nfreq).'.*y(2:nfreq,1) - theta(3,2:nfreq).'.*y(2:nfreq,2)).^2.*...
      exp(-xx_r(2:nfreq,:)*Beta_1(:,3)),xx_r(2:nfreq,:)), ones(nBeta,1)');
   coefmat_h33 = repmat(xx_r(2:nfreq,:), 1,nBeta);
   h33 = h33 + reshape(sum(bsxfun(@times, bigmat_h33, coefmat_h33),1),nBeta,nBeta) +...
      0.5*(abs(y(1,3) - theta(2,1).'.*y(1,1) - theta(3,1).'.*y(1,2)).^2.*...
      exp(-xx_r(1,:)*Beta_1(:,3))*xx_r(1,:)'*xx_r(1,:))+...
      0.5*(abs(y(end,3) - theta(2,end).'.*y(end,1) - theta(3,end).'.*y(end,2)).^2.*...
      exp(-xx_r(end,:)*Beta_1(:,3))*xx_r(end,:)'*xx_r(end,:));
 
   bigmat_h44 = kron(bsxfun(@times, exp(-xx_r(2:nfreq,:)*Beta_1(:,2)).*ck4, xx_r(2:nfreq,:)),ones(nBeta,1)');
   coefmat_h44 = repmat(xx_r(2:nfreq,:), 1,nBeta); 
   h44 = h44 + reshape(sum(bsxfun(@times, bigmat_h44, coefmat_h44),1),nBeta,nBeta) +...
      0.5*(exp(-xx_r(1,:)*Beta_1(:,2))*2*abs(y(1,1)).^2*xx_r(1,:)'*xx_r(1,:))+...
      0.5*(exp(-xx_r(end,:)*Beta_1(:,2))*2*abs(y(end,1)).^2*xx_r(end,:)'*xx_r(end,:));

   bigmat_h55 = kron(bsxfun(@times, exp(-xx_r(2:nfreq,:)*Beta_1(:,3)).*ck5, xx_r(2:nfreq,:)),ones(nBeta,1)');
   coefmat_h55 = repmat(xx_r(2:nfreq,:), 1,nBeta); 
   h55 = h55 + reshape(sum(bsxfun(@times, bigmat_h55, coefmat_h55),1),nBeta,nBeta) +...
      0.5*(exp(-xx_r(1,:)*Beta_1(:,3))*2*abs(y(1,1)).^2*xx_r(1,:)'*xx_r(1,:))+...
      0.5*(exp(-xx_r(end,:)*Beta_1(:,3))*2*abs(y(end,1)).^2*xx_r(end,:)'*xx_r(end,:));
 
   bigmat_h66 = kron(bsxfun(@times, exp(-xx_r(2:nfreq,:)*Beta_1(:,3)).*ck6, xx_r(2:nfreq,:)),ones(nBeta,1)');
   coefmat_h66 = repmat(xx_r(2:nfreq,:), 1,nBeta); 
   h66 = h66 + reshape(sum(bsxfun(@times, bigmat_h66, coefmat_h66),1),nBeta,nBeta) +...
      0.5*(exp(-xx_r(1,:)*Beta_1(:,3))*2*abs(y(1,2)).^2*xx_r(1,:)'*xx_r(1,:))+...
      0.5*(exp(-xx_r(end,:)*Beta_1(:,3))*2*abs(y(end,2)).^2*xx_r(end,:)'*xx_r(end,:));  
    
   bigmat_h77 = kron(bsxfun(@times, exp(-xx_r(2:nfreq,:)*Beta_1(:,2)).*ck7, xx_i(2:nfreq,:)),ones(nBeta,1)');
   coefmat_h77 = repmat(xx_i(2:nfreq,:), 1,nBeta); 
   h77 = h77 + reshape(sum(bsxfun(@times, bigmat_h77, coefmat_h77),1),nBeta,nBeta) +...
      0.5*(exp(-xx_r(1,:)*Beta_1(:,2))*2*abs(y(1,1)).^2*xx_i(1,:)'*xx_i(1,:))+...
      0.5*(exp(-xx_r(end,:)*Beta_1(:,2))*2*abs(y(end,1)).^2*xx_i(end,:)'*xx_i(end,:));

   bigmat_h88 = kron(bsxfun(@times, exp(-xx_r(2:nfreq,:)*Beta_1(:,3)).*ck8, xx_i(2:nfreq,:)),ones(nBeta,1)');
   coefmat_h88 = repmat(xx_i(2:nfreq,:), 1,nBeta); 
   h88 = h88 + reshape(sum(bsxfun(@times, bigmat_h88, coefmat_h88),1),nBeta,nBeta) +...
      0.5*(exp(-xx_r(1,:)*Beta_1(:,3))*2*abs(y(1,1)).^2*xx_i(1,:)'*xx_i(1,:))+...
      0.5*(exp(-xx_r(end,:)*Beta_1(:,3))*2*abs(y(end,1)).^2*xx_i(end,:)'*xx_i(end,:)); 
 
   bigmat_h99 = kron(bsxfun(@times, exp(-xx_r(2:nfreq,:)*Beta_1(:,3)).*ck9, xx_i(2:nfreq,:)),ones(nBeta,1)');
   coefmat_h99 = repmat(xx_i(2:nfreq,:), 1,nBeta); 
   h99 = h99 + reshape(sum(bsxfun(@times, bigmat_h99, coefmat_h99),1),nBeta,nBeta) +...
      0.5*(exp(-xx_r(1,:)*Beta_1(:,3))*2*abs(y(1,2)).^2*xx_i(1,:)'*xx_i(1,:))+...
      0.5*(exp(-xx_r(end,:)*Beta_1(:,3))*2*abs(y(end,2)).^2*xx_i(end,:)'*xx_i(end,:));   
  
    bigmat_h42_1 = kron(bsxfun(@times, exp(-xx_r(2:nfreq,:)*Beta_1(:,2)).*ck4.*(xx_r(2:nfreq,:)*Beta_1(:,4)),...
      xx_r(2:nfreq,:)), ones(nBeta,1)');
   coefmat_h42_1 = repmat(xx_r(2:nfreq,:), 1,nBeta);
   bigmat_h42_2 = kron(bsxfun(@times,exp(-xx_r(2:nfreq,:)*Beta_1(:,2)).*rk4, xx_r(2:nfreq,:)), ones(nBeta,1)');
   coefmat_h42_2 = repmat(xx_r(2:nfreq,:), 1,nBeta);
   h42 = h42 + reshape(sum(bsxfun(@times, bigmat_h42_1, coefmat_h42_1) +...
      bsxfun(@times, bigmat_h42_2, coefmat_h42_2),1),nBeta,nBeta)' +...
      0.5*(exp(-xx_r(1,:)*Beta_1(:,2))*(2*abs(y(1,1)).^2*(xx_r(1,:)*Beta_1(:,4))*xx_r(1,:)+...
      (-y(1,1).*conj(y(1,2)) - y(1,2).*conj(y(1,1)))*xx_r(1,:))'*xx_r(1,:))+...
      0.5*(exp(-xx_r(end,:)*Beta_1(:,2))*(2*abs(y(end,1)).^2*(xx_r(end,:)*Beta_1(:,4))*xx_r(end,:)+...
      (-y(end,1).*conj(y(end,2)) - y(end,2).*conj(y(end,1)))*xx_r(end,:))'*xx_r(end,:)); 

    bigmat_h53_1 = kron(bsxfun(@times, exp(-xx_r(2:nfreq,:)*Beta_1(:,3)).*ck5.*(xx_r(2:nfreq,:)*Beta_1(:,5)),...
      xx_r(2:nfreq,:)), ones(nBeta,1)'); 
   coefmat_h53_1 = repmat(xx_r(2:nfreq,:), 1,nBeta);
   bigmat_h53_2 = kron(bsxfun(@times,exp(-xx_r(2:nfreq,:)*Beta_1(:,3)).*rk5, xx_r(2:nfreq,:)), ones(nBeta,1)');
   coefmat_h53_2 = repmat(xx_r(2:nfreq,:), 1,nBeta);
   bigmat_h53_3 = kron(bsxfun(@times,exp(-xx_r(2:nfreq,:)*Beta_1(:,3)).*dk5, xx_r(2:nfreq,:)), ones(nBeta,1)');
   coefmat_h53_3 = repmat(xx_r(2:nfreq,:), 1,nBeta); 
   h53 = h53 + reshape(sum(bsxfun(@times, bigmat_h53_1, coefmat_h53_1) +...
      bsxfun(@times, bigmat_h53_2, coefmat_h53_2)+...
      bsxfun(@times, bigmat_h53_3, coefmat_h53_3),1),nBeta,nBeta)' +...
      0.5*(exp(-xx_r(1,:)*Beta_1(:,3))*(2*abs(y(1,1)).^2*(xx_r(1,:)*Beta_1(:,5))*xx_r(1,:)+...
      (-y(1,1).*conj(y(1,3)) - y(1,3).*conj(y(1,1)))*xx_r(1,:)+...
      (y(1,2).*conj(y(1,1)).*(b(1).') + conj(y(1,2).*conj(y(1,1)).*(b(1).')))*xx_r(1,:))'*xx_r(1,:))+...
      0.5*(exp(-xx_r(end,:)*Beta_1(:,3))*(2*abs(y(end,1)).^2*(xx_r(end,:)*Beta_1(:,5))*xx_r(end,:)+...
     (-y(end,1).*conj(y(end,3)) - y(end,3).*conj(y(end,1)))*xx_r(end,:)+...
      (y(end,2).*conj(y(end,1)).*(b(end).') + conj(y(end,2).*conj(y(end,1)).*(b(end).')))*xx_r(end,:))'*xx_r(end,:));

   bigmat_h56_1 = kron(bsxfun(@times, exp(-xx_r(2:nfreq,:)*Beta_1(:,3)).*conj(y(2:nfreq,1)).*y(2:nfreq,2),...
      xx_r(2:nfreq,:)), ones(nBeta,1)'); 
   coefmat_h56_1 = repmat(xx_r(2:nfreq,:), 1,nBeta);
   bigmat_h56_2 = kron(bsxfun(@times, exp(-xx_r(2:nfreq,:)*Beta_1(:,3)).*conj(y(2:nfreq,2)).*y(2:nfreq,1),...
      xx_r(2:nfreq,:)), ones(nBeta,1)'); 
   coefmat_h56_2 = repmat(xx_r(2:nfreq,:), 1,nBeta);
   h56 = real(h56 + reshape(sum(bsxfun(@times, bigmat_h56_1, coefmat_h56_1) +...
      bsxfun(@times, bigmat_h56_2, coefmat_h56_2),1),nBeta,nBeta)' +...
      0.5*(exp(-xx_r(1,:)*Beta_1(:,3))*((conj(y(1,1)).*y(1,2))*xx_r(1,:)'*xx_r(1,:)+...
      (conj(y(1,2)).*y(1,1))*xx_r(1,:)'*xx_r(1,:)))+...
      0.5*(exp(-xx_r(end,:)*Beta_1(:,3))*((conj(y(end,1)).*y(end,2))*xx_r(end,:)'*xx_r(end,:)+...
      (conj(y(end,2)).*y(end,1))*xx_r(end,:)'*xx_r(end,:)))); 

   bigmat_h59_1 = kron(bsxfun(@times, exp(-xx_r(2:nfreq,:)*Beta_1(:,3)).*conj(y(2:nfreq,1)).*y(2:nfreq,2),...
      xx_r(2:nfreq,:)), ones(nBeta,1)'); 
   coefmat_h59_1 = repmat(xx_i(2:nfreq,:), 1,nBeta);
   bigmat_h59_2 = kron(bsxfun(@times, exp(-xx_r(2:nfreq,:)*Beta_1(:,3)).*conj(y(2:nfreq,2)).*y(2:nfreq,1),...
      xx_r(2:nfreq,:)), ones(nBeta,1)'); 
   coefmat_h59_2 = repmat(xx_i(2:nfreq,:), 1,nBeta);
    h59 = imag(h59 + reshape(sum(bsxfun(@times, bigmat_h59_1, coefmat_h59_1) -...
      bsxfun(@times, bigmat_h59_2, coefmat_h59_2),1),nBeta,nBeta)' +...
      0.5*(exp(-xx_r(1,:)*Beta_1(:,3))*((conj(y(1,1)).*y(1,2))*xx_r(1,:)'*xx_i(1,:)-...
      (conj(y(1,2)).*y(1,1))*xx_r(1,:)'*xx_i(1,:)))+...
      0.5*(exp(-xx_r(end,:)*Beta_1(:,3))*((conj(y(end,1)).*y(end,2))*xx_r(end,:)'*xx_i(end,:)-...
         (conj(y(end,2)).*y(end,1))*xx_r(end,:)'*xx_i(end,:))));
    bigmat_h63_1 = kron(bsxfun(@times, exp(-xx_r(2:nfreq,:)*Beta_1(:,3)).*ck6.*(xx_r(2:nfreq,:)*Beta_1(:,6)),...
      xx_r(2:nfreq,:)), ones(nBeta,1)'); 
   coefmat_h63_1 = repmat(xx_r(2:nfreq,:), 1,nBeta);
   bigmat_h63_2 = kron(bsxfun(@times,exp(-xx_r(2:nfreq,:)*Beta_1(:,3)).*rk6, xx_r(2:nfreq,:)), ones(nBeta,1)');
   coefmat_h63_2 = repmat(xx_r(2:nfreq,:), 1,nBeta);
   bigmat_h63_3 = kron(bsxfun(@times,exp(-xx_r(2:nfreq,:)*Beta_1(:,3)).*dk6, xx_r(2:nfreq,:)), ones(nBeta,1)');
   coefmat_h63_3 = repmat(xx_r(2:nfreq,:), 1,nBeta); 
   h63 = h63 + reshape(sum(bsxfun(@times, bigmat_h63_1, coefmat_h63_1) +...
      bsxfun(@times, bigmat_h63_2, coefmat_h63_2)+...
      bsxfun(@times, bigmat_h63_3, coefmat_h63_3),1),nBeta,nBeta)' +...
      0.5*(exp(-xx_r(1,:)*Beta_1(:,3))*(2*abs(y(1,2)).^2*(xx_r(1,:)*Beta_1(:,6))*xx_r(1,:)+...
      (-y(1,2).*conj(y(1,3)) - y(1,3).*conj(y(1,2)))*xx_r(1,:)+...
      (y(1,2).*conj(y(1,1)).*(a(1).') + conj(y(1,2).*conj(y(1,1)).*(a(1).')))*xx_r(1,:))'*xx_r(1,:))+...
      0.5*(exp(-xx_r(end,:)*Beta_1(:,3))*(2*abs(y(end,2)).^2*(xx_r(end,:)*Beta_1(:,6))*xx_r(end,:)+...
      (-y(end,2).*conj(y(end,3)) - y(end,3).*conj(y(end,2)))*xx_r(end,:)+...
      (y(end,2).*conj(y(end,1)).*(a(end).') + conj(y(end,2).*conj(y(end,1)).*(a(end).')))*xx_r(end,:))'*xx_r(end,:));     

   bigmat_h68_1 = kron(bsxfun(@times, exp(-xx_r(2:nfreq,:)*Beta_1(:,3)).*conj(y(2:nfreq,2)).*y(2:nfreq,1),...
      xx_r(2:nfreq,:)), ones(nBeta,1)'); 
   coefmat_h68_1 = repmat(xx_i(2:nfreq,:), 1,nBeta);
   bigmat_h68_2 = kron(bsxfun(@times, exp(-xx_r(2:nfreq,:)*Beta_1(:,3)).*conj(y(2:nfreq,1)).*y(2:nfreq,2),...
      xx_r(2:nfreq,:)), ones(nBeta,1)'); 
   coefmat_h68_2 = repmat(xx_i(2:nfreq,:), 1,nBeta);
   h68 = imag(h68 + reshape(sum(bsxfun(@times, bigmat_h68_1, coefmat_h68_1) -...
   bsxfun(@times, bigmat_h68_2, coefmat_h68_2),1),nBeta,nBeta)' +...
      0.5*(exp(-xx_r(1,:)*Beta_1(:,3))*((conj(y(1,2)).*y(1,1))*xx_r(1,:)'*xx_i(1,:)-...
      (conj(y(1,1)).*y(1,2))*xx_r(1,:)'*xx_i(1,:)))+...
      0.5*(exp(-xx_r(end,:)*Beta_1(:,3))*((conj(y(end,2)).*y(end,1))*xx_r(end,:)'*xx_i(end,:)-...
      (conj(y(end,1)).*y(end,2))*xx_r(end,:)'*xx_i(end,:))));
   bigmat_h72_1 = kron(bsxfun(@times, exp(-xx_r(2:nfreq,:)*Beta_1(:,2)).*ck7.*(xx_i(2:nfreq,:)*Beta_2(:,1)),...
      xx_i(2:nfreq,:)), ones(nBeta,1)');    
   coefmat_h72_1 = repmat(xx_r(2:nfreq,:), 1,nBeta);
   bigmat_h72_2 = kron(bsxfun(@times,exp(-xx_r(2:nfreq,:)*Beta_1(:,2)).*ik7, xx_i(2:nfreq,:)), ones(nBeta,1)');
   coefmat_h72_2 = repmat(xx_r(2:nfreq,:), 1,nBeta);
   h72 = h72 + reshape(sum(bsxfun(@times, bigmat_h72_1, coefmat_h72_1) +...
      bsxfun(@times, bigmat_h72_2, coefmat_h72_2),1),nBeta,nBeta)' +...
      0.5*(exp(-xx_r(1,:)*Beta_1(:,2))*(2*abs(y(1,1)).^2*(xx_i(1,:)*Beta_2(:,1))*xx_i(1,:)+...
      imag(-y(1,1).*conj(y(1,2)) + y(1,2).*conj(y(1,1)))*xx_i(1,:))'*xx_r(1,:))+...
      0.5*(exp(-xx_r(end,:)*Beta_1(:,2))*(2*abs(y(end,1)).^2*(xx_i(end,:)*Beta_2(:,1))*xx_i(end,:)+...
      imag(-y(end,1).*conj(y(end,2)) + y(end,2).*conj(y(end,1)))*xx_i(end,:))'*xx_r(end,:)); 

   bigmat_h83_1 = kron(bsxfun(@times, exp(-xx_r(2:nfreq,:)*Beta_1(:,3)).*ck8.*(xx_i(2:nfreq,:)*Beta_2(:,2)),...
      xx_i(2:nfreq,:)), ones(nBeta,1)'); 
   coefmat_h83_1 = repmat(xx_r(2:nfreq,:), 1,nBeta);
   bigmat_h83_2 = kron(bsxfun(@times,exp(-xx_r(2:nfreq,:)*Beta_1(:,3)).*ik8, xx_i(2:nfreq,:)), ones(nBeta,1)');
   coefmat_h83_2 = repmat(xx_r(2:nfreq,:), 1,nBeta);
   bigmat_h83_3 = kron(bsxfun(@times,exp(-xx_r(2:nfreq,:)*Beta_1(:,3)).*(dk8), xx_i(2:nfreq,:)), ones(nBeta,1)');
   coefmat_h83_3 = repmat(xx_r(2:nfreq,:), 1,nBeta); 
   h83 = h83 + reshape(sum(bsxfun(@times, bigmat_h83_1, coefmat_h83_1) +...
      bsxfun(@times, bigmat_h83_2, coefmat_h83_2)+...
      bsxfun(@times, bigmat_h83_3, coefmat_h83_3),1),nBeta,nBeta)' +...
      0.5*(exp(-xx_r(1,:)*Beta_1(:,3))*(2*abs(y(1,1)).^2*(xx_i(1,:)*Beta_2(:,2))*xx_i(1,:)+...
      (-y(1,1).*conj(y(1,3)) + y(1,3).*conj(y(1,1)))*xx_i(1,:)+...
      imag(-y(1,2).*conj(y(1,1)).*(b(1).') + conj(y(1,2).*conj(y(1,1)).*(b(1).')))*xx_i(1,:))'*xx_r(1,:))+...
      0.5*(exp(-xx_r(end,:)*Beta_1(:,3))*(2*abs(y(end,1)).^2*(xx_i(end,:)*Beta_2(:,2))*xx_i(end,:)+...
      (-y(end,1).*conj(y(end,3)) + y(end,3).*conj(y(end,1)))*xx_i(end,:)+...
      imag(-y(end,2).*conj(y(end,1)).*(b(end).') + conj(y(end,2).*conj(y(end,1)).*(b(end).')))*xx_i(end,:))'*xx_r(end,:));  

   bigmat_h93_1 = kron(bsxfun(@times, exp(-xx_r(2:nfreq,:)*Beta_1(:,3)).*ck9.*(xx_i(2:nfreq,:)*Beta_2(:,3)),...
      xx_i(2:nfreq,:)), ones(nBeta,1)'); 
   coefmat_h93_1 = repmat(xx_r(2:nfreq,:), 1,nBeta);
   bigmat_h93_2 = kron(bsxfun(@times,exp(-xx_r(2:nfreq,:)*Beta_1(:,3)).*ik9, xx_i(2:nfreq,:)), ones(nBeta,1)');
   coefmat_h93_2 = repmat(xx_r(2:nfreq,:), 1,nBeta);
   bigmat_h93_3 = kron(bsxfun(@times,exp(-xx_r(2:nfreq,:)*Beta_1(:,3)).*(dk9), xx_i(2:nfreq,:)), ones(nBeta,1)');
   coefmat_h93_3 = repmat(xx_r(2:nfreq,:), 1,nBeta); 
   h93 = h93 + reshape(sum(bsxfun(@times, bigmat_h93_1, coefmat_h93_1) +...
      bsxfun(@times, bigmat_h93_2, coefmat_h93_2)+...
      bsxfun(@times, bigmat_h93_3, coefmat_h93_3),1),nBeta,nBeta)' +...
      0.5*(exp(-xx_r(1,:)*Beta_1(:,3))*(2*abs(y(1,2)).^2*(xx_i(1,:)*Beta_2(:,3))*xx_i(1,:)+...
      (-y(1,2).*conj(y(1,3)) + y(1,3).*conj(y(1,2)))*xx_i(1,:)+...
      imag(-y(1,1).*conj(y(1,2)).*(a(1).') + conj(y(1,1).*conj(y(1,2)).*(a(1).')))*xx_i(1,:))'*xx_r(1,:))+...
      0.5*(exp(-xx_r(end,:)*Beta_1(:,3))*(2*abs(y(end,2)).^2*(xx_i(end,:)*Beta_2(:,3))*xx_i(end,:)+...
      (-y(end,2).*conj(y(end,3)) + y(end,3).*conj(y(end,2)))*xx_i(end,:)+...
      imag(-y(end,1).*conj(y(end,2)).*(a(end).') + conj(y(end,1).*conj(y(end,2)).*(a(end).')))*xx_i(end,:))'*xx_r(end,:)); 

   bigmat_h98_1 = kron(bsxfun(@times, exp(-xx_r(2:nfreq,:)*Beta_1(:,3)).*conj(y(2:nfreq,1)).*y(2:nfreq,2),...
      xx_i(2:nfreq,:)), ones(nBeta,1)'); 
   coefmat_h98_1 = repmat(xx_i(2:nfreq,:), 1,nBeta);
   bigmat_h98_2 = kron(bsxfun(@times, exp(-xx_r(2:nfreq,:)*Beta_1(:,3)).*conj(y(2:nfreq,2)).*y(2:nfreq,1),...
      xx_i(2:nfreq,:)), ones(nBeta,1)'); 
   coefmat_h98_2 = repmat(xx_i(2:nfreq,:), 1,nBeta);
   h98 = real(h98 + reshape(sum(bsxfun(@times, bigmat_h98_1, coefmat_h98_1) +...
      bsxfun(@times, bigmat_h98_2, coefmat_h98_2),1),nBeta,nBeta)' +...
      0.5*(exp(-xx_r(1,:)*Beta_1(:,3))*((conj(y(1,1)).*y(1,2))*xx_i(1,:)'*xx_i(1,:)+...
      (conj(y(1,2)).*y(1,1))*xx_i(1,:)'*xx_i(1,:)))+...
      0.5*(exp(-xx_r(end,:)*Beta_1(:,3))*((conj(y(end,1)).*y(end,2))*xx_i(end,:)'*xx_i(end,:)+...
      (conj(y(end,2)).*y(end,1))*xx_i(end,:)'*xx_i(end,:))));
   h24=h42'; h35=h53'; h65=h56'; h95=h59'; h36=h63';
   h86=h68'; h27=h72'; h38=h83'; h39=h93'; h89=h98';
 end
 ze = zeros(nBeta,nBeta);
 h1 = [h11,repmat(ze,1,8)];
 h2 = [ze,h22,ze,-h24,ze,ze,-h27,ze,ze];
 h3 = [ze,ze,h33,ze,-h35,-h36,ze,-h38,-h39];
 h4 = [ze,-h42,ze,h44,repmat(ze,1,5)];
 h5 = [ze,ze,-h53,ze,h55,h56,ze,ze,h59];
 h6 = [ze,ze,-h63,ze,h65,h66,ze,h68,ze];
 h7 = [ze,-h72,ze,ze,ze,ze,h77,ze,ze];
 h8 = [ze,ze,-h83,ze,ze,h86,ze,h88,h89];
 h9 = [ze,ze,-h93,ze,h95,ze,ze,h98,h99];

 gr = [gr1;gr2;gr3;gr4;gr5;gr6;gr7;gr8;gr9]; h = [h1;h2;h3;h4;h5;h6;h7;h8;h9]; f = -f;
 gr_index = (1:(dimen^2*nBeta)).*kron(chol_index(Phi_temp,:),ones(nBeta,1)');
 gr_index = gr_index(find(gr_index~=0));
 gr = gr(gr_index); h = h(gr_index,gr_index);
end

function [f,gr,h] = Beta_derive2(x, yobs_tmp, chol_index, Phi_temp, tau_temp_1,...
      tau_temp_2, Beta_temp_1, Beta_temp_2, sigmasqalpha, nbasis, nseg)

%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
% Function used for optimization process for coefficients are the same
% across segments
%
%  Input:
%    1) x - initial values for coefficient of basis functions need to
%    be optimized
%    2) yobs_tmp - time series data within the segment
%    3) chol_index - index matrix
%    4) Phi_temp - which component changed
%    5) tau_temp_1 - smoothing parameters for the first segment
%    6) tau_temp_2 - smoothing parameters for the second segment
%         7) Beta_temp_1 - current coefficients for the first segment
%         8) Beta_temp_2 - current coefficients for the second segment
%    9) sigmasqalpha - smoothing parameters for the constant in real
%    components
%    10) nbasis - number of basis function used
%    11) nseg - number of observation in the first segment
%  Main Outputs:
%    1) f - log posterior probability based on input parameters
%    2) gr - gradients for optimization process
%    3) h - Hessian matrix for optimization process
%
%  Required programs: lin_basis_func, Beta_derive1
yobs_tmp_1 = yobs_tmp(1:nseg,:);
yobs_tmp_2 = yobs_tmp(nseg+1:end,:);
[f1,grad1,hes1] = Beta_derive1(x, yobs_tmp_1, chol_index, Phi_temp, tau_temp_1, Beta_temp_1, sigmasqalpha, nbasis);
[f2,grad2,hes2] = Beta_derive1(x, yobs_tmp_2, chol_index, Phi_temp, tau_temp_2, Beta_temp_2, sigmasqalpha, nbasis);

f = f1 + f2;
gr = grad1 + grad2;
h = hes1 + hes2;

function[PI,nseg_prop,xi_prop,tau_prop,Beta_prop,Phi_prop]=...
    birth(chol_index,ts,nexp_curr,nexp_prop,...
      tau_curr_temp,xi_curr_temp,nseg_curr_temp,Beta_curr_temp,Phi_curr_temp,....
      log_move_curr,log_move_prop)

%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%% Does the birth step in the paper
%
%  Input:
%    1) chol_index - index matrix
%    2) ts - TxN matrix of time series data
%    3) nexp_curr - current number of segment
%    4) nexp_prop - proposed number of segment
%    5) tau_curr_temp - current smoothing parameters
%    6) xi_curr_temp - current partitions
%    7) nseg_curr_temp - current number of observations in each segment
%    8) Beta_curr_temp - current coefficients
%    9) Phi_curr_temp - which component changed
%    10) log_move_curr - probability: proposed to current
%    11) log_move_prop - probability: current to proposed
%  Main Outputs:
%    1) A - acceptance probability
%    2) nseg_prop - proposed number of observations in each segment
%    3) xi_prop - proposed partitions
%    4) tau_prop - proposed smoothing parameters
%    5) Beta_prop - proposed coefficients
%    6) Phi_prop - proposed indicator variable
%
%  Required programs: postBeta1, Beta_derive1, whittle_like
%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%

global nobs dimen nbasis nBeta sigmasqalpha tmin tau_up_limit

Beta_prop = zeros(nBeta,dimen^2,nexp_prop);
tau_prop = ones(dimen^2,nexp_prop,1);
nseg_prop = zeros(nexp_prop,1);
xi_prop = zeros(nexp_prop,1);
Phi_prop = zeros(nexp_prop,1);

%***************************************
%Drawing segment to split
%***************************************
kk = find(nseg_curr_temp>2*tmin); %Number of segments available for splitting
nposs_seg = length(kk);
seg_cut = kk(unidrnd(nposs_seg)); %Drawing segment to split
nposs_cut = nseg_curr_temp(seg_cut)-2*tmin+1; %Drawing new birthed partition

%************************************************
% Proposing new parameters, Beta, Phi, tau, xi
%************************************************
for jj=1:nexp_curr
 if jj<seg_cut %nothing updated or proposed here
   xi_prop(jj) = xi_curr_temp(jj);
   tau_prop(:,jj) = tau_curr_temp(:,jj);
   nseg_prop(jj) = nseg_curr_temp(jj);
   Beta_prop(:,:,jj) = Beta_curr_temp(:,:,jj);
   Phi_prop(jj) = Phi_curr_temp(jj);
 elseif jj==seg_cut %updating parameters in the selected paritions
   index = unidrnd(nposs_cut);
   if (seg_cut==1)
    xi_prop(seg_cut)=index+tmin-1;
   else
    xi_prop(seg_cut)=xi_curr_temp(jj-1)-1+tmin+index;
   end
   xi_prop(seg_cut+1) = xi_curr_temp(jj);

   %Determine which Cholesky components should change here
   Phi_prop(seg_cut) = randsample(2:2^(dimen^2),1);
   Phi_prop(seg_cut+1) = Phi_curr_temp(jj);
 
   %Drawing new tausq
   select = find(chol_index(Phi_prop(seg_cut),:)~=0);
   select_inv = find(chol_index(Phi_prop(seg_cut),:)==0);
   zz = rand(dimen^2,1)'.*chol_index(Phi_prop(seg_cut),:);
   uu = zz./(1-zz);
   uu(find(uu==0))= 1;
   tau_prop(:,seg_cut)= tau_curr_temp(:,seg_cut).*uu';
   tau_prop(:,seg_cut+1) = tau_curr_temp(:,seg_cut).*(1./uu)';

   %Drawing new values for coefficient of basis function for new birthed segments.
   nseg_prop(seg_cut) = index+tmin-1;
   nseg_prop(seg_cut+1) = nseg_curr_temp(jj)-nseg_prop(seg_cut);
   Phi_need = Phi_prop(seg_cut);
   for k=jj:(jj+1)
    if k==jj
     [Beta_mean_1, Beta_var_1,yobs_tmp_1] = postBeta1(chol_index, Phi_need, ...
     k, ts, tau_prop(:,k), Beta_curr_temp(:,:,jj), xi_prop);
     Beta_prop(:,select,k) = reshape(mvnrnd(Beta_mean_1,...
         0.5*(Beta_var_1+Beta_var_1')),nBeta,length(select));
     Beta_prop(:,select_inv,k) = Beta_curr_temp(:,select_inv,jj);

   else
     [Beta_mean_2, Beta_var_2,yobs_tmp_2] = postBeta1(chol_index, Phi_need, ...
        k, ts, tau_prop(:,k), Beta_curr_temp(:,:,jj), xi_prop);
        Beta_prop(:,select,k) = reshape(mvnrnd(Beta_mean_2,...
         0.5*(Beta_var_2+Beta_var_2')),nBeta,length(select));
     Beta_prop(:,select_inv,k) = Beta_curr_temp(:,select_inv,jj);
   end
  end
 else %nothing updated or proposed here
   xi_prop(jj+1) = xi_curr_temp(jj);
       tau_prop(:,jj+1) = tau_curr_temp(:,jj);
       nseg_prop(jj+1) = nseg_curr_temp(jj);
       Beta_prop(:,:,jj+1) = Beta_curr_temp(:,:,jj);
   Phi_prop(jj+1) = Phi_curr_temp(jj);
 end
end

%Calculating Jacobian
ja = tau_curr_temp(:,seg_cut)./(zz.*(1-zz))'; ja = ja(ja~=Inf);
log_jacobian = sum(log(2*ja));

%*************************************************************
%Calculations related to proposed values
%*************************************************************

%=======================================================================
%Evaluating the Likelihood, Proposal and Prior Densities at the Proposed values
%=======================================================================

log_Beta_prop = 0;
log_tau_prior_prop = 0;
log_Beta_prior_prop = 0;
loglike_prop = 0;

for jj=seg_cut:seg_cut+1
 if jj==seg_cut
   Beta_mean = Beta_mean_1;
   Beta_var = Beta_var_1;
   yobs_tmp = yobs_tmp_1;
 else
   Beta_mean = Beta_mean_2;
   Beta_var = Beta_var_2;
   yobs_tmp = yobs_tmp_2;
 end
 pb = reshape(Beta_prop(:,select,jj), numel(Beta_prop(:,select,jj)),1);
 %Proposed density for coefficient of basis functions
 log_Beta_prop = log_Beta_prop - 0.5*(pb-Beta_mean)'*matpower(0.5*(Beta_var+Beta_var'),-1)*(pb-Beta_mean);

 %Prior density for coefficient of basis functions
 prior_tau = reshape([[repmat(sigmasqalpha,1,dimen^2-dimen*(dimen-1)/2) tau_prop((dimen + dimen*(dimen-1)/2 + 1):end,jj)'];...
      reshape(kron(tau_prop(:,jj),ones(nbasis,1)), nbasis, dimen^2)], nBeta,(dimen^2));
 prior_tau = reshape(prior_tau(:,select),length(select)*nBeta,1);

 log_Beta_prior_prop = log_Beta_prior_prop - 0.5*(pb)'*matpower(diag(prior_tau),-1)*(pb);

 log_tau_prior_prop = log_tau_prior_prop-length(select)*log(tau_up_limit);  % Prior Density of tausq
 [log_prop_spec_dens] = whittle_like(yobs_tmp,Beta_prop(:,:,jj));
 loglike_prop = loglike_prop + log_prop_spec_dens; %Loglikelihood at proposed values
end
 
log_seg_prop = -log(nposs_seg);%Proposal density for segment choice
log_cut_prop = -log(nposs_cut);%Proposal density for partition choice
log_Phi_prop = -log(2^(dimen^2)-1); %proposal density for component choice
log_prior_Phi_prop = -log(2^(dimen^2)); %prior density for component choice

%Evaluating prior density for cut points at proposed values
log_prior_cut_prop = 0;
for k=1:nexp_prop-1
    if k==1
       log_prior_cut_prop=-log(nobs-(nexp_prop-k+1)*tmin+1);
    else
       log_prior_cut_prop=log_prior_cut_prop-log(nobs-xi_prop(k-1)-(nexp_prop-k+1)*tmin+1);
    end
end
%Calculating Log Proposal density at Proposed values
log_proposal_prop = log_Beta_prop + log_seg_prop + log_move_prop + log_cut_prop + log_Phi_prop;
%Calculating Log Prior density at Proposed values
log_prior_prop = log_Beta_prior_prop + log_tau_prior_prop + log_prior_cut_prop + log_prior_Phi_prop;
%Calculating Target density at Proposed values
log_target_prop = loglike_prop + log_prior_prop;

%*************************************************************
%Calculations related to current values
%*************************************************************

%==============================================================================
%Evaluating the Likelihood, Proposal and Prior Densities at the Current values
%==============================================================================

[Beta_mean, Beta_var, yobs_tmp] = postBeta1(chol_index, Phi_need, seg_cut, ts, tau_curr_temp(:,seg_cut),...
              Beta_curr_temp(:,:,seg_cut), xi_curr_temp);
pb = reshape(Beta_curr_temp(:,select,seg_cut),numel(Beta_curr_temp(:,select,seg_cut)),1);

%Current density for coefficient of basis functions
log_Beta_curr = -0.5*(pb-Beta_mean)'*matpower(0.5*(Beta_var+Beta_var'),-1)*(pb-Beta_mean);

%Prior density for coefficient of basis functions at current values
prior_tau = reshape([[repmat(sigmasqalpha,1,dimen^2-dimen*(dimen-1)/2) tau_curr_temp((dimen + dimen*(dimen-1)/2 + 1):end,seg_cut)'];...
      reshape(kron(tau_curr_temp(:,seg_cut),ones(nbasis,1)), nbasis, dimen^2)], nBeta,(dimen^2));
prior_tau = reshape(prior_tau(:,select),length(select)*nBeta,1);
                
log_Beta_prior_curr = -0.5*(pb)'*matpower(diag(prior_tau),-1)*(pb);
           
log_tau_prior_curr = -length(select)*log(tau_up_limit); %prior density for smoothing parameters
[log_curr_spec_dens] = whittle_like(yobs_tmp,Beta_curr_temp(:,:,seg_cut));
loglike_curr = log_curr_spec_dens; %Loglikelihood at current values

log_Phi_curr = -log(1); %proposal for component choice
%Calculating Log Proposal density at current values
log_proposal_curr = log_Beta_curr + log_move_curr + log_Phi_curr;

%Evaluating prior density for partition current values
log_prior_cut_curr = 0;
for k=1:nexp_curr-1
    if k==1
       log_prior_cut_curr = -log(nobs-(nexp_curr-k+1)*tmin+1);
    else
       log_prior_cut_curr = log_prior_cut_curr-log(nobs-xi_curr_temp(k-1)-(nexp_curr-k+1)*tmin+1);
    end
end

log_prior_Phi_curr = -log(2^(dimen^2)); %prior density for component choice
%Calculating Priors at Current Values
log_prior_curr = log_Beta_prior_curr + log_tau_prior_curr + log_prior_cut_curr + log_prior_Phi_curr;
%Evalulating Target densities at current values
log_target_curr = loglike_curr + log_prior_curr;

%*************************************************************
%Calculations acceptance probability
%*************************************************************
 PI = min(1,exp(log_target_prop - log_target_curr +...
    log_proposal_curr - log_proposal_prop + log_jacobian));
end

function[chol_index]=chol_ind(dimen)

chol_index = zeros(2^(dimen^2), dimen^2 );
k=0;
for i=0:dimen^2
 C = nchoosek(1:dimen^2,i);
 dim = size(C);
 if i==0
   k=k+1;
   chol_index(1,:) = 0;
 else
   for j=1:dim(1)
    k=k+1;
    chol_index(k,C(j,:)) = 1;
   end
 end
end

end

function [result] = conv_diag(nobs, nBeta, nseg,nexp_curr, Beta_curr, tausq_curr, dimen)

%  conv_diag Calculate convergence diagnostics for beta and tau sq params
%  Get max eigenvalue of beta and tausq matrices across time and cov

idx = cumsum(nseg);
tausq_diag = zeros(nobs,dimen^2);
Beta_diag = zeros(nobs,nBeta,dimen^2);

for i=1:nexp_curr
 if i==1
  for k=1:dimen^2
   tausq_diag(1:idx(i),k) = repmat(tausq_curr(k,i),nseg(i),1);
 
   for j=1:nBeta
    Beta_diag(1:idx(i),j,k)=repmat(Beta_curr(j,k),nseg(i),1);
   end
  end
 else
  for k=1:dimen^2
   tausq_diag((idx(i-1)+1):idx(i),k) = repmat(tausq_curr(k,i),nseg(i),1);
 
   for j=1:nBeta
    Beta_diag((idx(i-1)+1):idx(i),j,k)=repmat(Beta_curr(j,k),nseg(i),1);
   end
  end 
 end
 
end

  %calculate eigenval of symmetric transform
  result = zeros(1,dimen^2*(nBeta+1));
  for k=1:dimen^2
   result(k) = real(max(eig(tausq_diag(:,k)*tausq_diag(:,k)')));
  end
  k=1;
  for i=1:dimen^2
   for j=1:nBeta
    result(k+dimen^2) = real(max(eig(Beta_diag(:,j,i)*Beta_diag(:,j,i)')));
    k=k+1;
   end
 end
end

function[PI,nseg_prop,xi_prop,tau_prop,Beta_prop,Phi_prop]=...
death(chol_index,ts,nexp_curr,nexp_prop,...
    tau_curr_temp,xi_curr_temp,nseg_curr_temp,Beta_curr_temp,Phi_curr_temp,log_move_curr,log_move_prop)

%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
% Does the death step in the paper
%
% Input:
%   1) chol_index - index matrix
%   2) ts - TxN matrix of time series data
%   3) nexp_curr - current number of segment
%   4) nexp_prop - proposed number of segment
%   5) tau_curr_temp - current smoothing parameters
%   6) xi_curr_temp - current partitions
%   7) nseg_curr_temp - current number of observations in each segment
%   8) Beta_curr_temp - current coefficients
%   9) Phi_curr_temp - which component changed
%   10) log_move_curr - probability: proposed to current
%   11) log_move_prop - probability: current to proposed
%  Main Outputs:
%   1) A - acceptance probability
%   2) nseg_prop - proposed number of observations in each segment
%   3) xi_prop - proposed partitions
%   4) tau_prop - proposed smoothing parameters
%   5) Beta_prop - proposed coefficients
%   6) Phi_prop - proposed indicator variable
%
%  Required programs: postBeta1, Beta_derive_1, whittle_like
%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%

global nobs dimen nBeta nbasis sigmasqalpha tmin tau_up_limit

Beta_prop = zeros(nBeta,dimen^2,nexp_prop);
tau_prop = ones(dimen^2,nexp_prop,1);
nseg_prop = zeros(nexp_prop,1);
xi_prop = zeros(nexp_prop,1);
Phi_prop = zeros(nexp_prop,1);

%***************************************
%Draw a partition to delete
%***************************************
cut_del=unidrnd(nexp_curr-1);

j=0;
for k = 1:nexp_prop
 j = j+1;
 if k==cut_del

   %*********************************************************
     %Calculations related to proposed values
   %*********************************************************
   xi_prop(k) = xi_curr_temp(j+1);
   tau_prop(:,k) = sqrt(tau_curr_temp(:,j).*tau_curr_temp(:,j+1)); %Combine two taus into one
   nseg_prop(k) = nseg_curr_temp(j) + nseg_curr_temp(j+1); %Combine two segments into one
   Phi_prop(k) = Phi_curr_temp(j+1);
 
%============================================================
   %Evaluate the Likelihood at proposed values
%============================================================
   need = sum(Beta_curr_temp(:,:,k)-Beta_curr_temp(:,:,k+1));
   need = need./need;
   need(isnan(need))=0;
   aa = zeros(2^(dimen^2),1);
   for i=1:2^(dimen^2)
    aa(i)=sum(chol_index(i,:)==need);
   end
   Phi_need = find(aa==dimen^2); 
   select = find(chol_index(Phi_need,:)~=0);
   select_inv = find(chol_index(Phi_need,:)==0);
 
   %Compute mean and variances for coefficents of basis functions
   [Beta_mean, Beta_var, yobs_tmp]= postBeta1(chol_index, Phi_need, k, ts, tau_prop(:,k),...
              Beta_curr_temp(:,:,k), xi_prop);
   Beta_prop(:,select,k) = reshape(mvnrnd(Beta_mean,0.5*(Beta_var+Beta_var')),nBeta,length(select));
   Beta_prop(:,select_inv,k) = Beta_curr_temp(:,select_inv,k);

   %Loglikelihood at proposed values
   [loglike_prop]=whittle_like(yobs_tmp,Beta_prop(:,:,k));
 %========================================================================
 %Evaluate the Proposal Densities at the Proposed values for tau, Phi, and Beta
 %========================================================================
   pb = reshape(Beta_prop(:,select,k), numel(Beta_prop(:,select,k)),1);
   %Proposed density for coefficient of basis functions
   log_Beta_prop = - 0.5*(pb-Beta_mean)'*matpower(0.5*(Beta_var+Beta_var'),-1)*(pb-Beta_mean);
           
   log_seg_prop = -log(nexp_curr-1); %Proposal for segment choice
   log_Phi_prop = -log(1); %proposal for component choice
   %Calcualte Jacobian
       log_jacobian = -sum(log(2*(sqrt(tau_curr_temp(select,j)) + sqrt(tau_curr_temp(select,j+1))).^2));
 
   %log proposal probabililty
   log_proposal_prop = log_Beta_prop + log_seg_prop + log_move_prop + log_Phi_prop;

%===================================================================
% Evaluate the PRIOR Densities at the Proposed values for tau, Phi, and Beta
%===================================================================
   prior_tau = reshape([[repmat(sigmasqalpha,1,dimen^2-dimen*(dimen-1)/2) tau_prop((dimen + dimen*(dimen-1)/2 + 1):end,k)'];...
        reshape(kron(tau_prop(:,k),ones(nbasis,1)), nbasis,...
        dimen^2)], nBeta,(dimen^2));
   prior_tau = reshape(prior_tau(:,select),length(select)*nBeta,1);
 
   %Prior density for coefficient of basis functions
   log_Beta_prior_prop = -0.5*(pb)'*matpower(diag(prior_tau),-1)*(pb);
                 
       %Prior Density of tausq
       log_tau_prior_prop = -length(select)*log(tau_up_limit);
   %Prior Density of Phi
   log_Phi_prior_prop = -log(2^(dimen^2));
   log_prior_prop = log_tau_prior_prop + log_Beta_prior_prop + log_Phi_prior_prop;
%*************************************************************
%Calculations related to current values
%*************************************************************
 
%===========================================================================
%Evaluate the Likelihood, Proposal and Prior Densities at the Current values
%===========================================================================
       log_Beta_curr = 0;
       log_tau_prior_curr = 0;
       log_Beta_prior_curr = 0;
       loglike_curr=0;
   for jj=j:j+1
    [Beta_mean, Beta_var, yobs_tmp]= postBeta1(chol_index, Phi_need, jj, ts, tau_curr_temp(:,jj),...
             Beta_curr_temp(:,:,jj), xi_curr_temp);
                
     pb = reshape(Beta_curr_temp(:,select,jj), numel(Beta_curr_temp(:,select,jj)),1);
     %Current density for coefficient of basis functions
      log_Beta_curr = log_Beta_curr -0.5*(pb-Beta_mean)'*matpower(0.5*(Beta_var+Beta_var'),-1)*...
                        (pb-Beta_mean);
                     
     prior_tau = reshape([[repmat(sigmasqalpha,1,dimen^2-dimen*(dimen-1)/2) tau_curr_temp((dimen + dimen*(dimen-1)/2 + 1):end,jj)'];...
       reshape(kron(tau_curr_temp(:,jj),ones(nbasis,1)), nbasis,...
       dimen^2)], nBeta,(dimen^2));
     prior_tau = reshape(prior_tau(:,select),length(select)*nBeta,1);
 
     %Prior density for coefficient of basis functions at current values    
     log_Beta_prior_curr = log_Beta_prior_curr - 0.5*(pb)'*matpower(diag(prior_tau),-1)*(pb);
     [log_curr_spec_dens] = whittle_like(yobs_tmp,Beta_curr_temp(:,:,jj));
     %Loglikelihood at proposed values
     loglike_curr = loglike_curr + log_curr_spec_dens;
 
     %prior density for smoothing parameters
     log_tau_prior_curr = log_tau_prior_curr - length(select)*log(tau_up_limit);
   end
 
   log_Phi_curr = -log(2^(dimen^2)-1); %proposal for component choice
   log_Phi_prior_curr = -log(2^(dimen^2)); %prior for component choice
 
   %Calculate Log proposal density at current values
   log_proposal_curr = log_move_curr + log_Beta_curr + log_Phi_curr;
 
   %Calculate Priors at Current Vlaues
   log_prior_curr = log_Beta_prior_curr + log_tau_prior_curr + log_Phi_prior_curr;
   j=j+1;
 else
   xi_prop(k) = xi_curr_temp(j);
       tau_prop(:,k) = tau_curr_temp(:,j);
       nseg_prop(k) = nseg_curr_temp(j);
   Beta_prop(:,:,k) = Beta_curr_temp(:,:,j);
   Phi_prop(k) = Phi_curr_temp(j);
 end
end

%=================================================
%Evaluate Target density at proposed values
%=================================================
log_prior_cut_prop=0;
for k=1:nexp_prop-1
    if k==1
       log_prior_cut_prop=-log(nobs-(nexp_prop-k+1)*tmin+1);
    else
       log_prior_cut_prop=log_prior_cut_prop-log(nobs-xi_prop(k-1)-(nexp_prop-k+1)*tmin+1);
    end
end
log_target_prop = loglike_prop + log_prior_prop + log_prior_cut_prop;

%==================================================
%Evaluate Target density at current values
%==================================================
log_prior_cut_curr=0;
for k=1:nexp_curr-1
    if k==1
       log_prior_cut_curr=-log(nobs-(nexp_curr-k+1)*tmin+1);
    else
       log_prior_cut_curr=log_prior_cut_curr-log(nobs-xi_curr_temp(k-1)-(nexp_curr-k+1)*tmin+1);
    end
end
log_target_curr = loglike_curr + log_prior_curr + log_prior_cut_curr;

%**********************************************************
%Calculations acceptance probability
%**********************************************************
PI = min(1,exp(log_target_prop - log_target_curr + ...
    log_proposal_curr - log_proposal_prop + log_jacobian));

function [gr] = Gradient1(yobs_tmp, chol_index, Phi_temp, tau_temp,...
            Beta_temp, sigmasqalpha, nbasis)
global dimen
%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
% Calculate log gradients for coefficients selected to be different
%
% Input:
%   1) x - initial values for coefficient of basis functions need to
%   be optimized
%   2) yobs_tmp - time series data within the segment
%   3) chol_index - index matrix
%   4) Phi_temp - which component changed
%   5) tau_temp - smoothing parameters
%   6) Beta_temp - current coefficients
%   7) sigmasqalpha - smoothing parameters for the constant in real
%   components
%   8) nbasis - number of basis functions used
%  Main Outputs:
%   2) gr - gradients for optimization process
%
%  Required programs: lin_basis_func
%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%

%initilize Beta_1 and Beta_2: Beta_2 is for imaginary components
nBeta = nbasis + 1;

Beta_1(:,:) = Beta_temp(:,1:(dimen + dimen*(dimen-1)/2));
Beta_2(:,:) = Beta_temp(1:nBeta,(dimen + dimen*(dimen-1)/2 + 1): end);

dim = size(yobs_tmp); n = dim(1);
nfreq = floor(n/2); tt = (0:nfreq)/(2*nfreq);
yy = fft(yobs_tmp)/sqrt(n); y = yy(1:(nfreq+1),:); nf = length(y);
[xx_r, xx_i]=lin_basis_func(tt);

%theta's
theta = zeros(dimen*(dimen-1)/2,nf);
for i=1:dimen*(dimen-1)/2
 theta_real = xx_r * Beta_1(:,i+dimen);
 theta_imag = xx_i * Beta_2(:,i);
 theta(i,:) = theta_real + sqrt(-1)*theta_imag;
end
%delta's
delta_sq = zeros(dimen,nf);
for i=1:dimen
  delta_sq(i,:) = exp(xx_r * Beta_1(:,i));
end

if dimen==2 %Bivariate Time Series

  gr1 = zeros(nBeta,1); gr2 = zeros(nBeta,1); gr3 = zeros(nBeta,1); gr4 = zeros(nBeta,1);
  gr1(1) = Beta_1(1,1)/sigmasqalpha; gr1(2:nBeta,1) = Beta_1(2:nBeta,1)/tau_temp(1);
  gr2(1) = Beta_1(1,2)/sigmasqalpha; gr2(2:nBeta,1) = Beta_1(2:nBeta,2)/tau_temp(2);
  gr3(1) = Beta_1(1,3)/sigmasqalpha; gr3(2:nBeta,1) = Beta_1(2:nBeta,3)/tau_temp(3);
 gr4(1:nBeta,1) = Beta_2(1:nBeta,1)/tau_temp(4);

 if (mod(n,2)==1) 
   %%%%%%%%%%%%%%%%%%%%%%%%
   %gradient
   %%%%%%%%%%%%%%%%%%%%%%%%
   rk = -y(2:end,1).*conj(y(2:end,2)) - y(2:end,2).*conj(y(2:end,1));
   ik = sqrt(-1)*(-y(2:end,1).*conj(y(2:end,2)) + y(2:end,2).*conj(y(2:end,1)));
   ck = 2*abs(y(2:end,1)).^2;

   gr1 = gr1 + xx_r(2:end,:)'*(1-abs(y(2:end,1)).^2.*exp(-xx_r(2:end,:)*Beta_1(:,1))) + ...
       0.5*(xx_r(1,:)'*(1-abs(y(1,1)).^2.*exp(-xx_r(1,:)*Beta_1(:,1))));
   gr2 = gr2 + xx_r(2:end,:)'*(1 - abs(y(2:end,2)-theta(2:end).'.*y(2:end,1)).^2.*exp(-xx_r(2:end,:)*Beta_1(:,2))) + ...
       0.5*(xx_r(1,:)'*(1 - abs(y(1,2)-theta(1).'.*y(1,1)).^2.*exp(-xx_r(1,:)*Beta_1(:,2))));
   temp_mat_31 = bsxfun(@times, xx_r(2:end,:),rk);
   temp_mat_32 = bsxfun(@times, ck, bsxfun(@times, xx_r(2:end,:),xx_r(2:end,:)*Beta_1(:,3)));
   gr3 = gr3 + sum( bsxfun(@times, (temp_mat_31+temp_mat_32), exp(-xx_r(2:end,:)*Beta_1(:,2)) ))' +...
       0.5*(exp(-xx_r(1,:)*Beta_1(:,2))*(-y(1,1).*conj(y(1,2)) - y(1,2).*conj(y(1,1)))*xx_r(1,:)' +...
        exp(-xx_r(1,:)*Beta_1(:,2))*2*abs(y(1,1)).^2*(xx_r(1,:)*Beta_1(:,3))*xx_r(1,:)');
   temp_mat_41 = bsxfun(@times, ik, xx_i(2:end,:));
   temp_mat_42 = bsxfun(@times, ck, bsxfun(@times,xx_i(2:end,:),xx_i(2:end,:)*Beta_2(:,1)));
   gr4 = gr4 + sum( bsxfun(@times, (temp_mat_41 + temp_mat_42), exp(-xx_r(2:end,:)*Beta_1(:,2))))' + ...
        0.5*(exp(-xx_r(1,:)*Beta_1(:,2))*(sqrt(-1)*(-y(1,1).*conj(y(1,2)) + y(1,2).*conj(y(1,1))))*xx_i(1,:)' +...
       exp(-xx_r(1,:)*Beta_1(:,2))*2*abs(y(1,1)).^2*(xx_i(1,:)*Beta_2(:,1))*xx_i(1,:)');
 else
   %%%%%%%%%%%%%%%%%%%%%
   %gradient
   %%%%%%%%%%%%%%%%%%%%%
   rk = -y(2:nfreq,1).*conj(y(2:nfreq,2)) - y(2:nfreq,2).*conj(y(2:nfreq,1));
   ik = sqrt(-1)*(-y(2:nfreq,1).*conj(y(2:nfreq,2)) + y(2:nfreq,2).*conj(y(2:nfreq,1)));
   ck = 2*abs(y(2:nfreq,1)).^2;

   gr1 = gr1 + xx_r(2:nfreq,:)'*(1-abs(y(2:nfreq,1)).^2.*exp(-xx_r(2:nfreq,:)*Beta_1(:,1))) + ...
        0.5*(xx_r(1,:)'*(1-abs(y(1,1)).^2.*exp(-xx_r(1,:)*Beta_1(:,1)))) +...
       0.5*(xx_r(end,:)'*(1-abs(y(end,1)).^2.*exp(-xx_r(end,:)*Beta_1(:,1))));
   gr2 = gr2 + xx_r(2:nfreq,:)'*(1 - abs(y(2:nfreq,2)-theta(2:nfreq).'.*y(2:nfreq,1)).^2.*exp(-xx_r(2:nfreq,:)*Beta_1(:,2))) + ...
       0.5*(xx_r(1,:)'*(1 - abs(y(1,2)-theta(1).'.*y(1,1)).^2.*exp(-xx_r(1,:)*Beta_1(:,2)))) + ...
      0.5*(xx_r(end,:)'*(1 - abs(y(end,2)-theta(end).'.*y(end,1)).^2.*exp(-xx_r(end,:)*Beta_1(:,2))));
   temp_mat_31 = bsxfun(@times,rk, xx_r(2:nfreq,:));
   temp_mat_32 = bsxfun(@times,ck,bsxfun(@times, xx_r(2:nfreq,:),xx_r(2:nfreq,:)*Beta_1(:,3)));
   gr3 = gr3 + sum( bsxfun(@times, (temp_mat_31 + temp_mat_32), exp(-xx_r(2:nfreq,:)*Beta_1(:,2)) ))' +...
      0.5*(exp(-xx_r(1,:)*Beta_1(:,2))*(-y(1,1).*conj(y(1,2)) - y(1,2).*conj(y(1,1)))*xx_r(1,:)' +...
      exp(-xx_r(1,:)*Beta_1(:,2))*2*abs(y(1,1)).^2*(xx_r(1,:)*Beta_1(:,3))*xx_r(1,:)') +...
      0.5*(exp(-xx_r(end,:)*Beta_1(:,2))*(-y(end,1).*conj(y(end,2)) - y(end,2).*conj(y(end,1)))*xx_r(end,:)' +...
      exp(-xx_r(end,:)*Beta_1(:,2))*2*abs(y(end,1)).^2*(xx_r(end,:)*Beta_1(:,3))*xx_r(end,:)');
   temp_mat_41 = bsxfun(@times, ik, xx_i(2:nfreq,:));
   temp_mat_42 = bsxfun(@times, ck, bsxfun(@times,xx_i(2:nfreq,:),xx_i(2:nfreq,:)*Beta_2(:,1)));
   gr4 = gr4 + sum( bsxfun(@times, (temp_mat_41 + temp_mat_42), exp(-xx_r(2:nfreq,:)*Beta_1(:,2))))' + ...
      0.5*(exp(-xx_r(1,:)*Beta_1(:,2))*(sqrt(-1)*(-y(1,1).*conj(y(1,2)) + y(1,2).*conj(y(1,1))))*xx_i(1,:)' +...
      exp(-xx_r(1,:)*Beta_1(:,2))*2*abs(y(1,1)).^2*(xx_i(1,:)*Beta_2(:,1))*xx_i(1,:)') +...
      0.5*(exp(-xx_r(end,:)*Beta_1(:,2))*(sqrt(-1)*(-y(end,1).*conj(y(end,2)) + y(end,2).*conj(y(end,1))))*xx_i(end,:)' +...
      exp(-xx_r(end,:)*Beta_1(:,2))*2*abs(y(end,1)).^2*(xx_i(end,:)*Beta_2(:,1))*xx_i(end,:)');
 end
 gr = [gr1;gr2;gr3;gr4];
 gr_index = (1:(4*nBeta)).*[kron(chol_index(Phi_temp,1:3),ones(nBeta,1)'), kron(chol_index(Phi_temp,4),ones(nBeta,1)')];
 gr_index = gr_index(find(gr_index~=0));
 gr = gr(gr_index);

elseif dimen==3 %trivariate time series

 gr1 = zeros(nBeta,1); gr2 = zeros(nBeta,1); gr3 = zeros(nBeta,1); gr4 = zeros(nBeta,1);
 gr5 = zeros(nBeta,1); gr6 = zeros(nBeta,1); gr7 = zeros(nBeta,1); gr8 = zeros(nBeta,1);
 gr9 = zeros(nBeta,1);

 gr1(1) = Beta_1(1,1)/sigmasqalpha; gr1(2:nBeta) = Beta_1(2:nBeta,1)/tau_temp(1);
 gr2(1) = Beta_1(1,2)/sigmasqalpha; gr2(2:nBeta) = Beta_1(2:nBeta,2)/tau_temp(2);
 gr3(1) = Beta_1(1,3)/sigmasqalpha; gr3(2:nBeta) = Beta_1(2:nBeta,3)/tau_temp(3);
 gr4(1) = Beta_1(1,4)/sigmasqalpha; gr4(2:nBeta) = Beta_1(2:nBeta,4)/tau_temp(4);
 gr5(1) = Beta_1(1,5)/sigmasqalpha; gr5(2:nBeta) = Beta_1(2:nBeta,5)/tau_temp(5);
 gr6(1) = Beta_1(1,6)/sigmasqalpha; gr6(2:nBeta) = Beta_1(2:nBeta,6)/tau_temp(6);
 gr7(1:nBeta) = Beta_2(1:nBeta,1)/tau_temp(7);
 gr8(1:nBeta) = Beta_2(1:nBeta,2)/tau_temp(8);
 gr9(1:nBeta) = Beta_2(1:nBeta,3)/tau_temp(9);

 if (mod(n,2)==1) 
   %%%%%%%%%%%%%%%%%%%%%%%%
   %gradient
   %%%%%%%%%%%%%%%%%%%%%%%%
   rk4 = -y(2:end,1).*conj(y(2:end,2)) - y(2:end,2).*conj(y(2:end,1));
   ck4 = 2*abs(y(2:end,1)).^2;
   rk5 = -y(2:end,1).*conj(y(2:end,3)) - y(2:end,3).*conj(y(2:end,1));
   ck5 = 2*abs(y(2:end,1)).^2;
   b = theta(3,:);
   dk5 = y(2:end,2).*conj(y(2:end,1)).*(b(2:end).') + conj(y(2:end,2)).*y(2:end,1).*conj(b(2:end).');
   rk6 = -y(2:end,2).*conj(y(2:end,3)) - y(2:end,3).*conj(y(2:end,2));
   ck6 = 2*abs(y(2:end,2)).^2;
   a = theta(2,:);
   dk6 = y(2:end,1).*conj(y(2:end,2)).*(a(2:end).') + conj(y(2:end,1)).*y(2:end,2).*conj(a(2:end).');
   ik7 = sqrt(-1)*(-y(2:end,1).*conj(y(2:end,2)) + y(2:end,2).*conj(y(2:end,1)));
   ck7 = 2*abs(y(2:end,1)).^2;
   ik8 = sqrt(-1)*(-y(2:end,1).*conj(y(2:end,3)) + y(2:end,3).*conj(y(2:end,1)));
   ck8 = 2*abs(y(2:end,1)).^2;
   dk8 = sqrt(-1)*(-y(2:end,2).*conj(y(2:end,1)).*(b(2:end).') + conj(y(2:end,2).*conj(y(2:end,1)).*(b(2:end).'))) ;
   ik9 = sqrt(-1)*(-y(2:end,2).*conj(y(2:end,3)) + y(2:end,3).*conj(y(2:end,2)));
   ck9 = 2*abs(y(2:end,2)).^2;
   dk9 = sqrt(-1)*(-y(2:end,1).*conj(y(2:end,2)).*(a(2:end).') + conj(y(2:end,1).*conj(y(2:end,2)).*(a(2:end).'))) ;

   gr1 = gr1 + xx_r(2:end,:)'*(1-abs(y(2:end,1)).^2.*exp(-xx_r(2:end,:)*Beta_1(:,1))) + ...
       0.5*(xx_r(1,:)'*(1-abs(y(1,1)).^2.*exp(-xx_r(1,:)*Beta_1(:,1))));
   gr2 = gr2 + xx_r(2:end,:)'*(1 - abs(y(2:end,2)-theta(1,2:end).'.*y(2:end,1)).^2.*exp(-xx_r(2:end,:)*Beta_1(:,2))) + ...
       0.5*(xx_r(1,:)'*(1 - abs(y(1,2)-theta(1,1).*y(1,1)).^2.*exp(-xx_r(1,:)*Beta_1(:,2))));
   gr3 = gr3 + xx_r(2:end,:)'*(1 - abs(y(2:end,3) - theta(2,2:end).'.*y(2:end,1) - theta(3,2:end).'.*y(2:end,2)).^2.*exp(-xx_r(2:end,:)*Beta_1(:,3)))+...
       0.5*(xx_r(1,:)'*(1 - abs(y(1,3) - theta(2,1).*y(1,1) - theta(3,1).*y(1,2))^2.*exp(-xx_r(1,:)*Beta_1(:,3))));   
   temp_mat_41 = bsxfun(@times, xx_r(2:end,:),rk4);
   temp_mat_42 = bsxfun(@times, ck4, bsxfun(@times, xx_r(2:end,:),xx_r(2:end,:)*Beta_1(:,4)));
   gr4 = gr4 + sum(bsxfun(@times, (temp_mat_41+temp_mat_42), exp(-xx_r(2:end,:)*Beta_1(:,2)) ))' +...
       0.5*(exp(-xx_r(1,:)*Beta_1(:,2))*(-y(1,1).*conj(y(1,2)) - y(1,2).*conj(y(1,1)))*xx_r(1,:)' +...
       exp(-xx_r(1,:)*Beta_1(:,2))*2*abs(y(1,1)).^2*(xx_r(1,:)*Beta_1(:,4))*xx_r(1,:)');
   temp_mat_51 = bsxfun(@times, xx_r(2:end,:),rk5);
   temp_mat_52 = bsxfun(@times, ck5, bsxfun(@times, xx_r(2:end,:),xx_r(2:end,:)*Beta_1(:,5)));
   temp_mat_53 = bsxfun(@times, xx_r(2:end,:),dk5);
   gr5 = gr5 + sum(bsxfun(@times, (temp_mat_51 + temp_mat_52 + temp_mat_53), exp(-xx_r(2:end,:)*Beta_1(:,3)) ))'+...
       0.5* ( exp(-xx_r(1,:)*Beta_1(:,3))*(-y(1,1).*conj(y(1,3)) - y(1,3).*conj(y(1,1)))*xx_r(1,:)' +...
       exp(-xx_r(1,:)*Beta_1(:,3))*2*abs(y(1,1)).^2*(xx_r(1,:)*Beta_1(:,5))*xx_r(1,:)'+...
       exp(-xx_r(1,:)*Beta_1(:,3))*(y(1,2).*conj(y(1,1)).*(b(1).') + conj(y(1,2).*conj(y(1,1)).*(b(1).')))*xx_r(1,:)');
   temp_mat_61 = bsxfun(@times, xx_r(2:end,:),rk6);
   temp_mat_62 = bsxfun(@times, ck6, bsxfun(@times, xx_r(2:end,:),xx_r(2:end,:)*Beta_1(:,6)));
   temp_mat_63 = bsxfun(@times, xx_r(2:end,:),dk6);
   gr6 = gr6 + sum(bsxfun(@times, (temp_mat_61 + temp_mat_62 + temp_mat_63), exp(-xx_r(2:end,:)*Beta_1(:,3)) ))'+...
       0.5* (exp(-xx_r(1,:)*Beta_1(:,3))*(-y(1,2).*conj(y(1,3)) - y(1,3).*conj(y(1,2)))*xx_r(1,:)' +...
       exp(-xx_r(1,:)*Beta_1(:,3))*2*abs(y(1,2)).^2*(xx_r(1,:)*Beta_1(:,6))*xx_r(1,:)'+...
       exp(-xx_r(1,:)*Beta_1(:,3))*(y(1,1).*conj(y(1,2)).*(a(1).') + conj(y(1,1).*conj(y(1,2)).*(a(1).')))*xx_r(1,:)');
       temp_mat_71 = bsxfun(@times, ik7, xx_i(2:end,:));
       temp_mat_72 = bsxfun(@times, ck7, bsxfun(@times,xx_i(2:end,:),xx_i(2:end,:)*Beta_2(:,1)));
       gr7 = gr7 + sum(bsxfun(@times, (temp_mat_71 + temp_mat_72), exp(-xx_r(2:end,:)*Beta_1(:,2))))' + ...
       0.5*(exp(-xx_r(1,:)*Beta_1(:,2))*(imag(-y(1,1).*conj(y(1,2)) + y(1,2).*conj(y(1,1))))*xx_i(1,:)' +...
            exp(-xx_r(1,:)*Beta_1(:,2))*2*abs(y(1,1)).^2*(xx_i(1,:)*Beta_2(:,1))*xx_i(1,:)');
   temp_mat_81 = bsxfun(@times, ik8, xx_i(2:end,:));
   temp_mat_82 = bsxfun(@times, ck8, bsxfun(@times,xx_i(2:end,:),xx_i(2:end,:)*Beta_2(:,2))); 
   temp_mat_83 = bsxfun(@times, xx_i(2:end,:),dk8);
   gr8 = gr8 + sum(bsxfun(@times, (temp_mat_81 + temp_mat_82 + temp_mat_83), exp(-xx_r(2:end,:)*Beta_1(:,3))))' + ...
       0.5*(exp(-xx_r(1,:)*Beta_1(:,3))*(imag(-y(1,1).*conj(y(1,3)) + y(1,3).*conj(y(1,1))))*xx_i(1,:)' +...
       exp(-xx_r(1,:)*Beta_1(:,3))*2*abs(y(1,1)).^2*(xx_i(1,:)*Beta_2(:,2))*xx_i(1,:)'+...
       sqrt(-1)*exp(-xx_r(1,:)*Beta_1(:,3))*(-y(1,1).*conj(y(1,2)).*(b(1).') + conj(y(1,1).*conj(y(1,2)).*(b(1).')))*xx_i(1,:)');
   temp_mat_91 = bsxfun(@times, ik9, xx_i(2:end,:));
   temp_mat_92 = bsxfun(@times, ck9, bsxfun(@times,xx_i(2:end,:),xx_i(2:end,:)*Beta_2(:,3))); 
   temp_mat_93 = bsxfun(@times, xx_i(2:end,:),dk9);
   gr9 = gr9 + sum(bsxfun(@times, (temp_mat_91 + temp_mat_92 + temp_mat_93), exp(-xx_r(2:end,:)*Beta_1(:,3))))' + ...
       0.5 * (exp(-xx_r(1,:)*Beta_1(:,3))*(imag(-y(1,2).*conj(y(1,3)) + y(1,3).*conj(y(1,2))))*xx_i(1,:)' +...
       exp(-xx_r(1,:)*Beta_1(:,3))*2*abs(y(1,2)).^2*(xx_i(1,:)*Beta_2(:,3))*xx_i(1,:)'+...
       sqrt(-1)*exp(-xx_r(1,:)*Beta_1(:,3))*(-y(1,1).*conj(y(1,2)).*(a(1).') + conj(y(1,1).*conj(y(1,2)).*(a(1).')))*xx_i(1,:)'); 
 else
  %%%%%%%%%%%%%%%%%%%%%%%%
  %gradient
  %%%%%%%%%%%%%%%%%%%%%%%%
   rk4 = -y(2:nfreq,1).*conj(y(2:nfreq,2)) - y(2:nfreq,2).*conj(y(2:nfreq,1));
   ck4 = 2*abs(y(2:nfreq,1)).^2;
   rk5 = -y(2:nfreq,1).*conj(y(2:nfreq,3)) - y(2:nfreq,3).*conj(y(2:nfreq,1));
   ck5 = 2*abs(y(2:nfreq,1)).^2;
   b = theta(3,:);
   dk5 = y(2:nfreq,2).*conj(y(2:nfreq,1)).*(b(2:nfreq).') + conj(y(2:nfreq,2)).*y(2:nfreq,1).*conj(b(2:nfreq).');
   rk6 = -y(2:nfreq,2).*conj(y(2:nfreq,3)) - y(2:nfreq,3).*conj(y(2:nfreq,2));
   ck6 = 2*abs(y(2:nfreq,2)).^2;
   a = theta(2,:);
   dk6 = y(2:nfreq,1).*conj(y(2:nfreq,2)).*(a(2:nfreq).') + conj(y(2:nfreq,1).*conj(y(2:nfreq,2)).*(a(2:nfreq).'));
   ik7 = sqrt(-1)*(-y(2:nfreq,1).*conj(y(2:nfreq,2)) + y(2:nfreq,2).*conj(y(2:nfreq,1)));
   ck7 = 2*abs(y(2:nfreq,1)).^2;
   ik8 = sqrt(-1)*(-y(2:nfreq,1).*conj(y(2:nfreq,3)) + y(2:nfreq,3).*conj(y(2:nfreq,1)));
   ck8 = 2*abs(y(2:nfreq,1)).^2;
    dk8 = sqrt(-1)*(-y(2:nfreq,2).*conj(y(2:nfreq,1)).*(b(2:nfreq).') + conj(y(2:nfreq,2).*conj(y(2:nfreq,1)).*(b(2:nfreq).')));
   ik9 = sqrt(-1)*(-y(2:nfreq,2).*conj(y(2:nfreq,3)) + y(2:nfreq,3).*conj(y(2:nfreq,2)));
   ck9 = 2*abs(y(2:nfreq,2)).^2;
    dk9 = sqrt(-1)*(-y(2:nfreq,1).*conj(y(2:nfreq,2)).*(a(2:nfreq).') + conj(y(2:nfreq,1).*conj(y(2:nfreq,2)).*(a(2:nfreq).')));

    gr1 = gr1 + xx_r(2:nfreq,:)'*(1-abs(y(2:nfreq,1)).^2.*exp(-xx_r(2:nfreq,:)*Beta_1(:,1))) + ...
       0.5*(xx_r(1,:)'*(1-abs(y(1,1)).^2.*exp(-xx_r(1,:)*Beta_1(:,1))))+...
       0.5*(xx_r(end,:)'*(1-abs(y(end,1)).^2.*exp(-xx_r(end,:)*Beta_1(:,1))));
   gr2 = gr2 + xx_r(2:nfreq,:)'*(1 - abs(y(2:nfreq,2)-theta(1,2:nfreq).'.*y(2:nfreq,1)).^2.*exp(-xx_r(2:nfreq,:)*Beta_1(:,2))) + ...
       0.5*(xx_r(1,:)'*(1 - abs(y(1,2)-theta(1,1).*y(1,1)).^2.*exp(-xx_r(1,:)*Beta_1(:,2)))) +...
       0.5*(xx_r(end,:)'*(1 - abs(y(end,2)-theta(1,end).*y(end,1)).^2.*exp(-xx_r(end,:)*Beta_1(:,2))));
   gr3 = gr3 + xx_r(2:nfreq,:)'*(1 - abs(y(2:nfreq,3) - theta(2,2:nfreq).'.*y(2:nfreq,1) - theta(3,2:nfreq).'.*y(2:nfreq,2)).^2.*exp(-xx_r(2:nfreq,:)*Beta_1(:,3)))+...
       0.5*(xx_r(1,:)'*(1 - abs(y(1,3) - theta(2,1).*y(1,1) - theta(3,1).*y(1,2))^2.*exp(-xx_r(1,:)*Beta_1(:,3))))+...
       0.5*(xx_r(end,:)'*(1 - abs(y(end,3) - theta(2,end).*y(end,1) - theta(3,end).*y(end,2))^2.*exp(-xx_r(end,:)*Beta_1(:,3))));   
   temp_mat_41 = bsxfun(@times, xx_r(2:nfreq,:),rk4);
   temp_mat_42 = bsxfun(@times, ck4, bsxfun(@times, xx_r(2:nfreq,:),xx_r(2:nfreq,:)*Beta_1(:,4)));
   gr4 = gr4 + sum(bsxfun(@times, (temp_mat_41+temp_mat_42), exp(-xx_r(2:nfreq,:)*Beta_1(:,2)) ))' +...
       0.5*(exp(-xx_r(1,:)*Beta_1(:,2))*(-y(1,1).*conj(y(1,2)) - y(1,2).*conj(y(1,1)))*xx_r(1,:)' +...
       exp(-xx_r(1,:)*Beta_1(:,2))*2*abs(y(1,1)).^2*(xx_r(1,:)*Beta_1(:,4))*xx_r(1,:)')+...
       0.5*(exp(-xx_r(end,:)*Beta_1(:,2))*(-y(end,1).*conj(y(end,2)) - y(end,2).*conj(y(end,1)))*xx_r(end,:)' +...
       exp(-xx_r(end,:)*Beta_1(:,2))*2*abs(y(end,1)).^2*(xx_r(end,:)*Beta_1(:,4))*xx_r(end,:)');
   temp_mat_51 = bsxfun(@times, xx_r(2:nfreq,:),rk5);
   temp_mat_52 = bsxfun(@times, ck5, bsxfun(@times, xx_r(2:nfreq,:),xx_r(2:nfreq,:)*Beta_1(:,5)));
   temp_mat_53 = bsxfun(@times, xx_r(2:nfreq,:),dk5);
   gr5 = gr5 + sum( bsxfun(@times, (temp_mat_51 + temp_mat_52 + temp_mat_53), exp(-xx_r(2:nfreq,:)*Beta_1(:,3)) ))'+...
            0.5*(exp(-xx_r(1,:)*Beta_1(:,3))*(-y(1,1).*conj(y(1,3)) - y(1,3).*conj(y(1,1)))*xx_r(1,:)' +...
       exp(-xx_r(1,:)*Beta_1(:,3))*2*abs(y(1,1)).^2*(xx_r(1,:)*Beta_1(:,5))*xx_r(1,:)'+...
       exp(-xx_r(1,:)*Beta_1(:,3))*(y(1,2).*conj(y(1,1)).*(b(1).') + conj(y(1,2).*conj(y(1,1)).*(b(1).')))*xx_r(1,:)')+...
       0.5*(exp(-xx_r(end,:)*Beta_1(:,3))*(-y(end,1).*conj(y(end,3)) - y(end,3).*conj(y(end,1)))*xx_r(end,:)' +...
       exp(-xx_r(end,:)*Beta_1(:,3))*2*abs(y(end,1)).^2*(xx_r(end,:)*Beta_1(:,5))*xx_r(end,:)'+...
        exp(-xx_r(end,:)*Beta_1(:,3))*(y(end,2).*conj(y(end,1)).*(b(end).') + conj(y(end,2).*conj(y(end,1)).*(b(end).')))*xx_r(end,:)');
   temp_mat_61 = bsxfun(@times, xx_r(2:nfreq,:),rk6);
   temp_mat_62 = bsxfun(@times, ck6, bsxfun(@times, xx_r(2:nfreq,:),xx_r(2:nfreq,:)*Beta_1(:,6)));
   temp_mat_63 = bsxfun(@times, xx_r(2:nfreq,:),dk6);
   gr6 = gr6 + sum(bsxfun(@times, (temp_mat_61 + temp_mat_62 + temp_mat_63), exp(-xx_r(2:nfreq,:)*Beta_1(:,3)) ))'+...
       0.5*(exp(-xx_r(1,:)*Beta_1(:,3))*(-y(1,2).*conj(y(1,3)) - y(1,3).*conj(y(1,2)))*xx_r(1,:)' +...
       exp(-xx_r(1,:)*Beta_1(:,3))*2*abs(y(1,2)).^2*(xx_r(1,:)*Beta_1(:,6))*xx_r(1,:)'+...
       exp(-xx_r(1,:)*Beta_1(:,3))*(y(1,1).*conj(y(1,2)).*(a(1).') + conj(y(1,1).*conj(y(1,2)).*(a(1).')))*xx_r(1,:)')+...
       0.5*(exp(-xx_r(end,:)*Beta_1(:,3))*(-y(end,2).*conj(y(end,3)) - y(end,3).*conj(y(end,2)))*xx_r(end,:)' +...
       exp(-xx_r(end,:)*Beta_1(:,3))*2*abs(y(end,2)).^2*(xx_r(end,:)*Beta_1(:,6))*xx_r(end,:)'+...
       exp(-xx_r(end,:)*Beta_1(:,3))*(y(end,1).*conj(y(end,2)).*(a(end).') + conj(y(end,1).*conj(y(end,2)).*(a(end).')))*xx_r(end,:)');
   temp_mat_71 = bsxfun(@times, ik7, xx_i(2:nfreq,:));
   temp_mat_72 = bsxfun(@times, ck7, bsxfun(@times,xx_i(2:nfreq,:),xx_i(2:nfreq,:)*Beta_2(:,1)));
   gr7 = gr7 + sum(bsxfun(@times, (temp_mat_71 + temp_mat_72), exp(-xx_r(2:nfreq,:)*Beta_1(:,2))))' + ...
       0.5*(exp(-xx_r(1,:)*Beta_1(:,2))*(imag(-y(1,1).*conj(y(1,2)) + y(1,2).*conj(y(1,1))))*xx_i(1,:)' +...
       exp(-xx_r(1,:)*Beta_1(:,2))*2*abs(y(1,1)).^2*(xx_i(1,:)*Beta_2(:,1))*xx_i(1,:)')+...
       0.5*(exp(-xx_r(end,:)*Beta_1(:,2))*(imag(-y(end,1).*conj(y(end,2)) + y(end,2).*conj(y(end,1))))*xx_i(end,:)' +...
       exp(-xx_r(end,:)*Beta_1(:,2))*2*abs(y(end,1)).^2*(xx_i(end,:)*Beta_2(:,1))*xx_i(end,:)');
   temp_mat_81 = bsxfun(@times, ik8, xx_i(2:nfreq,:));
   temp_mat_82 = bsxfun(@times, ck8, bsxfun(@times,xx_i(2:nfreq,:),xx_i(2:nfreq,:)*Beta_2(:,2))); 
   temp_mat_83 = bsxfun(@times, xx_i(2:nfreq,:),dk8);
   gr8 = gr8 + sum(bsxfun(@times, (temp_mat_81 + temp_mat_82 + temp_mat_83), exp(-xx_r(2:nfreq,:)*Beta_1(:,3))))' + ...
       0.5 * (exp(-xx_r(1,:)*Beta_1(:,3))*(imag(-y(1,1).*conj(y(1,3)) + y(1,3).*conj(y(1,1))))*xx_i(1,:)' +...
       exp(-xx_r(1,:)*Beta_1(:,3))*2*abs(y(1,1)).^2*(xx_i(1,:)*Beta_2(:,2))*xx_i(1,:)'+...
       sqrt(-1)*exp(-xx_r(1,:)*Beta_1(:,3))*(-y(1,1).*conj(y(1,2)).*(b(1).') + conj(y(1,1).*conj(y(1,2)).*(b(1).')))*xx_i(1,:)')+...
       0.5 * (exp(-xx_r(end,:)*Beta_1(:,3))*(imag(-y(end,1).*conj(y(end,3)) + y(end,3).*conj(y(end,1))))*xx_i(end,:)' +...
       exp(-xx_r(end,:)*Beta_1(:,3))*2*abs(y(end,1)).^2*(xx_i(end,:)*Beta_2(:,2))*xx_i(end,:)'+...
       sqrt(-1)*exp(-xx_r(end,:)*Beta_1(:,3))*(-y(end,1).*conj(y(end,2)).*(b(end).') + conj(y(end,1).*conj(y(end,2)).*(b(end).')))*xx_i(end,:)');
   temp_mat_91 = bsxfun(@times, ik9, xx_i(2:nfreq,:));
   temp_mat_92 = bsxfun(@times, ck9, bsxfun(@times,xx_i(2:nfreq,:),xx_i(2:nfreq,:)*Beta_2(:,3))); 
   temp_mat_93 = bsxfun(@times, xx_i(2:nfreq,:),dk9);
   gr9 = gr9 + sum(bsxfun(@times, (temp_mat_91 + temp_mat_92 + temp_mat_93), exp(-xx_r(2:nfreq,:)*Beta_1(:,3))))' + ...
       0.5 * (exp(-xx_r(1,:)*Beta_1(:,3))*(imag(-y(1,2).*conj(y(1,3)) + y(1,3).*conj(y(1,2))))*xx_i(1,:)' +...
        exp(-xx_r(1,:)*Beta_1(:,3))*2*abs(y(1,2)).^2*(xx_i(1,:)*Beta_2(:,3))*xx_i(1,:)'+...
       sqrt(-1)*exp(-xx_r(1,:)*Beta_1(:,3))*(-y(1,1).*conj(y(1,2)).*(a(1).') + conj(y(1,1).*conj(y(1,2)).*(a(1).')))*xx_i(1,:)')+...
        0.5 * (exp(-xx_r(end,:)*Beta_1(:,3))*(imag(-y(end,2).*conj(y(end,3)) + y(end,3).*conj(y(end,2))))*xx_i(end,:)' +...
       exp(-xx_r(end,:)*Beta_1(:,3))*2*abs(y(end,2)).^2*(xx_i(end,:)*Beta_2(:,3))*xx_i(end,:)'+...
       sqrt(-1)*exp(-xx_r(end,:)*Beta_1(:,3))*(-y(end,1).*conj(y(end,2)).*(a(end).') + conj(y(end,1).*conj(y(end,2)).*(a(end).')))*xx_i(end,:)');
 end
 gr = [gr1;gr2;gr3;gr4;gr5;gr6;gr7;gr8;gr9];
 gr_index = (1:(dimen^2*nBeta)).*kron(chol_index(Phi_temp,:),ones(nBeta,1)');
 gr_index = gr_index(find(gr_index~=0));
 gr = gr(gr_index);
end

gr=-gr;

function [gr] = Gradient2(yobs_tmp, chol_index, Phi_temp, tau_temp_1,...
       tau_temp_2, Beta_temp_1, Beta_temp_2, sigmasqalpha, nbasis, nseg)

%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%% Calculate log gradients for coefficients selected to be then same
%
% Input:
%  1) x - initial values for coefficient of basis functions need to
%  be optimized
%  2) yobs_tmp - time series data within the segment
%  3) chol_index - index matrix
%  4) Phi_temp - which component changed
%  5) tau_temp_1 - smoothing parameters for the first segment
%  6) tau_temp_2 - smoothing parameters for the second segment
%  7) Beta_temp_1 - current coefficients for the first segment
%  8) Beta_temp_2 - current coefficients for the second segment
%  9) sigmasqalpha - smoothing parameters for the constant in real
%  components
%  10) nbasis - number of basis functions used
%  11) nseg - number of observations in the first segment
%  Main Outputs:
%  2) gr - gradients for optimization process
%
%  Required programs: lin_basis_func, Beta_derive1
%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%

yobs_tmp_1 = yobs_tmp(1:nseg,:);
yobs_tmp_2 = yobs_tmp(nseg+1:end,:);
[grad1] = Gradient1(yobs_tmp_1, chol_index, Phi_temp, tau_temp_1, Beta_temp_1, sigmasqalpha, nbasis);
[grad2] = Gradient1(yobs_tmp_2, chol_index, Phi_temp, tau_temp_2, Beta_temp_2, sigmasqalpha, nbasis);

gr = grad1 + grad2;


function [Beta_out, m_out, m, yobs_tmp]=Hamilt1(chol_index, Phi_temp,...
           j, yobs, tau_temp, Beta_temp, xi_temp)
%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%% Does HMC updates for the coefficents that are different
%
%  Input:
%    1) chol_index - index matrix
%    2) Phi_temp - indicate variable that determine which coefficient
%    should be different
%    3) j - one of two segments
%    4) yobs - time series observations
%    5) tau_temp - current smoothing parameters
%    6) Beta_temp - current coefficients
%    7) xi__temp - current partitions
%  Main Outputs:
%    1) Beta_out - vector of coefficients
%    2) m_out - updated momentum variables
%    3) m - initial momentum variables
%    4) yobs_tmp - time series observations in selected segment
%
%  Required programs: Gradient1
%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
 global nbasis sigmasqalpha dimen nBeta M ee

 %pick right portion of the data
 if j>1
   yobs_tmp = yobs((xi_temp(j-1)+1):xi_temp(j),:);
 else
   yobs_tmp = yobs(1:xi_temp(j),:);
 end

 select = chol_index(Phi_temp,:).*(1:dimen^2);
 select = select(select~=0);


 ll = nBeta*length(select);
 [gr]=Gradient1(yobs_tmp, chol_index, Phi_temp, tau_temp,...
           Beta_temp, sigmasqalpha, nbasis);
 Beta_old = reshape(Beta_temp(:,select),ll,1);
 Beta_out = Beta_old;

 % generate momentum variable
 m = mvnrnd(zeros(ll,1),M*eye(ll))';
 % determine leap number and step
 stepsize = unifrnd(0,2*ee);
 leap = randsample(1:2*ceil(1/ee),1);
 %leap = randsample(1:(1/stepsize),1);

 m_out = m + 0.5*gr*stepsize;
 for i=1:leap
   Beta_out = Beta_out + stepsize*(1/M)*eye(ll)*m_out;
   Beta_temp(:,select) = reshape(Beta_out,nBeta,length(select));
   if i==leap
     [gr] = Gradient1(yobs_tmp, chol_index, Phi_temp, tau_temp,...
              Beta_temp, sigmasqalpha, nbasis);
     m_out = m_out + 0.5*gr*stepsize;
   else
     [gr] = Gradient1(yobs_tmp, chol_index, Phi_temp, tau_temp,...
             Beta_temp, sigmasqalpha, nbasis);
     m_out = m_out + 1*gr*stepsize;
   end
 end
 m_out = -m_out;


function[Beta_out, m_out, m, yobs_tmp] = Hamilt2(chol_index, Phi_temp, j, yobs,...
        tau_temp_1, tau_temp_2, Beta_temp_1, Beta_temp_2, xi_temp, nseg)
     
%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%% Does HMC updates for the coefficents that are the same
%
%  Input:
%    1) chol_index - index matrix
%    2) Phi_temp - indicate variable that determine which coefficient
%    should be different
%    3) j - the first segment
%    4) yobs - time series observations
%    5) tau_temp_1 - current smoothing parameters for the first segment
%    6) tau_temp_2 - current smoothing parameters for the second
%    segment
%    7) Beta_temp_1 - current coefficients for the first segment
%    8) Beta_temp_2 - current coefficients for the second segment
%    9) xi__temp - current partitions
%    10) nseg - number of observation in the first segment
%  Main Outputs:
%    1) Beta_out - vector of coefficients
%    2) m_out - updated momentum variables
%    3) m - initial momentum variables
%    4) yobs_tmp - time series observations in selected segment
%
%  Required programs: Gradient2
%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%    
 global dimen nbasis sigmasqalpha nBeta M ee


 %pick right portion of the data
 if j==1
   yobs_tmp = yobs(1:xi_temp(j+1),:);
 elseif j==length(xi_temp)-1
   yobs_tmp = yobs(xi_temp(j-1)+1:end,:);
 else
   yobs_tmp = yobs(xi_temp(j-1)+1:xi_temp(j+1),:);
 end

 aa = zeros(2^(dimen^2),1);
 for i=1:2^(dimen^2)
   aa(i)=sum(chol_index(i,:)~=chol_index(Phi_temp,:));
 end
 Phi_inv = find(aa==dimen^2);

 select = chol_index(Phi_inv,:).*(1:dimen^2);
 select = select(select~=0);

 ll = nBeta*length(select);
 [gr] = Gradient2(yobs_tmp, chol_index, Phi_inv, tau_temp_1,...
      tau_temp_2, Beta_temp_1, Beta_temp_2, sigmasqalpha, nbasis, nseg);
 Beta_old = reshape(Beta_temp_1(:,select),ll,1);
 Beta_out = Beta_old;
 % generate momentum variable
 m = mvnrnd(zeros(ll,1),M*eye(ll))';
 % determine leap number and stepsize
 stepsize = unifrnd(0,2*ee);
 leap = randsample(1:2*ceil(1/ee),1);

 m_out = m + 0.5*gr*stepsize;
 for i=1:leap
   Beta_out = Beta_out + stepsize*(1/M)*eye(ll)*m_out;
   Beta_temp_1(:,select) = reshape(Beta_out,nBeta,length(select));
   Beta_temp_2(:,select) = reshape(Beta_out,nBeta,length(select));
   if i==leap
     [gr] = Gradient2(yobs_tmp, chol_index, Phi_inv, tau_temp_1,...
          tau_temp_2, Beta_temp_1, Beta_temp_2, sigmasqalpha, nbasis, nseg);
     m_out = m_out + 0.5*gr*stepsize;
   else
     [gr] = Gradient2(yobs_tmp, chol_index, Phi_inv, tau_temp_1,...
          tau_temp_2, Beta_temp_1, Beta_temp_2, sigmasqalpha, nbasis, nseg);
     m_out = m_out + 1*gr*stepsize;
   end
 end
 m_out = -m_out;

function [xx_r,xx_i]= lin_basis_func(freq_hat)% freq are the frequencies

%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
% Produces linear basis functions
%
%  Input:
%    1) xx_r - linear basis function for real Cholesky components
%    2) ts - linear basis function for imaginary Cholesky components
%  Main Outputs:
%    1) freq_hat - frequencies used
%
%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
 global nBeta dimen
 nfreq_hat=length(freq_hat);
 xx_r = ones((nfreq_hat),nBeta);
 xx_i = ones((nfreq_hat),nBeta);
 for j=2:nBeta
   xx_r(:,j) = sqrt(2)* cos(2*pi*(j-1)*freq_hat)/(2*pi*(j-1));
 end
 for j =1:nBeta
   xx_i(:,j) = sqrt(2)*sin(2*pi*j*freq_hat)/(2*pi*j);
 end

function[ai] = matpower(a,alpha)
%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
% Calculate matrix power
%
%  Input:
%    1) a - a matrix
%    2) alpha - desired power
%  Main Outputs:
%    1) ai - matrix with desired power
%
%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
   small = .000001;
   if numel(a) == 1
      ai=a^alpha;
   else
    [p1, ~] = size(a);
    [eve, eva] = eig(a);
    eva = diag(eva);
    eve = eve./(repmat((diag((eve)'*eve).^0.5),1,p1));
    index = 1:p1;
    index = index(eva>small);
    evai = eva;
    evai(index) = (eva(index)).^(alpha);
    ai = eve*diag(evai)*(eve)';
   end
end

function[Beta_mean,Beta_var,yobs_tmp] = postBeta1(chol_index, Phi_temp,...
            j, yobs, tau_temp, Beta_temp, xi_temp)
%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%% Calculate the mean and variance of normal approximation for coefficient
% of basis functions that are different across segments
%
%  Input:
%    1) chol_index - index matrix
%    2) Phi_temp - indicate variable that determine which coefficient
%    should be different
%    3) j - one of two segments
%    4) yobs - time series observations
%    5) tau_temp - current smoothing parameters
%    6) Beta_temp - current coefficients
%    7) xi__temp - current partitions
%  Main Outputs:
%    1) Beta_mean - mean of approximated distribution
%    2) Beta_var - variance of approximated distribution
%    3) yobs_tmp - time series observations in selected segment
%
%  Required programs: Beta_derive1
%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%

 global nbasis nBeta sigmasqalpha dimen options

 %var_inflate_1 is for variance inflation for first real part of Cholesky components;
 %var_inflate_1 is for variance inflation for rest of components

 %pick right portion of the data
 if j>1
    yobs_tmp = yobs((xi_temp(j-1)+1):xi_temp(j),:);
 else
    yobs_tmp = yobs(1:xi_temp(j),:);
 end

 %provide initial values
 x = zeros(chol_index(Phi_temp,:)*repmat(nBeta,dimen^2,1),1);

 %optimization process
 [Beta_mean,~,~,~,~,Beta_inv_var] = fminunc(@Beta_derive1, x, options, ...
       yobs_tmp, chol_index, Phi_temp, tau_temp, Beta_temp, sigmasqalpha, nbasis);
 %Beta_var = Beta_inv_vareye(size(Beta_inv_var));
 Beta_var = matpower(Beta_inv_var,-1);

function[Beta_mean,Beta_var,yobs_tmp] = postBeta2(chol_index, Phi_temp, j, yobs,...
       tau_temp_1, tau_temp_2, Beta_temp_1, Beta_temp_2, xi_temp, nseg)
 global dimen options nbasis nBeta sigmasqalpha

%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%% Calculate the mean and variance of normal approximation for coefficient
% of basis functions that are the same across segments
%
%  Input:
%    1) chol_index - index matrix
%    2) Phi_temp - indicate variable that determine which coefficient
%    should be different
%    3) j - the first segment
%    4) yobs - time series observations
%    5) tau_temp_1 - current smoothing parameters for the first segment
%    6) tau_temp_2 - current smoothing parameters for the second
%    segment
%    7) Beta_temp_1 - current coefficients for the first segment
%    8) Beta_temp_2 - current coefficients for the second segment
%    9) xi__temp - current partitions
%    10) nseg - number of observation in the first segment
%  Main Outputs:
%    1) Beta_mean - mean of approximated distribution
%    2) Beta_var - variance of approximated distribution
%    3) yobs_tmp - time series observations in selected segment
%
%  Required programs: Beta_derive2
%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%

 %pick right portion of the data
 if j==1
   yobs_tmp = yobs(1:xi_temp(j+1),:);
 elseif j==length(xi_temp)-1
   yobs_tmp = yobs(xi_temp(j-1)+1:end,:);
 else
   yobs_tmp = yobs(xi_temp(j-1)+1:xi_temp(j+1),:);
 end

 aa = zeros(2^(dimen^2),1);
 for i=1:2^(dimen^2)
   aa(i)=sum(chol_index(i,:)~=chol_index(Phi_temp,:));
 end
 Phi_inv = find(aa==dimen^2);

 %provide initial values
 x = zeros(chol_index(Phi_inv,:)*repmat(nBeta,dimen^2,1),1);

 %optimization process
 [Beta_mean,~,~,~,~,Beta_inv_var] = fminunc(@Beta_derive2, x, options, yobs_tmp, chol_index, Phi_inv,...
               tau_temp_1, tau_temp_2, Beta_temp_1, Beta_temp_2, sigmasqalpha, nbasis, nseg);
 %Beta_var = Beta_inv_vareye(size(Beta_inv_var));
 Beta_var = matpower(Beta_inv_var,-1);

function [log_whittle] = whittle_like(yobs_tmp, Beta)

%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%% Calculate local Whittle likelihood
%
%  Input:
%    1) yobs_tmp - time series in the segment
%    2) Beta - coefficient of basis functions
%  Main Outputs:
%    1) log_whitle - log local whittle likelihood
%  Require programs: lin_basis_function
%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%

global dimen
Beta_1 = Beta(:,1:(dimen + dimen*(dimen-1)/2));
Beta_2 = Beta(:,(dimen + dimen*(dimen-1)/2 + 1): end);

dim = size(yobs_tmp);
n = dim(1);
nfreq = floor(n/2);
tt = (0:nfreq)/(2*nfreq);
yy = fft(yobs_tmp)/sqrt(n);
y = yy(1:(nfreq+1),:);
nf = length(y);

[xx_r, xx_i]=lin_basis_func(tt);

%theta's
theta = zeros(dimen*(dimen-1)/2,nf);
for i=1:(dimen*(dimen-1)/2)
 theta_real = xx_r * Beta_1(:,i+dimen);
 theta_imag = xx_i * Beta_2(:,i);
 theta(i,:) = theta_real + sqrt(-1)*theta_imag;
end

%delta's
delta_sq = zeros(dimen,nf);
for i=1:dimen
 delta_sq(i,:) = exp(xx_r * Beta_1(:,i));
end

if dimen==2
  if (mod(n,2)==1) %odd n
    log_whittle = -sum(log(delta_sq(1,2:end))' + log(delta_sq(2,2:end))' + ...
         conj(y(2:end,1)).*y(2:end,1).*exp(-xx_r(2:end,:)*Beta_1(:,1)) + ...
         conj(y(2:end,2) - theta(2:end).'.*y(2:end,1)).*(y(2:end,2) - theta(2:end).'.*y(2:end,1)).*exp(-xx_r(2:end,:)*Beta_1(:,2))) - ...
         0.5*(log(delta_sq(1,1))' + log(delta_sq(2,1))' + conj(y(1,1)).*y(1,1).*exp(-xx_r(1,:)*Beta_1(:,1)) - ...
         conj(y(1,2) - theta(1).'.*y(1,1)).*(y(1,2) - theta(1).'.*y(1,1)).*exp(-xx_r(1,:)*Beta_1(:,2)));
 else
   log_whittle = -sum(log(delta_sq(1,2:nfreq))' + log(delta_sq(2,2:nfreq))' + ...
               conj(y(2:nfreq,1)).*y(2:nfreq,1).*exp(-xx_r(2:nfreq,:)*Beta_1(:,1)) + ...
         conj(y(2:nfreq,2) - theta(2:nfreq).'.*y(2:nfreq,1)).*(y(2:nfreq,2) - theta(2:nfreq).'.*y(2:nfreq,1)).*exp(-xx_r(2:nfreq,:)*Beta_1(:,2))) - ...
         0.5*(log(delta_sq(1,1)) + log(delta_sq(2,1)) + conj(y(1,1)).*y(1,1).*exp(-xx_r(1,:)*Beta_1(:,1)) + ...
         conj(y(1,2) - theta(1).'.*y(1,1)).*(y(1,2) - theta(1).'.*y(1,1)).*exp(-xx_r(1,:)*Beta_1(:,2))) - ...
         0.5*(log(delta_sq(1,end)) + log(delta_sq(2,end)) + conj(y(end,1)).*y((nfreq+1),1).*exp(-xx_r(end,:)*Beta_1(:,1)) + ...
         conj(y(end,2) - theta(end).'.*y(end,1)).*(y(end,2) - theta(end).'.*y(end,1)).*exp(-xx_r(end,:)*Beta_1(:,2)));
 end
elseif dimen==3
 if (mod(n,2)==1) %odd n
   log_whittle = -sum(log(delta_sq(1,2:end))' + log(delta_sq(2,2:end))' + log(delta_sq(3,2:end))' + ...
         conj(y(2:end,1)).*y(2:end,1).*exp(-xx_r(2:end,:)*Beta_1(:,1)) + ...
         conj(y(2:end,2) - theta(1,2:end).'.*y(2:end,1)).*(y(2:end,2) - theta(1,2:end).'.*y(2:end,1)).*exp(-xx_r(2:end,:)*Beta_1(:,2))+...
         conj(y(2:end,3) -(theta(2,2:end).'.*y(2:end,1)+ theta(3,2:end).'.*y(2:end,2))).*(y(2:end,3) -(theta(2,2:end).'.*y(2:end,1)+ theta(3,2:end).'.*y(2:end,2))).*...
         exp(-xx_r(2:end,:)*Beta_1(:,3))) - ...
         0.5*(log(delta_sq(1,1))' + log(delta_sq(2,1))' + log(delta_sq(3,1))' + ...
                conj(y(1,1)).*y(1,1).*exp(-xx_r(1,:)*Beta_1(:,1)) + ...
         conj(y(1,2) - theta(1,1).'.*y(1,1)).*(y(1,2) - theta(1,1).'.*y(1,1)).*exp(-xx_r(1,:)*Beta_1(:,2))+...
         conj(y(1,3) -(theta(2,1).'.*y(1,1)+ theta(3,1).'.*y(1,2))).*(y(1,3) -(theta(2,1).'.*y(1,1)+ theta(3,1).'.*y(1,2))).*...
         exp(-xx_r(1,:)*Beta_1(:,3)));

 else
   log_whittle = -sum(log(delta_sq(1,2:nfreq))' + log(delta_sq(2,2:nfreq))' + log(delta_sq(3,2:nfreq))' + ...
         conj(y(2:nfreq,1)).*y(2:nfreq,1).*exp(-xx_r(2:nfreq,:)*Beta_1(:,1)) + ...
         conj(y(2:nfreq,2) - theta(1,2:nfreq).'.*y(2:nfreq,1)).*(y(2:nfreq,2) - theta(1,2:nfreq).'.*y(2:nfreq,1)).*exp(-xx_r(2:nfreq,:)*Beta_1(:,2))+...
         conj(y(2:nfreq,3) -(theta(2,2:nfreq).'.*y(2:nfreq,1)+ theta(3,2:nfreq).'.*y(2:nfreq,2))).*(y(2:nfreq,3) -(theta(2,2:nfreq).'.*y(2:nfreq,1)+ theta(3,2:nfreq).'.*y(2:nfreq,2))).*...
         exp(-xx_r(2:nfreq,:)*Beta_1(:,3))) - ...
         0.5*(log(delta_sq(1,1))' + log(delta_sq(2,1))' + log(delta_sq(3,1))' + ...
         conj(y(1,1)).*y(1,1).*exp(-xx_r(1,:)*Beta_1(:,1)) + ...
         conj(y(1,2) - theta(1,1).'.*y(1,1)).*(y(1,2) - theta(1,1).'.*y(1,1)).*exp(-xx_r(1,:)*Beta_1(:,2))+...
         conj(y(1,3) -(theta(2,1).'.*y(1,1)+ theta(3,1).'.*y(1,2))).*(y(1,3) -(theta(2,1).'.*y(1,1)+ theta(3,1).'.*y(1,2))).*...
         exp(-xx_r(1,:)*Beta_1(:,3)))-...
         0.5*(log(delta_sq(1,end))' + log(delta_sq(2,end))' + log(delta_sq(3,end))' + ...
         conj(y(end,1)).*y(end,1).*exp(-xx_r(end,:)*Beta_1(:,1)) + ...
         conj(y(end,2) - theta(1,end).'.*y(end,1)).*(y(end,2) - theta(1,end).'.*y(end,1)).*exp(-xx_r(end,:)*Beta_1(:,2))+...
                conj(y(end,3) -(theta(2,end).'.*y(end,1)+ theta(3,end).'.*y(end,2))).*(y(end,3) -(theta(2,end).'.*y(end,1)+ theta(3,end).'.*y(end,2))).*...
         exp(-xx_r(end,:)*Beta_1(:,3)));
  end
end

function[A,nseg_new,xi_prop,tau_prop,Beta_prop,Phi_prop,seg_temp]=...
within(chol_index,ts,nexp_temp,tau_temp,xi_curr_temp,nseg_curr_temp,Beta_curr_temp,Phi_temp)
global nobs dimen nBeta M prob_mm1 tmin

%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%% Does the within-model move in the paper
%
%  Input:
%    1) chol_index - index matrix
%    2) ts - TxN matrix of time series data
%    3) nexp_temp - number of segments
%    5) tau_temp - vector of smoothing parameters
%    6) xi_curr_temp - current partitions
%    7) nseg_curr_temp - current number of observations in each segment
%    8) Beta_curr_temp - current vector of coefficients
%    9) Phi_temp - which component changed
%    10) Z - together with Phi_temp, indicating how components should
%    be update
%  Main Outputs:
%    1) PI - acceptance probability
%    2) nseg_new - new number of observations in each segment
%    3) xi_prop - proposed partitions
%    4) tau_prop - proposed smoothing parameters
%    5) Beta_prop - proposed coefficients
%    6) Phi_prop - proposed indicator variable
%
%  Required programs: postBeta1, postBeta2, Beta_derive1, Beta_derive2,
%  whittle_like
%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%

xi_prop = xi_curr_temp;
Beta_prop = Beta_curr_temp;
nseg_new = nseg_curr_temp;
tau_prop = tau_temp;
Phi_prop = Phi_temp;

if nexp_temp>1
  %*********************************************************
  % If contains more than one segments
  %*********************************************************

 seg_temp = unidrnd(nexp_temp-1); %Drawing Segment to cut
 u = rand;
 cut_poss_curr = xi_curr_temp(seg_temp);
 nposs_prior = nseg_curr_temp(seg_temp) + nseg_curr_temp(seg_temp+1) - 2*tmin+1;

 %Determing if the relocation is a big jump or small jump
 if u<prob_mm1
   if nseg_curr_temp(seg_temp)==tmin && nseg_curr_temp(seg_temp+1)==tmin
      nposs=1; %Number of possible locations for new cutpoint
          new_index=unidrnd(nposs);%Drawing index of new cutpoint
          cut_poss_new = xi_curr_temp(seg_temp)- 1 + new_index;
   elseif nseg_curr_temp(seg_temp)==tmin
          nposs=2; %Number of possible locations for new cutpoint
          new_index=unidrnd(nposs);%Drawing index of new cutpoint
          cut_poss_new = xi_curr_temp(seg_temp)- 1 + new_index;
   elseif nseg_curr_temp(seg_temp+1)==tmin
          nposs=2; %Number of possible locations for new cutpoint
          new_index = unidrnd(nposs); %Drawing index of new cutpoint
          cut_poss_new = xi_curr_temp(seg_temp) + 1 - new_index;
   else
          nposs=3;% Number of possible locations for new cutpoint
          new_index = unidrnd(nposs);%Drawing index of new cutpoint
          cut_poss_new = xi_curr_temp(seg_temp) - 2 + new_index;
   end
 else
       new_index=unidrnd(nposs_prior);%
       cut_poss_new = sum(nseg_curr_temp(1:seg_temp-1))- 1 + tmin+new_index;
 end

 xi_prop(seg_temp)=cut_poss_new;
 if seg_temp>1
   %Number of observations in lower part of new cutpoin
                 nseg_new(seg_temp) = xi_prop(seg_temp) - xi_curr_temp(seg_temp-1);
 else
       nseg_new(seg_temp) = xi_prop(seg_temp);
 end
 %Number of observations in upper part of new cutpoint
     nseg_new(seg_temp+1) = nseg_curr_temp(seg_temp) + nseg_curr_temp(seg_temp+1) - nseg_new(seg_temp);

%===========================================================
%Evaluating the cut Proposal density for the cut-point at the cureent
%and proposed values
%===========================================================
 if(abs(cut_poss_new-cut_poss_curr)>1)
   log_prop_cut_prop=log(1-prob_mm1)-log(nposs_prior);
   log_prop_cut_curr=log(1-prob_mm1)-log(nposs_prior);
 elseif nseg_curr_temp(seg_temp)==tmin &&#38; nseg_curr_temp(seg_temp+1)==tmin
   log_prop_cut_prop=0;
   log_prop_cut_curr=0;
 else
   if (nseg_curr_temp(seg_temp)==tmin || nseg_curr_temp(seg_temp+1)==tmin)
    %log_prop_cut_prop=log(1-prob_mm1)-log(nposs_prior)+log(1/2)+log(prob_mm1);
    log_prop_cut_prop=log(1/2)+log(prob_mm1);
   else
    %log_prop_cut_prop=log(1-prob_mm1)-log(nposs_prior)+log(1/3)+log(prob_mm1);
    log_prop_cut_prop=log(1/3)+log(prob_mm1);
   end
   if(nseg_new(seg_temp)==tmin || nseg_new(seg_temp+1)==tmin)
     %log_prop_cut_curr=log(1-prob_mm1)-log(nposs_prior)+log(1/2)+log(prob_mm1);
     log_prop_cut_curr=log(1/2)+log(prob_mm1);
   else
     %log_prop_cut_curr=log(1-prob_mm1)-log(nposs_prior)+log(1/3)+log(prob_mm1);
     log_prop_cut_curr=log(1/3)+log(prob_mm1);
   end
 end


 need = sum(Beta_curr_temp(:,:,seg_temp) - Beta_curr_temp(:,:,seg_temp+1));
 need = need./need;
 need(isnan(need))=0;
 aa = zeros(2^(dimen^2),1);
 for i=1:2^(dimen^2)
   aa(i)=sum(chol_index(i,:)==need);
 end
 Phi_need = find(aa==dimen^2);
 select = find(chol_index(Phi_need,:)~=0);
 select_inv = find(chol_index(Phi_need,:)==0);

%=============================================================================
%Evaluating the Loglikelihood, Priors and Proposals at the current values
%=============================================================================

 loglike_curr = 0;
 for j=seg_temp:seg_temp+1
    if j>1
      yobs_tmp = ts((xi_curr_temp(j-1)+1):xi_curr_temp(j),:);
     else
      yobs_tmp = ts(1:xi_curr_temp(j),:);
     end
    %Loglikelihood at current values
    [log_curr_spec_dens] = whittle_like(yobs_tmp,Beta_curr_temp(:,:,j));
    loglike_curr = loglike_curr + log_curr_spec_dens;
 end

%=============================================================================
%Evaluating the Loglikelihood, Priors and Proposals at the proposed values
%=============================================================================

%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%For coefficient of basis functions that are the same across two segments %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%

 if Phi_need ~=2^(dimen^2)
   [Beta_out, m_out, m, ~] = Hamilt2(chol_index, Phi_need, seg_temp,...
           ts, tau_prop(:,seg_temp), tau_prop(:,seg_temp+1),...
            Beta_prop(:,:,seg_temp), Beta_prop(:,:,seg_temp+1), xi_prop, nseg_new(seg_temp));        
           
   Beta_prop(:,select_inv,seg_temp) = reshape(Beta_out,nBeta,length(select_inv));
   Beta_prop(:,select_inv,seg_temp+1) = Beta_prop(:,select_inv,seg_temp);
   m_curr_1 = -0.5*m'*((1/M)*eye(length(m)))*m;
   m_prop_1 = -0.5*m_out'*((1/M)*eye(length(m_out)))*m_out;
 else
   m_curr_1 = 0;
   m_prop_1 = 0;
 end
%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%% For coefficient of basis functions that are different across two segments  %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
 loglike_prop = 0;
 yobs_tmp_2 = cell(2,1);
 m_prop_2 = 0;
 m_curr_2 = 0;
 for j=seg_temp:seg_temp+1

   [Beta_out, m_out, m, yobs_tmp]=Hamilt1(chol_index, Phi_need,...
              j, ts, tau_prop(:,j), Beta_prop(:,:,j), xi_prop);          
   if j==seg_temp
     yobs_tmp_2{1} = yobs_tmp;
   else
     yobs_tmp_2{2} = yobs_tmp;
   end      
 
   Beta_prop(:,select,j) = reshape(Beta_out,nBeta,length(select));
    m_curr_2 = m_curr_2 - 0.5*m'*((1/M)*eye(length(m)))*m;       
   m_prop_2 = m_prop_2 - 0.5*m_out'*((1/M)*eye(length(m_out)))*m_out;
 end

 %Loglikelihood at proposed values
 for j=seg_temp:seg_temp+1
   if j==seg_temp
     [log_curr_spec_dens] = whittle_like(yobs_tmp_2{1},Beta_prop(:,:,j));
   else
     [log_curr_spec_dens] = whittle_like(yobs_tmp_2{2},Beta_prop(:,:,j));
   end
   loglike_prop = loglike_prop+log_curr_spec_dens;
 end

 %proposal density
 log_proposal_curr = log_prop_cut_curr;
 log_proposal_prop = log_prop_cut_prop;

 %target density
 log_prior_cut_prop=0;
   log_prior_cut_curr=0;
 for k=1:nexp_temp-1
   if k==1
     log_prior_cut_prop=-log(nobs-(nexp_temp-k+1)*tmin+1);
           log_prior_cut_curr=-log(nobs-(nexp_temp-k+1)*tmin+1);
        else
                          log_prior_cut_prop=log_prior_cut_prop - log(nobs-xi_prop(k-1)-(nexp_temp-k+1)*tmin+1);
           log_prior_cut_curr=log_prior_cut_curr - log(nobs-xi_curr_temp(k-1)-(nexp_temp-k+1)*tmin+1);
   end
 end

 log_target_prop = loglike_prop + log_prior_cut_prop + m_prop_1 + m_prop_2;
 log_target_curr = loglike_curr + log_prior_cut_curr + m_curr_1 + m_curr_2;

else
 %*********************************************************
 % If contains only one segment
 %*********************************************************
 nseg_new = nobs;
 seg_temp = 1;
 %===========================================================================
 %Evaluating the Loglikelihood, Priors and Proposals at the proposed values
 %===========================================================================
 Phi_need = 2^(dimen^2);

 [Beta_out, m_out, m]=Hamilt1(chol_index, Phi_need,...
            1, ts, tau_temp, Beta_curr_temp, xi_curr_temp);
 Beta_prop(:,:,1) = reshape(Beta_out,nBeta,dimen^2);
 m_curr = - 0.5*m'*((1/M)*eye(length(m)))*m;       
 m_prop = - 0.5*m_out'*((1/M)*eye(length(m_out)))*m_out;

 %Loglike at proposed values
 [loglike_prop] = whittle_like(ts,Beta_prop(:,:,1));

 %Loglike at current values
 [loglike_curr] = whittle_like(ts,Beta_curr_temp(:,:,1));

 log_target_prop = loglike_prop + m_prop;
 log_target_curr = loglike_curr + m_curr;
 log_proposal_curr = 0;
 log_proposal_prop = 0;
end

%*************************************************************
%Calculations acceptance probability
%*************************************************************
A = min(1,exp(log_target_prop-log_target_curr+log_proposal_curr-log_proposal_prop));

function [out, fitparams] = MultiSpect_diag(ts,varargin)

%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%% Program for the MCMC nonstationary multivariate spectrum analysis paper
% 07/02/2016
% Does the MCMC iterations for bivariate time series
%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%

%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%% Extract information from the option parameters
%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
% If params is empty, then use the default parameters
if nargin==1
  params = OptsMultiSpect();
else
  params = varargin{1};
end;

%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
%II) Run the estimation proceedure
%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%

warning('off','MATLAB:nearlySingularMatrix')
nloop = params.nloop;
%number of total MCMC iterations
nwarmup = params.nwarmup;
%number of warmup period
nexp_max = params.nexp_max;
%Input maximum number of segments

global dimen nobs nbasis nBeta sigmasqalpha tau_up_limit ...
    prob_mm1 tmin var_inflate_1 var_inflate_2 options
options = optimset('Display','off','GradObj','on','Hessian','on','MaxIter',10000,...
 'MaxFunEvals',10000,'TolFun',1e-5,'TolX',1e-5);

nbasis = params.nbasis;   
%number of linear smoothing spline basis functions
nBeta = nbasis + 1;
%number of coefficients for real part of components
sigmasqalpha = params.sigmasqalpha;
%smoothing parameter for alpha
tau_up_limit = params.tau_up_limit;
%the prior for smoothing parameters
prob_mm1 = params.prob_mml;
%the probability of small jump, 1-prob of big jump
tmin = params.tmin;  
%minimum number of observation in each segment
var_inflate_1 = params.v1(1); 
var_inflate_2 = params.v1(1);
nfreq_hat = params.nfreq;
freq_hat=(0:nfreq_hat)'/(2*nfreq_hat);

%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%% II) Run the estimation proceedure
%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%

dim = size(ts);
nobs = dim(1);
dimen = dim(2);

spect_hat = cell(nexp_max,1);
for j=1:nexp_max
  spect_hat{j} = zeros(2,2,nfreq_hat+1,j,nloop+1);
end

nexp_curr = params.init; %initialize the number of segments

%initialize tausq
for j=1:nexp_curr
    tausq_curr(:,j,1)=rand(4,1)*tau_up_limit;
end

%initilize the location of the changepoints for j=1:nexp_curr(1)
xi_curr = zeros(nexp_curr,1);
nseg_curr = zeros(nexp_curr,1);
for j=1:nexp_curr
  if nexp_curr==1
   xi_curr = nobs;
   nseg_curr = nobs;
  else
   if j==1
           nposs = nobs-nexp_curr*tmin+1;
           xi_curr(j) = tmin + unidrnd(nposs)-1;
           nseg_curr(j) = xi_curr(j);
   elseif j>1 && j<nexp_curr
            nposs=nobs-xi_curr(j-1)-tmin*(nexp_curr-j+1)+1;
            xi_curr(j)=tmin+unidrnd(nposs)+xi_curr(j-1)-1;
            nseg_curr(j)=xi_curr(j)-xi_curr(j-1);
   else
            xi_curr(j)=nobs;
            nseg_curr(j)=xi_curr(j)-xi_curr(j-1);
   end
 end
end

%index matrix for which components of cholesky decomposition changed
chol_index = [0 0 0 0; 1 0 0 0; 0 1 0 0; 0 0 1 0; 0 0 0 1;...
      1 1 0 0; 1 0 1 0; 1 0 0 1; 0 0 1 1; 0 1 0 1;...
      0 1 1 0; 1 1 1 0; 1 1 0 1; 0 1 1 1; 1 0 1 1;...
      1 1 1 1];
% 1: no change; 2-5: one changes; 6-11: two changes; 12-15: three changes
% 16: all changed

Beta_curr = zeros(nBeta,4,nexp_curr);
Phi_temp = repmat(16,nexp_curr,1);
for j=1:nexp_curr
    [Beta_mean, Beta_var,~] = postBeta1(chol_index, Phi_temp(j), j, ts, tausq_curr, Beta_curr, xi_curr);
    Beta_curr(:,:,j) = reshape(mvnrnd(Beta_mean,0.5*(Beta_var+Beta_var')),nBeta,4);
end
Phi_curr=[repmat(16,nexp_curr-1,1);0];
%jumping probabilities
epsilon=zeros(nloop,1);
met_rat=zeros(nloop,1);
rat_birth=[];
rat_death=[];


%%%initialize .mat file
S.('ts') = ts;
S.('varargin') = varargin;
S.('nexp') = [];
S.('nseg') = [];
S.('xi') = {};
S.('Beta') = {};
S.('tausq') = {};
S.('spect_hat') = {};
S.('Spec_1') = {};
S.('Spec_2') = {};
S.('Coh') = {};
S.('Phi') = {};
if params.convdiag==1
  S.('conv_diag') = [];
end

save(params.fname, '-struct', 'S', '-v7.3');

%create matfile object to use in MCMC sampler
out = matfile(params.fname, 'Writable', true);

%create data structures to store temporary results in between ouputs
Beta_tmp = cell(params.batchsize,1);
spect_hat_tmp = cell(params.batchsize,1);
nexp_tmp = zeros(params.batchsize,1);
nseg_tmp = cell(params.batchsize,1);
tausq_tmp = cell(params.batchsize,1);
xi_tmp = cell(params.batchsize,1);
Phi_tmp = cell(params.batchsize,1);
Spect_1_tmp = cell(params.batchsize,1);
Spect_2_tmp = cell(params.batchsize,1);
Coh_tmp = cell(params.batchsize,1);
if params.convdiag==1
  convdiag_tmp = zeros(params.batchsize,4*nBeta+4);
end

% preallocate for the worst case memory use
for i=1:params.batchsize
  Beta_tmp{i} = zeros(nBeta, 4, params.nexp_max);
  spect_hat_tmp{i} = zeros(2,2,nfreq_hat+1,params.nexp_max);
  nexp_tmp(i) = params.nexp_max;
  nseg_tmp{i} = zeros(params.nexp_max,1);
  tausq_tmp{i} = zeros(4,params.nexp_max);
  xi_tmp{i} = zeros(params.nexp_max,1);
  Phi_tmp{i} = zeros(params.nexp_max,1);
  Spect_1_tmp{i} = zeros(nfreq_hat+1,nobs);
  Spect_2_tmp{i} = zeros(nfreq_hat+1,nobs);
  Coh_tmp{i} = zeros(nfreq_hat+1,nobs);
  if params.convdiag==1
   convdiag_tmp(i,:)=zeros(1,4*nBeta+4);
  end
end

%set batch index
batch_idx = 1;


%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
% run the loop
%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
for p=1:nloop
 tic;
 if(mod(p,100)==0)
   fprintf('iter: %g of %g 
' ,p, nloop)
 end
 if p<nwarmup
   var_inflate_1=1;
   var_inflate_2=1;
 else
   var_inflate_1=1;
   var_inflate_2=1;
 end

 %========================
 %BETWEEN MODEL MOVE
 %========================

 kk = length(find(nseg_curr>2*tmin)); %Number of available segments

 %===========================
 %Deciding on birth or death
 if kk==0 %Stay where you (if nexp_curr=1) or join segments if there are no available segments to cut
   if nexp_curr==1
     nexp_prop = nexp_curr; %Stay
     log_move_prop = 0;
     log_move_curr = 0;
   else
     nexp_prop = nexp_curr-1; %death
     log_move_prop = 0;
     if nexp_prop==1
      log_move_curr = 1;
     else
      log_move_curr = log(0.5);
     end
   end
 else
   if nexp_curr==1
     nexp_prop = nexp_curr + 1; %birth
     log_move_prop = 0;
     if nexp_prop==nexp_max
      log_move_curr = 0;
     else
      log_move_curr = log(0.5);
     end
   elseif nexp_curr==nexp_max
     nexp_prop = nexp_curr-1;  %death
     log_move_prop = 0;
     if nexp_prop==1
      log_move_curr = 0;
     else
      log_move_curr = log(0.5);
     end
   else
     u = rand;
     if u<0.5
       nexp_prop = nexp_curr+1; %birth
       if nexp_prop==nexp_max;
        log_move_curr = 0;
        log_move_prop = log(0.5);
       else
         log_move_curr=log(0.5);
         log_move_prop=log(0.5);
       end
     else
       nexp_prop = nexp_curr-1; %death
       if nexp_prop==1
        log_move_curr = 0;
        log_move_prop = log(0.5);
       else
         log_move_curr = log(0.5);
         log_move_prop = log(0.5);
       end
     end
   end
 end


 if nexp_prop<nexp_curr
   %Do Death step
   [met_rat(p),nseg_prop,xi_prop,tausq_prop,Beta_prop, Phi_prop]= death(chol_index,ts,nexp_curr,nexp_prop,...
            tausq_curr,xi_curr,nseg_curr,Beta_curr,Phi_curr, log_move_curr,log_move_prop);
   rat_death=[rat_death met_rat(p)];
 elseif nexp_prop>nexp_curr
   %Do Birth step
   [met_rat(p),nseg_prop,xi_prop,tausq_prop,Beta_prop, Phi_prop]= birth(chol_index,ts,nexp_curr,nexp_prop,...
       tausq_curr,xi_curr,nseg_curr,Beta_curr,Phi_curr,log_move_curr,log_move_prop);
   rat_birth=[rat_birth met_rat(p)];
 else
   xi_prop=xi_curr;
   nseg_prop=nseg_curr;
   tausq_prop=tausq_curr;
   Beta_prop=Beta_curr;
   Phi_prop=Phi_curr;
   met_rat(p) = 1;
 end
 u = rand;
 if u<met_rat(p)
        nexp_curr=nexp_prop;
        xi_curr=xi_prop;
        nseg_curr=nseg_prop;
        tausq_curr=tausq_prop;
        Beta_curr=Beta_prop;
   Phi_curr=Phi_prop;
 end
 %========================
 %WITHIN MODEL MOVE
 %========================
   %Drawing a new cut point and Beta simultaneously
 %update coeffiecient of linear basis function
 [epsilon(p),nseg_new,xi_prop,~,Beta_prop,Phi_prop,seg_prop]= ...
 within(chol_index,ts,nexp_curr,tausq_curr, xi_curr,...
                    nseg_curr, Beta_curr, Phi_curr);
 u = rand;
 if (u<epsilon(p)|| p==1)
   if nexp_curr>1
     for j=seg_prop:seg_prop+1
       Beta_curr=Beta_prop;
       xi_curr=xi_prop;
       nseg_curr=nseg_new;
       Phi_curr=Phi_prop;
     end
   else
     Beta_curr=Beta_prop;
   end
 end

 %Drawing tau
 for j=1:nexp_curr
   for i=1:3
     tau_a = nbasis/2;
     tau_b = sum(Beta_curr(2:end,i,j).^2)/2;
     u=rand;
     const1 = gamcdf(1/tau_up_limit,tau_a,1/tau_b);
     const2 = 1-u*(1-const1);
     tausq_curr(i,j) = 1/gaminv(const2,tau_a,1/tau_b);
   end
   tau_a = nbasis/2;
   tau_b = sum(Beta_curr(1:nBeta,4,j).^2)/2;
   u=rand;
   const1 = gamcdf(1/tau_up_limit,tau_a,1/tau_b);
   const2 = 1-u*(1-const1);
   tausq_curr(4,j) = 1/gaminv(const2,tau_a,1/tau_b);
 end

 %==================================
 %Estimating Spectral Density
 %==================================
 [xx_r, xx_i] = lin_basis_func(freq_hat); %produce linear basis functions

 for j =1:nexp_curr
   %getting the coefficients of linear basis functions
   g1 = Beta_curr(:,1:3,j);
   g2 = Beta_curr(1:nBeta,4,j);
   theta_real = xx_r * g1(:,3);
   theta_imag = xx_i * g2;
   theta = theta_real+sqrt(-1)*theta_imag;
 
   delta_sq_hat = zeros(2,nfreq_hat+1);
   for q=1:2
     delta_sq_hat(q,:) = exp(xx_r * g1(:,q))';
   end
   %produce the spectral density matrix
   for k=1:(nfreq_hat+1)
     TT = eye(2);
     TT(2,1) = -theta(k);
     spect_hat{nexp_curr}(:,:,k,j,p+1) = ...
     inv(TT)*diag(delta_sq_hat(:,k))*inv(TT');
   end
 end

 %============================================
 % Get time-varying spectra and coherence
 %============================================
 Spec_1_est = zeros(nfreq_hat+1,nobs);
 Spec_2_est = zeros(nfreq_hat+1,nobs);
 Coh_est = zeros(nfreq_hat+1,nobs);

 % first spectrum
 if((p+1)>nwarmup)
   spec_hat_curr=squeeze(spect_hat{nexp_curr}(1,1,:,:,p+1));
   for j=1:nexp_curr
     if(j==1)
       Spect_1_est(:,1:xi_curr(j))= repmat(spec_hat_curr(:,j),1,xi_curr(j));
     else
       Spect_1_est(:,xi_curr(j-1)+1:xi_curr(j))= ...
       repmat(spec_hat_curr(:,j),1,xi_curr(j)-xi_curr(j-1));
     end
   end
 end

 % second spectrum
 if((p+1)>nwarmup)
   spec_hat_curr=squeeze(spect_hat{nexp_curr}(2,2,:,:,p+1));
   for j=1:nexp_curr
     if(j==1)
       Spect_2_est(:,1:xi_curr(j))= repmat(spec_hat_curr(:,j),1,xi_curr(j));
     else
       Spect_2_est(:,xi_curr(j-1)+1:xi_curr(j))= ...
       repmat(spec_hat_curr(:,j),1,xi_curr(j)-xi_curr(j-1));
     end
   end
 end

 % coherence
 if((p+1)>nwarmup)
   spec_hat_curr=abs(squeeze(spect_hat{nexp_curr}(2,1,:,:,p+1))).^2./...
    (squeeze(spect_hat{nexp_curr}(1,1,:,:,p+1)).*squeeze(spect_hat{nexp_curr}(2,2,:,:,p+1)));
   for j=1:nexp_curr
     if(j==1)
      Coh_est(:,1:xi_curr(j))=repmat(spec_hat_curr(:,j),1,xi_curr(j));
     else
      Coh_est(:,xi_curr(j-1)+1:xi_curr(j))=repmat(spec_hat_curr(:,j),1,xi_curr(j)-xi_curr(j-1));
     end
   end
 end

 %create convergence diagnostics
 if params.convdiag==1
   convdiag_curr=convdiag(nobs, nBeta, nseg_curr, nexp_curr, Beta_curr, tausq_curr);
 end

 %output to temporary container
 Beta_tmp{batch_idx} = Beta_curr;
 spect_hat_tmp{batch_idx} = spect_hat;
 nexp_tmp(batch_idx) = nexp_curr;
 nseg_tmp{batch_idx} = nseg_curr;
 tausq_tmp{batch_idx} = tausq_curr;
 xi_tmp{batch_idx} = xi_curr;
 Phi_tmp{batch_idx} = Phi_curr;
 if((p+1)>nwarmup)
   Spect_1_tmp{batch_idx} = Spect_1_est;
   Spect_2_tmp{batch_idx} = Spect_2_est;
   Coh_tmp{batch_idx} = Coh_est;
 end
 if params.convdiag==1
   convdiag_tmp=convdiag_curr;
 end

 if(mod(p,params.batchsize)==0)
  
   %output data to .mat file
   out.Beta((p-params.batchsize+1):p,1) = Beta_tmp;
   out.spect_hat((p-params.batchsize+1):p,1) = spect_hat_tmp;
   out.nexp((p-params.batchsize+1):p,1)=nexp_tmp;
   out.nseg((p-params.batchsize+1):p,1)=nseg_tmp;
   out.tausq((p-params.batchsize+1):p,1) = tausq_tmp;
   out.xi((p-params.batchsize+1):p,1) = xi_tmp;
   out.Phi((p-params.batchsize+1):p,1) = Phi_tmp;
   out.Spect_1((p-params.batchsize+1):p,1) = Spect_1_tmp;
   out.Spect_2((p-params.batchsize+1):p,1) = Spect_2_tmp;
   out.Coh((p-params.batchsize+1):p,1) = Coh_tmp;
   if params.convdiag==1
     out.convdiag((p-params.batchsize+1):p,1:nbeta+1)=convdiag_tmp;
   end
 
   %reset temporary data containers
   Beta_tmp = cell(params.batchsize,1);
   spect_hat_tmp = cell(params.batchsize,1);
   nexp_tmp = zeros(params.batchsize,1);
   nseg_tmp = cell(params.batchsize,1);
   tausq_tmp = cell(params.batchsize,1);
   xi_tmp = cell(params.batchsize,1);
   Phi_tmp = cell(params.batchsize,1);
   Spect_1_tmp = cell(params.batchsize,1);
   Spect_2_tmp = cell(params.batchsize,1);
   Coh_tmp = cell(params.batchsize,1);
   if params.convdiag==1
     convdiag_tmp = zeros(params.batchsize,4*nBeta+4);
   end
 
   %reset batch index
   batch_idx=0;
 
 end

 %increment batch index
 batch_idx = batch_idx + 1;

 tms(p) = toc;
 if params.verb ==1
   fprintf('sec / min hr: %g %g %g 
' ,[tms(p),sum(tms(1:p))/60, sum(tms(1:p))/(60*60)]') 
 end

end

fitparams = struct('nloop', nloop, ...
       'nwarmup', nwarmup, ...
       'timeMean', mean(tms), ...
       'timeMax', max(tms(2:end)), ...
       'timeMin', min(tms),...
       'timeStd', std(tms(2:end)));
end

function[spectra, coh] = MultiSpect_interval(zt, spect_matrices, fit, params, alphalevel)

dim = size(zt); nobs = dim(1); dimen=dim(2);
spectra = cell(dimen,1);
coh = cell(dimen,1);
if dimen==3
   temp1 = zeros(params.nfreq+1,(params.nloop-params.nwarmup));
   temp2 = zeros(params.nfreq+1,(params.nloop-params.nwarmup));
   temp3 = zeros(params.nfreq+1,(params.nloop-params.nwarmup));
  temp4 = zeros(params.nfreq+1,(params.nloop-params.nwarmup));
   temp5 = zeros(params.nfreq+1,(params.nloop-params.nwarmup));
  temp6 = zeros(params.nfreq+1,(params.nloop-params.nwarmup));
  spect11 = zeros(params.nfreq+1,nobs,3);
  spect22 = zeros(params.nfreq+1,nobs,3);
  spect33 = zeros(params.nfreq+1,nobs,3);
  coh21 = zeros(params.nfreq+1,nobs,3);
  coh31 = zeros(params.nfreq+1,nobs,3);
  coh32 = zeros(params.nfreq+1,nobs,3);
  for j=1:nobs
   if(mod(j,10)==0)
    fprintf('Completed: %g of %g observations 
' ,j, nobs)
   end
   for p=1:params.nloop
    if(p>params.nwarmup)
     if length(fit(fit(1).nexp_curr(p)).xi(:,p))==1
       k=1;
     else
       k = min(find(j<=fit(fit(1).nexp_curr(p)).xi(:,p)));
     end
     temp1(:,p-params.nwarmup) = squeeze(spect_matrices{fit(1).nexp_curr(p)}(1,1,:,k,p));
     temp2(:,p-params.nwarmup) = squeeze(spect_matrices{fit(1).nexp_curr(p)}(2,2,:,k,p));
     temp3(:,p-params.nwarmup) = squeeze(spect_matrices{fit(1).nexp_curr(p)}(3,3,:,k,p));
     temp4(:,p-params.nwarmup) = abs(squeeze(spect_matrices{fit(1).nexp_curr(p)}(2,1,:,k,p))).^2./...
(squeeze(spect_matrices{fit(1).nexp_curr(p)}(1,1,:,k,p)).*squeeze(spect_matrices{fit(1).nexp_curr(p)}(2,2,:,k,p)));
     temp5(:,p-params.nwarmup) = abs(squeeze(spect_matrices{fit(1).nexp_curr(p)}(3,1,:,k,p))).^2./...
(squeeze(spect_matrices{fit(1).nexp_curr(p)}(1,1,:,k,p)).*squeeze(spect_matrices{fit(1).nexp_curr(p)}(3,3,:,k,p)));
     temp6(:,p-params.nwarmup) = abs(squeeze(spect_matrices{fit(1).nexp_curr(p)}(3,2,:,k,p))).^2./...
(squeeze(spect_matrices{fit(1).nexp_curr(p)}(2,2,:,k,p)).*squeeze(spect_matrices{fit(1).nexp_curr(p)}(3,3,:,k,p)));         
    end
  end
  spect11(:,j,1) = mean(temp1,2); spect11(:,j,2:3) = quantile(temp1,[alphalevel/2, 1-alphalevel/2],2);
  spect22(:,j,1) = mean(temp2,2); spect22(:,j,2:3) = quantile(temp2,[alphalevel/2, 1-alphalevel/2],2);
  spect33(:,j,1) = mean(temp3,2); spect33(:,j,2:3) = quantile(temp3,[alphalevel/2, 1-alphalevel/2],2);
  coh21(:,j,1) = mean(temp4,2); coh21(:,j,2:3) = quantile(temp4,[alphalevel/2, 1-alphalevel/2],2);
  coh31(:,j,1) = mean(temp5,2); coh31(:,j,2:3) = quantile(temp5,[alphalevel/2, 1-alphalevel/2],2);
  coh32(:,j,1) = mean(temp6,2); coh32(:,j,2:3) = quantile(temp6,[alphalevel/2, 1-alphalevel/2],2);
 end

 spectra{1}=spect11;spectra{2}=spect22;spectra{3}=spect33;
 coh{1}=coh21;coh{2}=coh31; coh{3}=coh32;
elseif dimen==2

 temp1 = zeros(params.nfreq+1,(params.nloop-params.nwarmup));
 temp2 = zeros(params.nfreq+1,(params.nloop-params.nwarmup));
 temp3 = zeros(params.nfreq+1,(params.nloop-params.nwarmup));
 spect11 = zeros(params.nfreq+1,nobs,3);
 spect22 = zeros(params.nfreq+1,nobs,3);
 coh21 = zeros(params.nfreq+1,nobs,3);
 for j=1:nobs
   if(mod(j,10)==0)
    fprintf('Completed: %g of %g observations 
' ,j, nobs)
   end
   for p=1:params.nloop
     if(p>params.nwarmup)
      if length(fit(fit(1).nexp_curr(p)).xi(:,p))==1
        k=1;
       else
        k = min(find(j<=fit(fit(1).nexp_curr(p)).xi(:,p)));
       end
       temp1(:,p-params.nwarmup) = squeeze(spect_matrices{fit(1).nexp_curr(p)}(1,1,:,k,p));
       temp2(:,p-params.nwarmup) = squeeze(spect_matrices{fit(1).nexp_curr(p)}(2,2,:,k,p));
       temp3(:,p-params.nwarmup) = abs(squeeze(spect_matrices{fit(1).nexp_curr(p)}(2,1,:,k,p))).^2./...
(squeeze(spect_matrices{fit(1).nexp_curr(p)}(1,1,:,k,p)).*squeeze(spect_matrices{fit(1).nexp_curr(p)}(2,2,:,k,p)));
     end
   end
   spect11(:,j,1) = mean(temp1,2); spect11(:,j,2:3) = quantile(temp1,[alphalevel/2, 1-alphalevel/2],2);
   spect22(:,j,1) = mean(temp2,2); spect22(:,j,2:3) = quantile(temp2,[alphalevel/2, 1-alphalevel/2],2);
   coh21(:,j,1) = mean(temp3,2); coh21(:,j,2:3) = quantile(temp3,[alphalevel/2, 1-alphalevel/2],2);
 end

 spectra{1}=spect11;spectra{2}=spect22;
 coh{1}=coh21;

end

function[post_partitions]=MultiSpect_partition(zt, fit, params)

dim = size(zt);nobs = dim(1);
post_partitions = (histc(fit(1).nexp_curr(params.nwarmup+1:params.nloop),1:params.nexp_max)'/length(params.nwarmup+1:params.nloop))';

figure
histogram(fit(1).nexp_curr(params.nwarmup+1:params.nloop),'FaceAlpha',1,...
   'Normalization','probability','BinMethod','integers','BinWidth',.5,'BinLimits',[1,10])
title('Histogram of the Number of Partitions')
xlabel('The number of partitions')
ylabel('Probability')
for j=1:params.nexp_max
    kk=find(fit(1).nexp_curr(params.nwarmup+1:params.nloop)==j);
 if ∼isempty(kk) && j>1
   figure
   hold
   title(['Plot of Partition Points Given ',int2str(j), ' Segments'])
   for k=1:j-1
     plot(fit(j).xi(k,kk+params.nwarmup))
   end
   for k=1:j-1
     figure
     hold
     title(['Histogram of Location of Partition ', int2str(k), ',' ' Given ',int2str(j), ' Segments'])
     histogram(fit(j).xi(k,kk+params.nwarmup),'FaceAlpha',1,...
       'Normalization','probability','BinMethod','integers','BinWidth',20,'BinLimits',[1,nobs])
  end
 end
end

end

function[spectra, coh] = MultiSpect_surface(zt, spect_matrices, fit, params)
dim = size(zt); nobs = dim(1); dimen=dim(2);
spectra = cell(dimen,1);
coh = cell(dimen,1);
if dimen==3
  s_11=zeros(params.nfreq+1,nobs);
  s_22=zeros(params.nfreq+1,nobs);
  s_33=zeros(params.nfreq+1,nobs);
  for p=1:params.nloop
   if(p>params.nwarmup)
    xi_curr=fit(fit(1).nexp_curr(p)).xi(:,p);
    spec_hat_curr_11=squeeze(spect_matrices{fit(1).nexp_curr(p)}(1,1,:,:,p));
    spec_hat_curr_22=squeeze(spect_matrices{fit(1).nexp_curr(p)}(2,2,:,:,p));
    spec_hat_curr_33=squeeze(spect_matrices{fit(1).nexp_curr(p)}(3,3,:,:,p));
    for j=1:fit(1).nexp_curr(p)
      if(j==1)
s_11(:,1:xi_curr)=s_11(:,1:xi_curr(j))+repmat(spec_hat_curr_11(:,j),1,xi_curr(j))/(params.nloop-params.nwarmup);
s_22(:,1:xi_curr)=s_22(:,1:xi_curr(j))+repmat(spec_hat_curr_22(:,j),1,xi_curr(j))/(params.nloop-params.nwarmup);
s_33(:,1:xi_curr)=s_33(:,1:xi_curr(j))+repmat(spec_hat_curr_33(:,j),1,xi_curr(j))/(params.nloop-params.nwarmup);
      else
        s_11(:,xi_curr(j-1)+1:xi_curr(j))=s_11(:,xi_curr(j-1)+1:xi_curr(j))+...
         repmat(spec_hat_curr_11(:,j),1,xi_curr(j)-xi_curr(j-1))/(params.nloop-params.nwarmup);
        s_22(:,xi_curr(j-1)+1:xi_curr(j))=s_22(:,xi_curr(j-1)+1:xi_curr(j))+...
         repmat(spec_hat_curr_22(:,j),1,xi_curr(j)-xi_curr(j-1))/(params.nloop-params.nwarmup);
         s_33(:,xi_curr(j-1)+1:xi_curr(j))=s_33(:,xi_curr(j-1)+1:xi_curr(j))+...
         repmat(spec_hat_curr_33(:,j),1,xi_curr(j)-xi_curr(j-1))/(params.nloop-params.nwarmup);
      end
     end
   end
 end

 coh_21=zeros(params.nfreq+1,nobs);
 coh_31=zeros(params.nfreq+1,nobs);
 coh_32=zeros(params.nfreq+1,nobs);
 for p=1:params.nloop
   if(p>params.nwarmup)
     xi_curr=fit(fit(1).nexp_curr(p)).xi(:,p);
     spec_hat_curr_21=abs(squeeze(spect_matrices{fit(1).nexp_curr(p)}(2,1,:,:,p))).^2./...
(squeeze(spect_matrices{fit(1).nexp_curr(p)}(1,1,:,:,p)).*squeeze(spect_matrices{fit(1).nexp_curr(p)}(2,2,:,:,p)));
     spec_hat_curr_31=abs(squeeze(spect_matrices{fit(1).nexp_curr(p)}(3,1,:,:,p))).^2./...
(squeeze(spect_matrices{fit(1).nexp_curr(p)}(1,1,:,:,p)).*squeeze(spect_matrices{fit(1).nexp_curr(p)}(3,3,:,:,p)));
     spec_hat_curr_32=abs(squeeze(spect_matrices{fit(1).nexp_curr(p)}(3,2,:,:,p))).^2./...
(squeeze(spect_matrices{fit(1).nexp_curr(p)}(3,3,:,:,p)).*squeeze(spect_matrices{fit(1).nexp_curr(p)}(2,2,:,:,p)));
     for j=1:fit(1).nexp_curr(p)
       if(j==1)
coh_21(:,1:xi_curr(j))=coh_21(:,1:xi_curr(j))+repmat(spec_hat_curr_21(:,j),1,xi_curr(j))/(params.nloop-params.nwarmup);
 coh_31(:,1:xi_curr(j))=coh_31(:,1:xi_curr(j))+repmat(spec_hat_curr_31(:,j),1,xi_curr(j))/(params.nloop-params.nwarmup);
 coh_32(:,1:xi_curr(j))=coh_32(:,1:xi_curr(j))+repmat(spec_hat_curr_32(:,j),1,xi_curr(j))/(params.nloop-params.nwarmup);
       else
         coh_21(:,xi_curr(j-1)+1:xi_curr(j))=coh_21(:,xi_curr(j-1)+1:xi_curr(j))+...
          repmat(spec_hat_curr_21(:,j),1,xi_curr(j)-xi_curr(j-1))/(params.nloop-params.nwarmup);
         coh_31(:,xi_curr(j-1)+1:xi_curr(j))=coh_31(:,xi_curr(j-1)+1:xi_curr(j))+...
          repmat(spec_hat_curr_31(:,j),1,xi_curr(j)-xi_curr(j-1))/(params.nloop-params.nwarmup);
         coh_32(:,xi_curr(j-1)+1:xi_curr(j))=coh_32(:,xi_curr(j-1)+1:xi_curr(j))+...
          repmat(spec_hat_curr_32(:,j),1,xi_curr(j)-xi_curr(j-1))/(params.nloop-params.nwarmup);
       end
     end
   end
 end

 spectra{1}=s_11;spectra{2}=s_22;spectra{3}=s_33;
 coh{1}=coh_21;coh{2}=coh_31; coh{3}=coh_32;
elseif dimen==2
 s_11=zeros(params.nfreq+1,nobs);
 s_22=zeros(params.nfreq+1,nobs);
 for p=1:params.nloop
   if(p>params.nwarmup)
    xi_curr=fit(fit(1).nexp_curr(p)).xi(:,p);
    spec_hat_curr_11=squeeze(spect_matrices{fit(1).nexp_curr(p)}(1,1,:,:,p));
    spec_hat_curr_22=squeeze(spect_matrices{fit(1).nexp_curr(p)}(2,2,:,:,p));
    for j=1:fit(1).nexp_curr(p)
     if(j==1)
s_11(:,1:xi_curr)=s_11(:,1:xi_curr(j))+repmat(spec_hat_curr_11(:,j),1,xi_curr(j))/(params.nloop-params.nwarmup);
s_22(:,1:xi_curr)=s_22(:,1:xi_curr(j))+repmat(spec_hat_curr_22(:,j),1,xi_curr(j))/(params.nloop-params.nwarmup);
     else
       s_11(:,xi_curr(j-1)+1:xi_curr(j))=s_11(:,xi_curr(j-1)+1:xi_curr(j))+...
        repmat(spec_hat_curr_11(:,j),1,xi_curr(j)-xi_curr(j-1))/(params.nloop-params.nwarmup);
       s_22(:,xi_curr(j-1)+1:xi_curr(j))=s_22(:,xi_curr(j-1)+1:xi_curr(j))+...
        repmat(spec_hat_curr_22(:,j),1,xi_curr(j)-xi_curr(j-1))/(params.nloop-params.nwarmup);
       end
     end
   end
 end

 coh_21=zeros(params.nfreq+1,nobs);
 for p=1:params.nloop
  if(p>params.nwarmup)
    xi_curr=fit(fit(1).nexp_curr(p)).xi(:,p);
    spec_hat_curr_21=abs(squeeze(spect_matrices{fit(1).nexp_curr(p)}(2,1,:,:,p))).^2./...
(squeeze(spect_matrices{fit(1).nexp_curr(p)}(1,1,:,:,p)).*squeeze(spect_matrices{fit(1).nexp_curr(p)}(2,2,:,:,p)));
    for j=1:fit(1).nexp_curr(p)
     if(j==1)
coh_21(:,1:xi_curr(j))=coh_21(:,1:xi_curr(j))+repmat(spec_hat_curr_21(:,j),1,xi_curr(j))/(params.nloop-params.nwarmup);
     else
      coh_21(:,xi_curr(j-1)+1:xi_curr(j))=coh_21(:,xi_curr(j-1)+1:xi_curr(j))+...
      repmat(spec_hat_curr_21(:,j),1,xi_curr(j)-xi_curr(j-1))/(params.nloop-params.nwarmup);
       end
     end
   end
 end

 spectra{1}=s_11;spectra{2}=s_22;
 coh{1}=coh_21;
end

function [spect_hat, freq_hat, diagparams, fitparams] = MultiSpect(zt,varargin)

%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
% Program for the MCMC nonstationary multivariate spectrum % analysis paper
% Does the MCMC iterations for bivariate or trivariate time series
% Input:
%  1)zt - time series data. It should be T by N matrix. T is
%  the length of the time series; N is the dimension (2 or 3)
%  2)varargin - Tunning parameters, which can be set % by the program %  OptsMultiSpec.m
%     See the documnetation in that program for details and default %     values.
%  Main Outputs:
%   1) spec_hat - a cell contains spectral matrix
%   2) freq_hat - vector of frequencies considered.
%   3) diagparams - contains draws of parameters, including:
%    diagparams.tausq - smoothing parameters,
%    diagparams.Beta - coefficients,
%    diagparams.xi - locations of partitions
%    diagparams.nseg - number of observations in each block,
%    diagparams.nexp_curr - number of partitions,
%    diagparams.Phi - set of coefficients that changed,
%    diagparams.epsilon - acceptance rate of within-model move,
%    diagparams.bet_birth - number of accepted birth step,
%    diagparams.bet_death - number of accepted death step,
%    diagparams.bet_within - number of accepted within step,
%    diagparams.tms - tum time in second,
%    diagparams.convdiag_out - an optional convergence diagnostic %    output
%   4) fitparams - 6 dimensional structural array
%    fitparams.nloop - number of iterations run
%    fitparams.nwarmup - length of the burn-in 
%    fitparams.timeMean - average iteration run time time in %    seconds
%    fitparams.timeMax - maximum iteration run time in seconds
%    fitparams.timeMin - minimum iteration run time in seconds
%    fitparams.timeStd - standard deviation of iteration run times %           in seconds
%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%% % I) Extract information from the option parameters
%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
% If params is empty, then use the default parameters
if nargin==1
  params = OptsMultiSpect();
else
  params = varargin{1};
end

nloop = params.nloop; 
%number of total MCMC iterations
nwarmup = params.nwarmup; 
%number of warmup period
nexp_max = params.nexp_max;
%Input maximum number of segments

global dimen nobs nbasis nBeta sigmasqalpha tau_up_limit ...
    prob_mm1 tmin M ee options
if verLessThan('matlab','9.2') 
  options = optimset('Display','off','GradObj','on','Hessian','on','MaxIter',10000,...
  'MaxFunEvals',10000,'TolFun',1e-6,'TolX',1e-6);
else
 options = optimoptions(@fminunc,'Display','off','GradObj','on','Hessian','on','MaxIter',10000,...
  'MaxFunEvals',10000,'TolFun',1e-6,'TolX',1e-6,'Algorithm','trust-region');
end
nbasis = params.nbasis;     
%number of linear smoothing spline basis functions
nBeta = nbasis + 1;        
%number of coefficients
sigmasqalpha = params.sigmasqalpha; 
%smoothing parameter for alpha
tau_up_limit = params.tau_up_limit; 
%the prior for smoothing parameters
prob_mm1 = params.prob_mml;    
%the probability of small jump, 1-prob of big jump
tmin = params.tmin;        
%minimum number of observation in each segment
M = tau_up_limit;        
%scale value for momentum variable
ee = params.ee;          
%step size in HMC
nfreq_hat = params.nfreq;
freq_hat=(0:nfreq_hat)'/(2*nfreq_hat);

%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
% II) Run the estimation proceedure
%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%

dim = size(zt);
nobs = dim(1);
dimen = dim(2);
ts = zt;
% ts = zeros(nobs,dimen);
% for i=1:dimen
%  x = zt(:,i);
%  xmat = [ones(length(x),1) linspace(1,length(x),%   length(x))'];
%  linfit=inv(xmat'*xmat)*xmat'*x;
%  ts(:,i)=zt(:,i)- xmat*linfit;
% end

tausq = cell(nexp_max,1); 
%big array for smoothing parameters
Beta = cell(nexp_max,1);
%big array for coefficients
spect_hat = cell(nexp_max,1);
xi = cell(nexp_max,1); 
%Cutpoint locations xi_1 is first cutpoint, xi_) is beginning of timeseries
nseg = cell(nexp_max,1);
%Number of observations in each segment
Phi = cell(nexp_max,1); 
%index for component change
tms = zeros(1,nloop);
for j=1:nexp_max
 tausq{j}=ones(dimen^2,j,nloop+1);
 Beta{j} = zeros(nBeta,dimen^2,j,nloop+1);
 spect_hat{j} = zeros(dimen,dimen,nfreq_hat+1,j,nloop+1);
 xi{j}=ones(j,nloop+1);
    nseg{j}=ones(j,nloop+1);
 Phi{j}=zeros(j,nloop+1);
end

%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
% initilize the MCMC iteration
%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%

nexp_curr=nexp_max*ones(nloop+1,1); 
%big array for number of segment
nexp_curr(1) = params.init;   
%initialize the number of segments

%initialize tausq
for j=1:nexp_curr(1)
    tausq{nexp_curr(1)}(:,j,1)=rand(dimen^2,1)*tau_up_limit;
end

%initilize the location of the changepoints for j=1:nexp_curr(1)
for j=1:nexp_curr(1)
 if nexp_curr(1)==1
   xi{nexp_curr(1)}(j,1) = nobs;
   nseg{nexp_curr(1)}(j,1) = nobs;
 else
  if j==1
          nposs = nobs-nexp_curr(1)*tmin+1;
          xi{nexp_curr(1)}(j,1) = tmin + unidrnd(nposs)-1;
          nseg{nexp_curr(1)}(j,1) = xi{nexp_curr(1)}(j,1);
  elseif j>1 && j<nexp_curr(1)
          nposs=nobs-xi{nexp_curr(1)}(j-1,1)-tmin*(nexp_curr(1)-j+1)+1;
          xi{nexp_curr(1)}(j,1)=tmin+unidrnd(nposs)+xi{nexp_curr(1)}(j-1,1)-1;
          nseg{nexp_curr(1)}(j,1)=xi{nexp_curr(1)}(j,1)-xi{nexp_curr(1)}(j-1,1);
  else
          xi{nexp_curr(1)}(j,1)=nobs;
          nseg{nexp_curr(1)}(j,1)=xi{nexp_curr(1)}(j,1)-xi{nexp_curr(1)}(j-1,1);
  end
 end
end

%index matrix for which components of cholesky decomposition changed
chol_index = chol_ind(dimen);

xi_temp = xi{nexp_curr(1)}(:,1);
tau_temp = tausq{nexp_curr(1)}(:,:,1);
Beta_temp = Beta{nexp_curr(1)}(:,:,:,1);
Phi{nexp_curr(1)}(1:(end-1),1)=2^(dimen^2);
Phi_temp = repmat(2^(dimen^2),nexp_curr(1));
for j=1:nexp_curr(1)
    [Beta_mean, Beta_var,~] = postBeta1(chol_index, Phi_temp(j), j, ts, tau_temp(:,j), Beta_temp(:,:,j,1), xi_temp);
 if min(eig(0.5*(Beta_var+Beta_var')))<0
   Beta{nexp_curr(1)}(:,:,j,1) = reshape(mvnrnd(Beta_mean,0.5*(eye(nBeta*dimen^2))),nBeta,dimen^2);
 else
   Beta{nexp_curr(1)}(:,:,j,1) = reshape(mvnrnd(Beta_mean,0.5*(Beta_var+Beta_var')),nBeta,dimen^2);
 end
end

%jumping probabilities
epsilon=zeros(nloop,1);
met_rat=zeros(nloop,1);
bet_death = 0;
bet_birth = 0;
with = 0;

%create convergence diagnostics
if params.convdiag==1
  convdiag_out = zeros((dimen^2)*nBeta+dimen^2,nloop);
  convdiag_out(:,1)=conv_diag(nobs, nBeta, nseg{nexp_curr(1)}(:,1),...
       nexp_curr(1), Beta{nexp_curr(1)}(:,:,:,1), tausq{nexp_curr(1)}(:,:,1),dimen);
end
%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
% run the loop
%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
for p=1:nloop
 tic;
 if(mod(p,50)==0)
   fprintf('iter: %g of %g 
' ,p, nloop)
 end

 %========================
 %BETWEEN MODEL MOVE
 %========================
 kk = length(find(nseg{nexp_curr(p)}(:,p)>2*tmin)); %Number of available segments

 %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
 %Deciding on birth or death
 %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%

 if kk==0 %Stay where you (if nexp_curr=1) or join segments if there are no available segments to cut
   if nexp_curr(p)==1
    nexp_prop = nexp_curr(p); %Stay
    log_move_prop = 0;
    log_move_curr = 0;
   else
    nexp_prop = nexp_curr(p)-1; %death
    log_move_prop = 0;
    if nexp_prop==1
      log_move_curr = 1;
    else
      log_move_curr = log(0.5);
    end
   end
 else
   if nexp_curr(p)==1
    nexp_prop = nexp_curr(p) + 1; %birth
    log_move_prop = 0;
    if nexp_prop==nexp_max
      log_move_curr = 0;
    else
      log_move_curr = log(0.5);
    end
   elseif nexp_curr(p)==nexp_max
    nexp_prop = nexp_curr(p)-1; %death
    log_move_prop = 0;
    if nexp_prop==1
      log_move_curr = 0;
    else
     log_move_curr = log(0.5);
    end
   else
    u = rand;
    if u<0.5
      nexp_prop = nexp_curr(p)+1; %birth
      if nexp_prop==nexp_max;
       log_move_curr = 0;
       log_move_prop = log(0.5);
      else
       log_move_curr=log(0.5);
       log_move_prop=log(0.5);
      end
    else
      nexp_prop = nexp_curr(p)-1; %death
      if nexp_prop==1
       log_move_curr = 0;
       log_move_prop = log(0.5);
      else
       log_move_curr = log(0.5);
       log_move_prop = log(0.5);
       end
     end
   end
 end

 xi_curr_temp = xi{nexp_curr(p)}(:,p);
 Beta_curr_temp = Beta{nexp_curr(p)}(:,:,:,p);
 nseg_curr_temp = nseg{nexp_curr(p)}(:,p);
 tau_curr_temp = tausq{nexp_curr(p)}(:,:,p);
 Phi_curr_temp = Phi{nexp_curr(p)}(:,p);

 if nexp_prop<nexp_curr(p)
   %Death step
   [met_rat(p),nseg_prop,xi_prop,tausq_prop,Beta_prop, Phi_prop]= death(chol_index,ts,nexp_curr(p),nexp_prop,...
       tau_curr_temp,xi_curr_temp,nseg_curr_temp,Beta_curr_temp,Phi_curr_temp, log_move_curr,log_move_prop);
 elseif nexp_prop>nexp_curr(p)
   %Birth step
   [met_rat(p),nseg_prop,xi_prop,tausq_prop,Beta_prop, Phi_prop]= birth(chol_index,ts,nexp_curr(p),nexp_prop,...
    tau_curr_temp,xi_curr_temp,nseg_curr_temp,Beta_curr_temp,Phi_curr_temp,log_move_curr,log_move_prop);
 else
   xi_prop=xi{nexp_curr(p)}(:,p);
   nseg_prop=nseg{nexp_curr(p)}(:,p);
   tausq_prop=tausq{nexp_curr(p)}(:,:,p);
   Beta_prop=Beta{nexp_curr(p)}(:,:,p);
   Phi_prop=Phi{nexp_curr(p)}(:,p);
   met_rat(p) = 1;
 end
 u = rand;
 if u<met_rat(p)
   if nexp_prop<nexp_curr(p)
    bet_death = bet_death + 1;
   elseif nexp_prop>nexp_curr(p)
    bet_birth = bet_birth + 1;
   end
        nexp_curr(p+1)=nexp_prop;
        xi{nexp_curr(p+1)}(:,p+1)=xi_prop;
        nseg{nexp_curr(p+1)}(:,p+1)=nseg_prop;
        tausq{nexp_curr(p+1)}(:,:,p+1)=tausq_prop;
        Beta{nexp_curr(p+1)}(:,:,:,p+1)=Beta_prop;
   Phi{nexp_curr(p+1)}(:,p+1)=Phi_prop;
 else
        nexp_curr(p+1)=nexp_curr(p);
        xi{nexp_curr(p+1)}(:,p+1)=xi{nexp_curr(p+1)}(:,p);
        nseg{nexp_curr(p+1)}(:,p+1)=nseg{nexp_curr(p+1)}(:,p);
        tausq{nexp_curr(p+1)}(:,:,p+1)=tausq{nexp_curr(p+1)}(:,:,p);
        Beta{nexp_curr(p+1)}(:,:,:,p+1)=Beta{nexp_curr(p+1)}(:,:,:,p);
   Phi{nexp_curr(p+1)}(:,p+1)=Phi{nexp_curr(p+1)}(:,p);
 end

 %========================
 %WITHIN MODEL MOVE
 %========================

   %Drawing a new cut point and Beta simultaneously
 %update coeffiecient of linear basis function
 xi_curr_temp=xi{nexp_curr(p+1)}(:,p+1);
 Beta_curr_temp=Beta{nexp_curr(p+1)}(:,:,:,p+1);
 tau_temp=tausq{nexp_curr(p+1)}(:,:,p+1);
 nseg_curr_temp=nseg{nexp_curr(p+1)}(:,p+1);
 Phi_temp = Phi{nexp_curr(p+1)}(:,p+1);
 [epsilon(p),nseg_new,xi_prop,tausq_prop,Beta_prop,Phi_prop,seg_temp]= ...
 within(chol_index,ts,nexp_curr(p+1),tau_temp, xi_curr_temp,...
              nseg_curr_temp, Beta_curr_temp,Phi_temp);
 u = rand;
 if (u<epsilon(p)|| p==1)
   with = with + 1;
   if nexp_curr(p+1)>1
     for j=seg_temp:seg_temp+1
      Beta{nexp_curr(p+1)}(:,:,j,p+1)=Beta_prop(:,:,j);
      xi{nexp_curr(p+1)}(j,p+1)=xi_prop(j);
      nseg{nexp_curr(p+1)}(j,p+1)=nseg_new(j);
      Phi{nexp_curr(p+1)}(j,p+1)=Phi_prop(j);
     end
   else
     Beta{nexp_curr(p+1)}(:,:,p+1)=Beta_prop;
   end
 else
   Beta{nexp_curr(p+1)}(:,:,:,p+1)=Beta_curr_temp;
   xi{nexp_curr(p+1)}(:,p+1)=xi_curr_temp;
   nseg{nexp_curr(p+1)}(:,p+1)=nseg_curr_temp;
   Phi{nexp_curr(p+1)}(:,p+1)=Phi_temp;
 end

 %Drawing tau
 for j=1:nexp_curr(p+1)
  for i=1:dimen^2
   if ismember(i,1:dimen + dimen*(dimen-1)/2)
     tau_a = nbasis/2;
     tau_b = sum(Beta{nexp_curr(p+1)}(2:end,i,j,p+1).^2)/2;
     u=rand;
     const1 = gamcdf(1/tau_up_limit,tau_a,1/tau_b);
     const2 = 1-u*(1-const1);
     tausq{nexp_curr(p+1)}(i,j,p+1) = 1/gaminv(const2,tau_a,1/tau_b);
   else
     tau_a = nBeta/2;
     tau_b = sum(Beta{nexp_curr(p+1)}(1:nBeta,i,j,p+1).^2)/2;
     u=rand;
     const1 = gamcdf(1/tau_up_limit,tau_a,1/tau_b);
     const2 = 1-u*(1-const1);
     tausq{nexp_curr(p+1)}(i,j,p+1) = 1/gaminv(const2,tau_a,1/tau_b);
    end
   end 
 end

 tms(p) = toc;
 if params.verb ==1
   fprintf('sec / min hr: %g %g %g 
' ,[tms(p),sum(tms(1:p))/60, sum(tms(1:p))/(60*60)]') 
 end
 %==================================
 %Estimating Spectral Density
 %==================================
 [xx_r, xx_i] = lin_basis_func(freq_hat); %produce linear basis functions

 for j =1:nexp_curr(p+1)
   %getting the coefficients of linear basis functions
   g1 = Beta{nexp_curr(p+1)}(:,1:(dimen + dimen*(dimen-1)/2),j,p+1);
   g2 = Beta{nexp_curr(p+1)}(1:nBeta,(dimen + dimen*(dimen-1)/2 + 1):end,j,p+1);
 
   theta = zeros(dimen*(dimen-1)/2,(nfreq_hat+1));
   for i=1:dimen*(dimen-1)/2
    theta_real = xx_r * g1(:,i+dimen);
    theta_imag = xx_i * g2(:,i);
    theta(i,:) = (theta_real + sqrt(-1)*theta_imag).';
   end
 
   delta_sq_hat = zeros(2,nfreq_hat+1);
   for q=1:dimen
    delta_sq_hat(q,:) = exp(xx_r * g1(:,q)).';
   end
 
   %produce the spectral density matrix
   for k=1:(nfreq_hat+1)
    TT = eye(dimen);
    TT(2,1) = -theta(1,k);
    if dimen==3
      TT(3,1) = -theta(2,k);
      TT(3,2) = -theta(3,k);
    end
    spect_hat{nexp_curr(p+1)}(:,:,k,j,p+1) = ...
    inv(TT)*diag(delta_sq_hat(:,k))*inv(TT');
   end

 end

 %create convergence diagnostics
 if params.convdiag==1
   convdiag_out(:,p+1)=conv_diag(nobs, nBeta, nseg{nexp_curr(p+1)}(:,p+1),...
    nexp_curr(p+1), Beta{nexp_curr(p+1)}(:,:,:,p+1), tausq{nexp_curr(p+1)}(:,:,p+1),dimen);
 end

end
%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%% III) Collect outputs
%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
fitparams = struct('nloop', nloop, ...
      'nwarmup', nwarmup, ...
      'timeMean', mean(tms), ...
      'timeMax', max(tms(2:end)), ...
      'timeMin', min(tms),...
      'timeStd', std(tms(2:end)));
if params.convdiag==1   
  diagparams = struct('tausq', tausq,...
      'Beta', Beta,...
      'xi', xi,...
      'nseg', nseg,...
      'nexp_curr', nexp_curr,...
      'Phi', Phi,...
      'epsilon', epsilon,...     
      'bet_birth', bet_birth,...
      'bet_death', bet_death,...
      'with', with,...
      'time', tms,...
      'convdiag_out', convdiag_out);
else
  diagparams = struct('tausq', tausq,...
      'Beta', Beta,...
      'xi', xi,...
      'nseg', nseg,...
      'nexp_curr', nexp_curr,...
      'Phi', Phi,...
      'epsilon', epsilon,...     
      'bet_birth', bet_birth,...
      'bet_death', bet_death,...
      'with', with,...
      'time', tms);
end

end
function [param] = OptsMultiSpect(varargin)
% This function sets the optional input arguments for the function
% MCBSpec().
%
% (I) ONLY DEFAULT PARAMETERS
%   If only defult values for the parameters are desired, then either:
%
%     a) the SECOND argument in MultiSpec() can be left missing, or
%
%     b) params=setOptions() can be defined and used as the second
%     argument of MultiSpec().
%
% (II) USING NONDEFAULT PARAMETERS
%   If some options other than the default are desired:
%
%     1) Set all default parameters using (Ia) above.
%
%     2) Change desired parameters.
%
%
% PARAMETERS
%
%  nloop  - The number of iterations run.
%       Default: 6000.
%  nwarmup - The length of the burn-in.
%       Default: 200.
%  nexp_max- The maximum number of segments allowed, adjust % when         time series get longer.
%       Default: 10
%  tmin  - The minimum number of observation in each segment.
%       Default: 100
%  prob_mml- The probability of small jump, 1-prob of big
%  jump.
%       Default: 0.8
%  nbasis -  The number of linear smoothing spline basis
% functions.
%       Default: 10
%  tau_up_limit - The normal variance prior for smoothing
% parameters.
%       Default: 10^4.
%  sigmasqalpha -  The smoothing parameter for alpha.
%       Default: 10^4
%  init -  Initial number of partitions
%       Default: 3
%  nfreq -  The number of frequencies for spectrm
%       Default: 50
%  verb -  An indicator if the iteration number should be printed
%       at the completion of each iteration.  1 is
% yes, 0 is
%       no.
%  covdiag -  An indicator if the diagnostic should be
%  ee   -  Step size in Hamiltonian Monte Carlo
%        Default is 0.1


param = struct('nloop',10000, 'nwarmup',2000, ...
   'nexp_max',10, 'tmin',60, 'prob_mml',0.8 , 'nbasis',10, ...
   'tau_up_limit',10^5 , 'sigmasqalpha',10^5, 'init',3,...
   'nfreq',50, 'verb',1, 'convdiag',0, 'ee',0.1);
   
param.bands = {};
   
end


%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%% read in data and implement the analysis
%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%

fdata = csvread('C:/Bookdata/WW9.csv',1,1);
dimen = size(fdata);
nobs = dimen(1);

startDate = datenum('04-01-1990');
endDate = datenum('12-31-2011');
xData = linspace(startDate,endDate,nobs);
figure
subplot(1,3,1)
plot(xData,fdata(:,1))
title('DJIA')
datetick('x','yyyy','keeplimits')
subplot(1,3,2)
plot(xData,fdata(:,2))
title('NASDAQ')
datetick('x','yyyy','keeplimits')
subplot(1,3,3)
plot(xData,fdata(:,3))
title('S&P500')
datetick('x','yyyy','keeplimits')

params = struct('nloop',12000, 'nwarmup',4000, ...
      'nexp_max',10, 'tmin',60, 'prob_mml',0.8 , 'nbasis',10, ...
      'tau_up_limit',10^5, 'sigmasqalpha',10^5, 'init',2,...
      'nfreq',50, 'verb',1, 'convdiag',0, 'ee', 0.05);
   
%%% Note: he code can be run using default parameters
%%% [spect_matrices, freq_hat, fit, fit_diag] = MultiSpect(zt)
          
rng(201701204);
[spect_matrices, freq_hat, fit, fit_diag] = MultiSpect(fdata,params);

[spect, coh] = MultiSpect_surface(fdata, spect_matrices, fit, params);

figure
subplot(1,3,1); meshc( xData, freq_hat,real(spect{1})); set(gca,'YDir','reverse'); title('$ f_{11}$','Interpreter','LaTex'); xlabel('time'); ylabel('freq'); datetick('x','yyyy','keeplimits')
subplot(1,3,2); meshc( xData, freq_hat,real(spect{2})); set(gca,'YDir','reverse'); title('$ f_{22}$','Interpreter','LaTex'); xlabel('time'); ylabel('freq'); datetick('x','yyyy','keeplimits')
subplot(1,3,3); meshc( xData, freq_hat,real(spect{3})); set(gca,'YDir','reverse'); title('$ f_{33}$','Interpreter','LaTex'); xlabel('time'); ylabel('freq'); datetick('x','yyyy','keeplimits')
figure
subplot(1,3,1); meshc( xData, freq_hat,real(coh{1})); set(gca,'YDir','reverse'); title('$ 
ho_{21}^2$','Interpreter','LaTex'); xlabel('time'); ylabel('freq'); datetick('x','yyyy','keeplimits')
subplot(1,3,2); meshc( xData, freq_hat,real(coh{2})); set(gca,'YDir','reverse'); title('$ 
ho_{31}^2$','Interpreter','LaTex'); xlabel('time'); ylabel('freq'); datetick('x','yyyy','keeplimits')
subplot(1,3,3); meshc( xData, freq_hat,real(coh{3})); set(gca,'YDir','reverse'); title('$ 
ho_{32}^2$','Interpreter','LaTex'); xlabel('time'); ylabel('freq'); datetick('x','yyyy','keeplimits')

Projects

Find a m‐dimensional stationary vector time series data set of your interest with m no less than 5. Complete your detailed nonparametric analysis of multivariate spectrum with a written report and analysis software code.
Find a m‐dimensional stationary vector time series data set of your interest with m no less than 5. Complete your detailed VARMA spectral analysis with a written report and analysis software code.
Use the data set from Project 1 to perform a VARMA spectral analysis and compare the results from the two methods.
Use the data set from Project 2 to perform a nonparametric analysis of multivariate spectrum, and compare the results from the two methods.
Find a m‐dimensional nonstationary vector time series data set of your interest and complete your detailed analysis of multivariate spectral analysis with a written report and analysis software code.

References

Bandyopadhyay, S., Jentsch, C., and Rao, S.S. (2017). A spectral domain test for stationarity of spatio‐temporal data. Journal of Time Series Analysis 38: 326–351.
Bartlett, M.S. (1950). Periodogram analysis and continuous spectra. Biometrika 37: 1–16.
Blackman, R.B. and Tukey, J.W. (1959). The Measurements of Power Spectrum from the Point of View of Communications Engineering. Dover Publications.
Brillinger, D.R. (2002). Time Series: Data Analysis and Theory. Philadelphia: SIAM.
Chang, J., Hall, P., and Tang, C.Y. (2017). A frequency domain analysis of the error distribution from noisy high‐frequency data. Biometrika 103: 1–16.
Dahlhaus, R. (1996). Fitting time series model to nonstationary process. Annals of Statistics 25: 1–37.
Dahlhaus, R. (2000). A likelihood approximation for locally stationary process. Annals of Statistics 28: 1762–1794.
Dai, M. and Guo, W. (2004). Multivariate spectral analysis using Cholesky decomposition. Biometrika 91: 629–643.
Daniell, P.J. (1946). Discussion on symposium on autocorrelation in time series. Journal of the Royal Statistical Society (Suppl. 8): 88–90.
Davis, R.A., Lee, T.C.M., and Rodriguez‐Yam, G.A. (2006). Structural break estimation for nonstationary time series models. Journal of the American Statistical Association 101: 223–239.
Eubank, R.L. and Hsing, T. (2008). Canonical correlation for stochastic processes. Stochastic Processes and their Applications 118: 1634–1661.
Goodman, N. (1963). Statistical analysis based on a certain multivariate complex Gaussian distribution. Annals of Mathematical Statistics 34: 152–177.
Grant, A.J. and Quinn, B.G. (2017). Parametric spectral discrimination. Journal of Time Series Analysis 38: 838–864.
Gu, C. and Wahba, G. (1993). Semiparametric analysis of variance with tensor product thin plate splines. Journal of the Royal Statistical Society, Series B 55: 353–368.
Guo, W. and Dai, M. (2006). Multivariate time‐dependent spectral analysis using Cholesky decomposition. Statistica Sinica 16: 825–845.
Hannan, E.J. (1970). Multiple Time Series. Wiley.
Hansen, M.H. and Yu, B. (2001). Model selection and the principle of minimum description length. Journal of the American Statistical Association 96: 746–774.
Krafty, R.T. and Collinge, W.O. (2013). Penalized multivariate Whittle likelihood for power spectrum estimation. Biometrika 100: 447–458.
Li, Z. and Krafty, R.T. (2018). Adaptive Bayesian time‐frequency analysis of multivariate time series. To appear in Journal of the American Statistical Association.
Ombao, H., von Sachs, R., and Guo, W. (2005). SLEX analysis of multivariate nonstationary time series. Journal of the American Statistical Association 100: 519–531.
Parzen, E. (1961). Mathematical considerations in the estimation of spectra. Technometrics 3: 167–190.
Parzen, E. (1963). Notes on Fourier analysis and spectral windows. Technical report No. 48, Office of Naval Research.
Pawitan, Y. (1996). Automatic estimation of the cross‐spectrum of a bivariate time series. Biometrika 83: 419–432.
Pawitan, Y. and O'Sullivan, F. (1994). Nonparametric spectral density estimation using penalized whittle likelihood. Journal of the American Statistical Association 89: 600–610.
Priestley, M.B. (1965). Evolutionary spectral and non‐stationary process. Journal of the Royal Statistical Society, Series B 27: 204–237.
Priestley, M.B. (1966). Design relations for non‐stationary processes. Journal of the Royal Statistical Society, Series B 28: 228–240.
Priestley, M.B. (1967). Power spectral analyses of non‐stationary random processes. Journal of Sound and Vibration 6: 86–97.
Priestley, M.B. (1981). Spectral Analysis and Time Series, vol. 1 and 2. Academic Press.
Qin, L. and Wang, Y. (2008). Nonparametric spectral analysis with applications to seizure characterization using EEG time series. Annals of Applied Statistics 2: 1432–1451.
Rao, T.S. and Terdick, G. (2017). On the frequency variogram and on frequency domain methods for the analysis of spatio‐temporal data. Journal of Time Series Analysis 38: 308–325.
Ray, E.L., Sakrejda, K., Lauer, S.A., Johansson, M.A., and Reich, N.G. (2017). Infectious disease prediction with kernel conditional density estimation. Statistics in Medicine 36: 4908–4929.
Riedel, K. and Sidorenko, A. (1995). Minimum bias multiple taper spectral estimation. IEEE Transaction on Signal Processing 43: 188–195.
Rissanen, J. (1989). Stochastic Complexity in Statistical Inquiry. Singapore: Word Scientific.
Rosen, O. and Stoffer, D.S. (2007). Automatic estimation of multivariate spectra via smoothing splines. Biometrika 94: 335–345.
Schwarz, K. and Krivobokova, T. (2016). A unified framework for spline estimators. Biometrika 103: 103–120.
Slepian, D. (1978). Prolate spheroidal wave functions, Fourier analysis, and uncertainty – V: the discrete case. Bell System Technical Journal 57: 1371–1430.
Thompson, D.J. (1982). Spectrum estimation and harmonic analysis. Proceedings of the IEEE 70: 1055–1096.
Wahba, G. (1990). Spline Models for Observational Data, CBMS‐NSF Regional Conference Series in Applied Mathematics. Philadelphia: SIAM.
Walden, A.T. (2000). A unified view of multitaper multivariate spectral estimation. Biometrika 87: 767–788.
Wei, W.W.S. (2006). Time Series Analysis Univariate & Multivariate Methods, 2e. Pearson Addison‐Wesley.
Wei, W.W.S. (2008). Spectral analysis. In: Handbook of Longitudinal Research, Design, Measurement, and Analysis (ed. S. Menard), 601–620. Academic Press.
Whittle, P. (1953). Estimation and information in stationary time series. Arkiv för Matematik 2: 423–434.
Whittle, P. (1954). Some recent contributions to the theory of stationary processes. In: A Study in the Analysis of Stationary Time Series, 2e (ed. H.O. Wold), 196–228. Stockhol: Almqristand Witsett.
Wilson, G.T. (2017). Spectral estimation of the multivariate impulse response. Journal of Time Series Analysis 38: 381–391.
Wood, S.N. (2011). Fast stable restricted maximum likelihood and marginal likelihood estimation of semiparametric generalized linear models. Journal of the Royal Statistical Society, Series B 73: 3–36.
Zhang, S. (2016). Adaptive spectral estimation for nonstationary multivariate time series. Computational Statistics & Data Analysis 103: 330–349.

..................Content has been hidden....................

You can't read the all page of ebook, please click here login for view all page.

Table of Contents for 9 Multivariate spectral analysis of time series

Create new playlist

Sign In

Sign Up

9.1 Introduction

9.2 Spectral representations of multivariate time series processes

9.3 The estimation of the spectral density matrix

9.3.1 The smoothed spectrum matrix

9.3.2 Multitaper smoothing

9.3.3 Smoothing spline

9.3.4 Bayesian method

9.3.5 Penalized Whittle likelihood

9.3.6 VARMA spectral estimation

9.4 Empirical examples of stationary vector time series

9.4.1 Sample spectrum

9.4.2 Bayesian method

9.4.3 Penalized Whittle likelihood method

9.4.4 Example of VAR spectrum estimation

9.5 Spectrum analysis of a nonstationary vector time series

9.5.1 Introduction

9.5.2 Spectrum representations of a nonstationary multivariate process

9.5.2.1 Time‐varying autoregressive model

9.5.2.2 Smoothing spline ANOVA model

9.5.2.3 Piecewise vector autoregressive model

9.5.2.4 Bayesian methods

9.6 Empirical spectrum example of nonstationary vector time series

Projects

References

Table of Contents for
9 Multivariate spectral analysis of time series