Chapter 8 - Correlation-Driven Nonparametric Learning (2/3)

Search in book...
Toggle Font Controls
Create new playlist

Name your new playlist

Playlist description (optional)
Sign In

Email address

Password

Forgot Password?

or

Continue with Facebook

Continue with Google
Sign Up

Full Name

Email address

Confirm Email Address

Password

or

Continue with Facebook

Continue with Google

52 CORRELATION-DRIVEN NONPARAMETRIC LEARNING

Algorithm 8.1: CORrelation-driven Nonparametric expert: CORN (w, ρ).

Input: w: Window size; ρ: Correlation coefﬁcient threshold; x

: Historical

market sequence; t: Index of current period.

Output: b

t+1

: Expert’s portfolio for period t +1.

begin

Initialize C

=∅, b

t+1



,...,



;

if t ≤ w then

return b

t+1

;

end

for i = w +1,w+2,...,t do

if corrcoef (x

i−1

i−w

, x

t−w+1

) ≥ ρ then

= C

∪i;

end

if C

=∅then

Search for an optimal portfolio: b

t+1

= arg max

b∈



i∈C

(b ·x

);

end

return b

t+1

;

end

period t, we propose to learn an optimal portfolio following the idea of BCRP (Cover

1991), which maximizes the expected multiplicative return over the sequence of

similar price relatives, that is,

t+1

(w, ρ) = arg max

b∈



i∈C

(w,ρ)

(b ·x

), (8.1)

where 

represents an m-dimensional simplex. In case that C

(w, ρ) is empty (espe-

cially for a large ρ value), we will simply adopt uniform portfolio



,...,



. Note

that the correlation-similar set usually contains a large number of correlated price

relatives. If one similar price relative vector has occurred frequently in history, it will

also appear multiple times in the correlation-similar set. In other words, Equation 8.1

has more or less considered the occurrence/conﬁdence of the correlated price relative

vectors, which would avoid simply taking an extreme case in history.

We further combine all experts according to their historical performance S

(w, ρ)

and a probability distribution function q(w,ρ). Speciﬁcally, CORN combines experts’

portfolios and calculates the ﬁnal portfolio for period t +1as

t+1



w,ρ

q(w,ρ)S

(w, ρ)b

t+1

(w, ρ)



w,ρ

q(w,ρ)S

(w, ρ)

, (8.2)

where b

t+1

(w, ρ) represents the portfolio computed by expert E(w, ρ) and S

(w, ρ)

represents its historical performance. For an individual expert, the higher its historical

return, the higher its weight assigned in the ﬁnal portfolio.

T&F Cat #K23731 — K23731_C008 — page 52 — 9/28/2015 — 21:18

ALGORITHMS 53

After releasing the price relative vector of x

t+1

, CORN updates the cumulative

wealth,

t+1

= S

×(b

t+1

·x

t+1

For the underlying experts, CORN updates their cumulative wealth,

t+1

(w, ρ) = S

(w, ρ) ×(b

t+1

(w, ρ) ·x

t+1

where S

(w, ρ) represents the cumulative wealth achieved by expert E(w, ρ) till

period t.

Therefore, it is straightforward that the cumulative wealth achieved by the pro-

posed CORN strategy after n periods is equivalent to a q-weighted sum of all experts’

returns,



w,ρ

q(w,ρ)S

(w, ρ). (8.3)

Clearly, the ﬁnal cumulative return is affected by all underlying experts, and the

portions of contributions made by each expert are determined by the predeﬁned

distribution q(w,ρ) and expert’s performance S

(w, ρ).

Ideally, indexed by (w, ρ), we can choose CORN experts such that they cover all

possible parametersettings,thus eliminating their effects.However,the computational

cost of such a combination is inhibitively high. To boost the efﬁciency, we can choose

ﬁnite discrete dimensions of the parameters, that is, a speciﬁed number of (w, ρ)

combinations.

The selection of experts also trades off an individual expert’s performance and its

computational time. First, Equation 8.3 clearly shows that each expert contributes to

the ﬁnal cumulative wealth by its performance; thus, choosing a worse expert may

lower the ﬁnal performance. Second, the mixture’s computation time is generally the

summation of all experts’individual time. In other words, choosing too many experts,

which cost too much time, may affect its practical scalability.

In this study, we ﬁrst adopted uniform combination, which chooses a uniform

distribution of q(w,ρ), and named it “CORN uniform combination” (CORN-U).

Algorithm 8.2 shows the details of the proposed CORN-U algorithm. In particular,

we assign the same weights to all CORN experts, although the weights can be adjusted

if we have more information. Moreover, CORN-U only considers P = 1 and chooses

a speciﬁc value of ρ.

The above uniform combination algorithm may include some poor experts, lead-

ing to the degradation of overall performance. To overcome such limitations, the

second algorithm, “CORN top-K combination” (CORN-K), combines only the top

K best experts. Algorithm 8.3 illustrates the proposed CORN-K algorithm. In partic-

ular, it chooses the top K experts with the highest historical returns and uniformly

combines them. That is, the strategy assigns the set of top K experts a uniform distri-

bution q(w,ρ) =

, while the weights assigned to other experts are simply set to 0.

Moreover, for the proposed CORN-K algorithm, we deﬁne P ≥ 1 associated experts,

each of which has a different ρ value.

T&F Cat #K23731 — K23731_C008 — page 53 — 9/28/2015 — 21:18

54 CORRELATION-DRIVEN NONPARAMETRIC LEARNING

Algorithm 8.2: Online portfolio selection with CORN uniform algorithm

(CORN-U).

Input: W : Maximum window size; ρ: Correlation coefﬁcient threshold;

= (x

,...,x

): Historical market sequence.

Output: S

: Final cumulative wealth.

begin

Initialize S

and W experts: S

= 1, b

1, q(w,ρ) =

(w, ρ) = 1, b

(w, ρ) =

1,w = 1,...,W;

for t = 1 to n do

Rebalance the portfolio to b

;

Receive current price relatives: x

;

Update the cumulative wealth: S

= S

t−1

×(b

·x

);

Update the experts’ cumulative wealth:

(w, ρ) = S

t−1

(w, ρ) ×(b

(w, ρ) ·x

);

Update next portfolio:

begin

for w = 1 to W do

CORN expert (Algorithm 8.1) ﬁnds a portfolio:

t+1

(w, ρ) = CORN(w, ρ)

end

Combine experts’ portfolios:

t+1



q(w,ρ)S

(w, ρ)b

t+1

(w, ρ)



q(w,ρ)S

(w, ρ)

end

Remarks on Aggregation: Note that the aggregation or combination rule

described in Equation 8.2 is a special case of the general concept of exponential

weighting. For a learning parameter η > 0, put

t+1



w,ρ

q(w,ρ)e

η log S

(w,ρ)b

t+1

(w,ρ)



w,ρ

q(w,ρ)e

η log S

(w,ρ)

If η = 1, then one gets the rule in Equation 8.2. The proof of universal consistency of

, B

, and B

works without any difﬁculties if η ≤ 1. However, the exponential

results are superior if η is much larger than 1, but there is no theoretical support for this

phenomenon. The large η corresponds to CORN-K with K = 1 (i.e., this rule is the

T&F Cat #K23731 — K23731_C008 — page 54 — 9/28/2015 — 21:18

ALGORITHMS 55

Algorithm 8.3: Online portfolio selection with CORN top-K algorithm

(CORN-K).

Input: W : Maximum window size; P : The number of correlation coefﬁcient

thresholds; K: The number for top K experts; x

= (x

,...,x

historical market sequence.

Output: S

: Final cumulative wealth.

begin

Initialize S

and W ×P experts: S

= 1, P =



,...,

P −1



q(w,ρ) =

W ×P

, S

(w, ρ) = 1,w = 1,...,W,ρ ∈ P;

for t = 1 to n do

Rebalance the portfolio to b

;

Receive current price relatives: x

;

Update the cumulative wealth: S

= S

t−1

×(b

·x

);

Update the experts’ cumulative wealth:

(w, ρ) = S

t−1

(w, ρ) ×(b

(w, ρ) ·x

);

Update experts’ weights:

begin

Select top K experts K ={(w, ρ)} w.r.t. S

(w, ρ) ;

Set weights for the top K experts: q(w,ρ) =

,(w,ρ) ∈ K;

Set zero weights for other experts: q(w,ρ) = 0,(w,ρ) ∈ K;

end

Update next portfolio:

begin

for w = 1 to W do

for ρ ∈ P do

CORN expert (Algorithm 8.1) ﬁnds a portfolio:

t+1

(w, ρ) = CORN(w, ρ)

end

Combine top K experts’ portfolios:

t+1



w,ρ

q(w,ρ)S

(w, ρ)b

t+1

(w, ρ)



w,ρ

q(w,ρ)S

(w, ρ)

end

follow the winner rule, which experts called follow the leader in the machine-learning

literature) (Cesa-Bianchi and Lugosi 2006). It would be nice to prove or disprove that

the follow the leader aggregation results in universally consistent strategies (i.e.,

asymptotically it is of growth optimal for any stationary and ergodic market process).

T&F Cat #K23731 — K23731_C008 — page 55 — 9/28/2015 — 21:18

56 CORRELATION-DRIVEN NONPARAMETRIC LEARNING

8.4 Analysis

In this section, we ﬁrst analyze CORN’s universal consistency with respect to the

class of all ergodic process.

Theorem 8.1 The portfolio scheme CORN is universal with respect to the class of

all ergodic processes such that E{|log X

|} < ∞, for j = 1,...,m.

Proof The proof can be found in Appendix B.1.1.

In the CORN expert learning procedure, there are two key parameters: the cor-

relation coefﬁcient threshold ρ and the window size w. Below, we analyze how they

affect the algorithms.

As shown in the motivating example, the correlation coefﬁcient threshold ρ is

critical to a correlation-similar set. If ρ is negative, the correlation-similar set would

contain some negatively correlated price relative vectors or irrelevant vectors. On

the other hand, if ρ is too large, for example, ρ ≥ 0.5, the correlation-similar set

would neglect some positively correlated vectors. Since the correlation-similar set is

crucial in selecting optimal portfolios, it would harm the learning performance if it

either contains negatively correlated vectors/irrelevant vectors or discards positively

correlated vectors. Empirically, we found that the optimal ρ value is often dataset

dependent, but often close to 0, which will be veriﬁed in Section 13.3.1. Moreover,

we note that CORN would degrade to a special case when ρ → 1. As ρ → 1, fewer

market windows are highly positively correlated to the latest window. In the extreme

case of ρ = 1, C

(w, ρ) becomes almost empty, which thus reduces to the uniform

CRP strategy.

∗

Another key parameter for the CORN expert learning process is window size.

Since the calculation of correlation coefﬁcient treats market windows as a vector, the

window size does not have a signiﬁcant impact on the ﬁnal portfolio. When certain

experts give verybadperformance,theﬁnal result tends to be relatively stable sincethe

proposed combination methods (viz., CORN-U and CORN-K) will reduce the impact

of these experts and thus provide a stable portfolio. We will numerically analyze the

effect of window size in Section 13.3.1, which shows that the proposed combination

can effectively smoothen the performance curve.

The simplicity and effectiveness of CORN raise a fundamental question: Is it

reasonable to select a portfolio using only market price information?” While our

goal is not to resolve the philosophical debates between fundamental and techni-

cal analysts, we believe this work goes a long way to provide empirical evidence

endorsing the effectiveness of technical analysis. Moreover, note that the success of

CORN depends on three basic assumptions that form the basis of most technical anal-

ysis methods, including: (i) market action discounts everything; (ii) price moves in

trends; and (iii) history tends to repeat itself. The ﬁrst point assumes that stock prices

at any given time reﬂect everything that has or could affect a company, including

fundamental factors. And the second and third points directly lead to our proposed

∗

This is not a general case, which depends on the initial portfolio and default values if a similarity set

is empty.

T&F Cat #K23731 — K23731_C008 — page 56 — 9/28/2015 — 21:18

..................Content has been hidden....................

You can't read the all page of ebook, please click here login for view all page.

Table of Contents for Chapter 8 - Correlation-Driven Nonparametric Learning (2/3)

Create new playlist

Sign In

Sign Up

Table of Contents for
Chapter 8 - Correlation-Driven Nonparametric Learning (2/3)