The Mathematics behind portfolio VAR

Introduction

In this article we define VaR and give a brief overview of how it is used to quantify the risk of a portfolio. We will then go into a step-by-step mathematical derivation of VaR, whilst highlighting and discussing the various assumptions used along the way. The awareness of such assumptions is important to ensure reliable results and the use appropriate of models in order to avoid underestimation of risk.

Definition of VaR

Suppose that we have a portfolio of assets. Each asset has an associated price which changes from one time unit to the next. Hence the total value of the portfolio changes from one time unit to the next. The time unit could be a day, 5-day period, 1 week, 1 month, and so on. The choice of the time unit depends on how the historic data of prices is presented and the time horizon of the VaR calculation (as we shall see).

The term VaR stands for Value-At-Risk and has an associated time horizon and level of confidence. Let us say that the time horizon is 1 time unit for now. An \alpha 1-time unit VaR is the value v by which with a level of confidence (or probability) of \alpha, the value of the loss suffered during 1 time unit would be not be greater than v. As an example, if the 95% 5-day VaR is €100,000, then with a level of confidence of 95% , it is expected that the loss suffered during a 5-day period will not be greater than €100,000.

Consider the 1-time unit portfolio rate of return R defined by the percentage change in the value of the portfolio from one time point to the next. Suppose that we know the distribution of the R. Let r be the the value of \mathbb{P}[R\geq r]=\alpha. This means that the 1-time unit rate of return of the portfolio will be greater than r, 100\alpha \% of the time.

Then the \alpha 1-time unit VaR is equal to -rV, where V is the total value of the portfolio.

Derivation of VaR

Suppose we have a portfolio of n assets. Let i \in \left\{ 1,2,\cdots,n \right\}. Let X_i be the number of units of asset i. Let P_i be the price of each unit of asset i. Let V_i be the total value of asset i. Hence V_i=X_i P_i. Let R_i be the rate of return from asset i during one time unit and let S_i be the return from asset i. Hence we have S_i=V_iR_i.

The portfolio can be described by the set X=\lbrace X_1,X_2,\cdots,X_n\rbrace. Let the total current value of the portfolio be V, where V=V_1+V_2+\cdots+V_n. The total return on the portfolio S during one time unit is the sum of the return from each asset present in the portfolio. Thus S=S_1+S_2+\cdots+S_n. Finally the rate of return R on the whole portfolio is given by:

    \begin{equation*}\begin{split}R&=\frac{S}{V}\\  &=\frac{S_1+S_2+\cdots+S_n}{V}\\  &=\frac{V_1R_1+V_2R_2+\cdots+V_nR_n}{V}\\  &=\frac{V_1}{V} R_1+\frac{V_2}{V} R_2+\cdots+\frac{V_n}{V} R_n\\  &=w_1 R_1+w_2 R_2+\cdots+w_n R_n \end{split}\end{equation*}

where w_i=\frac{V_i}{V} for i \in \left\{1,2,\cdots,n\right\} and represents the weight of asset i in the portfolio at the current time.

For every i \in \left\{1,2,\cdots,n\right\}, let R_i be a random variable with mean \mathbb{E}(R_i) and variance \mbox{Var}(R_i) (not to be confused with VaR which represents the Value-At-Risk).

Then the mean \mathbb{E}(R) of the portfolio rate of return R is given by:

    \begin{equation*}\begin{split}\mathbb{E}(R)&=\mathbb{E}(w_1 R_1+w_2 R_2+\cdots+w_n R_n)\\&=w_1\mathbb{E}(R_1)+w_2\mathbb{E}(R_2)+\cdots+w_n\mathbb{E}(R_n)\end{split}\end{equation*}

and the variance \mbox{Var}(R) of the portfolio rate of return R is given by:

    \begin{equation*}\begin{split}\mbox{Var}(R)&=\mbox{Var}(w_1 R_1+w_2 R_2+\cdots+w_n R_n)\\&=\mbox{Cov}(w_1 R_1+w_2 R_2+\cdots+w_n R_n,w_1 R_1+w_2 R_2+\cdots+w_n R_n)\\ &=\sum_{i =1}^{n}\sum_{j=1}^n w_iw_j\mbox{Cov}(R_i,R_j)\\ &=\sum_{i =1}^{n} w_i^2 \mbox{Var}(R_i) + 2\sum_{i,j: i<j} w_iw_j\mbox{Cov}(R_i,R_j) \end{split}\end{equation*}

where \mbox{Cov}(R_i,R_j) is the covariance between R_i and R_j.

Hence R is a random variable with mean \mathbb{E}(R) and variance \mbox{Var}(R), given in terms of the means, variance and covariances of R_1,R_2,\cdots,R_n. In order to characterise R completely, we need to know the values of \mathbb{E}(R) and \mbox{Var}(R), and also its distribution function.

Estimating the mean of the portfolio return \mathbb{E}(R) and its variance \mbox{Var}(R)

It is common practise that a series of prices is gathered for each asset over a time horizon common for all assets. Then the percentage change of a price of an asset, from one time unit to the next is the rate of return for that asset at that particular time interval. Hence a series of rates of return is obtain for each asset in the portfolio. The sample mean and the sample variance of each of these series are taken to be estimates of \mathbb{E}(R_1),\mathbb{E}(R_2),\cdots,\mathbb{E}(R_n) and \mbox{Var}(R_1),\mbox{Var}(R_2),\cdots,\mbox{Var}(R_n).

Hence it is assumed that the means and the variances are independent of time. So time should not have any effect on the means and variances (no heteroscedasticity). In the case when the mean is increasing over the time horizon of the historic data, the risk is overestimated and vice-versa. In the case when the variance of the rates of return is increasing over the time horizon of the historic data, the risk is underestimated, and vice-versa. So one must check that the means and variances are more or less constant over the period of study.

Moreover the sample covariances between the n series of rates of return are taken to be estimates of the covariances between the R_i‘s. Here it is assumed that the correlation structure is uniform over the values of the R_i‘s. Consider for example the case of positively correlated series (of rates of return). It is assumed that the correlation when the R_i‘s are positive is the same as that when the R_i‘s are negative. In the case when the correlation is higher when the R_i‘s are negative, the risk is underestimated.

Estimating the distribution of R

It is common practise that each R_i (for i \in \lbrace 1,2,\cdots,n\rbrace) is assumed to be normally distributed. In the case where the actual distribution is symmetric but has fatter tails than the normal distribution (positive excess kurtosis), the risk is underestimated, and vice-versa. This is known as kurtosis risk. In the case of an actual distribution which is skewed to the left (negatively skewed), there is underestimation of risk and vice-versa. This is known as skewness risk.

When we assume that each R_i follows the normal distribution and moreover that the joint distribution of the R_i‘s is the multivariate normal distribution, then since R is a linear combination of the R_i‘s, R itself is normally distributed with mean \mathbb{E}(R) and variance \mbox{Var(R)}, estimated as described above.

Estimating the 1-time unit \alpha VaR

We would like to find r such that \mathbb{P}[R\geq r]=\alpha. We have:

    \begin{equation*}\begin{split}\mathbb{P}[R\geq r]&=\alpha\\\mathbb{P}[\frac{R-\mathbb{E}(R)}{\sqrt{\mbox{Var}(R)}}\geq \frac{r-\mathbb{E}(R)}{\sqrt{\mbox{Var}(R)}}]&=\alpha\end{split}\end{equation*}

The random variable \frac{R-\mathbb{E}(R)}{\sqrt{\mbox{Var(R)}}} follows the standard normal distribution (since R is assumed to follow the normal distribution). Hence \frac{r-\mathbb{E}(R)}{\sqrt{\mbox{Var}(R)}}=z_{1-\alpha} where z_{1-\alpha} is the z-score with probability 1-\alpha. Therefore:

    \begin{equation*} r=\mathbb{E}(R)+z_{1-\alpha}\sqrt{\mbox{Var}(R)}. \end{equation*}

 

The \alpha\% VaR is given by:

    \begin{equation*} \begin{split} &-rV\\ =&-V\mathbb{E}(R)-z_{1-\alpha}V\sqrt{\mbox{Var}(R)}\\ =&-V\mathbb{E}(R)+z_{\alpha}V\sqrt{\mbox{Var}(R)}. \end{split} \end{equation*}

Sometimes when the value of \mathbb{E}(R) is very small, it is assumed that \mathbb{E}(R)=0. This simplifies the VaR equation even further to just:

    \begin{equation*} z_{\alpha}V\sqrt{\mbox{Var}(R)}. \end{equation*}

Extending to the m-time unit \alpha VaR

The 1-time unit \alpha \% VaR could be extended to m-time units \alpha \% VaR. Let R_{(m)} be the portfolio rate of return over m time units. In this case, the distribution of the portfolio logarithmic rate of return \ln(1+R_{(m)}) is calculated and used as a proxy of the distribution of R_{(m)}.

For i \in \lbrace 1,2,\cdots,m\rbrace, let R^{(i)} be the rate of return during the i^{th} time unit. Then:

    \begin{equation*} \begin{split} (1+R_{(m)})&=(1+R^{(1)})(1+R^{(2)})\cdots(1+R^{(m)})\\ \ln(1+R_{(m)})&=\ln(1+R^{(1)})+\ln(1+R^{(2)})+\cdots+\ln(1+R^{(m)}). \end{split} \end{equation*}

Let us find the expected value and the variance fo the random variable \ln(1+R_{(m)}).

Approximating \mathbb{E}(\ln(1+R_{(m)})) and \mbox{Var}(\ln(1+R_{(m)}))

    \begin{equation*}\begin{split} \mathbb{E}(\ln(1+R_{(m)}))&=\mathbb{E}(\ln(1+R^{(1)})+\ln(1+R^{(2)})+\cdots+\ln(1+R^{(m)}))\\&=\mathbb{E}(\ln(1+R^{(1)})+\mathbb{E}(\ln(1+R^{(2)}))+\cdots+\mathbb{E}(\ln(1+R^{(m)}))\\&=m\mathbb{E}(\ln(1+R))\end{split}\end{equation*}


since \ln(1+R^{(1)}),\ln(1+R^{(2)}),\cdots,\ln(1+R^{(m)}) are identically distributed 1 time unit portfolio logarithmic rates of return.

Recall that the Maclaurin series of \ln(1+x)=x-\frac{x^2}{2}+\frac{x^3}{3}-\frac{x^4}{4}+\cdots. When the value of x is small, we have the approximation \ln(1+x)=x, which we shall refer to as the “Maclaurin approximation”. By using the Maclaurin approximation on the formula for the expected value, we obtain:

    \begin{equation*} \mathbb{E}(\ln(1+R_{(m)}))=m\mathbb{E}(R). \end{equation*}

Consider now the variance.

    \begin{equation*}\begin{split} \mbox{Var}(\ln(1+R_{(m)}))&=\mbox{Var}(\ln(1+R^{(1)})+\ln(1+R^{(2)})+\cdots+\ln(1+R^{(m)}))\\&=\sum_{i=1}^m \mbox{Var}(\ln(1+R^{(i)})+2\sum_{i,j:i<j}^m \mbox{Cov}(\ln(1+R^{(i)}),\ln(1+R^{(j)}))\end{split}\end{equation*}

It is assumed that \ln(1+R^{(1)}),\ln(1+R^{(2)}),\cdots,\ln(1+R^{(m)}) are not correlated, that is, there is no serial correlation present. Hence, in particular, situations in which a negative rate of return in one time point triggers a negative return in the time point, violates this assumption. By assuming that there is no serial correlation, the variance of the logarithmic rate of return reduces to:

    \begin{equation*} \mbox{Var}(\ln(1+R_{(m)}))=\sum_{i=1}^m \mbox{Var}(\ln(1+R^{(i)})) \end{equation*}

and moreover

    \begin{equation*} \mbox{Var}(\ln(1+R_{(m)}))=m\mbox{Var}(\ln(1+R)) \end{equation*}

since \ln(1+R^{(1)}),\ln(1+R^{(2)}),\cdots,\ln(1+R^{(m)}) are identically distributed 1 time unit portfolio logarithmic rates of return. This is the well-known square root of time rule because here the standard deviation of the logarithmic rate of return over a period of m units is equal to the standard deviation of the logarithmic rate of return over one time unit multiplied by \sqrt{m} (where m is the time).

By the Maclaurin approximation, the formula for the variance reduces to

    \begin{equation*} \mbox{Var}(\ln(1+R_{(m)}))=m\mbox{Var}(R). \end{equation*}

Approximate distribution of R_{(m)}

If \ln(1+R_{(m)}) and each \ln(1+R^{(i)}) is approximated by the Maclaurin approximation in the equation \ln(1+R_{(m)})=\ln(1+R^{(1)})+\ln(1+R^{(2)})+\cdots+\ln(1+R^{(m)}), and carry on with the assumption that the portfolio rate of return is normally distributed, then we assume that R_{(m)} is normally distributed with mean m\mathbb{E}(R) and variance m\mbox{Var}(R).

Hence the m-time units \alpha VaR is -Vm\mathbb{E}(R)+z_{\alpha}V\sqrt{m}\sqrt{\mbox{Var}(R)}. Again, if \mathbb{E}(R) is small and assumed to be zero, the VaR would reduce to z_{\alpha}V\sqrt{m}\sqrt{\mbox{Var}(R)}.

Conclusion

We have gone through the mathematical derivation on the VaR equation for 1 time unit and for larger time horizons. The assumptions taken at each steps are highlighted. This is important for the risk manager to be aware of the assumptions being made and to make sure that the movement in the prices do not violate such assumptions. This will ensure that the portfolio risk is not underestimated and that the results are reliable.