Also applies to fixed income positions
Addresses what will happen to bond price if interest rises by 1 basis point
Price bond at current interest rates
Price bond assuming rate rise by 1 bp
Calculate loss as current minus prospective bond prices
Pros
Cons
Staff had to
Main elements of system working by around 1990
Then decided to use the '4:15' report
Found that it worked well
Sensitised senior management to risk-expected return tradeoffs, etc.
New system publicly launched in 1993 and attracted a lot of interest
Other firms working on their systems
JPM decided to make a lower-grade version of its system publicly available
This was RiskMetrics system launched in Oct 1994
Subsequent development of other VaR systems, applications to credit, liquidity, op risks, etc.
PT interprets risk as std of portfolio return, VaR interprets it as maximum likely loss
PT assumes returns are normal or near normal, whilst VaR systems can accommodate wider range of distributions
VaR approaches can be applied to a wider range of problems
VaR systems not all based on portfolio theory
VaR provides a single summary measure of possible portfolio losses
VaR provides a common consistent measure of risk across different positions and risk factors
VaR takes account of correlations between risk factors
Can be used to set overall firm risk target
Can use it to determine capital allocation
Can provide a more consistent, integrated treatment of different risks
Can be useful for reporting and disclosing
Can be used to guide investment, hedging, trading and risk management decisions
Can be used for remuneration purposes
Can be applied to credit, liquidity and op risks
High cl if we want to use VaR to set capital requirements
Lower if we want to use VaR
Depends on investment/reporting horizons
Daily common for cap market institutions
10 days (or 2 weeks) for banks under Basel
Can depend on liquidity of market – hp should be equal to liquidation period
Short hp makes it easier to justify assumption of unchanging portfolio
Short hp preferable for model validation/backtesting requirements
Given: A portfolio consists of $600,000 in Stock A and $400,000 in Stock B.
| Parameter | Stock A | Stock B |
|---|---|---|
| Daily expected return | 0.05% | 0.03% |
| Daily volatility | 2.0% | 1.0% |
| Correlation |
0.40 |
Step 1: Portfolio daily expected return and volatility
Step 2: 1-day 99% VaR (
Step 3: Diversification benefit
Undiversified VaR:
Diversification reduces VaR by 16.4%, driven by imperfect correlation (
Given: 500 daily portfolio P&L observations (sorted, worst to best, in $thousands):
| Rank | -25 | -24 | -23 | ... | -6 | -5 |
|---|---|---|---|---|---|---|
| P&L | ... |
99% VaR: The 5th worst loss out of 500 observations (99% →
95% VaR: The 25th worst loss out of 500 (
Interpretation: With 99% confidence, the portfolio will not lose more than $27,100 in one day.
VaR estimates subject to error
VaR models subject to (considerable!) model risk
VaR systems subject to implementation risk
But these problems common to all risk measurement systems
VaR tells us most we can lose at a certain probability, i.e., if tail event does not occur
VaR does not tell us anything about what might happen if tail event does occur
Trader can spike firm by selling out of the money options
Two positions with equal VaRs not necessarily equally risky, because tail events might be very different
Solution to use more VaR information – estimate VaR at higher cl
VaR-based decision calculus can be misleading, because it ignores low-prob, high-impact events
Additional problems if VaR is used in a decentralized system
VaR-constrained traders/managers have incentives to 'game' the VaR constraint
VaR of diversified portfolio can be larger than VaR of undiversified one
Example
Aggregating individual risks does not increase overall risk
Important because: Adding risks together gives conservative (over-) estimate of portfolio risk – want bias to be conservative
If risks not subadditive and VaR used to measure risk
Subadditivity is highly desirable
But VaR is only subadditive if risks are normal or elliptical
VaR not subadditive for arbitrary distributions
背景: 2008年金融危机期间,多家大型金融机构的VaR模型严重低估了实际风险暴露。
Lehman Brothers(2008年8月):
根本原因分析:
| VaR假设 | 实际情况 |
|---|---|
| 资产回报近似正态 | 尾部极厚,极端事件频率远高于正态假设 |
| 历史波动率/相关性稳定 | 波动率急剧上升,相关性趋近于1 |
| 市场流动性恒定 | 危机中流动性蒸发,买卖价差扩大10-100倍 |
| 分散化有效 | 所有风险资产同步下跌,分散化失效 |
关键教训:
Basel II.5(2009年修订):
Basel III / FRTB(Fundamental Review of the Trading Book, 2016/2019最终版):
| 维度 | 旧框架(Basel II) | 新框架(FRTB) |
|---|---|---|
| 风险度量 | VaR (99%, 10-day) | ES (97.5%, 12-month stressed) |
| 资本计算 | 标准法或内部模型法 | 标准法(SA)或内部模型法(IMA) |
| 交易台层级 | 全行汇总 | 必须在交易台层面验证和计算 |
| 非流动性风险 | 未显式处理 | 引入流动性期限调整(LR) |
| 尾部风险 | VaR不关注尾部 | ES关注尾部平均损失 |
| 回测 | 基于VaR例外 | 基于ES的"超越比率"(exceedance)检验 |
为什么从VaR转向ES?
Let
Homogeneity and Monotonicity imply convexity, which is important
Translation invariance means that adding a sure amount to our end-period portfolio will reduce loss by amount added
it is also called expected shortfall, tail conditional expectation, conditional loss, or expected tail loss
VaR tells us the most we can lose if a tail event does not occur, CVaR tells us the amount we expect to lose if a tail event does occur
CVaR is coherent
Given: A portfolio with normally distributed returns,
99% VaR (1-day):
99% CVaR (for normal distribution):
where
| Measure | Value (in % of portfolio) |
|---|---|
| 99% VaR | 3.409 |
| 99% CVaR | 3.925 |
| CVaR/VaR ratio | 1.15 |
CVaR exceeds VaR by approximately 15%, reflecting the expected severity of tail losses.
Standard-Portfolio Analysis Risk (SPAN, CME)
Considers 14 scenarios (moderate/large changes in vol, changes in price) + 2 extreme scenarios
Positions revalued under each scenario, and the risk measure is the maximum loss under the first 14 scenarios plus 35% of the loss under the two extreme scenarios
SPAN risk measure can be interpreted as maximum of expected loss under each of 16 probability measures, and is therefore coherent
Try to verify whether the following popular risk measures are coherent measures of risk or not
STs are procedures that gauge vulnerability to 'what if' events
Early stress tests 'back of envelope' exercises
Major improvements in recent years, helped by developments in computer power
Modern ST much more sophisticated than predecessors
Scenario ('what if') analysis
Mechanical stress tests
Can provide good risk information
Can guide decision making
ST can help firms to design systems to deal with bad events
Difficulty of working through scenarios in a consistent way
Need to follow through scenarios, and consequences can be very complex and can easily become unmanageable
Need to take account of interrelationships in a reasonable way
Need to take account of zero arbitrage relationships
Interpretation of ST results
STs do not address prob issues as such
Hence, always an issue of how to interpret results
This implies some informal notion of likelihood
Integrating ST and prob analysis
Can integrate ST and prob analysis using Berkowitz's coherent framework
This approach is judgemental, but does give an integrated analysis of both probs and STs
moving key variables one at a time
using historical scenarios
creating prospective scenarios
reverse stress tests
Simulated movement in major IRs, stock prices, etc.
Long been used in ALM analysis
Problem is to keep scenarios down in number, without missing plausible important ones
Based on actual historical events
Advantages
Can choose scenarios from a catalogue, which might include
Moderate market changes
More extreme events such as reruns of major crashes
These might be natural, political, legal, major defaults, counterparty defaults, economic crises, etc.
Can obtain them by alternate history exercises
Can also look to historical record to guide us in working out what such scenarios might look like
Having specified our set of scenarios, we then need to evaluate their effects
Key is to get an understanding of the sensitivities of our positions to changes in the risk factors being changed
Easy for some positions
Harder for options
Must pay particular attention to impact of stylised events on markets
Very unwise to assume that liquidity will remain strong in a crisis
If futures contracts are used hedges, must also take account of funding implications
Otherwise well-hedged positions can unravel because of interim funding
These approaches try to reduce subjectivity of SA and put ST on firmer foundation
Mechanical ST more systematic and thorough than SA, but also more intensive
Push each price or factor by a certain amount
Specify confidence level
Push factors up/down by
Revalue positions each time,
Work out most disadvantageous combination
This gives us worst-case maximum loss (ML)
FP is easy to program, at least for simple positions
Does not require very restrictive assumptions
Can be modified for correlations,
Results of FP are coherent
If we are prepared to make further assumptions, FP can also give us an indication of likelihoods
But FP rests on assumption that highest losses occur when factors move most, and this is not true for some positions
Solution to this problem is to search over interim values of factor ranges
This is MLO
MLO is more intensive, but will pick up high losses that occur within factor ranges
MLO is better for more complex positions, where it will uncover losses that FP might miss
Backtesting is the process to compare systematically the VaR forecasts with actual returns.
Backtesting compares the daily VaR forecast with the realized profit and loss (P&L) the next day.
Trading outcome
Need to obtain suitable P/L data
Accounting (e.g., GAAP) data often inappropriate because of smoothing, prudence etc
Want P/L data that reflect market risks taken
Need to clean data or use hypothetical P/L data (obtained by revaluing periods from day to day)
P/L data typically random
Portfolios and dfs often change from day to day
How to compare P/L data if underlying pdfs change?
Good practice to map P/L data to predicted percentile
This standardizes data to make observations comparable given changes in pdf or portfolio
Binomial Distribution
So, we would expect to observe 8.1% of samples with zero exceptions under the null hypothesis. We can repeat this calculation with different values for
Decision Rule for Backtests
Consider a VaR measure over a daily horizon defined at the 99% level of confidence
| Number of Exceptions | Probability | Cumulative Probability | Type 1 Error Rate |
|---|---|---|---|
| 0 | 0.0811 | 0.0811 | 100.00% |
| 1 | 0.2047 | 0.2858 | 91.89% |
| 2 | 0.2574 | 0.5432 | 71.42% |
| 3 | 0.2150 | 0.7581 | 45.68% |
| 4 | 0.1341 | 0.8922 | 24.19% |
| 5 | 0.0666 | 0.9588 | 10.78% |
| 6 | 0.0275 | 0.9863 | 04.12% |
| 7 | 0.0097 | 0.9960 | 01.37% |
| 8 | 0.0030 | 0.9989 | 00.40% |
| 9 | 0.0008 | 0.9998 | 00.10% |
| 10 | 0.0002 | 0.9999 | 00.03% |
| Zone | Number of Exceptions | Potential Increase in |
|---|---|---|
| Green | 0 to 4 | 0.00 |
| Yellow | 5 | 0.40 |
| Yellow | 6 | 0.50 |
| Yellow | 7 | 0.65 |
| Yellow | 8 | 0.75 |
| Yellow | 9 | 0.85 |
| Red | 1.00 |
If the number of exceptions falls within the yellow zone, the supervisor has discretion to apply a penalty, depending on the causes for the exceptions. The Basel Committee uses these categories:
Exception tests focus only on the frequency of occurrences
It ignores the time pattern of losses
假设我们采用100天的数据来对VaR进行回溯测试,VaR所采用的置信水平为99%,在100天的数据中我们观察到5次例外。如果我们因此而拒绝该VaR模型,犯第一类错误的概率为多少?
A risk management team reports the following VaR backtesting results for a trading desk over 500 days at 99% confidence: 12 exceptions were observed.
(a) Under the null hypothesis that the VaR model is correct, what is the expected number of exceptions?
(b) Using the normal approximation, compute the test statistic. Would you reject the model at the 5% significance level? (
(c) According to the Basel traffic-light system, which zone does this result fall in, and what is the regulatory consequence?
Two VaR models are being compared for a portfolio:
| Model A | Model B | |
|---|---|---|
| VaR confidence | 99% | 99% |
| Backtest window | 250 days | 250 days |
| Exceptions observed | 1 | 7 |
| Average loss beyond VaR | $2.5M | $1.2M |
(a) Using the binomial distribution, which model is more likely to be rejected at the 5% significance level?
(b) Despite having fewer exceptions, why might Model A actually be worse?
(c) What additional test could help distinguish the two models?
Key takeaways:
Next lecture: L03 — Volatility Modeling & Multivariate Risk Models (GARCH, copula, EVT, VaR methods)